Online Conference &
Challenge
October 2 & 9, 2020

How data science, digital humanities, and libraries address sources, methods, and meaning


Researchers in every field of study have access to far more data than could have been imagined even a decade ago. The scale and speed of data creation has opened up exciting new paths of inquiry while, at the same time, introducing new kinds of data bias and challenges in the form of reproducibility and reliability of results.

All data are representations. Their collection, handling, reduction and transformation influence what we can and cannot learn from data. And yet our traditional methods of data management, rooted in provenance, context, and careful documentation, struggle to keep pace with big data.

Data science has made significant advances in computational and mathematical tools to organize and analyze data, recognize patterns and make predictions at scale.

Digital humanities are also concerned with scale, but focus on the problem from a different angle: How do we reduce rich, layered and uneven, historical and literary sources into data representations that are ripe for computation?

Libraries and archives that produce metadata, define collections, and trace provenance, all to provide meaningful context, are in need of more powerful tools to support the production of new knowledge.

This state of things provides an opportunity, and perhaps faces us with an imperative, to learn from each other across theoretical and methodological divides, and to address the social, ethical, and political implications of the boundlessness of data today.

In response to this need, the Stanford Data Science Institute, CESTA (Center for Spatial and Textual Analysis) and Stanford Libraries are hosting two complementary trans-disciplinary events to foster communication and shared understanding.


Plenary Talks

Friday, October 2, 2020
9 a.m. - 12:30 p.m. PDT

The conference will feature three plenary talks from thought leaders in data science, the humanities, and design introduced by the conference organizers: Giovanna Ceserani, Chiara Sabatti, and Nicole Coleman.

This event will be hosted as an online 'watch party' with recorded talks followed by a live question and answer session with the speakers.

Challenge

Friday, October 9, 2020
10 a.m. - 3 p.m. PDT

24 Stanford PhD candidates or recent recipients will be selected from disciplines across campus to participate in a challenge including group workshopping of those papers through the lens of issues in data practices and culminating in a discussion and review of findings with faculty and library staff.

Conference Keynote Speakers

Anne Burdick

Jo Guldi

Mark Hansen