Human-Centered Data Science book cover
Credit: The MIT Press

A new book explores best practices for addressing the bias and inequality that may result from the automated collection, analysis, and distribution of large datasets.

Human-centered data science is a new interdisciplinary field that draws from human-computer interaction, social science, statistics, and computational techniques.

This book, written by founders of the field, introduces best practices for addressing the bias and inequality that may result from the automated collection, analysis, and distribution of very large datasets.

It offers a brief and accessible overview of many common statistical and algorithmic data science techniques, explains human-centered approaches to data science problems, and presents practical guidelines and real-world case studies to help readers apply these methods.

The authors explain how data scientists’ choices are involved at every stage of the data science workflow—and show how a human-centered approach can enhance each one, by making the process more transparent, asking questions, and considering the social context of the data. They describe how tools from social science might be incorporated into data science practices, discuss different types of collaboration, and consider data storytelling through visualization. The book shows that data science practitioners can build rigorous and ethical algorithms and design projects that use cutting-edge computational tools and address social concerns.

Authors

– Cecilia Aragon, Professor, Department of Human Centered Design and Engineering, University of Washington

– Shion Guha, Assistant Professor, Faculty of Information, University of Toronto

– Marina Kogan, Assistant Professor, School of Computing, University of Utah

– Michael Muller, Research staff member, IBM Research

– Professor Gina Neff, Executive Director, Minderoo Centre for Technology and Democracy, University of Cambridge

Catch up with the book launch for Human Centered Data Science

Event begins 03:15

The live transcript is available