Teaching Responsible Data Science: Charting New Pedagogical Territory

12/23/2019
by   Julia Stoyanovich, et al.
0

Although numerous ethics courses are available, with many focusing specifically on technology and computer ethics, pedagogical approaches employed in these courses rely exclusively on texts rather than on software development or data analysis. Technical students often consider these courses unimportant and a distraction from the "real" material. To develop instructional materials and methodologies that are thoughtful and engaging, we must strive for balance: between texts and coding, between critique and solution, and between cutting-edge research and practical applicability. Finding such balance is particularly difficult in the nascent field of responsible data science (RDS), where we are only starting to understand how to interface between the intrinsically different methodologies of engineering and social sciences. In this paper we recount a recent experience in developing and teaching an RDS course to graduate and advanced undergraduate students in data science. We then dive into an area that is critically important to RDS – transparency and interpretability of machine-assisted decision-making, and tie this area to the needs of emerging RDS curricula. Recounting our own experience, and leveraging literature on pedagogical methods in data science and beyond, we propose the notion of an "object-to-interpret-with". We link this notion to "nutritional labels" – a family of interpretability tools that are gaining popularity in RDS research and practice. With this work we aim to contribute to the nascent area of RDS education, and to inspire others in the community to come together to develop a deeper theoretical understanding of the pedagogical needs of RDS, and contribute concrete educational materials and methodologies that others can use. All course materials are publicly available at https://dataresponsibly.github.io/courses.

READ FULL TEXT
research
07/31/2023

ChatGPT for Teaching and Learning: An Experience from Data Science Education

ChatGPT, an implementation and application of large language models, has...
research
08/01/2020

A fresh look at introductory data science

The proliferation of vast quantities of available datasets that are larg...
research
05/09/2023

Motivation, inclusivity, and realism should drive data science education

Data science education provides tremendous opportunities but remains ina...
research
09/17/2021

Opinionated practices for teaching reproducibility: motivation, guided instruction and practice

In the data science courses at the University of British Columbia, we de...
research
11/05/2018

Using GitHub Classroom To Teach Statistics

Git and GitHub are common tools for keeping track of multiple versions o...
research
07/17/2020

Principles for data analysis workflows

Traditional data science education often omits training on research work...
research
12/29/2022

Deep R Programming

Deep R Programming is a comprehensive course on one of the most popular ...

Please sign up or login with your details

Forgot password? Click here to reset