A fresh look at introductory data science

08/01/2020
by   Mine Çetinkaya-Rundel, et al.
0

The proliferation of vast quantities of available datasets that are large and complex in nature has challenged universities to keep up with the demand for graduates trained in both the statistical and the computational set of skills required to effectively plan, acquire, manage, analyze, and communicate the findings of such data. To keep up with this demand, attracting students early on to data science as well as providing them a solid foray into the field becomes increasingly important. We present a case study of an introductory undergraduate course in data science that is designed to address these needs. Offered at Duke University, this course has no pre-requisites and serves a wide audience of aspiring statistics and data science majors as well as humanities, social sciences, and natural sciences students. We discuss the unique set of challenges posed by offering such a course and in light of these challenges, we present a detailed discussion into the pedagogical design elements, content, structure, computational infrastructure, and the assessment methodology of the course. We also offer a repository containing all teaching materials that are open-source, along with supplemental materials and the R code for reproducing the figures found in the paper.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2016

Embracing Data Science

Statistics is running the risk of appearing irrelevant to today's underg...
research
01/29/2021

A Statistician Teaches Deep Learning

Deep learning (DL) has gained much attention and become increasingly pop...
research
01/21/2020

Integrating data science ethics into an undergraduate major: A case study

We present a programmatic approach to incorporating ethics into an under...
research
12/23/2019

Teaching Responsible Data Science: Charting New Pedagogical Territory

Although numerous ethics courses are available, with many focusing speci...
research
01/03/2023

Introducing Variational Inference in Statistics and Data Science Curriculum

Probabilistic models such as logistic regression, Bayesian classificatio...
research
12/29/2022

Deep R Programming

Deep R Programming is a comprehensive course on one of the most popular ...
research
03/25/2019

Categorical Data Integration for Computational Science

Categorical Query Language is an open-source query and data integration ...

Please sign up or login with your details

Forgot password? Click here to reset