"Playing the whole game": A data collection and analysis exercise with Google Calendar

02/23/2020
by   Albert Kim, et al.
0

We provide an exercise suitable for early introduction in an undergraduate statistics or data science course that allows students to `play the whole game' of data science: performing both data collection and data analysis. While many teaching resources exist for data analysis, such resources are not as abundant for data collection given the inherent difficulty of the task. Our proposed exercise centers around student use of Google Calendar to collect data with the goal of answering the question `How do I spend my time?' On the one hand, the exercise involves answering a question with near universal appeal, but on the other hand, the data collection mechanism is not beyond the reach of a modal undergraduate student. A further benefit of this exercise is that it provides an opportunity for discussions on ethical questions and considerations that data providers and data analysts face in today's age of large-scale internet-based data collection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

Prespecification of Structure for Optimizing Data Collection and Research Transparency by Leveraging Conditional Independencies

Data collection and research methodology represents a critical part of t...
research
08/04/2022

Teaching Visual Accessibility in Introductory Data Science Classes with Multi-Modal Data Representations

Although there are various ways to represent data patterns and models, v...
research
01/30/2019

Software solutions for form-based collection of data and the semantic enrichment of form data

Data collection is an important part of many citizen science projects as...
research
11/20/2017

Data Capture & Analysis to Assess Impact of Carbon Credit Schemes

Data enables Non-Governmental Organisations (NGOs) to quantify the impac...
research
12/20/2022

AI applications in forest monitoring need remote sensing benchmark datasets

With the rise in high resolution remote sensing technologies there has b...
research
01/23/2019

Three principles of data science: predictability, computability, and stability (PCS)

We propose the predictability, computability, and stability (PCS) framew...
research
05/31/2022

To Collaborate or Not in Distributed Statistical Estimation with Resource Constraints?

We study how the amount of correlation between observations collected by...

Please sign up or login with your details

Forgot password? Click here to reset