How Much Automation Does a Data Scientist Want?

01/07/2021
by   Dakuo Wang, et al.
54

Data science and machine learning (DS/ML) are at the heart of the recent advancements of many Artificial Intelligence (AI) applications. There is an active research thread in AI, , that aims to develop systems for automating end-to-end the DS/ML Lifecycle. However, do DS and ML workers really want to automate their DS/ML workflow? To answer this question, we first synthesize a human-centered AutoML framework with 6 User Role/Personas, 10 Stages and 43 Sub-Tasks, 5 Levels of Automation, and 5 Types of Explanation, through reviewing research literature and marketing reports. Secondly, we use the framework to guide the design of an online survey study with 217 DS/ML workers who had varying degrees of experience, and different user roles "matching" to our 6 roles/personas. We found that different user personas participated in distinct stages of the lifecycle – but not all stages. Their desired levels of automation and types of explanation for AutoML also varied significantly depending on the DS/ML stage and the user persona. Based on the survey results, we argue there is no rationale from user needs for complete automation of the end-to-end DS/ML lifecycle. We propose new next steps for user-controlled DS/ML automation.

READ FULL TEXT

page 2

page 13

page 14

page 15

page 16

page 17

research
01/13/2021

AutoDS: Towards Human-Centered Automation of Data Science

Data science (DS) projects often follow a lifecycle that consists of lab...
research
07/20/2023

Assessing the Use of AutoML for Data-Driven Software Engineering

Background. Due to the widespread adoption of Artificial Intelligence (A...
research
01/13/2021

Whither AutoML? Understanding the Role of Automation in Machine Learning Workflows

Efforts to make machine learning more widely accessible have led to a ra...
research
08/06/2022

Imagining Future Digital Assistants at Work: A Study of Task Management Needs

Digital Assistants (DAs) can support workers in the workplace and beyond...
research
04/17/2023

Why is AI not a Panacea for Data Workers? An Interview Study on Human-AI Collaboration in Data Storytelling

Data storytelling plays an important role in data workers' daily jobs si...
research
09/06/2018

Propheticus: Generalizable Machine Learning Framework

Due to recent technological developments, Machine Learning (ML), a subfi...
research
08/03/2019

Machinic Surrogates: Human-Machine Relationships in Computational Creativity

Recent advancements in artificial intelligence (AI) and its sub-branch m...

Please sign up or login with your details

Forgot password? Click here to reset