Fits and Starts: Enterprise Use of AutoML and the Role of Humans in the Loop

by   Anamaria Crisan, et al.

AutoML systems can speed up routine data science work and make machine learning available to those without expertise in statistics and computer science. These systems have gained traction in enterprise settings where pools of skilled data workers are limited. In this study, we conduct interviews with 29 individuals from organizations of different sizes to characterize how they currently use, or intend to use, AutoML systems in their data science work. Our investigation also captures how data visualization is used in conjunction with AutoML systems. Our findings identify three usage scenarios for AutoML that resulted in a framework summarizing the level of automation desired by data workers with different levels of expertise. We surfaced the tension between speed and human oversight and found that data visualization can do a poor job balancing the two. Our findings have implications for the design and implementation of human-in-the-loop visual analytics approaches.


page 1

page 2

page 3

page 4


How do Data Science Workers Collaborate? Roles, Workflows, and Tools

Today, the prominence of data science within organizations has given ris...

AutoDS: Towards Human-Centered Automation of Data Science

Data science (DS) projects often follow a lifecycle that consists of lab...

Toward a Knowledge Discovery Framework for Data Science Job Market in the United States

The growth of the data science field requires better tools to understand...

Computational Skills by Stealth in Secondary School Data Science

The unprecedented growth in the availability of data of all types and qu...

Efficient Specialized Spreadsheet Parsing for Data Science

Spreadsheets are widely used for data exploration. Since spreadsheet sys...

Untidy Data: The Unreasonable Effectiveness of Tables

Working with data in table form is usually considered a preparatory and ...