Testing the Robustness of AutoML Systems

05/06/2020
by   Tuomas Halvari, et al.
0

Automated machine learning (AutoML) systems aim at finding the best machine learning (ML) pipeline that automatically matches the task and data at hand. We investigate the robustness of machine learning pipelines generated with three AutoML systems, TPOT, H2O, and AutoKeras. In particular, we study the influence of dirty data on the accuracy, and consider how using dirty training data may help to create more robust solutions. Furthermore, we also analyze how the structure of the generated pipelines differs in different cases.

READ FULL TEXT
research
06/11/2019

Toward Best Practices for Explainable B2B Machine Learning

To design tools and data pipelines for explainable B2B machine learning ...
research
07/05/2019

Visus: An Interactive System for Automatic Machine Learning Model Building and Curation

While the demand for machine learning (ML) applications is booming, ther...
research
02/28/2020

End-to-end Robustness for Sensing-Reasoning Machine Learning Pipelines

As machine learning (ML) being applied to many mission-critical scenario...
research
02/09/2023

Hyperparameter Search Is All You Need For Training-Agnostic Backdoor Robustness

Commoditization and broad adoption of machine learning (ML) technologies...
research
05/05/2022

Replicating Data Pipelines with GrimoireLab

In this paper, we present our MSR Hackathon 2022 project that replicates...
research
03/19/2022

METL: a modern ETL pipeline with a dynamic mapping matrix

Modern ETL streaming pipelines extract data from various sources and for...

Please sign up or login with your details

Forgot password? Click here to reset