Task Selection for AutoML System Evaluation

08/26/2022
by   Jonathan Lorraine, et al.
0

Our goal is to assess if AutoML system changes - i.e., to the search space or hyperparameter optimization - will improve the final model's performance on production tasks. However, we cannot test the changes on production tasks. Instead, we only have access to limited descriptors about tasks that our AutoML system previously executed, like the number of data points or features. We also have a set of development tasks to test changes, ex., sampled from OpenML with no usage constraints. However, the development and production task distributions are different leading us to pursue changes that only improve development and not production. This paper proposes a method to leverage descriptor information about AutoML production tasks to select a filtered subset of the most relevant development tasks. Empirical studies show that our filtering strategy improves the ability to assess AutoML system changes on holdout tasks with different distributions than development.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2017

On the Synthesis of Guaranteed-Quality Plans for Robot Fleets in Logistics Scenarios via Optimization Modulo Theories

In manufacturing, the increasing involvement of autonomous robots in pro...
research
06/11/2020

The Role of Modularity and Neuro-Regulation for the Production of Multiple Behaviors

This project investigates whether functional specialization or modularit...
research
10/23/2022

Towards Pragmatic Production Strategies for Natural Language Generation Tasks

This position paper proposes a conceptual framework for the design of Na...
research
10/11/2018

Predictive Test Selection

Change-based testing is a key component of continuous integration at Fac...
research
03/27/2021

An empirical study into the relationship between class features and test smells

While a substantial body of prior research has investigated the form and...
research
12/01/2017

Closed-loop field development optimization with multipoint geostatistics and statistical assessment

Closed-loop field development (CLFD) optimization is a comprehensive fra...
research
10/13/2016

Bank Card Usage Prediction Exploiting Geolocation Information

We describe the solution of team ISMLL for the ECML-PKDD 2016 Discovery ...

Please sign up or login with your details

Forgot password? Click here to reset