Efficient Identification of Approximate Best Configuration of Training in Large Datasets

11/08/2018
by   Silu Huang, et al.
0

A configuration of training refers to the combinations of feature engineering, learner, and its associated hyperparameters. Given a set of configurations and a large dataset randomly split into training and testing set, we study how to efficiently identify the best configuration with approximately the highest testing accuracy when trained from the training set. To guarantee small accuracy loss, we develop a solution using confidence interval (CI)-based progressive sampling and pruning strategy. Compared to using full data to find the exact best configuration, our solution achieves more than two orders of magnitude speedup, while the returned top configuration has identical or close test accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2019

Configuration Testing: Testing Configuration Values Together with Code Logic

This paper proposes configuration testing as a key reliability engineeri...
research
09/07/2018

Simple coarse graining and sampling strategies for image recognition

A conceptually simple way to recognize images is to directly compare tes...
research
06/11/2018

Development of FEB Configuration Test Board for ATLAS NSW Upgrade

The FEB(front end board) configuration test board is developed aiming at...
research
10/06/2021

Data Twinning

In this work, we develop a method named Twinning, for partitioning a dat...
research
09/01/2011

(Re)configuration based on model generation

Reconfiguration is an important activity for companies selling configura...
research
01/22/2021

Artificial intelligence prediction of stock prices using social media

The primary objective of this work is to develop a Neural Network based ...
research
07/24/2020

Counting Fish and Dolphins in Sonar Images Using Deep Learning

Deep learning provides the opportunity to improve upon conflicting repor...

Please sign up or login with your details

Forgot password? Click here to reset