autoBagging: Learning to Rank Bagging Workflows with Metalearning

06/28/2017
by   Fábio Pinto, et al.
0

Machine Learning (ML) has been successfully applied to a wide range of domains and applications. One of the techniques behind most of these successful applications is Ensemble Learning (EL), the field of ML that gave birth to methods such as Random Forests or Boosting. The complexity of applying these techniques together with the market scarcity on ML experts, has created the need for systems that enable a fast and easy drop-in replacement for ML libraries. Automated machine learning (autoML) is the field of ML that attempts to answers these needs. Typically, these systems rely on optimization techniques such as bayesian optimization to lead the search for the best model. Our approach differs from these systems by making use of the most recent advances on metalearning and a learning to rank approach to learn from metadata. We propose autoBagging, an autoML system that automatically ranks 63 bagging workflows by exploiting past performance and dataset characterization. Results on 140 classification datasets from the OpenML platform show that autoBagging can yield better performance than the Average Rank method and achieve results that are not statistically different from an ideal model that systematically selects the best workflow for each dataset. For the purpose of reproducibility and generalizability, autoBagging is publicly available as an R package on CRAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2020

Fair Bayesian Optimization

Given the increasing importance of machine learning (ML) in our lives, a...
research
08/19/2019

Machine Teaching by Domain Experts: Towards More Humane,Inclusive, and Intelligent Machine Learning Systems

This paper argues that a possible way to escape from the limitations of ...
research
07/15/2018

Boosting Combinatorial Problem Modeling with Machine Learning

In the past few years, the area of Machine Learning (ML) has witnessed t...
research
04/27/2022

Prescriptive and Descriptive Approaches to Machine-Learning Transparency

Specialized documentation techniques have been developed to communicate ...
research
05/22/2019

The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development

As machine learning is applied more and more widely, data scientists oft...
research
12/23/2019

AutoML: Exploration v.s. Exploitation

Building a machine learning (ML) pipeline in an automated way is a cruci...
research
10/01/2018

SmartChoices: Hybridizing Programming and Machine Learning

We present SmartChoices, an approach to making machine learning (ML) a f...

Please sign up or login with your details

Forgot password? Click here to reset