Automated Imbalanced Learning

11/01/2022
by   Prabhant Singh, et al.
0

Automated Machine Learning has grown very successful in automating the time-consuming, iterative tasks of machine learning model development. However, current methods struggle when the data is imbalanced. Since many real-world datasets are naturally imbalanced, and improper handling of this issue can lead to quite useless models, this issue should be handled carefully. This paper first introduces a new benchmark to study how different AutoML methods are affected by label imbalance. Second, we propose strategies to better deal with imbalance and integrate them into an existing AutoML framework. Finally, we present a systematic study which evaluates the impact of these strategies and find that their inclusion in AutoML systems significantly increases their robustness against label imbalance.

READ FULL TEXT

page 15

page 16

research
08/25/2022

An Empirical Analysis of the Efficacy of Different Sampling Techniques for Imbalanced Classification

Learning from imbalanced data is a challenging task. Standard classifica...
research
03/23/2023

SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology

Multiple Instance learning (MIL) models have been extensively used in pa...
research
09/29/2020

Weakly Supervised-Based Oversampling for High Imbalance and High Dimensionality Data Classification

With the abundance of industrial datasets, imbalanced classification has...
research
10/30/2018

Weak-supervision for Deep Representation Learning under Class Imbalance

Class imbalance is a pervasive issue among classification models includi...
research
12/27/2018

Generic adaptation strategies for automated machine learning

Automation of machine learning model development is increasingly becomin...
research
05/20/2022

Predicting Seriousness of Injury in a Traffic Accident: A New Imbalanced Dataset and Benchmark

The paper introduces a new dataset to assess the performance of machine ...
research
10/19/2018

Malicious Web Domain Identification using Online Credibility and Performance Data by Considering the Class Imbalance Issue

Purpose: Malicious web domain identification is of significant importanc...

Please sign up or login with your details

Forgot password? Click here to reset