Variation-Incentive Loss Re-weighting for Regression Analysis on Biased Data

09/14/2021
by   Wentai Wu, et al.
0

Both classification and regression tasks are susceptible to the biased distribution of training data. However, existing approaches are focused on the class-imbalanced learning and cannot be applied to the problems of numerical regression where the learning targets are continuous values rather than discrete labels. In this paper, we aim to improve the accuracy of the regression analysis by addressing the data skewness/bias during model training. We first introduce two metrics, uniqueness and abnormality, to reflect the localized data distribution from the perspectives of their feature (i.e., input) space and target (i.e., output) space. Combining these two metrics we propose a Variation-Incentive Loss re-weighting method (VILoss) to optimize the gradient descent-based model training for regression analysis. We have conducted comprehensive experiments on both synthetic and real-world data sets. The results show significant improvement in the model quality (reduction in error by up to 11.9

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2021

Delving into Deep Imbalanced Regression

Real-world data often exhibit imbalanced distributions, where certain ta...
research
02/11/2022

CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep Learning

Modern deep neural networks can easily overfit to biased training data c...
research
02/21/2022

Imbalanced Classification via Explicit Gradient Learning From Augmented Data

Learning from imbalanced data is one of the most significant challenges ...
research
05/30/2022

RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression

Data imbalance, in which a plurality of the data samples come from a sma...
research
08/05/2023

Generalized Oversampling for Learning from Imbalanced datasets and Associated Theory

In supervised learning, it is quite frequent to be confronted with real ...
research
07/15/2023

Learning Subjective Time-Series Data via Utopia Label Distribution Approximation

Subjective time-series regression (STR) tasks have gained increasing att...
research
12/18/2020

Classification with Strategically Withheld Data

Machine learning techniques can be useful in applications such as credit...

Please sign up or login with your details

Forgot password? Click here to reset