Learning Subjective Time-Series Data via Utopia Label Distribution Approximation

07/15/2023
by   Wenxin Xu, et al.
0

Subjective time-series regression (STR) tasks have gained increasing attention recently. However, most existing methods overlook the label distribution bias in STR data, which results in biased models. Emerging studies on imbalanced regression tasks, such as age estimation and depth estimation, hypothesize that the prior label distribution of the dataset is uniform. However, we observe that the label distributions of training and test sets in STR tasks are likely to be neither uniform nor identical. This distinct feature calls for new approaches that estimate more reasonable distributions to train a fair model. In this work, we propose Utopia Label Distribution Approximation (ULDA) for time-series data, which makes the training label distribution closer to real-world but unknown (utopia) label distribution. This would enhance the model's fairness. Specifically, ULDA first convolves the training label distribution by a Gaussian kernel. After convolution, the required sample quantity at each regression label may change. We further devise the Time-slice Normal Sampling (TNS) to generate new samples when the required sample quantity is greater than the initial sample quantity, and the Convolutional Weighted Loss (CWL) to lower the sample weight when the required sample quantity is less than the initial quantity. These two modules not only assist the model training on the approximated utopia label distribution, but also maintain the sample continuity in temporal context space. To the best of our knowledge, ULDA is the first method to address the label distribution bias in time-series data. Extensive experiments demonstrate that ULDA lifts the state-of-the-art performance on two STR tasks and three benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2020

A New Look to Three-Factor Fama-French Regression Model using Sample Innovations

The Fama-French model is widely used in assessing the portfolio's perfor...
research
05/23/2022

Time-series Transformer Generative Adversarial Networks

Many real-world tasks are plagued by limitations on data: in some instan...
research
07/24/2022

CODiT: Conformal Out-of-Distribution Detection in Time-Series Data

Machine learning models are prone to making incorrect predictions on inp...
research
02/23/2022

Towards Speaker Age Estimation with Label Distribution Learning

Existing methods for speaker age estimation usually treat it as a multi-...
research
05/30/2022

RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression

Data imbalance, in which a plurality of the data samples come from a sma...
research
06/09/2021

GP-ConvCNP: Better Generalization for Convolutional Conditional Neural Processes on Time Series Data

Neural Processes (NPs) are a family of conditional generative models tha...
research
09/14/2021

Variation-Incentive Loss Re-weighting for Regression Analysis on Biased Data

Both classification and regression tasks are susceptible to the biased d...

Please sign up or login with your details

Forgot password? Click here to reset