Improving Imbalanced Text Classification with Dynamic Curriculum Learning

10/25/2022
by   Xulong Zhang, et al.
0

Recent advances in pre-trained language models have improved the performance for text classification tasks. However, little attention is paid to the priority scheduling strategy on the samples during training. Humans acquire knowledge gradually from easy to complex concepts, and the difficulty of the same material can also vary significantly in different learning stages. Inspired by this insights, we proposed a novel self-paced dynamic curriculum learning (SPDCL) method for imbalanced text classification, which evaluates the sample difficulty by both linguistic character and model capacity. Meanwhile, rather than using static curriculum learning as in the existing research, our SPDCL can reorder and resample training data by difficulty criterion with an adaptive from easy to hard pace. The extensive experiments on several classification tasks show the effectiveness of SPDCL strategy, especially for the imbalanced dataset.

READ FULL TEXT
research
10/30/2020

Dynamic Data Selection for Curriculum Learning via Ability Estimation

Curriculum learning methods typically rely on heuristics to estimate the...
research
08/24/2021

Density-Based Dynamic Curriculum Learning for Intent Detection

Pre-trained language models have achieved noticeable performance on the ...
research
02/09/2023

Mixed-order self-paced curriculum learning for universal lesion detection

Self-paced curriculum learning (SCL) has demonstrated its great potentia...
research
08/14/2022

Text Difficulty Study: Do machines behave the same as humans regarding text difficulty?

Given a task, human learns from easy to hard, whereas the model learns r...
research
01/21/2019

Dynamic Curriculum Learning for Imbalanced Data Classification

Human attribute analysis is a challenging task in the field of computer ...
research
07/17/2023

Curriculum Learning for Graph Neural Networks: A Multiview Competence-based Approach

A curriculum is a planned sequence of learning materials and an effectiv...
research
11/20/2020

Sequential Targeting: an incremental learning approach for data imbalance in text classification

Classification tasks require a balanced distribution of data to ensure t...

Please sign up or login with your details

Forgot password? Click here to reset