A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking

10/27/2022
by   Zilin Yuan, et al.
0

Text classification is a very classic NLP task, but it has two prominent shortcomings: On the one hand, text classification is deeply domain-dependent. That is, a classifier trained on the corpus of one domain may not perform so well in another domain. On the other hand, text classification models require a lot of annotated data for training. However, for some domains, there may not exist enough annotated data. Therefore, it is valuable to investigate how to efficiently utilize text data from different domains to improve the performance of models in various domains. Some multi-domain text classification models are trained by adversarial training to extract shared features among all domains and the specific features of each domain. We noted that the distinctness of the domain-specific features is different, so in this paper, we propose to use a curriculum learning strategy based on keyword weight ranking to improve the performance of multi-domain text classification models. The experimental results on the Amazon review and FDU-MTL datasets show that our curriculum learning strategy effectively improves the performance of multi-domain text classification models based on adversarial learning and outperforms state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Dual Adversarial Co-Learning for Multi-Domain Text Classification

In this paper we propose a novel dual adversarial co-learning approach f...
research
04/26/2022

A Robust Contrastive Alignment Method For Multi-Domain Text Classification

Multi-domain text classification can automatically classify texts in var...
research
02/15/2018

Multinomial Adversarial Networks for Multi-Domain Text Classification

Many text classification tasks are known to be highly domain-dependent. ...
research
01/29/2022

Maximum Batch Frobenius Norm for Multi-Domain Text Classification

Multi-domain text classification (MDTC) has obtained remarkable achievem...
research
08/06/2016

Transferring Knowledge from Text to Predict Disease Onset

In many domains such as medicine, training data is in short supply. In s...
research
08/24/2018

Building a Robust Text Classifier on a Test-Time Budget

We propose a generic and interpretable learning framework for building r...
research
05/09/2018

Cross Domain Regularization for Neural Ranking Models Using Adversarial Learning

Unlike traditional learning to rank models that depend on hand-crafted f...

Please sign up or login with your details

Forgot password? Click here to reset