A Deep Reinforced Sequence-to-Set Model for Multi-Label Text Classification

09/10/2018
by   Pengcheng Yang, et al.
0

Multi-label text classification (MLTC) aims to assign multiple labels to each sample in the dataset. The labels usually have internal correlations. However, traditional methods tend to ignore the correlations between labels. In order to capture the correlations between labels, the sequence-to-sequence (Seq2Seq) model views the MLTC task as a sequence generation problem, which achieves excellent performance on this task. However, the Seq2Seq model is not suitable for the MLTC task in essence. The reason is that it requires humans to predefine the order of the output labels, while some of the output labels in the MLTC task are essentially an unordered set rather than an ordered sequence. This conflicts with the strict requirement of the Seq2Seq model for the label order. In this paper, we propose a novel sequence-to-set framework utilizing deep reinforcement learning, which not only captures the correlations between labels, but also reduces the dependence on the label order. Extensive experimental results show that our proposed method outperforms the competitive baselines by a large margin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2018

SGM: Sequence Generation Model for Multi-label Classification

Multi-label classification is an important yet challenging task in natur...
research
04/14/2023

Label Dependencies-aware Set Prediction Networks for Multi-label Text Classification

Multi-label text classification aims to extract all the related labels f...
research
03/12/2019

A Sequential Set Generation Method for Predicting Set-Valued Outputs

Consider a general machine learning setting where the output is a set of...
research
06/06/2021

Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task Learning

In multi-label text classification (MLTC), each given document is associ...
research
03/22/2020

Multi-Label Text Classification using Attention-based Graph Neural Network

In Multi-Label Text Classification (MLTC), one sample can belong to more...
research
08/26/2018

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification

We propose a novel model for multi-label text classification, which is b...
research
10/30/2017

Prototype Matching Networks for Large-Scale Multi-label Genomic Sequence Classification

One of the fundamental tasks in understanding genomics is the problem of...

Please sign up or login with your details

Forgot password? Click here to reset