HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization

by   Zhongfen Deng, et al.

The current state-of-the-art model HiAGM for hierarchical text classification has two limitations. First, it correlates each text sample with all labels in the dataset which contains irrelevant information. Second, it does not consider any statistical constraint on the label representations learned by the structure encoder, while constraints for representation learning are proved to be helpful in previous work. In this paper, we propose HTCInfoMax to address these issues by introducing information maximization which includes two modules: text-label mutual information maximization and label prior matching. The first module can model the interaction between each text sample and its ground truth labels explicitly which filters out irrelevant information. The second one encourages the structure encoder to learn better representations with desired characteristics for all labels which can better handle label imbalance in hierarchical text classification. Experimental results on two benchmark datasets demonstrate the effectiveness of the proposed HTCInfoMax.



There are no comments yet.


page 1

page 2

page 3

page 4


Exploiting Global and Local Hierarchies for Hierarchical Text Classification

Hierarchical text classification aims to leverage label hierarchy in mul...

Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text Classification

Hierarchical text classification is a challenging subtask of multi-label...

Label Confusion Learning to Enhance Text Classification Models

Representing a true label as a one-hot vector is a common practice in tr...

LA-HCN: Label-based Attention for Hierarchical Multi-label TextClassification Neural Network

Hierarchical multi-label text classification(HMTC) problems become popul...

Label-guided Learning for Text Classification

Text classification is one of the most important and fundamental tasks i...

Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

Hierarchical Text Classification (HTC), which aims to predict text label...

Metric Learning for Dynamic Text Classification

Traditional text classifiers are limited to predicting over a fixed set ...

Code Repositories


The code for our NAACL 2021 paper "HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization".

view repo
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.