HFT-ONLSTM: Hierarchical and Fine-Tuning Multi-label Text Classification

04/18/2022
by   Pengfei Gao, et al.
0

Many important classification problems in the real-world consist of a large number of closely related categories in a hierarchical structure or taxonomy. Hierarchical multi-label text classification (HMTC) with higher accuracy over large sets of closely related categories organized in a hierarchy or taxonomy has become a challenging problem. In this paper, we present a hierarchical and fine-tuning approach based on the Ordered Neural LSTM neural network, abbreviated as HFT-ONLSTM, for more accurate level-by-level HMTC. First, we present a novel approach to learning the joint embeddings based on parent category labels and textual data for accurately capturing the joint features of both category labels and texts. Second, a fine tuning technique is adopted for training parameters such that the text classification results in the upper level should contribute to the classification in the lower one. At last, the comprehensive analysis is made based on extensive experiments in comparison with the state-of-the-art hierarchical and flat multi-label text classification approaches over two benchmark datasets, and the experimental results show that our HFT-ONLSTM approach outperforms these approaches, in particular reducing computational costs while achieving superior performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2020

A Hierarchical Fine-Tuning Approach Based on Joint Embedding of Words and Parent Categories for Hierarchical Multi-label Text Classification

Many important classification problems in real world consist of a large ...
research
09/21/2023

Accelerating Thematic Investment with Prompt Tuned Pretrained Language Models

Prompt Tuning is emerging as a scalable and cost-effective method to fin...
research
04/02/2022

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

Hierarchical Text Classification (HTC) is a challenging task where a doc...
research
04/13/2022

An Ensemble Learning Based Approach to Multi-label Power Text Classification for Fault-type Recognition

With the rapid development of ICT Custom Services (ICT CS) in power indu...
research
05/05/2020

Efficient strategies for hierarchical text classification: External knowledge and auxiliary tasks

In hierarchical text classification, we perform a sequence of inference ...
research
11/22/2021

Hierarchy Decoder is All You Need To Text Classification

Hierarchical text classification (HTC) to a taxonomy is essential for va...
research
05/24/2022

Exploiting Dynamic and Fine-grained Semantic Scope for Extreme Multi-label Text Classification

Extreme multi-label text classification (XMTC) refers to the problem of ...

Please sign up or login with your details

Forgot password? Click here to reset