Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification

04/02/2022
by   Ruohong Zhang, et al.
0

Extreme multi-label text classification (XMTC) is the task of tagging each document with the relevant labels from a very large space of predefined categories. Recently, large pre-trained Transformer models have made significant performance improvements in XMTC, which typically use the embedding of the special CLS token to represent the entire document semantics as a global feature vector, and match it against candidate labels. However, we argue that such a global feature vector may not be sufficient to represent different granularity levels of semantics in the document, and that complementing it with the local word-level features could bring additional gains. Based on this insight, we propose an approach that combines both the local and global features produced by Transformer models to improve the prediction power of the classifier. Our experiments show that the proposed model either outperforms or is comparable to the state-of-the-art methods on benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2021

Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification

Extreme multi-label text classification (XMC) seeks to find relevant lab...
research
05/24/2019

Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification

Extreme multi-label text classification (XMTC) aims at tagging a documen...
research
09/23/2020

LA-HCN: Label-based Attention for Hierarchical Multi-label TextClassification Neural Network

Hierarchical multi-label text classification(HMTC) problems become popul...
research
01/10/2022

GUDN A novel guide network for extreme multi-label text classification

The problem of extreme multi-label text classification (XMTC) is to reca...
research
10/29/2022

CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

Extreme Multi-label Text Classification (XMC) involves learning a classi...
research
03/12/2023

Endoscopy Classification Model Using Swin Transformer and Saliency Map

Endoscopy is a valuable tool for the early diagnosis of colon cancer. Ho...
research
07/28/2021

XFL: eXtreme Function Labeling

Reverse engineers would benefit from identifiers like function names, bu...

Please sign up or login with your details

Forgot password? Click here to reset