Transformers are Short Text Classifiers: A Study of Inductive Short Text Classifiers on Benchmarks and Real-world Datasets

11/30/2022
by   Fabian Karl, et al.
0

Short text classification is a crucial and challenging aspect of Natural Language Processing. For this reason, there are numerous highly specialized short text classifiers. However, in recent short text research, State of the Art (SOTA) methods for traditional text classification, particularly the pure use of Transformers, have been unexploited. In this work, we examine the performance of a variety of short text classifiers as well as the top performing traditional text classifier. We further investigate the effects on two new real-world short text datasets in an effort to address the issue of becoming overly dependent on benchmark datasets with a limited number of characteristics. Our experiments unambiguously demonstrate that Transformers achieve SOTA accuracy on short text classification tasks, raising the question of whether specialized short text techniques are necessary.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2018

Topic Memory Networks for Short Text Classification

Many classification models work poorly on short texts due to data sparsi...
research
06/25/2022

Protoformer: Embedding Prototypes for Transformers

Transformers have been widely applied in text classification. Unfortunat...
research
04/14/2018

ClassiNet -- Predicting Missing Features for Short-Text Classification

The fundamental problem in short-text classification is feature sparsene...
research
08/19/2023

Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers

Customer reviews play a crucial role in assessing customer satisfaction,...
research
01/19/2018

Investigating the Working of Text Classifiers

Text classification is one of the most widely studied task in natural la...
research
01/31/2020

Benchmarking Popular Classification Models' Robustness to Random and Targeted Corruptions

Text classification models, especially neural networks based models, hav...
research
02/01/2019

tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification

The use of background knowledge remains largely unexploited in many text...

Please sign up or login with your details

Forgot password? Click here to reset