Many-Class Text Classification with Matching

05/23/2022
by   Yi Song, et al.
0

In this work, we formulate Text Classification as a Matching problem between the text and the labels, and propose a simple yet effective framework named TCM. Compared with previous text classification approaches, TCM takes advantage of the fine-grained semantic information of the classification labels, which helps distinguish each class better when the class number is large, especially in low-resource scenarios. TCM is also easy to implement and is compatible with various large pretrained language models. We evaluate TCM on 4 text classification datasets (each with 20+ labels) in both few-shot and full-data settings, and this model demonstrates significant improvements over other text classification paradigms. We also conduct extensive experiments with different variants of TCM and discuss the underlying factors of its success. Our method and analyses offer a new perspective on text classification.

READ FULL TEXT
research
02/10/2018

TextZoo, a New Benchmark for Reconsidering Text Classification

Text representation is a fundamental concern in Natural Language Process...
research
10/23/2022

Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification

This paper proposes a novel prompt-based finetuning method (called DLM-S...
research
08/29/2021

kFolden: k-Fold Ensemble for Out-Of-Distribution Detection

Out-of-Distribution (OOD) detection is an important problem in natural l...
research
05/15/2023

Text Classification via Large Language Models

Despite the remarkable success of large-scale Language Models (LLMs) suc...
research
05/30/2023

Cross Encoding as Augmentation: Towards Effective Educational Text Classification

Text classification in education, usually called auto-tagging, is the au...
research
06/09/2022

Privacy Leakage in Text Classification: A Data Extraction Approach

Recent work has demonstrated the successful extraction of training data ...
research
05/17/2023

Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents

Labeling data is essential for training text classifiers but is often di...

Please sign up or login with your details

Forgot password? Click here to reset