Contrastive Bootstrapping for Label Refinement

06/07/2023
by   Shudi Hou, et al.
0

Traditional text classification typically categorizes texts into pre-defined coarse-grained classes, from which the produced models cannot handle the real-world scenario where finer categories emerge periodically for accurate services. In this work, we investigate the setting where fine-grained classification is done only using the annotation of coarse-grained categories and the coarse-to-fine mapping. We propose a lightweight contrastive clustering-based bootstrapping method to iteratively refine the labels of passages. During clustering, it pulls away negative passage-prototype pairs under the guidance of the mapping from both global and local perspectives. Experiments on NYT and 20News show that our method outperforms the state-of-the-art methods by a large margin.

READ FULL TEXT

page 5

page 8

research
10/14/2022

Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning

Novel category discovery aims at adapting models trained on known catego...
research
09/12/2021

Not All Negatives are Equal: Label-Aware Contrastive Loss for Fine-grained Text Classification

Fine-grained classification involves dealing with datasets with larger n...
research
05/25/2020

AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels

Authors of research papers in the fields of mathematics, and other math-...
research
12/04/2020

Rethinking movie genre classification with fine-grained semantic clustering

Movie genre classification is an active research area in machine learnin...
research
08/03/2023

Contrastive Multi-FaceForensics: An End-to-end Bi-grained Contrastive Learning Approach for Multi-face Forgery Detection

DeepFakes have raised serious societal concerns, leading to a great surg...
research
09/12/2020

Exploring the Hierarchy in Relation Labels for Scene Graph Generation

By assigning each relationship a single label, current approaches formul...
research
09/23/2022

IDEA: Interactive DoublE Attentions from Label Embedding for Text Classification

Current text classification methods typically encode the text merely int...

Please sign up or login with your details

Forgot password? Click here to reset