TeSS: Zero-Shot Classification via Textual Similarity Comparison with Prompting using Sentence Encoder

12/20/2022
by   Jimin Hong, et al.
0

We introduce TeSS (Text Similarity Comparison using Sentence Encoder), a framework for zero-shot classification where the assigned label is determined by the embedding similarity between the input text and each candidate label prompt. We leverage representations from sentence encoders optimized to locate semantically similar samples closer to each other in embedding space during pre-training. The label prompt embeddings serve as prototypes of their corresponding class clusters. Furthermore, to compensate for the potentially poorly descriptive labels in their original format, we retrieve semantically similar sentences from external corpora and additionally use them with the original label prompt (TeSS-R). TeSS outperforms strong baselines on various closed-set and open-set classification datasets under zero-shot setting, with further gains when combined with label prompt diversification through retrieval. These results are robustly attained to verbalizer variations, an ancillary benefit of using a bi-encoder. Altogether, our method serves as a reliable baseline for zero-shot classification and a simple interface to assess the quality of sentence encoders.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2023

SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

We introduce SONAR, a new multilingual and multimodal fixed-size sentenc...
research
04/14/2023

Zero-Shot Multi-Label Topic Inference with Sentence Encoders

Sentence encoders have indeed been shown to achieve superior performance...
research
01/10/2022

Language-driven Semantic Segmentation

We present LSeg, a novel model for language-driven semantic image segmen...
research
10/02/2019

SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders

We propose an end-to-end neural model for zero-shot abstractive text sum...
research
04/20/2022

Unsupervised Ranking and Aggregation of Label Descriptions for Zero-Shot Classifiers

Zero-shot text classifiers based on label descriptions embed an input te...
research
12/16/2021

Extreme Zero-Shot Learning for Extreme Text Classification

The eXtreme Multi-label text Classification (XMC) problem concerns findi...

Please sign up or login with your details

Forgot password? Click here to reset