EXnet: Efficient In-context Learning for Data-less Text classification

05/24/2023
by   Debaditya Shome, et al.
0

Large pre-trained language models (PLMs) have made significant progress in encoding world knowledge and spawned a new set of learning paradigms including zero-shot, few-shot, and in-context learning. Many language tasks can be modeled as a set of prompts (for example, is this text about geography?) and language models can provide binary answers, i.e., Yes or No. There is evidence to suggest that the next-word prediction used by many PLMs does not align well with zero-shot paradigms. Therefore, PLMs are fine-tuned as a question-answering system. In-context learning extends zero-shot learning by incorporating prompts and examples, resulting in increased task accuracy. Our paper presents EXnet, a model specifically designed to perform in-context learning without any limitations on the number of examples. We argue that in-context learning is an effective method to increase task accuracy, and providing examples facilitates cross-task generalization, especially when it comes to text classification tasks. With extensive experiments, we show that even our smallest model (15M parameters) generalizes to several unseen classification tasks and domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2022

Zero-Shot Text Classification with Self-Training

Recent advances in large pretrained language models have increased atten...
research
10/26/2022

Don't Prompt, Search! Mining-based Zero-Shot Learning with Language Models

Masked language models like BERT can perform text classification in a ze...
research
04/18/2023

CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models

Large pre-trained language models (LLMs) have been shown to have signifi...
research
09/19/2023

In-Context Learning for Text Classification with Many Labels

In-context learning (ICL) using large language models for tasks with man...
research
05/23/2023

Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science

Instruction-tuned Large Language Models (LLMs) have exhibited impressive...
research
05/15/2023

Text Classification via Large Language Models

Despite the remarkable success of large-scale Language Models (LLMs) suc...
research
12/27/2021

What do Large Language Models Learn about Scripts?

Script Knowledge (Schank and Abelson, 1975) has long been recognized as ...

Please sign up or login with your details

Forgot password? Click here to reset