Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

01/27/2022
by   Jixuan Wang, et al.
0

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task. Simultaneously, many realistic NLP problems are "few shot", without a sufficiently large training set. In this work, we propose a novel conditional neural process-based approach for few-shot text classification that learns to transfer from other diverse tasks with rich annotation. Our key idea is to represent each task using gradient information from a base model and to train an adaptation network that modulates a text classifier conditioned on the task representation. While previous task-aware few-shot learners represent tasks by input encoding, our novel task representation is more powerful, as the gradient captures input-output relationships of a task. Experimental results show that our approach outperforms traditional fine-tuning, sequential transfer learning, and state-of-the-art meta learning approaches on a collection of diverse few-shot tasks. We further conducted analysis and ablations to justify our design choices.

READ FULL TEXT
research
10/22/2022

Meta-learning Pathologies from Radiology Reports using Variance Aware Prototypical Networks

Large pretrained Transformer-based language models like BERT and GPT hav...
research
05/11/2022

Towards Unified Prompt Tuning for Few-shot Text Classification

Prompt-based fine-tuning has boosted the performance of Pre-trained Lang...
research
07/19/2020

Meta-learning for Few-shot Natural Language Processing: A Survey

Few-shot natural language processing (NLP) refers to NLP tasks that are ...
research
01/04/2023

MessageNet: Message Classification using Natural Language Processing and Meta-data

In this paper we propose a new Deep Learning (DL) approach for message c...
research
01/21/2020

Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference

Some NLP tasks can be solved in a fully unsupervised fashion by providin...
research
04/20/2022

Active Few-Shot Learning with FASL

Recent advances in natural language processing (NLP) have led to strong ...
research
02/03/2023

Towards Few-Shot Identification of Morality Frames using In-Context Learning

Data scarcity is a common problem in NLP, especially when the annotation...

Please sign up or login with your details

Forgot password? Click here to reset