MEAL: Stable and Active Learning for Few-Shot Prompting

11/15/2022
by   Abdullatif Köksal, et al.
0

Few-shot classification in NLP has recently made great strides due to the availability of large foundation models that, through priming and prompting, are highly effective few-shot learners. However, this approach has high variance across different sets of few shots and across different finetuning runs. For example, we find that validation accuracy on RTE can vary by as much as 27 points. In this context, we make two contributions for more effective few-shot learning. First, we propose novel ensembling methods and show that they substantially reduce variance. Second, since performance depends a lot on the set of few shots selected, active learning is promising for few-shot classification. Based on our stable ensembling method, we build on existing work on active learning and introduce a new criterion: inter-prompt uncertainty sampling with diversity. We present the first active learning based approach to select training examples for prompt-based learning and show that it outperforms prior work on active learning. Finally, we show that our combined method, MEAL (Multiprompt finetuning and prediction Ensembling with Active Learning), improves overall performance of prompt-based finetuning by 2.3 absolute points on five different tasks.

READ FULL TEXT
research
04/20/2022

Active Few-Shot Learning with FASL

Recent advances in natural language processing (NLP) have led to strong ...
research
05/23/2023

Active Learning Principles for In-Context Learning with Large Language Models

The remarkable advancements in large language models (LLMs) have signifi...
research
05/20/2020

Batch Decorrelation for Active Metric Learning

We present an active learning strategy for training parametric models of...
research
05/31/2019

Minimum-Margin Active Learning

We present a new active sampling method we call min-margin which trains ...
research
09/04/2019

Augmented Memory Networks for Streaming-Based Active One-Shot Learning

One of the major challenges in training deep architectures for predictiv...
research
02/14/2023

ScatterShot: Interactive In-context Example Curation for Text Transformation

The in-context learning capabilities of LLMs like GPT-3 allow annotators...
research
05/13/2023

An Active Learning-based Approach for Hosting Capacity Analysis in Distribution Systems

With the increasing amount of distributed energy resources (DERs) integr...

Please sign up or login with your details

Forgot password? Click here to reset