ScatterShot: Interactive In-context Example Curation for Text Transformation

02/14/2023
by   Tongshuang Wu, et al.
0

The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when "enough" examples have been included even for known patterns. In this work, we present ScatterShot, an interactive system for building high-quality demonstration sets for in-context learning. ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set. In simulation studies on two text perturbation scenarios, ScatterShot sampling improves the resulting few-shot functions by 4-5 percentage points over random sampling, with less variance as more examples are added. In a user study, ScatterShot greatly helps users in covering different patterns in the input space and labeling in-context examples more efficiently, resulting in better in-context learning and less user effort.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

Active Learning Principles for In-Context Learning with Large Language Models

The remarkable advancements in large language models (LLMs) have signifi...
research
11/15/2022

MEAL: Stable and Active Learning for Few-Shot Prompting

Few-shot classification in NLP has recently made great strides due to th...
research
06/27/2012

Batch Active Learning via Coordinated Matching

Most prior work on active learning of classifiers has focused on sequent...
research
09/13/2023

VLSlice: Interactive Vision-and-Language Slice Discovery

Recent work in vision-and-language demonstrates that large-scale pretrai...
research
05/23/2020

Active Learning for Skewed Data Sets

Consider a sequential active learning problem where, at each round, an a...
research
12/13/2022

Structured Prompting: Scaling In-Context Learning to 1,000 Examples

Large language models have exhibited intriguing in-context learning capa...
research
10/06/2015

Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks

How can we reuse existing knowledge, in the form of available datasets, ...

Please sign up or login with your details

Forgot password? Click here to reset