Few-shot learning for sentence pair classification and its applications in software engineering

06/13/2023
by   Robert Kraig Helmeczi, et al.
0

Few-shot learning-the ability to train models with access to limited data-has become increasingly popular in the natural language processing (NLP) domain, as large language models such as GPT and T0 have been empirically shown to achieve high performance in numerous tasks with access to just a handful of labeled examples. Smaller language models such as BERT and its variants have also been shown to achieve strong performance with just a handful of labeled examples when combined with few-shot learning algorithms like pattern-exploiting training (PET) and SetFit. The focus of this work is to investigate the performance of alternative few-shot learning approaches with BERT-based models. Specifically, vanilla fine-tuning, PET and SetFit are compared for numerous BERT-based checkpoints over an array of training set sizes. To facilitate this investigation, applications of few-shot learning are considered in software engineering. For each task, high-performance techniques and their associated model checkpoints are identified through detailed empirical analysis. Our results establish PET as a strong few-shot learning approach, and our analysis shows that with just a few hundred labeled examples it can achieve performance near that of fine-tuning on full-sized data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2020

Making Pre-trained Language Models Better Few-shot Learners

The recent GPT-3 model (Brown et al., 2020) achieves remarkable few-shot...
research
03/22/2021

Improving and Simplifying Pattern Exploiting Training

Recently, pre-trained language models (LMs) have achieved strong perform...
research
01/02/2023

Adaptive Fine-tuning for Multiclass Classification over Software Requirement Data

The analysis of software requirement specifications (SRS) using Natural ...
research
11/04/2022

A Prompt-based Few-shot Learning Approach to Software Conflict Detection

A software requirement specification (SRS) document is an essential part...
research
02/14/2023

Few-shot learning approaches for classifying low resource domain specific software requirements

With the advent of strong pre-trained natural language processing models...
research
11/28/2022

Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

Distance Metric Learning (DML) has attracted much attention in image pro...
research
03/16/2022

Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again

The strong few-shot in-context learning capability of large pre-trained ...

Please sign up or login with your details

Forgot password? Click here to reset