Efficient Model Finetuning for Text Classification via Data Filtering

07/28/2022
by   Xu Ouyang, et al.
0

As model finetuning is central to the modern NLP, we set to maximize its efficiency. Motivated by training examples are often redundant, we design an algorithm that filters the examples in a streaming fashion. Our key techniques are two: (1) automatically determine a training loss threshold for skipping the backward propagation; and (2) maintain a meta predictor for further skipping the forward propagation. Incarnated as a three-stage process, on a diverse set of benchmarks our algorithm reduces the required training examples by up to 5× while only seeing minor degradation on average. Our method is effective even for as few as one training epoch, where each training example is encountered once. It is simple to implement and is compatible with the existing model finetuning optimizations such as layer freezing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2022

Careful Data Curation Stabilizes In-context Learning

In-context learning (ICL) enables large language models (LLMs) to perfor...
research
08/30/2023

Towards One-Shot Learning for Text Classification using Inductive Logic Programming

With the ever-increasing potential of AI to perform personalised tasks, ...
research
02/27/2023

Make Every Example Count: On Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets

Increasingly larger datasets have become a standard ingredient to advanc...
research
03/27/2013

Automated Generation of Connectionist Expert Systems for Problems Involving Noise and Redundancy

When creating an expert system, the most difficult and expensive task is...
research
08/11/2020

Intrinsic Certified Robustness of Bagging against Data Poisoning Attacks

In a data poisoning attack, an attacker modifies, deletes, and/or insert...
research
08/31/2012

Statistically adaptive learning for a general class of cost functions (SA L-BFGS)

We present a system that enables rapid model experimentation for tera-sc...
research
10/19/2011

A Reliable Effective Terascale Linear Learning System

We present a system and a set of techniques for learning linear predicto...

Please sign up or login with your details

Forgot password? Click here to reset