Revisiting the Updates of a Pre-trained Model for Few-shot Learning

05/13/2022
by   Yujin Kim, et al.
5

Most of the recent few-shot learning algorithms are based on transfer learning, where a model is pre-trained using a large amount of source data, and the pre-trained model is updated using a small amount of target data afterward. In transfer-based few-shot learning, sophisticated pre-training methods have been widely studied for universal and improved representation. However, there is little study on updating pre-trained models for few-shot learning. In this paper, we compare the two popular updating methods, fine-tuning (i.e., updating the entire network) and linear probing (i.e., updating only the linear classifier), considering the distribution shift between the source and target data. We find that fine-tuning is better than linear probing as the number of samples increases, regardless of distribution shift. Next, we investigate the effectiveness and ineffectiveness of data augmentation when pre-trained models are fine-tuned. Our fundamental analyses demonstrate that careful considerations of the details about updating pre-trained models are required for better few-shot performance.

READ FULL TEXT

page 10

page 11

page 13

page 17

research
07/19/2022

Similarity of Pre-trained and Fine-tuned Representations

In transfer learning, only the last part of the networks - the so-called...
research
10/06/2020

On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers

Fine-tuning pre-trained contextualized embedding models has become an in...
research
07/19/2022

Revealing Secrets From Pre-trained Models

With the growing burden of training deep learning models with large data...
research
05/25/2023

Representation Transfer Learning via Multiple Pre-trained models for Linear Regression

In this paper, we consider the problem of learning a linear regression m...
research
05/06/2020

Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns

Application and use of deep learning algorithms for different healthcare...
research
07/14/2021

BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews

User reviews have an essential role in the success of the developed mobi...
research
11/16/2022

On Measuring the Intrinsic Few-Shot Hardness of Datasets

While advances in pre-training have led to dramatic improvements in few-...

Please sign up or login with your details

Forgot password? Click here to reset