How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy

09/10/2021
by   Sara Rajaee, et al.
0

It is widely accepted that fine-tuning pre-trained language models usually brings about performance improvements in downstream tasks. However, there are limited studies on the reasons behind this effectiveness, particularly from the viewpoint of structural changes in the embedding space. Trying to fill this gap, in this paper, we analyze the extent to which the isotropy of the embedding space changes after fine-tuning. We demonstrate that, even though isotropy is a desirable geometrical property, fine-tuning does not necessarily result in isotropy enhancements. Moreover, local structures in pre-trained contextual word representations (CWRs), such as those encoding token types or frequency, undergo a massive change during fine-tuning. Our experiments show dramatic growth in the number of elongated directions in the embedding space, which, in contrast to pre-trained CWRs, carry the essential linguistic knowledge in the fine-tuned embedding space, making existing isotropy enhancement methods ineffective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2020

On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers

Fine-tuning pre-trained contextualized embedding models has become an in...
research
03/12/2023

Knowledge-integrated AutoEncoder Model

Data encoding is a common and central operation in most data analysis ta...
research
03/17/2022

On the Importance of Data Size in Probing Fine-tuned Models

Several studies have investigated the reasons behind the effectiveness o...
research
03/23/2022

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization

Model fine-tuning and adaptation have become a common approach for model...
research
05/19/2023

Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast

Existing research has shown that a multilingual pre-trained language mod...
research
02/15/2023

Measuring the Instability of Fine-Tuning

Fine-tuning pre-trained language models on downstream tasks with varying...
research
11/19/2021

An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood Vessels

Characterizing blood vessels in digital images is important for the diag...

Please sign up or login with your details

Forgot password? Click here to reset