On the Transformation of Latent Space in Fine-Tuned NLP Models

10/23/2022
by   Nadir Durrani, et al.
0

We study the evolution of latent space in fine-tuned NLP models. Different from the commonly used probing-framework, we opt for an unsupervised method to analyze representations. More specifically, we discover latent concepts in the representational space using hierarchical clustering. We then use an alignment function to gauge the similarity between the latent space of a pre-trained model and its fine-tuned version. We use traditional linguistic concepts to facilitate our understanding and also study how the model space transforms towards task-specific information. We perform a thorough analysis, comparing pre-trained and fine-tuned models across three models and three downstream tasks. The notable findings of our work are: i) the latent space of the higher layers evolve towards task-specific concepts, ii) whereas the lower layers retain generic concepts acquired in the pre-trained model, iii) we discovered that some concepts in the higher layers acquire polarity towards the output class, and iv) that these concepts can be used for generating adversarial triggers.

READ FULL TEXT

page 6

page 14

page 16

page 17

page 18

page 19

page 20

page 21

research
02/13/2023

Task-Specific Skill Localization in Fine-tuned Language Models

Pre-trained language models can be fine-tuned to solve diverse NLP tasks...
research
06/27/2022

Analyzing Encoded Concepts in Transformer Language Models

We propose a novel framework ConceptX, to analyze how latent concepts ar...
research
05/15/2022

Discovering Latent Concepts Learned in BERT

A large number of studies that analyze deep neural network models and th...
research
11/08/2020

Bait and Switch: Online Training Data Poisoning of Autonomous Driving Systems

We show that by controlling parts of a physical environment in which a p...
research
11/12/2022

ConceptX: A Framework for Latent Concept Analysis

The opacity of deep neural networks remains a challenge in deploying sol...
research
07/28/2021

An Evaluation of Generative Pre-Training Model-based Therapy Chatbot for Caregivers

With the advent of off-the-shelf intelligent home products and broader i...
research
08/20/2023

Scaled-up Discovery of Latent Concepts in Deep NLP Models

Pre-trained language models (pLMs) learn intricate patterns and contextu...

Please sign up or login with your details

Forgot password? Click here to reset