On Isotropy and Learning Dynamics of Contrastive-based Sentence Representation Learning

12/18/2022
by   Chenghao Xiao, et al.
8

Incorporating contrastive learning objectives in sentence representation learning (SRL) has yielded significant improvements on many sentence-level NLP tasks. However, It is not well understood why contrastive learning works for learning sentence-level semantics. In this paper, we take a closer look at contrastive sentence representation learning through the lens of isotropy and learning dynamics. We interpret its success stories through the geometry of the representation shifts. We show that contrastive learning brings isotropy, and surprisingly learns to converge tokens to similar positions in the semantic space if given the signal that they are in the same sentence. Also, what we formalize as "spurious contextualization" is mitigated for semantically meaningful tokens, while augmented for functional ones. The embedding space is pushed toward the origin during training, with more areas now better defined. We ablate these findings by observing the learning dynamic with different training temperatures, batch sizes and pooling methods. With these findings, we aim to shed light on future designs of sentence representation learning methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2022

InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings

Contrastive learning has been extensively studied in sentence embedding ...
research
03/07/2018

An efficient framework for learning sentence representations

In this work we propose a simple and efficient framework for learning se...
research
12/31/2020

CLEAR: Contrastive Learning for Sentence Representation

Pre-trained language models have proven their unique powers in capturing...
research
09/03/2021

Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation

Exemplar-Guided Paraphrase Generation (EGPG) aims to generate a target s...
research
08/19/2021

Batch Curation for Unsupervised Contrastive Representation Learning

The state-of-the-art unsupervised contrastive visual representation lear...
research
05/09/2023

StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure

This work explores the utility of explicit structure for representation ...
research
10/31/2022

SDCL: Self-Distillation Contrastive Learning for Chinese Spell Checking

Due to the ambiguity of homophones, Chinese Spell Checking (CSC) has wid...

Please sign up or login with your details

Forgot password? Click here to reset