A simple method for domain adaptation of sentence embeddings

08/25/2020
by   Anna Kruspe, et al.
0

Pre-trained sentence embeddings have been shown to be very useful for a variety of NLP tasks. Due to the fact that training such embeddings requires a large amount of data, they are commonly trained on a variety of text data. An adaptation to specific domains could improve results in many cases, but such a finetuning is usually problem-dependent and poses the risk of over-adapting to the data used for adaptation. In this paper, we present a simple universal method for finetuning Google's Universal Sentence Encoder (USE) using a Siamese architecture. We demonstrate how to use this approach for a variety of data sets and present results on different data sets representing similar problems. The approach is also compared to traditional finetuning on these data sets. As a further advantage, the approach can be used for combining data sets with different annotations. We also present an embedding finetuned on all data sets in parallel.

READ FULL TEXT

page 7

page 8

research
08/14/2017

Data Sets: Word Embeddings Learned from Tweets and General Data

A word embedding is a low-dimensional, dense and real- valued vector rep...
research
12/15/2022

Silhouette: Toward Performance-Conscious and Transferable CPU Embeddings

Learned embeddings are widely used to obtain concise data representation...
research
07/06/2023

Efficient Domain Adaptation of Sentence Embeddings using Adapters

Sentence embeddings enable us to capture the semantic similarity of shor...
research
11/01/2021

Domain-adaptation of spherical embeddings

Domain adaptation of embedding models, updating a generic embedding to t...
research
06/16/2018

Evaluation of sentence embeddings in downstream and linguistic probing tasks

Despite the fast developmental pace of new sentence embedding methods, i...
research
06/16/2020

Domain Adaptation with Morphologic Segmentation

We present a novel domain adaptation framework that uses morphologic seg...
research
02/13/2023

Towards Writing Style Adaptation in Handwriting Recognition

One of the challenges of handwriting recognition is to transcribe a larg...

Please sign up or login with your details

Forgot password? Click here to reset