Unsupervised Domain Adaptation using Lexical Transformations and Label Injection for Twitter Data

07/14/2023
by   Akshat Gupta, et al.
0

Domain adaptation is an important and widely studied problem in natural language processing. A large body of literature tries to solve this problem by adapting models trained on the source domain to the target domain. In this paper, we instead solve this problem from a dataset perspective. We modify the source domain dataset with simple lexical transformations to reduce the domain shift between the source dataset distribution and the target dataset distribution. We find that models trained on the transformed source domain dataset performs significantly better than zero-shot models. Using our proposed transformations to convert standard English to tweets, we reach an unsupervised part-of-speech (POS) tagging accuracy of 92.14 accuracy), which is only slightly below the supervised performance of 94.45 We also use our proposed transformations to synthetically generate tweets and augment the Twitter dataset to achieve state-of-the-art performance for POS tagging.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

Unsupervised Domain Adaptation in the Absence of Source Data

Current unsupervised domain adaptation methods can address many types of...
research
04/04/2023

MEnsA: Mix-up Ensemble Average for Unsupervised Multi Target Domain Adaptation on 3D Point Clouds

Unsupervised domain adaptation (UDA) addresses the problem of distributi...
research
07/06/2023

Dense Retrieval Adaptation using Target Domain Description

In information retrieval (IR), domain adaptation is the process of adapt...
research
09/11/2020

Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation

Machine learning models trained in one domain perform poorly in the othe...
research
08/26/2023

Unsupervised Domain Adaptation via Domain-Adaptive Diffusion

Unsupervised Domain Adaptation (UDA) is quite challenging due to the lar...
research
01/14/2022

Domain-shift adaptation via linear transformations

A predictor, f_A : X → Y, learned with data from a source domain (A) mig...
research
05/21/2019

Domain adaptation for part-of-speech tagging of noisy user-generated text

The performance of a Part-of-speech (POS) tagger is highly dependent on ...

Please sign up or login with your details

Forgot password? Click here to reset