Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models

10/05/2020
by   Thuy-Trang Vu, et al.
0

Recent work has shown the importance of adaptation of broad-coverage contextualised embedding models on the domain of the target task of interest. Current self-supervised adaptation methods are simplistic, as the training signal comes from a small percentage of randomly masked-out tokens. In this paper, we show that careful masking strategies can bridge the knowledge gap of masked language models (MLMs) about the domains more effectively by allocating self-supervision where it is needed. Furthermore, we propose an effective training strategy by adversarially masking out those tokens which are harder to reconstruct by the underlying MLM. The adversarial objective leads to a challenging combinatorial optimisation problem over subsets of tokens, which we tackle efficiently through relaxation to a variational lowerbound and dynamic programming. On six unsupervised domain adaptation tasks involving named entity recognition, our method strongly outperforms the random masking strategy and achieves up to +1.64 F1 score improvements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2021

Geometric Unsupervised Domain Adaptation for Semantic Segmentation

Simulators can efficiently generate large amounts of labeled synthetic d...
research
04/07/2020

Inexpensive Domain Adaptation of Pretrained Language Models: A Case Study on Biomedical Named Entity Recognition

Domain adaptation of Pretrained Language Models (PTLMs) is typically ach...
research
04/14/2021

UDALM: Unsupervised Domain Adaptation through Language Modeling

In this work we explore Unsupervised Domain Adaptation (UDA) of pretrain...
research
07/20/2021

Self-Supervised Domain Adaptation for Diabetic Retinopathy Grading using Vessel Image Reconstruction

This paper investigates the problem of domain adaptation for diabetic re...
research
06/29/2023

Prompt Ensemble Self-training for Open-Vocabulary Domain Adaptation

Traditional domain adaptation assumes the same vocabulary across source ...
research
05/22/2020

Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation

We tackle the task of building supervised event trigger identification m...
research
11/29/2022

Soft Alignment Objectives for Robust Adaptation in Machine Translation

Domain adaptation allows generative language models to address specific ...

Please sign up or login with your details

Forgot password? Click here to reset