Neural Supervised Domain Adaptation by Augmenting Pre-trained Models with Random Units

06/09/2021
by   Sara Meftah, et al.
16

Neural Transfer Learning (TL) is becoming ubiquitous in Natural Language Processing (NLP), thanks to its high performance on many tasks, especially in low-resourced scenarios. Notably, TL is widely used for neural domain adaptation to transfer valuable knowledge from high-resource to low-resource domains. In the standard fine-tuning scheme of TL, a model is initially pre-trained on a source domain and subsequently fine-tuned on a target domain and, therefore, source and target domains are trained using the same architecture. In this paper, we show through interpretation methods that such scheme, despite its efficiency, is suffering from a main limitation. Indeed, although capable of adapting to new domains, pre-trained neurons struggle with learning certain patterns that are specific to the target domain. Moreover, we shed light on the hidden negative transfer occurring despite the high relatedness between source and target domains, which may mitigate the final gain brought by transfer learning. To address these problems, we propose to augment the pre-trained model with normalised, weighted and randomly initialised units that foster a better adaptation while maintaining the valuable source knowledge. We show that our approach exhibits significant improvements to the standard fine-tuning scheme for neural domain adaptation from the news domain to the social media domain on four NLP tasks: part-of-speech tagging, chunking, named entity recognition and morphosyntactic tagging.

READ FULL TEXT
research
04/07/2019

Joint Learning of Pre-Trained and Random Units for Domain Adaptation in Part-of-Speech Tagging

Fine-tuning neural networks is widely used to transfer valuable knowledg...
research
09/01/2021

DILBERT: Customized Pre-Training for Domain Adaptation withCategory Shift, with an Application to Aspect Extraction

The rise of pre-trained language models has yielded substantial progress...
research
11/02/2022

Low-Resource Music Genre Classification with Advanced Neural Model Reprogramming

Transfer learning (TL) approaches have shown promising results when hand...
research
07/17/2023

Domain Adaptation using Silver Standard Masks for Lateral Ventricle Segmentation in FLAIR MRI

Lateral ventricular volume (LVV) is an important biomarker for clinical ...
research
05/18/2023

Parameter-Efficient Learning for Text-to-Speech Accent Adaptation

This paper presents a parameter-efficient learning (PEL) to develop a lo...
research
05/21/2019

Domain adaptation for part-of-speech tagging of noisy user-generated text

The performance of a Part-of-speech (POS) tagger is highly dependent on ...
research
02/22/2022

Model Reprogramming: Resource-Efficient Cross-Domain Machine Learning

In data-rich domains such as vision, language, and speech, deep learning...

Please sign up or login with your details

Forgot password? Click here to reset