Diversity-Based Generalization for Neural Unsupervised Text Classification under Domain Shift

02/25/2020
by   Jitin Krishnan, et al.
5

Domain adaptation approaches seek to learn from a source domain and generalize it to an unseen target domain. At present, the state-of-the-art domain adaptation approaches for subjective text classification problems are semi-supervised; and use unlabeled target data along with labeled source data. In this paper, we propose a novel method for domain adaptation of single-task text classification problems based on a simple but effective idea of diversity-based generalization that does not require unlabeled target data. Diversity plays the role of promoting the model to better generalize and be indiscriminate towards domain shift by forcing the model not to rely on same features for prediction. We apply this concept on the most explainable component of neural networks, the attention layer. To generate sufficient diversity, we create a multi-head attention model and infuse a diversity constraint between the attention heads such that each head will learn differently. We further expand upon our model by tri-training and designing a procedure with an additional diversity constraint between the attention heads of the tri-trained classifiers. Extensive evaluation using the standard benchmark dataset of Amazon reviews and a newly constructed dataset of Crisis events shows that our fully unsupervised method matches with the competing semi-supervised baselines. Our results demonstrate that machine learning architectures that ensure sufficient diversity can generalize better; encouraging future research to design ubiquitously usable learning models without using unlabeled target data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2021

Learning Invariant Representation with Consistency and Diversity for Semi-supervised Source Hypothesis Transfer

Semi-supervised domain adaptation (SSDA) aims to solve tasks in target d...
research
03/16/2020

A Label Proportions Estimation technique for Adversarial Domain Adaptation in Text Classification

Many text classification tasks are domain-dependent, and various domain ...
research
10/23/2022

Unsupervised Non-transferable Text Classification

Training a good deep learning model requires substantial data and comput...
research
11/04/2021

TimeMatch: Unsupervised Cross-Region Adaptation by Temporal Shift Estimation

The recent developments of deep learning models that capture the complex...
research
05/28/2015

Domain-Adversarial Training of Neural Networks

We introduce a new representation learning approach for domain adaptatio...
research
06/28/2021

Domain Adaptation Broad Learning System Based on Locally Linear Embedding

Broad learning system (BLS) has been proposed for a few years. It demons...
research
03/04/2020

Unsupervised and Interpretable Domain Adaptation to Rapidly Filter Social Web Data for Emergency Services

During the onset of a disaster event, filtering relevant information fro...

Please sign up or login with your details

Forgot password? Click here to reset