Active Learning Over Multiple Domains in Natural Language Tasks

02/01/2022
by   Shayne Longpre, et al.
4

Studies of active learning traditionally assume the target and source data stem from a single domain. However, in realistic applications, practitioners often require active learning with multiple sources of out-of-distribution data, where it is unclear a priori which data sources will help or hurt the target domain. We survey a wide variety of techniques in active learning (AL), domain shift detection (DS), and multi-domain sampling to examine this challenging setting for question answering and sentiment analysis. We ask (1) what family of methods are effective for this task? And, (2) what properties of selected examples and domains achieve strong results? Among 18 acquisition functions from 4 families of methods, we find H-Divergence methods, and particularly our proposed variant DAL-E, yield effective results, averaging 2-3 diverse allocation of domains, as well as room-for-improvement of existing methods on both domain and example selection. Our findings yield the first comprehensive analysis of both existing and novel methods for practitioners faced with multi-domain active learning for natural language tasks.

READ FULL TEXT
research
08/16/2018

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

Several recent papers investigate Active Learning (AL) for mitigating th...
research
11/27/2022

Improving Low-Resource Question Answering using Active Learning in Multiple Stages

Neural approaches have become very popular in the domain of Question Ans...
research
02/14/2023

Investigating Multi-source Active Learning for Natural Language Inference

In recent years, active learning has been successfully applied to an arr...
research
07/06/2021

Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering

Active learning promises to alleviate the massive data needs of supervis...
research
10/03/2018

Active Learning for New Domains in Natural Language Understanding

We explore active learning (AL) utterance selection for improving the ac...
research
06/25/2021

Multi-Domain Active Learning: A Comparative Study

Building classifiers on multiple domains is a practical problem in the r...
research
04/01/2022

Efficient Argument Structure Extraction with Transfer Learning and Active Learning

The automation of extracting argument structures faces a pair of challen...

Please sign up or login with your details

Forgot password? Click here to reset