Active Learning with Siamese Twins for Sequence Tagging

11/01/2019
by   Rishi Hazra, et al.
15

Deep learning, in general, and natural language processing methods, in particular, rely heavily on annotated samples to achieve good performance. However, manually annotating data is expensive and time consuming. Active Learning (AL) strategies reduce the need for huge volumes of labelled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which do not aid in the learning process. We propose a method, referred to as Active^2 Learning (A^2L), that actively adapts to the sequence tagging model being trained, to further eliminate such redundant examples chosen by an AL strategy. We empirically demonstrate that A^2L improves the performance of state-of-the-art AL strategies on different sequence tagging tasks. Furthermore, we show that A^2L is widely applicable by using it in conjunction with different AL strategies and sequence tagging models. We demonstrate that the proposed A^2L able to reach full data F-score with ≈2-16 % less data compared to state-of-art AL strategies on different sequence tagging datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2021

Active^2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

While deep learning is a powerful tool for natural language processing (...
research
01/20/2021

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Annotating training data for sequence tagging tasks is usually very time...
research
05/30/2019

Understanding Goal-Oriented Active Learning via Influence Functions

Active learning (AL) concerns itself with learning a model from as few l...
research
10/09/2018

Discovering General-Purpose Active Learning Strategies

We propose a general-purpose approach to discovering active learning (AL...
research
11/02/2020

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Active learning (AL) uses a data selection algorithm to select useful tr...
research
07/12/2018

How transferable are the datasets collected by active learners?

Active learning is a widely-used training strategy for maximizing predic...
research
06/22/2011

Acquiring Word-Meaning Mappings for Natural Language Interfaces

This paper focuses on a system, WOLFIE (WOrd Learning From Interpreted E...

Please sign up or login with your details

Forgot password? Click here to reset