Active^2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

03/11/2021
by   Rishi Hazra, et al.
0

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which may not contribute significantly to the learning process. Our proposed approach, Active^2 Learning (A^2L), actively adapts to the deep learning model being trained to eliminate further such redundant examples chosen by an AL strategy. We show that A^2L is widely applicable by using it in conjunction with several different AL strategies and NLP tasks. We empirically demonstrate that the proposed approach is further able to reduce the data requirements of state-of-the-art AL strategies by an absolute percentage reduction of ≈3-25% on multiple NLP tasks while achieving the same performance with no additional computation overhead.

READ FULL TEXT
research
11/01/2019

Active Learning with Siamese Twins for Sequence Tagging

Deep learning, in general, and natural language processing methods, in p...
research
08/01/2023

ALE: A Simulation-Based Active Learning Evaluation Framework for the Parameter-Driven Comparison of Query Strategies for NLP

Supervised machine learning and deep learning require a large amount of ...
research
07/12/2018

How transferable are the datasets collected by active learners?

Active learning is a widely-used training strategy for maximizing predic...
research
05/30/2019

Understanding Goal-Oriented Active Learning via Influence Functions

Active learning (AL) concerns itself with learning a model from as few l...
research
05/24/2023

Active Learning for Natural Language Generation

The field of text generation suffers from a severe shortage of labeled d...
research
10/26/2022

Eeny, meeny, miny, moe. How to choose data for morphological inflection

Data scarcity is a widespread problem in numerous natural language proce...
research
06/22/2011

Acquiring Word-Meaning Mappings for Natural Language Interfaces

This paper focuses on a system, WOLFIE (WOrd Learning From Interpreted E...

Please sign up or login with your details

Forgot password? Click here to reset