Diversity-Aware Batch Active Learning for Dependency Parsing

04/28/2021
by   Tianze Shi, et al.
0

While the predictive performance of modern statistical dependency parsers relies heavily on the availability of expensive expert-annotated treebank data, not all annotations contribute equally to the training of the parsers. In this paper, we attempt to reduce the number of labeled examples needed to train a strong dependency parser using batch active learning (AL). In particular, we investigate whether enforcing diversity in the sampled batches, using determinantal point processes (DPPs), can improve over their diversity-agnostic counterparts. Simulation experiments on an English newswire corpus show that selecting diverse batches with DPPs is superior to strong selection strategies that do not enforce batch diversity, especially during the initial stages of the learning process. Additionally, our diversityaware strategy is robust under a corpus duplication setting, where diversity-agnostic sampling strategies exhibit significant degradation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2020

Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient

Recently, several studies have investigated active learning (AL) for nat...
research
01/17/2019

Diverse mini-batch Active Learning

We study the problem of reducing the amount of labeled training data req...
research
06/09/2019

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

We design a new algorithm for batch active learning with deep neural net...
research
07/25/2021

ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation

Despite the success of deep learning on supervised point cloud semantic ...
research
06/19/2019

Batch Active Learning Using Determinantal Point Processes

Data collection and labeling is one of the main challenges in employing ...
research
05/31/2019

Minimum-Margin Active Learning

We present a new active sampling method we call min-margin which trains ...
research
01/28/2023

Leveraging Importance Weights in Subset Selection

We present a subset selection algorithm designed to work with arbitrary ...

Please sign up or login with your details

Forgot password? Click here to reset