A Survey of Active Learning for Text Classification using Deep Neural Networks

08/17/2020
by   Christopher Schröder, et al.
0

Natural language processing (NLP) and neural networks (NNs) have both undergone significant changes in recent years. For active learning (AL) purposes, NNs are, however, less commonly used – despite their current popularity. By using the superior text classification performance of NNs for AL, we can either increase a model's performance using the same amount of data or reduce the data and therefore the required annotation efforts while keeping the same performance. We review AL for text classification using deep neural networks (DNNs) and elaborate on two main causes which used to hinder the adoption: (a) the inability of NNs to provide reliable uncertainty estimates, on which the most commonly used query strategies rely, and (b) the challenge of training DNNs on small data. To investigate the former, we construct a taxonomy of query strategies, which distinguishes between data-based, model-based, and prediction-based instance selection, and investigate the prevalence of these classes in recent research. Moreover, we review recent NN-based advances in NLP like word embeddings or language models in the context of (D)NNs, survey the current state-of-the-art at the intersection of AL, text classification, and DNNs and relate recent advances in NLP to AL. Finally, we analyze recent work in AL for text classification, connect the respective query strategies to the taxonomy, and outline commonalities and shortcomings. As a result, we highlight gaps in current research and present open research questions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2021

Active learning for reducing labeling effort in text classification tasks

Labeling data can be an expensive task as it is usually performed manual...
research
01/09/2023

Active Learning for Abstractive Text Summarization

Construction of human-curated annotated datasets for abstractive text su...
research
08/15/2021

Deep Active Learning for Text Classification with Diverse Interpretations

Recently, Deep Neural Networks (DNNs) have made remarkable progress for ...
research
04/20/2022

Active Few-Shot Learning with FASL

Recent advances in natural language processing (NLP) have led to strong ...
research
12/18/2018

Safety and Trustworthiness of Deep Neural Networks: A Survey

In the past few years, significant progress has been made on deep neural...
research
08/26/2020

SHAP values for Explaining CNN-based Text Classification Models

Deep neural networks are increasingly used in natural language processin...
research
08/05/2022

Model Blending for Text Classification

Deep neural networks (DNNs) have proven successful in a wide variety of ...

Please sign up or login with your details

Forgot password? Click here to reset