Efficient, Uncertainty-based Moderation of Neural Networks Text Classifiers

To maximize the accuracy and increase the overall acceptance of text classifiers, we propose a framework for the efficient, in-operation moderation of classifiers' output. Our framework focuses on use cases in which F1-scores of modern Neural Networks classifiers (ca. 90 practice. We suggest a semi-automated approach that uses prediction uncertainties to pass unconfident, probably incorrect classifications to human moderators. To minimize the workload, we limit the human moderated data to the point where the accuracy gains saturate and further human effort does not lead to substantial improvements. A series of benchmarking experiments based on three different datasets and three state-of-the-art classifiers show that our framework can improve the classification F1-scores by 5.1 to 11.2 approx. 98 to 99 a random moderation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2020

FIND: Human-in-the-Loop Debugging Deep Text Classifiers

Since obtaining a perfect training dataset (i.e., a dataset which is con...
research
08/10/2023

Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT

Recently, generative AIs like ChatGPT have become available to the wide ...
research
11/12/2018

Not Just Depressed: Bipolar Disorder Prediction on Reddit

Bipolar disorder, an illness characterized by manic and depressive episo...
research
11/21/2019

MSD: Multi-Self-Distillation Learning via Multi-classifiers within Deep Neural Networks

As the development of neural networks, more and more deep neural network...
research
03/11/2023

Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation

The use of Shap scores has become widespread in Explainable AI. However,...
research
08/27/2020

A benchmark of data stream classification for human activity recognition on connected objects

This paper evaluates data stream classifiers from the perspective of con...
research
04/19/2022

Unsupervised Numerical Reasoning to Extract Phenotypes from Clinical Text by Leveraging External Knowledge

Extracting phenotypes from clinical text has been shown to be useful for...

Please sign up or login with your details

Forgot password? Click here to reset