Improving Human-Labeled Data through Dynamic Automatic Conflict Resolution

12/08/2020
by   David Q. Sun, et al.
0

This paper develops and implements a scalable methodology for (a) estimating the noisiness of labels produced by a typical crowdsourcing semantic annotation task, and (b) reducing the resulting error of the labeling process by as much as 20-30 new approach to the labeling process, which we name Dynamic Automatic Conflict Resolution (DACR), does not require a ground truth dataset and is instead based on inter-project annotation inconsistencies. This makes DACR not only more accurate but also available to a broad range of labeling tasks. In what follows we present results from a text classification task performed at scale for a commercial personal assistant, and evaluate the inherent ambiguity uncovered by this annotation strategy as compared to other common labeling strategies.

READ FULL TEXT
research
07/11/2021

Learning from Crowds with Sparse and Imbalanced Annotations

Traditional supervised learning requires ground truth labels for the tra...
research
01/09/2017

Crowdsourcing Ground Truth for Medical Relation Extraction

Cognitive computing systems require human labeled data for evaluation, a...
research
09/24/2018

Empirical Methodology for Crowdsourcing Ground Truth

The process of gathering ground truth data through human annotation is a...
research
12/20/2019

Assessing Data Quality of Annotations with Krippendorff Alpha For Applications in Computer Vision

Current supervised deep learning frameworks rely on annotated data for m...
research
06/27/2023

"Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models

Many questions that we ask about the world do not have a single clear an...
research
06/11/2022

A Decomposition-Based Approach for Evaluating Inter-Annotator Disagreement in Narrative Analysis

In this work, we explore sources of inter-annotator disagreement in narr...
research
04/11/2018

Offline Object Extraction from Dynamic Occupancy Grid Map Sequences

A dynamic occupancy grid map (DOGMa) allows a fast, robust, and complete...

Please sign up or login with your details

Forgot password? Click here to reset