SeedBERT: Recovering Annotator Rating Distributions from an Aggregated Label

11/23/2022
by   Aneesha Sampath, et al.
0

Many machine learning tasks – particularly those in affective computing – are inherently subjective. When asked to classify facial expressions or to rate an individual's attractiveness, humans may disagree with one another, and no single answer may be objectively correct. However, machine learning datasets commonly have just one "ground truth" label for each sample, so models trained on these labels may not perform well on tasks that are subjective in nature. Though allowing models to learn from the individual annotators' ratings may help, most datasets do not provide annotator-specific labels for each sample. To address this issue, we propose SeedBERT, a method for recovering annotator rating distributions from a single label by inducing pre-trained models to attend to different portions of the input. Our human evaluations indicate that SeedBERT's attention mechanism is consistent with human sources of annotator disagreement. Moreover, in our empirical evaluations using large language models, SeedBERT demonstrates substantial gains in performance on downstream subjective tasks compared both to standard deep learning models and to other current models that account explicitly for annotator disagreement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2023

"Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models

Many questions that we ask about the world do not have a single clear an...
research
07/06/2023

Style Over Substance: Evaluation Biases for Large Language Models

As large language models (LLMs) continue to advance, accurately and comp...
research
10/12/2021

On Releasing Annotator-Level Labels and Information in Datasets

A common practice in building NLP datasets, especially using crowd-sourc...
research
02/07/2022

Jury Learning: Integrating Dissenting Voices into Machine Learning Models

Whose labels should a machine learning (ML) algorithm learn to emulate? ...
research
01/16/2016

Brain-Inspired Deep Networks for Image Aesthetics Assessment

Image aesthetics assessment has been challenging due to its subjective n...
research
02/19/2016

A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

Many machine learning problems can be characterized by mutual contaminat...
research
04/13/2022

Mitigating Bias in Facial Analysis Systems by Incorporating Label Diversity

Facial analysis models are increasingly applied in real-world applicatio...

Please sign up or login with your details

Forgot password? Click here to reset