Label quality in AffectNet: results of crowd-based re-annotation

10/09/2021
by   Doo Yon Kim, et al.
0

AffectNet is one of the most popular resources for facial expression recognition (FER) on relatively unconstrained in-the-wild images. Given that images were annotated by only one annotator with limited consistency checks on the data, however, label quality and consistency may be limited. Here, we take a similar approach to a study that re-labeled another, smaller dataset (FER2013) with crowd-based annotations, and report results from a re-labeling and re-annotation of a subset of difficult AffectNet faces with 13 people on both expression label, and valence and arousal ratings. Our results show that human labels overall have medium to good consistency, whereas human ratings especially for valence are in excellent agreement. Importantly, however, crowd-based labels are significantly shifting towards neutral and happy categories and crowd-based affective ratings form a consistent pattern different from the original ratings. ResNets fully trained on the original AffectNet dataset do not predict human voting patterns, but when weakly-trained do so much better, particularly for valence. Our results have important ramifications for label quality in affective computing.

READ FULL TEXT

page 2

page 9

research
08/03/2016

Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution

Crowd sourcing has become a widely adopted scheme to collect ground trut...
research
07/10/2021

Consensual Collaborative Training And Knowledge Distillation Based Facial Expression Recognition Under Noisy Annotations

Presence of noise in the labels of large scale facial expression dataset...
research
03/13/2017

Users prefer Guetzli JPEG over same-sized libjpeg

We report on pairwise comparisons by human raters of JPEG images from li...
research
05/10/2023

Auditing Cross-Cultural Consistency of Human-Annotated Labels for Recommendation Systems

Recommendation systems increasingly depend on massive human-labeled data...
research
07/26/2012

Identifying Users From Their Rating Patterns

This paper reports on our analysis of the 2011 CAMRa Challenge dataset (...
research
09/04/2018

The Effect of Context on Metaphor Paraphrase Aptness Judgments

We conduct two experiments to study the effect of context on metaphor pa...
research
04/12/2017

Real-time On-Demand Crowd-powered Entity Extraction

Output-agreement mechanisms such as ESP Game have been widely used in hu...

Please sign up or login with your details

Forgot password? Click here to reset