Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking

05/02/2020
by   Eduardo Fonseca, et al.
0

The study of label noise in sound event recognition has recently gained attention with the advent of larger and noisier datasets. This work addresses the problem of missing labels, one of the big weaknesses of large audio datasets, and one of the most conspicuous issues for AudioSet. We propose a simple and model-agnostic method based on a teacher-student framework with loss masking to first identify the most critical missing label candidates, and then ignore their contribution during the learning process. We find that a simple optimisation of the training label set improves recognition performance without additional compute. We discover that most of the improvement comes from ignoring a critical tiny portion of the missing labels. We also show that the damage done by missing labels is larger as the training set gets smaller, yet it can still be observed even when training with massive amounts of audio. We believe these insights can generalize to other large-scale datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2019

Learning Sound Event Classifiers from Web Audio with Noisy Labels

As sound event classification moves towards larger datasets, issues of l...
research
11/24/2021

Semi-Supervised Audio Classification with Partially Labeled Data

Audio classification has seen great progress with the increasing availab...
research
10/26/2019

Model-agnostic Approaches to Handling Noisy Labels When Training Sound Event Classifiers

Label noise is emerging as a pressing issue in sound event classificatio...
research
05/27/2021

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims t...
research
07/10/2020

Overcoming label noise in audio event detection using sequential labeling

This paper addresses the noisy label issue in audio event detection (AED...
research
07/13/2016

AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis

Recently, sound recognition has been used to identify sounds, such as ca...
research
09/19/2021

ARCA23K: An audio dataset for investigating open-set label noise

The availability of audio data on sound sharing platforms such as Freeso...

Please sign up or login with your details

Forgot password? Click here to reset