A Closer Look at Weak Label Learning for Audio Events

04/24/2018
by   Ankit Shah, et al.
0

Audio content analysis in terms of sound events is an important research problem for a variety of applications. Recently, the development of weak labeling approaches for audio or sound event detection (AED) and availability of large scale weakly labeled dataset have finally opened up the possibility of large scale AED. However, a deeper understanding of how weak labels affect the learning for sound events is still missing from literature. In this work, we first describe a CNN based approach for weakly supervised training of audio events. The approach follows some basic design principle desirable in a learning method relying on weakly labeled audio. We then describe important characteristics, which naturally arise in weakly supervised learning of sound events. We show how these aspects of weak labels affect the generalization of models. More specifically, we study how characteristics such as label density and corruption of labels affects weakly supervised training for audio events. We also study the feasibility of directly obtaining weak labeled data from the web without any manual label and compare it with a dataset which has been manually labeled. The analysis and understanding of these factors should be taken into picture in the development of future weak label learning methods. Audioset, a large scale weakly labeled dataset for sound events is used in our experiments.

READ FULL TEXT

page 4

page 8

research
11/25/2018

Learning Sound Events From Webly Labeled Data

In the last couple of years, weakly labeled learning for sound events ha...
research
10/25/2019

SeCoST: Sequential Co-Supervision for Weakly Labeled Audio Event Detection

Weakly supervised learning algorithms are critical for scaling audio eve...
research
04/28/2022

Pseudo strong labels for large scale weakly supervised audio tagging

Large-scale audio tagging datasets inevitably contain imperfect labels, ...
research
02/05/2020

Limitations of weak labels for embedding and tagging

While many datasets and approaches in ambient sound analysis use weakly ...
research
12/11/2016

Multiple Instance Learning: A Survey of Problem Characteristics and Applications

Multiple instance learning (MIL) is a form of weakly supervised learning...
research
07/09/2017

Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data

The development of audio event recognition models requires labeled train...
research
09/21/2023

Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection

Weakly-supervised learning has emerged as a promising approach to levera...

Please sign up or login with your details

Forgot password? Click here to reset