Framework for evaluation of sound event detection in web videos

11/02/2017
by   Rohan Badlani, et al.
0

The largest source of sound events is web videos. Most videos lack sound event labels at segment level, however, a significant number of them do respond to text queries, from a match found to their metadata by the search engine. In this paper we explore the extent to which a search query could be used as the true label for the presence of sound events in the videos. For this, we developed a framework for large-scale sound event recognition on web videos. The framework crawls videos using search queries corresponding to 78 sound event labels drawn from three datasets. The datasets are used to train three classifiers, which were then run on 3.7 million video segments. We evaluated performance using the search query as the true label and compare it (on a subset) with human labeling. Both types exhibited close performance, to within 10 increased. Hence, our experiments show potential for using search query as a preliminary true label for sound events in web videos.

READ FULL TEXT
research
01/04/2019

Learning Sound Event Classifiers from Web Audio with Noisy Labels

As sound event classification moves towards larger datasets, issues of l...
research
07/19/2019

Sound Search by Text Description or Vocal Imitation?

Searching sounds by text labels is often difficult, as text descriptions...
research
05/26/2019

Technical Report of the Video Event Reconstruction and Analysis (VERA) System - Shooter Localization, Models, Interface, and Beyond

Every minute, hundreds of hours of video are uploaded to social media si...
research
05/10/2012

Hajj and Umrah Event Recognition Datasets

In this note, new Hajj and Umrah Event Recognition datasets (HUER) are p...
research
01/17/2018

NELS - Never-Ending Learner of Sounds

Sounds are essential to how humans perceive and interact with the world ...
research
10/26/2020

Improving Sound Event Detection Metrics: Insights from DCASE 2020

The ranking of sound event detection (SED) systems may be biased by assu...
research
08/26/2015

Real-time Top-K Predictive Query Processing over Event Streams

This paper addresses the problem of predicting the k events that are mos...

Please sign up or login with your details

Forgot password? Click here to reset