Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

11/09/2018
by   Sandeep Kothinti, et al.
0

Sound event detection is a challenging task, especially for scenes with multiple simultaneous events. While event classification methods tend to be fairly accurate, event localization presents additional challenges, especially when large amounts of labeled data are not available. Task4 of the 2018 DCASE challenge presents an event detection task that requires accuracy in both segmentation and recognition of events while providing only weakly labeled training data. Supervised methods can produce accurate event labels but are limited in event segmentation when training data lacks event timestamps. On the other hand, unsupervised methods that model the acoustic properties of the audio can produce accurate event boundaries but are not guided by the characteristics of event classes and sound categories. We present a hybrid approach that combines an acoustic-driven event boundary detection and a supervised label inference using a deep neural network. This framework leverages benefits of both unsupervised and supervised methodologies and takes advantage of large amounts of unlabeled data, making it ideal for large-scale weakly labeled event detection. Compared to a baseline system, the proposed approach delivers a 15 benefits of the hybrid bottom-up, top-down approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
07/27/2018

Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments

This paper presents DCASE 2018 task 4. The task evaluates systems for th...
research
11/01/2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data

Sound event detection (SED) is typically posed as a supervised learning ...
research
04/29/2019

Semi-supervised Acoustic Event Detection based on tri-training

This paper presents our work of training acoustic event detection (AED) ...
research
05/27/2021

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims t...
research
03/23/2021

Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss

A good joint training framework is very helpful to improve the performan...
research
11/17/2022

Balanced Deep CCA for Bird Vocalization Detection

Event detection improves when events are captured by two different modal...
research
09/07/2020

A Hybrid Neuro-Symbolic Approach for Complex Event Processing

Training a model to detect patterns of interrelated events that form sit...

Please sign up or login with your details

Forgot password? Click here to reset