Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection

09/21/2023
by   Rui Tao, et al.
0

Weakly-supervised learning has emerged as a promising approach to leverage limited labeled data in various domains by bridging the gap between fully supervised methods and unsupervised techniques. Acquisition of strong annotations for detecting sound events is prohibitively expensive, making weakly supervised learning a more cost-effective and broadly applicable alternative. In order to enhance the recognition rate of the learning of detection of weakly-supervised sound events, we introduce a Frame Pairwise Distance (FPD) loss branch, complemented with a minimal amount of synthesized data. The corresponding sampling and label processing strategies are also proposed. Two distinct distance metrics are employed to evaluate the proposed approach. Finally, the method is validated on the standard DCASE dataset. The obtained experimental results corroborated the efficacy of this approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2023

Defect detection using weakly supervised learning

In many real-world scenarios, obtaining large amounts of labeled data ca...
research
11/02/2020

Learning generic feature representation with synthetic data for weakly-supervised sound event detection by inter-frame distance loss

Due to the limitation of strong-labeled sound event detection data set, ...
research
04/24/2018

A Closer Look at Weak Label Learning for Audio Events

Audio content analysis in terms of sound events is an important research...
research
06/21/2021

Affinity Mixup for Weakly Supervised Sound Event Detection

The weakly supervised sound event detection problem is the task of predi...
research
03/27/2020

Voice activity detection in the wild via weakly supervised sound event detection

Traditional supervised voice activity detection (VAD) methods work well ...
research
03/08/2023

SoftMatch Distance: A Novel Distance for Weakly-Supervised Trend Change Detection in Bi-Temporal Images

General change detection (GCD) and semantic change detection (SCD) are c...
research
03/23/2021

Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss

A good joint training framework is very helpful to improve the performan...

Please sign up or login with your details

Forgot password? Click here to reset