Domestic sound event detection by shift consistency mean-teacher training and adversarial domain adaptation

08/17/2022
by   Fang-Ching Chen, et al.
0

Semi-supervised learning and domain adaptation techniques have drawn increasing attention in the field of domestic sound event detection thanks to the availability of large amounts of unlabeled data and the relative ease to generate synthetic strongly-labeled data. In a previous work, several semi-supervised learning strategies were designed to boost the performance of a mean-teacher model. Namely, these strategies include shift consistency training (SCT), interpolation consistency training (ICT), and pseudo-labeling. However, adversarial domain adaptation (ADA) did not seem to improve the event detection accuracy further when we attempt to compensate for the domain gap between synthetic and real data. In this research, we empirically found that ICT tends to pull apart the distributions of synthetic and real data in t-SNE plots. Therefore, ICT is abandoned while SCT, in contrast, is applied to train both the student and the teacher models. With these modifications, the system successfully integrates with an ADA network, and we achieve 47.2 score on the DCASE 2020 task 4 dataset, which is 2.1 reported in the previous work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2020

Grasping Detection Network with Uncertainty Estimation for Confidence-Driven Semi-Supervised Domain Adaptation

Data-efficient domain adaptation with only a few labelled data is desire...
research
04/25/2019

Exploring Object Relation in Mean Teacher for Cross-Domain Detection

Rendering synthetic data (e.g., 3D CAD-rendered images) to generate anno...
research
10/21/2021

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Sound event detection (SED), as a core module of acoustic environmental ...
research
08/06/2021

From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data

Single image dehazing is a challenging task, for which the domain shift ...
research
07/13/2022

Wakeword Detection under Distribution Shifts

We propose a novel approach for semi-supervised learning (SSL) designed ...
research
04/16/2022

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

Scene text recognition (STR) attracts much attention over the years beca...
research
03/27/2021

From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation

Animal pose estimation is an important field that has received increasin...

Please sign up or login with your details

Forgot password? Click here to reset