Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

09/04/2019
by   Paul Primus, et al.
0

Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of machine listening (Task 1b of the DCASE 2019 Challenge). We propose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-parallel samples, i.e. time-aligned recordings from different recording devices. No classification labels are needed for our domain adaptation (DA) method, which makes the data collection process cheaper.

READ FULL TEXT

page 2

page 3

research
05/25/2021

Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices

Machine learning algorithms, when trained on audio recordings from a lim...
research
06/20/2023

On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers

Varying conditions between the data seen at training and at application ...
research
10/26/2021

Towards Audio Domain Adaptation for Acoustic Scene Classification using Disentanglement Learning

The deployment of machine listening algorithms in real-life applications...
research
06/13/2023

Domain Information Control at Inference Time for Acoustic Scene Classification

Domain shift is considered a challenge in machine learning as it causes ...
research
10/18/2021

Adversarial Domain Adaptation with Paired Examples for Acoustic Scene Classification on Different Recording Devices

In classification tasks, the classification accuracy diminishes when the...
research
03/26/2021

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier

Domain mismatch is a noteworthy issue in acoustic event detection tasks,...
research
06/21/2021

Speech prosody and remote experiments: a technical report

The aim of this paper is twofold. First, we present a review of differen...

Please sign up or login with your details

Forgot password? Click here to reset