Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

05/12/2023
by   Tobias Morocutti, et al.
10

The ability to generalize to a wide range of recording devices is a crucial performance factor for audio classification models. The characteristics of different types of microphones introduce distributional shifts in the digitized audio signals due to their varying frequency responses. If this domain shift is not taken into account during training, the model's performance could degrade severely when it is applied to signals recorded by unseen devices. In particular, training a model on audio signals recorded with a small number of different microphones can make generalization to unseen devices difficult. To tackle this problem, we convolve audio signals in the training set with pre-recorded device impulse responses (DIRs) to artificially increase the diversity of recording devices. We systematically study the effect of DIR augmentation on the task of Acoustic Scene Classification using CNNs and Audio Spectrogram Transformers. The results show that DIR augmentation in isolation performs similarly to the state-of-the-art method Freq-MixStyle. However, we also show that DIR augmentation and Freq-MixStyle are complementary, achieving a new state-of-the-art performance on signals recorded by devices unseen during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2023

Domain Information Control at Inference Time for Acoustic Scene Classification

Domain shift is considered a challenge in machine learning as it causes ...
research
11/18/2020

CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification

Acoustic Scene Classification (ASC) aims to classify the environment in ...
research
06/20/2023

On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers

Varying conditions between the data seen at training and at application ...
research
12/09/2021

On The Effect Of Coding Artifacts On Acoustic Scene Classification

Previous DCASE challenges contributed to an increase in the performance ...
research
05/25/2021

Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices

Machine learning algorithms, when trained on audio recordings from a lim...
research
07/19/2021

Over-Parameterization and Generalization in Audio Classification

Convolutional Neural Networks (CNNs) have been dominating classification...
research
07/16/2020

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

In this technical report, we present a joint effort of four groups, name...

Please sign up or login with your details

Forgot password? Click here to reset