Over-Parameterization and Generalization in Audio Classification

07/19/2021
by   Khaled Koutini, et al.
4

Convolutional Neural Networks (CNNs) have been dominating classification tasks in various domains, such as machine vision, machine listening, and natural language processing. In machine listening, while generally exhibiting very good generalization capabilities, CNNs are sensitive to the specific audio recording device used, which has been recognized as a substantial problem in the acoustic scene classification (DCASE) community. In this study, we investigate the relationship between over-parameterization of acoustic scene classification models, and their resulting generalization abilities. Specifically, we test scaling CNNs in width and depth, under different conditions. Our results indicate that increasing width improves generalization to unseen devices, even without an increase in the number of parameters.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

research
06/13/2023

Domain Information Control at Inference Time for Acoustic Scene Classification

Domain shift is considered a challenge in machine learning as it causes ...
research
11/18/2020

CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification

Acoustic Scene Classification (ASC) aims to classify the environment in ...
research
05/12/2023

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

The ability to generalize to a wide range of recording devices is a cruc...
research
06/24/2022

Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification

While using two-dimensional convolutional neural networks (2D-CNNs) in i...
research
06/20/2023

On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers

Varying conditions between the data seen at training and at application ...
research
10/14/2019

Acoustic Scene Classification Based on a Large-margin Factorized CNN

In this paper, we present an acoustic scene classification framework bas...
research
05/26/2020

Is deeper better? It depends on locality of relevant features

It has been recognized that a heavily overparameterized artificial neura...

Please sign up or login with your details

Forgot password? Click here to reset