The bag-of-frames approach: a not so sufficient model for urban soundscapes

12/11/2014
by   Mathieu Lagrange, et al.
0

The "bag-of-frames" approach (BOF), which encodes audio signals as the long-term statistical distribution of short-term spectral features, is commonly regarded as an effective and sufficient way to represent environmental sound recordings (soundscapes) since its introduction in an influential 2007 article. The present paper describes a concep-tual replication of this seminal article using several new soundscape datasets, with results strongly questioning the adequacy of the BOF approach for the task. We show that the good accuracy originally re-ported with BOF likely result from a particularly thankful dataset with low within-class variability, and that for more realistic datasets, BOF in fact does not perform significantly better than a mere one-point av-erage of the signal's features. Soundscape modeling, therefore, may not be the closed case it was once thought to be. Progress, we ar-gue, could lie in reconsidering the problem of considering individual acoustical events within each soundscape.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2018

Speaker Recognition using Deep Belief Networks

Short time spectral features such as mel frequency cepstral coefficients...
research
07/14/2021

Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection

Active speaker detection (ASD) seeks to detect who is speaking in a visu...
research
04/10/2019

Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events

In this paper, we propose a new strategy for acoustic scene classificati...
research
11/02/2017

Audio Set classification with attention model: A probabilistic perspective

This paper investigate the classification of the Audio Set dataset. Audi...
research
11/03/2022

Convolution channel separation and frequency sub-bands aggregation for music genre classification

In music, short-term features such as pitch and tempo constitute long-te...
research
12/11/2021

Overview of The MediaEval 2021 Predicting Media Memorability Task

This paper describes the MediaEval 2021 Predicting Media Memorabilitytas...

Please sign up or login with your details

Forgot password? Click here to reset