A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

06/19/2018
by   Eduardo Fonseca, et al.
0

In the past, Acoustic Scene Classification systems have been based on hand crafting audio features that are input to a classifier. Nowadays, the common trend is to adopt data driven techniques, e.g., deep learning, where audio representations are learned from data. In this paper, we propose a system that consists of a simple fusion of two methods of the aforementioned types: a deep learning approach where log-scaled mel-spectrograms are input to a convolutional neural network, and a feature engineering approach, where a collection of hand-crafted features is input to a gradient boosting machine. We first show that both methods provide complementary information to some extent. Then, we use a simple late fusion strategy to combine both methods. We report classification accuracy of each method individually and the combined system on the TUT Acoustic Scenes 2017 dataset. The proposed fused system outperforms each of the individual methods and attains a classification accuracy of 72.8 on the evaluation set, improving the baseline system by 11.8

READ FULL TEXT
research
06/12/2021

Deep Learning Frameworks Applied For Audio-Visual Scene Classification

In this paper, we present deep learning frameworks for audio-visual scen...
research
06/12/2021

A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

In this paper, we presents a low-complexity deep learning frameworks for...
research
10/01/2018

Convolutional Neural Networks and x-vector Embedding for DCASE2018 Acoustic Scene Classification Challenge

In this paper, the Brno University of Technology (BUT) team submissions ...
research
11/02/2018

Beyond Equal-Length Snippets: How Long is Sufficient to Recognize an Audio Scene?

Due to the variability in characteristics of audio scenes, some can natu...
research
03/07/2022

Deep Neural Decision Forest for Acoustic Scene Classification

Acoustic scene classification (ASC) aims to classify an audio clip based...
research
07/30/2018

Audio segmentation based on melodic style with hand-crafted features and with convolutional neural networks

We investigate methods for the automatic labeling of the taan section, a...
research
04/30/2023

Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

In this paper, we present a deep learning based multimodal system for cl...

Please sign up or login with your details

Forgot password? Click here to reset