Robust Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks

04/21/2016
by   Lars Hertel, et al.
0

We present in this paper a simple, yet efficient convolutional neural network (CNN) architecture for robust audio event recognition. Opposing to deep CNN architectures with multiple convolutional and pooling layers topped up with multiple fully connected layers, the proposed network consists of only three layers: convolutional, pooling, and softmax layer. Two further features distinguish it from the deep architectures that have been proposed for the task: varying-size convolutional filters at the convolutional layer and 1-max pooling scheme at the pooling layer. In intuition, the network tends to select the most discriminative features from the whole audio signals for recognition. Our proposed CNN not only shows state-of-the-art performance on the standard task of robust audio event recognition but also outperforms other deep architectures up to 4.5 to 76.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2016

Classifying Variable-Length Audio Files with All-Convolutional Networks and Masked Global Pooling

We trained a deep all-convolutional neural network with masked global po...
research
08/16/2020

Adaptive Signal Variances: CNN Initialization Through Modern Architectures

Deep convolutional neural networks (CNN) have achieved the unwavering co...
research
12/21/2014

Striving for Simplicity: The All Convolutional Net

Most modern convolutional neural networks (CNNs) used for object recogni...
research
04/17/2015

Color Constancy Using CNNs

In this work we describe a Convolutional Neural Network (CNN) to accurat...
research
03/10/2023

Enhancing the success rates by performing pooling decisions adjacent to the output layer

Learning classification tasks of (2^nx2^n) inputs typically consist of ≤...
research
07/07/2012

Object Recognition with Multi-Scale Pyramidal Pooling Networks

We present a Multi-Scale Pyramidal Pooling Network, featuring a novel py...
research
02/09/2017

Effective face landmark localization via single deep network

In this paper, we propose a novel face alignment method using single dee...

Please sign up or login with your details

Forgot password? Click here to reset