End-to-End Auditory Object Recognition via Inception Nucleus

05/25/2020
by   Mohammad K. Ebrahimpour, et al.
0

Machine learning approaches to auditory object recognition are traditionally based on engineered features such as those derived from the spectrum or cepstrum. More recently, end-to-end classification systems in image and auditory recognition systems have been developed to learn features jointly with classification and result in improved classification accuracy. In this paper, we propose a novel end-to-end deep neural network to map the raw waveform inputs to sound class labels. Our network includes an "inception nucleus" that optimizes the size of convolutional filters on the fly that results in reducing engineering efforts dramatically. Classification results compared favorably against current state-of-the-art approaches, besting them by 10.4 percentage points on the Urbansound8k dataset. Analyses of learned representations revealed that filters in the earlier hidden layers learned wavelet-like transforms to extract features that were informative for classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2018

Learning Environmental Sounds with Multi-scale Convolutional Neural Network

Deep learning has dramatically improved the performance of sounds recogn...
research
06/27/2018

Deep Steganalysis: End-to-End Learning with Supervisory Information beyond Class Labels

Recently, deep learning has shown its power in steganalysis. However, th...
research
05/25/2020

InfantNet: A Deep Neural Network for Analyzing Infant Vocalizations

Acoustic analyses of infant vocalizations are valuable for research on s...
research
01/28/2016

Towards the Design of an End-to-End Automated System for Image and Video-based Recognition

Over many decades, researchers working in object recognition have longed...
research
04/03/2019

End-to-end Binaural Sound Localisation from the Raw Waveform

A novel end-to-end binaural sound localisation approach is proposed whic...
research
06/08/2008

Fast Wavelet-Based Visual Classification

We investigate a biologically motivated approach to fast visual classifi...
research
06/09/2020

End-to-end User Recognition using Touchscreen Biometrics

We study the touchscreen data as behavioural biometrics. The goal was to...

Please sign up or login with your details

Forgot password? Click here to reset