Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes

11/04/2017
by   Anurag Kumar, et al.
0

In this work we propose approaches to effectively transfer knowledge from weakly labeled web audio data. We first describe a convolutional neural network (CNN) based framework for sound event detection and classification using weakly labeled audio data. Our model trains efficiently from audios of variable lengths which; hence, it is well suited for transfer learning. We then propose methods to learn representations using this model which can be effectively used for solving the target task. We study both transductive and inductive transfer learning tasks, showing the effectiveness of our methods for both domain and task adaptation. We show that even off-the-shelf representations using the proposed CNN model generalizes well enough to reach human level accuracy on ESC-50 sound events dataset. We further use them for acoustic scene classification task and once again show that our proposed approaches suits well for this task as well. Moreover, we show that our methods are helpful in capturing semantic meanings and relations as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2018

Learning Sound Events From Webly Labeled Data

In the last couple of years, weakly labeled learning for sound events ha...
research
10/01/2017

Large-scale weakly supervised audio classification using gated convolutional neural network

In this paper, we present a gated convolutional neural network and a tem...
research
03/15/2023

Transfer Learning Based Diagnosis and Analysis of Lung Sound Aberrations

With the development of computer -systems that can collect and analyze e...
research
06/30/2020

A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

An important problem in machine auditory perception is to recognize and ...
research
09/21/2020

Detecting Acoustic Events Using Convolutional Macaron Net

In this paper, we propose to address the issue of the lack of strongly l...
research
10/23/2017

Listening to the World Improves Speech Command Recognition

We study transfer learning in convolutional network architectures applie...
research
06/28/2023

Improving Primate Sounds Classification using Binary Presorting for Deep Learning

In the field of wildlife observation and conservation, approaches involv...

Please sign up or login with your details

Forgot password? Click here to reset