Deep Learning for Audio Transcription on Low-Resource Datasets

07/10/2018
by   Veronica Morfi, et al.
0

In training a deep learning system to perform audio transcription, two practical problems may arise. Firstly, most datasets are weakly labelled, having only a list of events present in each recording without any temporal information for training. Secondly, deep neural networks need a very large amount of labelled training data to achieve good quality performance, yet in practice it is difficult to collect enough samples for most classes of interest. In this paper, we propose factorising the final task of audio transcription into multiple intermediate tasks in order to improve the training performance when dealing with this kind of low-resource datasets. We evaluate three data-efficient approaches of training a stacked convolutional and recurrent neural network for the intermediate tasks. Our results show that different methods of training have different advantages and disadvantages.

READ FULL TEXT
research
07/10/2018

Deep Learning on Low-Resource Datasets

In training a deep learning system to perform audio transcription, two p...
research
07/17/2018

Data-Efficient Weakly Supervised Learning for Low-Resource Audio Event Detection Using Deep Learning

We propose a method to perform audio event detection under the common co...
research
11/04/2017

Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

In recent years, neural networks have proven to be effective in Chinese ...
research
07/10/2021

Variational Information Bottleneck for Effective Low-resource Audio Classification

Large-scale deep neural networks (DNNs) such as convolutional neural net...
research
11/04/2018

Handwriting Recognition in Low-resource Scripts using Adversarial Learning

Handwritten Word Recognition and Spotting is a challenging field dealing...
research
02/18/2022

Predicting Sex and Stroke Success – Computer-aided Player Grunt Analysis in Tennis Matches

Professional athletes increasingly use automated analysis of meta- and s...
research
04/24/2017

k-FFNN: A priori knowledge infused Feed-forward Neural Networks

Recurrent neural network (RNN) are being extensively used over feed-forw...

Please sign up or login with your details

Forgot password? Click here to reset