Audio segmentation based on melodic style with hand-crafted features and with convolutional neural networks

07/30/2018
by   Amruta Vidwans, et al.
0

We investigate methods for the automatic labeling of the taan section, a prominent structural component of the Hindustani Khayal vocal concert. The taan contains improvised raga-based melody rendered in the highly distinctive style of rapid pitch and energy modulations of the voice. We propose computational features that capture these specific high-level characteristics of the singing voice in the polyphonic context. The extracted local features are used to achieve classification at the frame level via a trained multilayer perceptron (MLP) network, followed by grouping and segmentation based on novelty detection. We report high accuracies with reference to musician annotated taan sections across artists and concerts. We also compare the performance obtained by the compact specialized features with frame-level classification via a convolutional neural network (CNN) operating directly on audio spectrogram patches for the same task. While the relatively simple architecture we experiment with does not quite attain the classification accuracy of the hand-crafted features, it provides for a performance well above chance with interesting insights about the ability of the network to learn discriminative features effectively from labeled data.

READ FULL TEXT

page 2

page 6

research
05/05/2019

Multivariate Time Series Classification using Dilated Convolutional Neural Network

Multivariate time series classification is a high value and well-known p...
research
01/28/2017

Treelogy: A Novel Tree Classifier Utilizing Deep and Hand-crafted Representations

We propose a novel tree classification system called Treelogy, that fuse...
research
06/29/2017

Audio Spectrogram Representations for Processing with Convolutional Neural Networks

One of the decisions that arise when designing a neural network for any ...
research
06/19/2018

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

In the past, Acoustic Scene Classification systems have been based on ha...
research
11/16/2022

Structural Segmentation and Labeling of Tabla Solo Performances

Tabla is a North Indian percussion instrument used as an accompaniment a...
research
12/19/2018

Multitask Painting Categorization by Deep Multibranch Neural Network

In this work we propose a new deep multibranch neural network to solve t...
research
11/02/2018

Convolutional Neural Networks for Epileptic Seizure Prediction

Epilepsy is the most common neurological disorder and an accurate foreca...

Please sign up or login with your details

Forgot password? Click here to reset