Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms

07/24/2021
by   Xuan Shi, et al.
0

Timbre representations of musical instruments, essential for diverse applications such as musical audio synthesis and separation, might be learned as bottleneck features from an instrumental recognition model. Given the similarities between speaker recognition and musical instrument recognition, in this paper, we investigate how to adapt successful speaker recognition algorithms to musical instrument recognition to learn meaningful instrumental timbre representations. To address the mismatch between musical audio and models devised for speech, we introduce a group of trainable filters to generate proper acoustic features from input raw waveforms, making it easier for a model to be optimized in an input-agnostic and end-to-end manner. Through experiments on both the NSynth and RWC databases in both musical instrument closed-set identification and open-set verification scenarios, the modified speaker recognition model was capable of generating discriminative embeddings for instrument and instrument-family identities. We further conducted extensive experiments to characterize the encoded information in learned timbre embeddings.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
05/03/2021

Deep Neural Network for Musical Instrument Recognition using MFCCs

The task of efficient automatic music classification is of vital importa...
research
03/30/2018

Conditional End-to-End Audio Transforms

We present an end-to-end method for transforming audio from one style to...
research
11/30/2019

Predominant Musical Instrument Classification based on Spectral Features

This work aims to examine one of the cornerstone problems of Musical Ins...
research
08/08/2021

Deep Single Shot Musical Instrument Identification using Scalograms

Musical Instrument Identification has for long had a reputation of being...
research
01/21/2021

Soloist: Generating Mixed-Initiative Tutorials from Existing Guitar Instructional Videos Through Audio Processing

Learning musical instruments using online instructional videos has becom...
research
02/13/2021

Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms

Sound Event Detection and Audio Classification tasks are traditionally a...
research
07/11/2023

Musical Excellence of Mridangam: an introductory review

This is an introductory review of Musical Excellence of Mridangam by Dr....

Please sign up or login with your details

Forgot password? Click here to reset