Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data

04/09/2020
by   Xulong Zhang, et al.
0

Models for music artist classification usually were operated in the frequency domain, in which the input audio samples are processed by the spectral transformation. The WaveNet architecture, originally designed for speech and music generation. In this paper, we propose an end-to-end architecture in the time domain for this task. A WaveNet classifier was introduced which directly models the features from a raw audio waveform. The WaveNet takes the waveform as the input and several downsampling layers are subsequent to discriminate which artist the input belongs to. In addition, the proposed method is applied to singer identification. The model achieving the best performance obtains an average F1 score of 0.854 on benchmark dataset of Artist20, which is a significant improvement over the related works. In order to show the effectiveness of feature learning of the proposed method, the bottleneck layer of the model is visualized.

READ FULL TEXT

page 7

page 8

page 10

research
04/08/2019

Audio Classification of Bit-Representation Waveform

This paper investigates waveform representation for audio signal classif...
research
10/27/2019

Transferring neural speech waveform synthesizers to musical instrument sounds generation

Recent neural waveform synthesizers such as WaveNet, WaveGlow, and the n...
research
01/14/2019

Music Artist Classification with Convolutional Recurrent Neural Networks

Previous attempts at music artist classification use frame-level audio f...
research
02/09/2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

In recent years, there has been an increased popularity in image and spe...
research
05/24/2022

Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features

Metaverse is an interactive world that combines reality and virtuality, ...
research
02/20/2022

It's Raw! Audio Generation with State-Space Models

Developing architectures suitable for modeling raw audio is a challengin...
research
07/20/2023

Progressive distillation diffusion for raw music generation

This paper aims to apply a new deep learning approach to the task of gen...

Please sign up or login with your details

Forgot password? Click here to reset