PrimaDNN': A Characteristics-aware DNN Customization for Singing Technique Detection

06/25/2023
by   Yuya Yamamoto, et al.
0

Professional vocalists modulate their voice timbre or pitch to make their vocal performance more expressive. Such fluctuations are called singing techniques. Automatic detection of singing techniques from audio tracks can be beneficial to understand how each singer expresses the performance, yet it can also be difficult due to the wide variety of the singing techniques. A deep neural network (DNN) model can handle such variety; however, there might be a possibility that considering the characteristics of the data improves the performance of singing technique detection. In this paper, we propose PrimaDNN, a CRNN model with a characteristics-oriented improvement. The features of the model are: 1) input feature representation based on auxiliary pitch information and multi-resolution mel spectrograms, 2) Convolution module based on the Squeeze-and-excitation (SENet) and the Instance normalization. In the results of J-POP singing technique detection, PrimaDNN achieved the best results of 44.9 found that the contribution of each component varies depending on the type of singing technique.

READ FULL TEXT

page 2

page 3

page 5

research
07/24/2019

Zero-shifting Technique for Deep Neural Network Training on Resistive Cross-point Arrays

A resistive memory device-based computing architecture is one of the pro...
research
10/10/2020

Improve the Robustness and Accuracy of Deep Neural Network with L_2,∞ Normalization

In this paper, the robustness and accuracy of the deep neural network (D...
research
06/24/2022

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification

Singing techniques are used for expressive vocal performances by employi...
research
08/13/2020

MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection

Voice activity detection (VAD) makes a distinction between speech and no...
research
02/16/2022

APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization

In this paper, we propose an audio declipping method that takes advantag...

Please sign up or login with your details

Forgot password? Click here to reset