Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

09/09/2020
by   Helena Cuesta, et al.
0

This paper addresses the extraction of multiple F0 values from polyphonic and a cappella vocal performances using convolutional neural networks (CNNs). We address the major challenges of ensemble singing, i.e., all melodic sources are vocals and singers sing in harmony. We build upon an existing architecture to produce a pitch salience function of the input signal, where the harmonic constant-Q transform (HCQT) and its associated phase differentials are used as an input representation. The pitch salience function is subsequently thresholded to obtain a multiple F0 estimation output. For training, we build a dataset that comprises several multi-track datasets of vocal quartets with F0 annotations. This work proposes and evaluates a set of CNNs for this task in diverse scenarios and data configurations, including recordings with additional reverb. Our models outperform a state-of-the-art method intended for the same music genre when evaluated with an increased F0 resolution, as well as a general-purpose method for multi-F0 estimation. We conclude with a discussion on future research directions.

READ FULL TEXT

page 1

page 3

research
08/05/2015

Single and Multiple Illuminant Estimation Using Convolutional Neural Networks

In this paper we present a method for the estimation of the color of the...
research
07/15/2020

An Ensemble of Convolutional Neural Networks for Audio Classification

In this paper, ensembles of classifiers that exploit several data augmen...
research
05/14/2021

Post-processing Multi-Model Medium-Term Precipitation Forecasts Using Convolutional Neural Networks

The goal of this study was to improve the post-processing of precipitati...
research
07/12/2017

Score-informed syllable segmentation for a cappella singing voice with convolutional neural networks

This paper introduces a new score-informed method for the segmentation o...
research
12/07/2018

Harmonic Networks: Integrating Spectral Information into CNNs

Convolutional neural networks (CNNs) learn filters in order to capture l...
research
05/08/2016

Chained Predictions Using Convolutional Neural Networks

In this paper, we present an adaptation of the sequence-to-sequence mode...
research
04/23/2018

ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

In this paper, we address a relatively new task: prediction of ASR perfo...

Please sign up or login with your details

Forgot password? Click here to reset