CREPE: A Convolutional Representation for Pitch Estimation

02/17/2018
by   Jong Wook Kim, et al.
0

The task of estimating the fundamental frequency of a monophonic sound recording, also known as pitch tracking, is fundamental to audio processing with multiple applications in speech processing and music information retrieval. To date, the best performing techniques, such as the pYIN algorithm, are based on a combination of DSP pipelines and heuristics. While such techniques perform very well on average, there remain many cases in which they fail to correctly estimate the pitch. In this paper, we propose a data-driven pitch tracking algorithm, CREPE, which is based on a deep convolutional neural network that operates directly on the time-domain waveform. We show that the proposed model produces state-of-the-art results, performing equally or better than pYIN. Furthermore, we evaluate the model's generalizability in terms of noise robustness. A pre-trained version of CREPE is made freely available as an open-source Python module for easy application.

READ FULL TEXT
research
04/30/2019

Deep Learning for Audio Signal Processing

Given the recent surge in developments of deep learning, this article pr...
research
02/11/2021

DEEPF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals

We propose a novel pitch estimation technique called DeepF0, which lever...
research
05/21/2019

Bayesian Pitch Tracking Based on the Harmonic Model

Fundamental frequency is one of the most important characteristics of sp...
research
06/03/2019

Data-driven Estimation of Sinusoid Frequencies

Frequency estimation is a fundamental problem in signal processing, with...
research
12/08/2017

Representations of Sound in Deep Learning of Audio Features from Music

The work of a single musician, group or composer can vary widely in term...
research
06/23/2022

Formant Estimation and Tracking using Probabilistic Heat-Maps

Formants are the spectral maxima that result from acoustic resonances of...
research
04/22/2018

Tempo-Invariant Processing of Rhythm with Convolutional Neural Networks

Rhythm patterns can be performed with a wide variation of tempi. This pr...

Please sign up or login with your details

Forgot password? Click here to reset