Metrical-accent Aware Vocal Onset Detection in Polyphonic Audio

07/19/2017
by   Georgi Dzhambazov, et al.
0

The goal of this study is the automatic detection of onsets of the singing voice in polyphonic audio recordings. Starting with a hypothesis that the knowledge of the current position in a metrical cycle (i.e. metrical accent) can improve the accuracy of vocal note onset detection, we propose a novel probabilistic model to jointly track beats and vocal note onsets. The proposed model extends a state of the art model for beat and meter tracking, in which a-priori probability of a note at a specific metrical accent interacts with the probability of observing a vocal note onset. We carry out an evaluation on a varied collection of multi-instrument datasets from two music traditions (English popular music and Turkish makam) with different types of metrical cycles and singing styles. Results confirm that the proposed model reasonably improves vocal note onset detection accuracy compared to a baseline model that does not take metrical position into account.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2023

A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription

Note-level automatic music transcription is one of the most representati...
research
11/05/2020

From Note-Level to Chord-Level Neural Network Models for Voice Separation in Symbolic Music

Music is often experienced as a progression of concurrent streams of not...
research
05/26/2023

A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation

Recently, multi-instrument music generation has become a hot topic. Diff...
research
10/05/2020

High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times

Automatic music transcription (AMT) is the task of transcribing audio re...
research
10/20/2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

Most of the state-of-the-art automatic music transcription (AMT) models ...
research
07/01/2017

An Augmented Lagrangian Method for Piano Transcription using Equal Loudness Thresholding and LSTM-based Decoding

A central goal in automatic music transcription is to detect individual ...
research
03/11/2021

Topological Data Analysis of Korean Music in Jeongganbo: A Cycle Structure

Jeongganbo is a unique music representation invented by Sejong the Great...

Please sign up or login with your details

Forgot password? Click here to reset