Genre-conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music

04/07/2022
by   Xiaoxue Gao, et al.
0

Lyrics transcription of polyphonic music is challenging not only because the singing vocals are corrupted by the background music, but also because the background music and the singing style vary across music genres, such as pop, metal, and hip hop, which affects lyrics intelligibility of the song in different ways. In this work, we propose to transcribe the lyrics of polyphonic music using a novel genre-conditioned network. The proposed network adopts pre-trained model parameters, and incorporates the genre adapters between layers to capture different genre peculiarities for lyrics-genre pairs, thereby only requiring lightweight genre-specific parameters for training. Our experiments show that the proposed genre-conditioned network outperforms the existing lyrics transcription systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2019

Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help?

Background music affects lyrics intelligibility of singing vocals in a m...
research
09/23/2019

Automatic Lyrics Transcription in Polyphonic Music: Does Background Music Help?

Background music affects lyrics intelligibility of singing vocals in a m...
research
06/25/2019

Acoustic Modeling for Automatic Lyrics-to-Audio Alignment

Automatic lyrics to polyphonic audio alignment is a challenging task not...
research
07/15/2022

PoLyScriber: Integrated Training of Extractor and Lyrics Transcriber for Lyrics Transcription in Polyphonic Music

Lyrics transcription of polyphonic music is challenging as the backgroun...
research
11/15/2022

Music Instrument Classification Reprogrammed

The performance of approaches to Music Instrument Classification, a popu...
research
01/28/2022

Dual Learning Music Composition and Dance Choreography

Music and dance have always co-existed as pillars of human activities, c...
research
10/11/2022

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability

In this paper we propose a novel generative approach, DiffRoll, to tackl...

Please sign up or login with your details

Forgot password? Click here to reset