Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music

03/25/2022
by   Sangeun Kum, et al.
0

Lack of large-scale note-level labeled data is the major obstacle to singing transcription from polyphonic music. We address the issue by using pseudo labels from vocal pitch estimation models given unlabeled data. The proposed method first converts the frame-level pseudo labels to note-level through pitch and rhythm quantization steps. Then, it further improves the label quality through self-training in a teacher-student framework. To validate the method, we conduct various experiment settings by investigating two vocal pitch estimation models as pseudo-label generators, two setups of teacher-student frameworks, and the number of iterations in self-training. The results show that the proposed method can effectively leverage large-scale unlabeled audio data and self-training with the noisy student model helps to improve performance. Finally, we show that the model trained with only unlabeled data has comparable performance to previous works and the model trained with additional labeled data achieves higher accuracy than the model trained with only labeled data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2020

Semi-supervised learning using teacher-student models for vocal melody extraction

The lack of labeled data is a major obstacle in many music information r...
research
06/03/2021

Noisy student-teacher training for robust keyword spotting

We propose self-training with noisy student-teacher approach for streami...
research
03/02/2021

Pseudo-labeling for Scalable 3D Object Detection

To safely deploy autonomous vehicles, onboard perception systems must wo...
research
01/06/2022

Self-Training Vision Language BERTs with a Unified Conditional Model

Natural language BERTs are trained with language corpus in a self-superv...
research
02/05/2022

LST: Lexicon-Guided Self-Training for Few-Shot Text Classification

Self-training provides an effective means of using an extremely small am...
research
03/16/2022

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

We aim to improve the performance of regressing hand keypoints and segme...
research
11/24/2019

DeepMimic: Mentor-Student Unlabeled Data Based Training

In this paper, we present a deep neural network (DNN) training approach ...

Please sign up or login with your details

Forgot password? Click here to reset