DeepMimic: Mentor-Student Unlabeled Data Based Training

11/24/2019
by   Itay Mosafi, et al.
0

In this paper, we present a deep neural network (DNN) training approach called the "DeepMimic" training method. Enormous amounts of data are available nowadays for training usage. Yet, only a tiny portion of these data is manually labeled, whereas almost all of the data are unlabeled. The training approach presented utilizes, in a most simplified manner, the unlabeled data to the fullest, in order to achieve remarkable (classification) results. Our DeepMimic method uses a small portion of labeled data and a large amount of unlabeled data for the training process, as expected in a real-world scenario. It consists of a mentor model and a student model. Employing a mentor model trained on a small portion of the labeled data and then feeding it only with unlabeled data, we show how to obtain a (simplified) student model that reaches the same accuracy and loss as the mentor model, on the same test set, without using any of the original data labels in the training of the student model. Our experiments demonstrate that even on challenging classification tasks the student network architecture can be simplified significantly with a minor influence on the performance, i.e., we need not even know the original network architecture of the mentor. In addition, the time required for training the student model to reach the mentor's performance level is shorter, as a result of a simplified architecture and more available data. The proposed method highlights the disadvantages of regular supervised training and demonstrates the benefits of a less traditional training approach.

READ FULL TEXT
research
08/14/2020

Semi-supervised learning using teacher-student models for vocal melody extraction

The lack of labeled data is a major obstacle in many music information r...
research
12/09/2019

Stealing Knowledge from Protected Deep Neural Networks Using Composite Unlabeled Data

As state-of-the-art deep neural networks are deployed at the core of mor...
research
09/25/2019

Teacher-Student Learning Paradigm for Tri-training: An Efficient Method for Unlabeled Data Exploitation

Given that labeled data is expensive to obtain in real-world scenarios, ...
research
03/25/2022

Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music

Lack of large-scale note-level labeled data is the major obstacle to sin...
research
01/17/2022

Distillation from heterogeneous unlabeled collections

Compressing deep networks is essential to expand their range of applicat...
research
09/05/2019

FraudJudger: Real-World Data Oriented Fraud Detection on Digital Payment Platforms

Automated fraud behaviors detection on electronic payment platforms is a...
research
01/13/2019

Gradient Regularized Budgeted Boosting

As machine learning transitions increasingly towards real world applicat...

Please sign up or login with your details

Forgot password? Click here to reset