Bringing Giant Neural Networks Down to Earth with Unlabeled Data

07/13/2019
by   Yehui Tang, et al.
5

Compressing giant neural networks has gained much attention for their extensive applications on edge devices such as cellphones. During the compressing process, one of the most important procedures is to retrain the pre-trained models using the original training dataset. However, due to the consideration of security, privacy or commercial profits, in practice, only a fraction of sample training data are made available, which makes the retraining infeasible. To solve this issue, this paper proposes to resort to unlabeled data in hand that can be cheaper to acquire. Specifically, we exploit the unlabeled data to mimic the classification characteristics of giant networks, so that the original capacity can be preserved nicely. Nevertheless, there exists a dataset bias between the labeled and unlabeled data, disturbing the mimicking to some extent. We thus fix this bias by an adversarial loss to make an alignment on the distributions of their low-level feature representations. We further provide theoretical discussions about how the unlabeled data help compressed networks to generalize better. Experimental results demonstrate that the unlabeled data can significantly improve the performance of the compressed networks.

READ FULL TEXT

page 1

page 3

page 7

page 9

page 10

research
03/14/2022

Improving State-of-the-Art in One-Class Classification by Leveraging Unlabeled Data

When dealing with binary classification of data with only one labeled cl...
research
10/31/2020

A Novel Semi-Supervised Data-Driven Method for Chiller Fault Diagnosis with Unlabeled Data

In practical chiller systems, applying efficient fault diagnosis techniq...
research
12/31/2019

Leveraging Semi-Supervised Learning for Fairness using Neural Networks

There has been a growing concern about the fairness of decision-making s...
research
08/29/2023

Class Prior-Free Positive-Unlabeled Learning with Taylor Variational Loss for Hyperspectral Remote Sensing Imagery

Positive-unlabeled learning (PU learning) in hyperspectral remote sensin...
research
06/15/2020

Improving Adversarial Robustness via Unlabeled Out-of-Domain Data

Data augmentation by incorporating cheap unlabeled data from multiple do...
research
03/07/2023

Gradient-Free Structured Pruning with Unlabeled Data

Large Language Models (LLMs) have achieved great success in solving diff...
research
05/23/2019

Detecting Malicious PowerShell Scripts Using Contextual Embeddings

PowerShell is a command line shell, that is widely used in organizations...

Please sign up or login with your details

Forgot password? Click here to reset