Rocket Launching: A Universal and Efficient Framework for Training Well-performing Light Net

08/14/2017
by   Guorui Zhou, et al.
0

Models applied on real time response task, like click-through rate (CTR) prediction model, require high accuracy and rigorous response time. Therefore, top-performing deep models of high depth and complexity are not well suited for these applications with the limitations on the inference time. In order to further improve the neural networks' performance given the time and computational limitations, we propose an approach that exploits a cumbersome net to help train the lightweight net for prediction. We dub the whole process rocket launching, where the cumbersome booster net is used to guide the learning of the target light net throughout the whole training process. We analyze different loss functions aiming at pushing the light net to behave similarly to the booster net, and adopt the loss with best performance in our experiments. We use one technique called gradient block to improve the performance of the light net and booster net further. Experiments on benchmark datasets and real-life industrial advertisement data present that our light model can get performance only previously achievable with more complex models.

READ FULL TEXT
research
07/20/2017

An All-in-One Network for Dehazing and Beyond

This paper proposes an image dehazing model built with a convolutional n...
research
06/27/2021

Immuno-mimetic Deep Neural Networks (Immuno-Net)

Biomimetics has played a key role in the evolution of artificial neural ...
research
08/24/2020

LCA-Net: Light Convolutional Autoencoder for Image Dehazing

Image dehazing is a crucial image pre-processing task aimed at removing ...
research
03/01/2021

Self-supervised Low Light Image Enhancement and Denoising

This paper proposes a self-supervised low light image enhancement method...
research
09/18/2023

Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Automated assessment of speech intelligibility in hearing aid (HA) devic...
research
08/26/2023

Autonomous Underwater Robotic System for Aquaculture Applications

Aquaculture is a thriving food-producing sector producing over half of t...
research
07/03/2019

Slim-CNN: A Light-Weight CNN for Face Attribute Prediction

We introduce a computationally-efficient CNN micro-architecture Slim Mod...

Please sign up or login with your details

Forgot password? Click here to reset