AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices

02/15/2019

∙

In this paper, we propose a hierarchical deep reinforcement learning (DRL)-based AutoML framework, AutoQB, to automatically explore the design space of channel-level network quantization and binarization for hardware-friendly deep learning on mobile devices. Compared to prior DDPG-based quantization techniques, on the various CNN models, AutoQB automatically achieves the same inference accuracy by ∼79% less computing overhead, or improves the inference accuracy by ∼2% with the same computing cost.

READ FULL TEXT

AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices

Sign in with Google

Consider DeepAI Pro