Resource-Efficient Neural Networks for Embedded Systems

01/07/2020
by   Wolfgang Roth, et al.
21

While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges in current machine learning research and key to ensure a smooth transition of machine learning technology from a scientific environment with virtually unlimited computing resources into every day's applications. In this article, we provide an overview of the current state of the art of machine learning techniques facilitating these real-world requirements. In particular, we focus on deep neural networks (DNNs), the predominant machine learning models of the past decade. We give a comprehensive overview of the vast literature that can be mainly split into three non-mutually exclusive categories: (i) quantized neural networks, (ii) network pruning, and (iii) structural efficiency. These techniques can be applied during training or as post-processing, and they are widely used to reduce the computational demands in terms of memory footprint, inference speed, and energy efficiency. We substantiate our discussion with experiments on well-known benchmark data sets to showcase the difficulty of finding good trade-offs between resource-efficiency and predictive performance.

READ FULL TEXT
research
12/05/2018

Efficient and Robust Machine Learning for Real-World Systems

While machine learning is traditionally a resource intensive task, embed...
research
07/08/2023

Towards Efficient In-memory Computing Hardware for Quantized Neural Networks: State-of-the-art, Open Challenges and Perspectives

The amount of data processed in the cloud, the development of Internet-o...
research
05/20/2022

Deployment of Energy-Efficient Deep Learning Models on Cortex-M based Microcontrollers using Deep Compression

Large Deep Neural Networks (DNNs) are the backbone of today's artificial...
research
07/22/2020

Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement

While machine learning techniques are traditionally resource intensive, ...
research
03/17/2023

Operating critical machine learning models in resource constrained regimes

The accelerated development of machine learning methods, primarily deep ...
research
04/17/2023

Energy Efficiency Considerations for Popular AI Benchmarks

Advances in artificial intelligence need to become more resource-aware a...
research
06/24/2022

How to train accurate BNNs for embedded systems?

A key enabler of deploying convolutional neural networks on resource-con...

Please sign up or login with your details

Forgot password? Click here to reset