A System-Level Solution for Low-Power Object Detection

09/24/2019
by   Fanrong Li, et al.
0

Object detection has made impressive progress in recent years with the help of deep learning. However, state-of-the-art algorithms are both computation and memory intensive. Though many lightweight networks are developed for a trade-off between accuracy and efficiency, it is still a challenge to make it practical on an embedded device. In this paper, we present a system-level solution for efficient object detection on a heterogeneous embedded device. The detection network is quantized to low bits and allows efficient implementation with shift operators. In order to make the most of the benefits of low-bit quantization, we design a dedicated accelerator with programmable logic. Inside the accelerator, a hybrid dataflow is exploited according to the heterogeneous property of different convolutional layers. We adopt a straightforward but resource-friendly column-prior tiling strategy to map the computation-intensive convolutional layers to the accelerator that can support arbitrary feature size. Other operations can be performed on the low-power CPU cores, and the entire system is executed in a pipelined manner. As a case study, we evaluate our object detection system on a real-world surveillance video with input size of 512x512, and it turns out that the system can achieve an inference speed of 18 fps at the cost of 6.9W (with display) with an mAP of 66.4 verified on the PASCAL VOC 2012 dataset.

READ FULL TEXT
research
05/22/2023

TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers

This paper introduces a highly flexible, quantized, memory-efficient, an...
research
06/21/2018

Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices

Neural networks have established as a generic and powerful means to appr...
research
09/03/2022

Low-Power Hardware-Based Deep-Learning Diagnostics Support Case Study

Deep learning research has generated widespread interest leading to emer...
research
05/24/2019

Light-Weight RetinaNet for Object Detection

Object detection has gained great progress driven by the development of ...
research
12/18/2017

Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA

Deep convolutional neural networks (CNN) based solutions are the current...
research
01/28/2020

Multi-modal Sentiment Analysis using Super Characters Method on Low-power CNN Accelerator Device

Recent years NLP research has witnessed the record-breaking accuracy imp...
research
09/08/2020

Adapted Pepper

One of the main issue in robotics is the lack of embedded computational ...

Please sign up or login with your details

Forgot password? Click here to reset