Mobile Video Object Detection with Temporally-Aware Feature Maps

11/17/2017
by   Mason Liu, et al.
0

This paper introduces an online model for object detection in videos designed to run in real-time on low-powered mobile and embedded devices. Our approach combines fast single-image object detection with convolutional long short term memory (LSTM) layers to create an interweaved recurrent-convolutional architecture. Additionally, we propose an efficient Bottleneck-LSTM layer that significantly reduces computational cost compared to regular LSTMs. Our network achieves temporal awareness by using Bottleneck-LSTMs to refine and propagate feature maps across frames. This approach is substantially faster than existing detection methods in video, outperforming the fastest single-frame models in model size and computational cost while attaining accuracy comparable to much more expensive single-frame models on the Imagenet VID 2015 dataset. Our model reaches a real-time inference speed of up to 15 FPS on a mobile CPU.

READ FULL TEXT

page 1

page 3

page 8

research
03/01/2018

TSSD: Temporal Single-Shot Object Detection Based on Attention-Aware LSTM

Temporal object detection has attracted significant attention, but most ...
research
03/24/2020

RN-VID: A Feature Fusion Architecture for Video Object Detection

Consecutive frames in a video are highly redundant. Therefore, to perfor...
research
03/01/2018

TSSD: Temporal Single-Shot Detector Based on Attention and LSTM for Robotic Intelligent Perception

Temporal object detection has attracted significant attention, but most ...
research
04/18/2018

Pelee: A Real-Time Object Detection System on Mobile Devices

An increasing need of running Convolutional Neural Network (CNN) models ...
research
03/04/2019

TKD: Temporal Knowledge Distillation for Active Perception

Deep neural networks based methods have been proved to achieve outstandi...
research
06/21/2021

Temporal Early Exits for Efficient Video Object Detection

Transferring image-based object detectors to the domain of video remains...
research
11/11/2022

RepGhost: A Hardware-Efficient Ghost Module via Re-parameterization

Feature reuse has been a key technique in light-weight convolutional neu...

Please sign up or login with your details

Forgot password? Click here to reset