Modality-Buffet for Real-Time Object Detection

11/17/2020
by   Nicolai Dorka, et al.
9

Real-time object detection in videos using lightweight hardware is a crucial component of many robotic tasks. Detectors using different modalities and with varying computational complexities offer different trade-offs. One option is to have a very lightweight model that can predict from all modalities at once for each frame. However, in some situations (e.g., in static scenes) it might be better to have a more complex but more accurate model and to extrapolate from previous predictions for the frames coming in at processing time. We formulate this task as a sequential decision making problem and use reinforcement learning (RL) to generate a policy that decides from the RGB input which detector out of a portfolio of different object detectors to take for the next prediction. The objective of the RL agent is to maximize the accuracy of the predictions per image. We evaluate the approach on the Waymo Open Dataset and show that it exceeds the performance of each single detector.

READ FULL TEXT

page 1

page 3

page 4

research
10/28/2022

ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy

This paper analyzes the effects of dynamically varying video contents an...
research
03/28/2019

ThunderNet: Towards Real-time Generic Object Detection

Real-time generic object detection on mobile platforms is a crucial but ...
research
03/22/2022

FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics

Detection-driven real-time video analytics require continuous detection ...
research
08/22/2018

A Survey of Modern Object Detection Literature using Deep Learning

Object detection is the identification of an object in the image along w...
research
08/29/2019

Minimum Delay Object Detection From Video

We consider the problem of detecting objects, as they come into view, fr...
research
11/16/2020

Online Monitoring of Object Detection Performance Post-Deployment

Post-deployment, an object detector is expected to operate at a similar ...
research
06/06/2022

Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles

Object detection is a difficult downstream task in computer vision. For ...

Please sign up or login with your details

Forgot password? Click here to reset