DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

10/02/2022
by   Yiming Cui, et al.
0

Video object detection is a fundamental yet challenging task in computer vision. One practical solution is to take advantage of temporal information from the video and apply feature aggregation to enhance the object features in each frame. Though effective, those existing methods always suffer from low inference speeds because they use a fixed number of frames for feature aggregation regardless of the input frame. Therefore, this paper aims to improve the inference speed of the current feature aggregation-based video object detectors while maintaining their performance. To achieve this goal, we propose a vanilla dynamic aggregation module that adaptively selects the frames for feature enhancement. Then, we extend the vanilla dynamic aggregation module to a more effective and reconfigurable deformable version. Finally, we introduce inplace distillation loss to improve the representations of objects aggregated with fewer frames. Extensive experimental results validate the effectiveness and efficiency of our proposed methods: On the ImageNet VID benchmark, integrated with our proposed methods, FGFA and SELSA can improve the inference speed by 31 performance on accuracy.

READ FULL TEXT

page 2

page 6

page 12

research
03/15/2023

FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors

Video object detection needs to solve feature degradation situations tha...
research
10/23/2020

Object-aware Feature Aggregation for Video Object Detection

We present an Object-aware Feature Aggregation (OFA) module for video ob...
research
12/16/2017

Impression Network for Video Object Detection

Video object detection is more challenging compared to image object dete...
research
08/12/2021

TF-Blender: Temporal Feature Blender for Video Object Detection

Video objection detection is a challenging task because isolated video f...
research
10/05/2022

Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection

This paper presents the novel idea of generating object proposals by lev...
research
08/12/2017

Kill Two Birds With One Stone: Boosting Both Object Detection Accuracy and Speed With adaptive Patch-of-Interest Composition

Object detection is an important yet challenging task in video understan...
research
08/20/2022

YOLOV: Making Still Image Object Detectors Great at Video Object Detection

Video object detection (VID) is challenging because of the high variatio...

Please sign up or login with your details

Forgot password? Click here to reset