QueryProp: Object Query Propagation for High-Performance Video Object Detection

07/22/2022
by   Fei He, et al.
0

Video object detection has been an important yet challenging topic in computer vision. Traditional methods mainly focus on designing the image-level or box-level feature propagation strategies to exploit temporal information. This paper argues that with a more effective and efficient feature propagation framework, video object detectors can gain improvement in terms of both accuracy and speed. For this purpose, this paper studies object-level feature propagation, and proposes an object query propagation (QueryProp) framework for high-performance video object detection. The proposed QueryProp contains two propagation strategies: 1) query propagation is performed from sparse key frames to dense non-key frames to reduce the redundant computation on non-key frames; 2) query propagation is performed from previous key frames to the current key frame to improve feature representation by temporal context modeling. To further facilitate query propagation, an adaptive propagation gate is designed to achieve flexible key frame selection. We conduct extensive experiments on the ImageNet VID dataset. QueryProp achieves comparable accuracy with state-of-the-art methods and strikes a decent accuracy/speed trade-off. Code is available at https://github.com/hf1995/QueryProp.

READ FULL TEXT
research
04/16/2018

Towards High Performance Video Object Detection for Mobiles

Despite the recent success of video object detection on Desktop GPUs, it...
research
03/25/2021

Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation

Video object detection is a fundamental problem in computer vision and h...
research
08/20/2022

YOLOV: Making Still Image Object Detectors Great at Video Object Detection

Video object detection (VID) is challenging because of the high variatio...
research
04/16/2018

Optimizing Video Object Detection via a Scale-Time Lattice

High-performance object detection relies on expensive convolutional netw...
research
08/18/2023

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Camera-based 3D object detection in BEV (Bird's Eye View) space has draw...
research
10/18/2022

Decoupling Features in Hierarchical Propagation for Video Object Segmentation

This paper focuses on developing a more effective method of hierarchical...
research
09/08/2021

Temporal RoI Align for Video Object Recognition

Video object detection is challenging in the presence of appearance dete...

Please sign up or login with your details

Forgot password? Click here to reset