Object-aware Feature Aggregation for Video Object Detection

10/23/2020
by   Qichuan Geng, et al.
0

We present an Object-aware Feature Aggregation (OFA) module for video object detection (VID). Our approach is motivated by the intriguing property that video-level object-aware knowledge can be employed as a powerful semantic prior to help object recognition. As a consequence, augmenting features with such prior knowledge can effectively improve the classification and localization performance. To make features get access to more content about the whole video, we first capture the object-aware knowledge of proposals and incorporate such knowledge with the well-established pair-wise contexts. With extensive experimental results on the ImageNet VID dataset, our approach demonstrates the effectiveness of object-aware knowledge with the superior performance of 83.93 and 86.09 equipped with Sequence DIoU NMS, we obtain the best-reported mAP of 85.07 86.88 released after acceptance.

READ FULL TEXT

page 1

page 3

research
10/02/2022

DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Video object detection is a fundamental yet challenging task in computer...
research
03/15/2023

FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors

Video object detection needs to solve feature degradation situations tha...
research
07/22/2023

Topology-Preserving Automatic Labeling of Coronary Arteries via Anatomy-aware Connection Classifier

Automatic labeling of coronary arteries is an essential task in the prac...
research
03/25/2019

Looking Fast and Slow: Memory-Guided Mobile Video Object Detection

With a single eye fixation lasting a fraction of a second, the human vis...
research
09/22/2014

1-HKUST: Object Detection in ILSVRC 2014

The Imagenet Large Scale Visual Recognition Challenge (ILSVRC) is the on...
research
08/26/2019

Relation Distillation Networks for Video Object Detection

It has been well recognized that modeling object-to-object relations wou...
research
07/15/2019

Sequence Level Semantics Aggregation for Video Object Detection

Video objection detection (VID) has been a rising research direction in ...

Please sign up or login with your details

Forgot password? Click here to reset