Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach

05/18/2021
by   Ryota Yoshihashi, et al.
14

Detecting tiny objects in a high-resolution video is challenging because the visual information is little and unreliable. Specifically, the challenge includes very low resolution of the objects, MPEG artifacts due to compression and a large searching area with many hard negatives. Tracking is equally difficult because of the unreliable appearance, and the unreliable motion estimation. Luckily, we found that by combining this two challenging tasks together, there will be mutual benefits. Following the idea, in this paper, we present a neural network model called the Recurrent Correlational Network, where detection and tracking are jointly performed over a multi-frame representation learned through a single, trainable, and end-to-end network. The framework exploits a convolutional long short-term memory network for learning informative appearance changes for detection, while the learned representation is shared in tracking for enhancing its performance. In experiments with datasets containing images of scenes with small flying objects, such as birds and unmanned aerial vehicles, the proposed method yielded consistent improvements in detection performance over deep single-frame detectors and existing motion-based detectors. Furthermore, our network performs as well as state-of-the-art generic object trackers when it was evaluated as a tracker on a bird image dataset.

READ FULL TEXT

page 2

page 4

page 5

page 9

page 12

page 13

page 16

research
09/14/2017

Learning Multi-frame Visual Representation for Joint Detection and Tracking of Small Objects

Deep convolutional and recurrent neural networks have delivered signific...
research
04/12/2021

Localization-Based Tracking

End-to-end production of object tracklets from high resolution video in ...
research
11/13/2018

Detect or Track: Towards Cost-Effective Video Object Detection/Tracking

State-of-the-art object detectors and trackers are developing fast. Trac...
research
03/19/2021

TDIOT: Target-driven Inference for Deep Video Object Tracking

Recent tracking-by-detection approaches use deep object detectors as tar...
research
10/19/2020

Multiple Pedestrians and Vehicles Tracking in Aerial Imagery: A Comprehensive Study

In this paper, we address various challenges in multi-pedestrian and veh...
research
12/22/2020

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net

In this paper we propose a novel deep neural network that is able to joi...
research
08/20/2019

An End-to-end Video Text Detector with Online Tracking

Video text detection is considered as one of the most difficult tasks in...

Please sign up or login with your details

Forgot password? Click here to reset