Deep Motion Features for Visual Tracking

12/20/2016
by   Susanna Gladh, et al.
0

Robust visual tracking is a challenging computer vision problem, with many real-world applications. Most existing approaches employ hand-crafted appearance features, such as HOG or Color Names. Recently, deep RGB features extracted from convolutional neural networks have been successfully applied for tracking. Despite their success, these features only capture appearance information. On the other hand, motion cues provide discriminative and complementary information that can improve tracking performance. Contrary to visual tracking, deep motion features have been successfully applied for action recognition and video classification tasks. Typically, the motion features are learned by training a CNN on optical flow images extracted from large amounts of labeled videos. This paper presents an investigation of the impact of deep motion features in a tracking-by-detection framework. We further show that hand-crafted, deep RGB, and deep motion features contain complementary information. To the best of our knowledge, we are the first to propose fusing appearance information with deep motion features for visual tracking. Comprehensive experiments clearly suggest that our fusion approach with deep motion features outperforms standard methods relying on appearance information alone.

READ FULL TEXT

page 1

page 3

research
12/09/2016

ActionFlowNet: Learning Motion Representation for Action Recognition

Even with the recent advances in convolutional neural networks (CNN) in ...
research
11/21/2020

Exploring the multimodal information from video content using deep learning features of appearance, audio and action for video recommendation

Following the popularisation of media streaming, a number of video strea...
research
10/27/2022

Deep Latent Mixture Model for Recommendation

Recent advances in neural networks have been successfully applied to man...
research
10/02/2016

Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection

Most of the crowd abnormal event detection methods rely on complex hand-...
research
10/06/2015

Learning Deep Representations of Appearance and Motion for Anomalous Event Detection

We present a novel unsupervised deep learning framework for anomalous ev...
research
03/19/2016

Deep Shading: Convolutional Neural Networks for Screen-Space Shading

In computer vision, convolutional neural networks (CNNs) have recently a...
research
06/12/2020

Multiple-Vehicle Tracking in the Highway Using Appearance Model and Visual Object Tracking

In recent decades, due to the groundbreaking improvements in machine vis...

Please sign up or login with your details

Forgot password? Click here to reset