Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

06/06/2016
by   Lin Wu, et al.
0

In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification. Given the video sequence of a person, features from each frame that are extracted from all levels of a deep convolutional network can preserve a higher spatial resolution from which we can model finer motion patterns. These low-level visual percepts are leveraged into a variant of recurrent model to characterize the temporal variation between time-steps. Features from all time-steps are then summarized using temporal pooling to produce an overall feature representation for the complete sequence. The deep convolutional network, recurrent layer, and the temporal pooling are jointly trained to extract comparable hidden-unit representations from input pair of time series to compute their corresponding similarity value. The proposed framework combines time series modeling and metric learning to jointly learn relevant features and a good similarity measure between time sequences of person. Experiments demonstrate that our approach achieves the state-of-the-art performance for video-based person re-identification on iLIDS-VID and PRID 2011, the two primary public datasets for this purpose.

READ FULL TEXT

page 5

page 7

research
08/03/2018

Where-and-When to Look: Deep Siamese Attention Networks for Video-based Person Re-identification

Video-based person re-identification (re-id) is a central application in...
research
11/19/2015

Delving Deeper into Convolutional Networks for Learning Video Representations

We propose an approach to learn spatio-temporal features in videos from ...
research
09/04/2019

Deep Convolutional Networks in System Identification

Recent developments within deep learning are relevant for nonlinear syst...
research
01/27/2016

PersonNet: Person Re-identification with Deep Convolutional Neural Networks

In this paper, we propose a deep end-to-end neu- ral network to simultan...
research
11/15/2017

A Correlation Based Feature Representation for First-Person Activity Recognition

In this paper, a simple yet efficient feature encoding for first-person ...
research
07/25/2021

Spatio-Temporal Representation Factorization for Video-based Person Re-Identification

Despite much recent progress in video-based person re-identification (re...
research
03/30/2018

Efficient and Deep Person Re-Identification using Multi-Level Similarity

Person Re-Identification (ReID) requires comparing two images of person ...

Please sign up or login with your details

Forgot password? Click here to reset