Not 3D Re-ID: a Simple Single Stream 2D Convolution for Robust Video Re-identification

08/14/2020
by   Toby P. Breckon, et al.
0

Video-based person re-identification has received increasing attention recently, as it plays an important role within surveillance video analysis. Video-based Re-ID is an expansion of earlier image-based re-identification methods by learning features from a video via multiple image frames for each person. Most contemporary video Re-ID methods utilise complex CNNbased network architectures using 3D convolution or multibranch networks to extract spatial-temporal video features. By contrast, in this paper, we illustrate superior performance from a simple single stream 2D convolution network leveraging the ResNet50-IBN architecture to extract frame-level features followed by temporal attention for clip level features. These clip level features can be generalised to extract video level features by averaging without any significant additional cost. Our approach uses best video Re-ID practice and transfer learning between datasets to outperform existing state-of-the-art approaches on the MARS, PRID2011 and iLIDS-VID datasets with 89:62 MARS, without reliance on complex and memory intensive 3D convolutions or multi-stream networks architectures as found in other contemporary work. Conversely, our work shows that global features extracted by the 2D convolution network are a sufficient representation for robust state of the art video Re-ID.

READ FULL TEXT

page 3

page 7

research
11/19/2018

Multi-scale 3D Convolution Network for Video Based Person Re-Identification

This paper proposes a two-stream convolution network to extract spatial ...
research
05/05/2019

Intra-clip Aggregation for Video Person Re-identification

Video-based person re-id has drawn much attention in recent years due to...
research
12/24/2019

Ordered or Orderless: A Revisit for Video based Person Re-Identification

Is recurrent network really necessary for learning a good visual represe...
research
08/03/2017

Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

Person Re-Identification (person re-id) is a crucial task as its applica...
research
10/15/2020

Integrating Coarse Granularity Part-level Features with Supervised Global-level Features for Person Re-identification

Holistic person re-identification (Re-ID) and partial person re-identifi...
research
06/19/2020

A Symbolic Temporal Pooling method for Video-based Person Re-Identification

In video-based person re-identification, both the spatial and temporal f...
research
08/05/2019

Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification

Video-based person re-identification (Re-ID) aims at matching video sequ...

Please sign up or login with your details

Forgot password? Click here to reset