Set Augmented Triplet Loss for Video Person Re-Identification

11/02/2020
by   Pengfei Fang, et al.
0

Modern video person re-identification (re-ID) machines are often trained using a metric learning approach, supervised by a triplet loss. The triplet loss used in video re-ID is usually based on so-called clip features, each aggregated from a few frame features. In this paper, we propose to model the video clip as a set and instead study the distance between sets in the corresponding triplet loss. In contrast to the distance between clip representations, the distance between clip sets considers the pair-wise similarity of each element (i.e., frame representation) between two sets. This allows the network to directly optimize the feature representation at a frame level. Apart from the commonly-used set distance metrics (e.g., ordinary distance and Hausdorff distance), we further propose a hybrid distance metric, tailored for the set-aware triplet loss. Also, we propose a hard positive set construction strategy using the learned class prototypes in a batch. Our proposed method achieves state-of-the-art results across several standard benchmarks, demonstrating the advantages of the proposed method.

READ FULL TEXT

page 6

page 8

research
07/30/2018

Hard-Aware Point-to-Set Deep Metric for Person Re-identification

Person re-identification (re-ID) is a highly challenging task due to lar...
research
07/25/2017

Deep Feature Learning via Structured Graph Laplacian Embedding for Person Re-Identification

Learning the distance metric between pairs of examples is of great impor...
research
03/21/2017

No Fuss Distance Metric Learning using Proxies

We address the problem of distance metric learning (DML), defined as lea...
research
01/09/2019

Individual common dolphin identification via metric embedding learning

Photo-identification (photo-id) of dolphin individuals is a commonly use...
research
10/28/2019

Accurate and Scalable Version Identification Using Musically-Motivated Embeddings

The version identification (VI) task deals with the automatic detection ...
research
06/19/2020

A Symbolic Temporal Pooling method for Video-based Person Re-Identification

In video-based person re-identification, both the spatial and temporal f...
research
04/13/2023

Leveraging triplet loss for unsupervised action segmentation

In this paper, we propose a novel fully unsupervised framework that lear...

Please sign up or login with your details

Forgot password? Click here to reset