Hard negative examples are hard, but useful

07/24/2020
by   Hong Xuan, et al.
0

Triplet loss is an extremely common approach to distance metric learning. Representations of images from the same class are optimized to be mapped closer together in an embedding space than representations of images from different classes. Much work on triplet losses focuses on selecting the most useful triplets of images to consider, with strategies that select dissimilar examples from the same class or similar examples from different classes. The consensus of previous research is that optimizing with the hardest negative examples leads to bad training behavior. That's a problem – these hardest negatives are literally the cases where the distance metric fails to capture semantic similarity. In this paper, we characterize the space of triplets and derive why hard negatives make triplet loss training fail. We offer a simple fix to the loss function and show that, with this fix, optimizing with hard negative examples becomes feasible. This leads to more generalizable features, and image retrieval results that outperform state of the art for datasets with high intra-class variance.

READ FULL TEXT

page 9

page 10

page 14

research
02/15/2019

Enhancing Remote Sensing Image Retrieval with Triplet Deep Metric Learning Network

With the rapid growing of remotely sensed imagery data, there is a high ...
research
02/14/2022

Do Lessons from Metric Learning Generalize to Image-Caption Retrieval?

The triplet loss with semi-hard negatives has become the de facto choice...
research
02/26/2020

A Quadruplet Loss for Enforcing Semantically Coherent Embeddings in Multi-output Classification Problems

This paper describes one objective function for learning semantically co...
research
09/09/2019

Deep Metric Learning with Density Adaptivity

The problem of distance metric learning is mostly considered from the pe...
research
01/27/2022

Dissecting the impact of different loss functions with gradient surgery

Pair-wise loss is an approach to metric learning that learns a semantic ...
research
08/27/2020

Learning Condition Invariant Features for Retrieval-Based Localization from 1M Images

Image features for retrieval-based localization must be invariant to dyn...
research
09/09/2017

How to Train Triplet Networks with 100K Identities?

Training triplet networks with large-scale data is challenging in face r...

Please sign up or login with your details

Forgot password? Click here to reset