Noise-Resistant Deep Metric Learning with Probabilistic Instance Filtering

08/03/2021
by   Chang Liu, et al.
0

Noisy labels are commonly found in real-world data, which cause performance degradation of deep neural networks. Cleaning data manually is labour-intensive and time-consuming. Previous research mostly focuses on enhancing classification models against noisy labels, while the robustness of deep metric learning (DML) against noisy labels remains less well-explored. In this paper, we bridge this important gap by proposing Probabilistic Ranking-based Instance Selection with Memory (PRISM) approach for DML. PRISM calculates the probability of a label being clean, and filters out potentially noisy samples. Specifically, we propose three methods to calculate this probability: 1) Average Similarity Method (AvgSim), which calculates the average similarity between potentially noisy data and clean data; 2) Proxy Similarity Method (ProxySim), which replaces the centers maintained by AvgSim with the proxies trained by proxy-based method; and 3) von Mises-Fisher Distribution Similarity (vMF-Sim), which estimates a von Mises-Fisher distribution for each data class. With such a design, the proposed approach can deal with challenging DML situations in which the majority of the samples are noisy. Extensive experiments on both synthetic and real-world noisy dataset show that the proposed approach achieves up to 8.37 best performing state-of-the-art baseline approaches, within reasonable training time.

READ FULL TEXT

page 5

page 12

research
03/30/2021

Noise-resistant Deep Metric Learning with Ranking-based Instance Selection

The existence of noisy labels in real-world data negatively impacts the ...
research
06/20/2023

MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels

Despite deep learning has achieved great success, it often relies on a l...
research
10/29/2021

Adaptive Hierarchical Similarity Metric Learning with Noisy Labels

Deep Metric Learning (DML) plays a critical role in various machine lear...
research
12/08/2022

Leveraging Unlabeled Data to Track Memorization

Deep neural networks may easily memorize noisy labels present in real-wo...
research
07/29/2021

Learning with Noisy Labels for Robust Point Cloud Segmentation

Point cloud segmentation is a fundamental task in 3D. Despite recent pro...
research
12/01/2022

Noisy Label Detection for Speaker Recognition

The success of deep neural networks requires both high annotation qualit...
research
07/08/2022

A Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning

Proxy-based Deep Metric Learning (DML) learns deep representations by em...

Please sign up or login with your details

Forgot password? Click here to reset