Analyzing and Improving Representations with the Soft Nearest Neighbor Loss

02/05/2019
by   Nicholas Frosst, et al.
14

We explore and expand the Soft Nearest Neighbor Loss to measure the entanglement of class manifolds in representation space: i.e., how close pairs of points from the same class are relative to pairs of points from different classes. We demonstrate several use cases of the loss. As an analytical tool, it provides insights into the evolution of class similarity structures during learning. Surprisingly, we find that maximizing the entanglement of representations of different classes in the hidden layers is beneficial for discrimination in the final layer, possibly because it encourages representations to identify class-independent similarity structures. Maximizing the soft nearest neighbor loss in the hidden layers leads not only to improved generalization but also to better-calibrated estimates of uncertainty on outlier data. Data that is not from the training distribution can be recognized by observing that in the hidden layers, it has fewer than the normal number of neighbors from the predicted class.

READ FULL TEXT

page 4

page 8

page 13

page 14

page 15

page 16

research
06/05/2020

Improving k-Means Clustering Performance with Disentangled Internal Representations

Deep clustering algorithms combine representation learning and clusterin...
research
07/23/2021

Text Classification and Clustering with Annealing Soft Nearest Neighbor Loss

We define disentanglement as how far class-different data points from ea...
research
08/20/2019

K-Nearest Neighbor Approximation Via the Friend-of-a-Friend Principle

Suppose V is an n-element set where for each x ∈ V, the elements of V ∖{...
research
03/09/2023

Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

To achieve near-zero training error in a classification problem, the lay...
research
02/04/2023

Reducing Nearest Neighbor Training Sets Optimally and Exactly

In nearest-neighbor classification, a training set P of points in ℝ^d wi...
research
10/25/2017

Anatomical labeling of brain CT scan anomalies using multi-context nearest neighbor relation networks

This work is an endeavor to develop a deep learning methodology for auto...
research
11/20/2014

Maximum Likelihood Directed Enumeration Method in Piecewise-Regular Object Recognition

We explore the problems of classification of composite object (images, s...

Please sign up or login with your details

Forgot password? Click here to reset