A Self-Encoder for Learning Nearest Neighbors

06/25/2023
by   Armand Boschin, et al.
0

We present the self-encoder, a neural network trained to guess the identity of each data sample. Despite its simplicity, it learns a very useful representation of data, in a self-supervised way. Specifically, the self-encoder learns to distribute the data samples in the embedding space so that they are linearly separable from one another. This induces a geometry where two samples are close in the embedding space when they are not easy to differentiate. The self-encoder can then be combined with a nearest-neighbor classifier or regressor for any subsequent supervised task. Unlike regular nearest neighbors, the predictions resulting from this encoding of data are invariant to any scaling of features, making any preprocessing like min-max scaling not necessary. The experiments show the efficiency of the approach, especially on heterogeneous data mixing numerical features and categorical features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2020

Deep Nearest Neighbor Anomaly Detection

Nearest neighbors is a successful and long-standing technique for anomal...
research
11/01/2019

Generalization through Memorization: Nearest Neighbor Language Models

We introduce kNN-LMs, which extend a pre-trained neural language model (...
research
11/13/2018

Dynamic Feature Scaling for K-Nearest Neighbor Algorithm

Nearest Neighbors Algorithm is a Lazy Learning Algorithm, in which the a...
research
05/28/2023

k-NNN: Nearest Neighbors of Neighbors for Anomaly Detection

Anomaly detection aims at identifying images that deviate significantly ...
research
06/17/2022

Intra-Instance VICReg: Bag of Self-Supervised Image Patch Embedding

Recently, self-supervised learning (SSL) has achieved tremendous empiric...
research
10/18/2021

TLDR: Twin Learning for Dimensionality Reduction

Dimensionality reduction methods are unsupervised approaches which learn...
research
05/13/2022

Toward a Geometrical Understanding of Self-supervised Contrastive Learning

Self-supervised learning (SSL) is currently one of the premier technique...

Please sign up or login with your details

Forgot password? Click here to reset