Universal consistency of Wasserstein k-NN classifier

09/10/2020
by   Donlapark Ponnoprat, et al.
33

The Wasserstein distance provides a notion of dissimilarities between probability measures, which has recent applications in learning of structured data with varying size such as images and text documents. In this work, we analyze the k-nearest neighbor classifier (k-NN) under the Wasserstein distance and establish the universal consistency on families of distributions. Using previous known results on the consistency of the k-NN classifier on infinite dimensional metric spaces, it suffices to show that the families is a countable union of finite dimensional components. As a result, we are able to prove universal consistency of k-NN on spaces of finitely supported measures, the space of finite wavelet series and the spaces of Gaussian measures with commuting covariance matrices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2020

Universal consistency of the k-NN rule in metric spaces and Nagata dimension

The k nearest neighbour learning rule (under the uniform distance tie br...
research
10/19/2011

Is the k-NN classifier in high dimensions affected by the curse of dimensionality?

There is an increasing body of evidence suggesting that exact nearest ne...
research
05/26/2023

Universal consistency of the k-NN rule in metric spaces and Nagata dimension. II

We continue to investigate the k nearest neighbour learning rule in sepa...
research
06/17/2008

Supervised functional classification: A theoretical remark and some comparisons

The problem of supervised classification (or discrimination) with functi...
research
10/01/2020

Universal consistency and rates of convergence of multiclass prototype algorithms in metric spaces

We study universal consistency and convergence rates of simple nearest-n...
research
11/29/2015

k-Nearest Neighbour Classification of Datasets with a Family of Distances

The k-nearest neighbour (k-NN) classifier is one of the oldest and most ...
research
07/25/2018

Topics in Random Matrices and Statistical Machine Learning

This thesis consists of two independent parts: random matrices, which fo...

Please sign up or login with your details

Forgot password? Click here to reset