Consistency of archetypal analysis

10/16/2020
by   Braxton Osting, et al.
0

Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data. For fixed k, the method finds a convex polytope with k vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared distance between the data and the polytope is minimal. In this paper, we prove a consistency result that shows if the data is independently sampled from a probability measure with bounded support, then the archetype points converge to a solution of the continuum version of the problem, of which we identify and establish several properties. We also obtain the convergence rate of the optimal objective values under appropriate assumptions on the distribution. If the data is independently sampled from a distribution with unbounded support, we also prove a consistency result for a modified method that penalizes the dispersion of the archetype points. Our analysis is supported by detailed computational experiments of the archetype points for data sampled from the uniform distribution in a disk, the normal distribution, an annular distribution, and a Gaussian mixture model.

READ FULL TEXT

page 25

page 26

research
10/25/2022

Wasserstein Archetypal Analysis

Archetypal analysis is an unsupervised machine learning method that summ...
research
11/20/2018

Convergence rate of optimal quantization grids and application to empirical measure

We study the convergence rate of optimal quantization for a probability ...
research
04/05/2021

Which Sampling Densities are Suitable for Spectral Clustering on Unbounded Domains?

We consider a random geometric graph with vertices sampled from a probab...
research
04/26/2022

Convergence of neural networks to Gaussian mixture distribution

We give a proof that, under relatively mild conditions, fully-connected ...
research
08/18/2017

Consistency of Dirichlet Partitions

A Dirichlet k-partition of a domain U ⊆R^d is a collection of k pairwise...
research
09/01/2019

Gaussian mixture model decomposition of multivariate signals

We propose a greedy variational method for decomposing a non-negative mu...
research
08/12/2021

Probabilistic methods for approximate archetypal analysis

Archetypal analysis is an unsupervised learning method for exploratory d...

Please sign up or login with your details

Forgot password? Click here to reset