The Flag Median and FlagIRLS

03/08/2022
by   Nathan Mankovich, et al.
0

Finding prototypes (e.g., mean and median) for a dataset is central to a number of common machine learning algorithms. Subspaces have been shown to provide useful, robust representations for datasets of images, videos and more. Since subspaces correspond to points on a Grassmann manifold, one is led to consider the idea of a subspace prototype for a Grassmann-valued dataset. While a number of different subspace prototypes have been described, the calculation of some of these prototypes has proven to be computationally expensive while other prototypes are affected by outliers and produce highly imperfect clustering on noisy data. This work proposes a new subspace prototype, the flag median, and introduces the FlagIRLS algorithm for its calculation. We provide evidence that the flag median is robust to outliers and can be used effectively in algorithms like Linde-Buzo-Grey (LBG) to produce improved clusterings on Grassmannians. Numerical experiments include a synthetic dataset, the MNIST handwritten digits dataset, the Mind's Eye video dataset and the UCF YouTube action dataset. The flag median is compared the other leading algorithms for computing prototypes on the Grassmannian, namely, the ℓ_2-median and to the flag mean. We find that using FlagIRLS to compute the flag median converges in 4 iterations on a synthetic dataset. We also see that Grassmannian LBG with a codebook size of 20 and using the flag median produces at least a 10% improvement in cluster purity over Grassmannian LBG using the flag mean or ℓ_2-median on the Mind's Eye dataset.

READ FULL TEXT
research
10/29/2018

Geometric Median Shapes

We present an algorithm to compute the geometric median of shapes which ...
research
05/25/2017

Distributed Robust Subspace Recovery

We study Robust Subspace Recovery (RSR) in distributed settings. We cons...
research
02/10/2020

K-bMOM: a robust Lloyd-type clustering algorithm based on bootstrap Median-of-Means

We propose a new clustering algorithm that is robust to the presence of ...
research
09/16/2009

Median K-flats for hybrid linear modeling with many outliers

We describe the Median K-Flats (MKF) algorithm, a simple online method f...
research
06/03/2022

RODIAN: Robustified Median

We propose a robust method for averaging numbers contaminated by a large...
research
01/16/2020

A meta analysis of tournaments and an evaluation of performance in the Iterated Prisoner's Dilemma

The Iterated Prisoner's Dilemma has been used for decades as a model of ...
research
05/09/2022

Towards a median signal detector through the total Bregman divergence and its robust analysis

A novel family of geometric signal detectors are proposed through median...

Please sign up or login with your details

Forgot password? Click here to reset