Optimal Projections in the Distance-Based Statistical Methods

11/07/2019
by   Chuanping Yu, et al.
0

This paper introduces a new way to calculate distance-based statistics, particularly when the data are multivariate. The main idea is to pre-calculate the optimal projection directions given the variable dimension, and to project multidimensional variables onto these pre-specified projection directions; by subsequently utilizing the fast algorithm that is developed in Huo and Székely [2016] for the univariate variables, the computational complexity can be improved from O(m^2) to O(n m ·(m)), where n is the number of projection directions and m is the sample size. When n ≪ m/log(m), computational savings can be achieved. The key challenge is how to find the optimal pre-specified projection directions. This can be obtained by minimizing the worse-case difference between the true distance and the approximated distance, which can be formulated as a nonconvex optimization problem in a general setting. In this paper, we show that the exact solution of the nonconvex optimization problem can be derived in two special cases: the dimension of the data is equal to either 2 or the number of projection directions. In the generic settings, we propose an algorithm to find some approximate solutions. Simulations confirm the advantage of our method, in comparison with the pure Monte Carlo approach, in which the directions are randomly selected rather than pre-calculated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2020

Approximate computation of projection depths

Data depth is a concept in multivariate statistics that measures the cen...
research
10/14/2019

Uniform convergence rates for the approximated halfspace and projection depth

The computational complexity of some depths that satisfy the projection ...
research
02/24/2023

Wasserstein Projection Pursuit of Non-Gaussian Signals

We consider the general dimensionality reduction problem of locating in ...
research
05/07/2015

Fast Spectral Unmixing based on Dykstra's Alternating Projection

This paper presents a fast spectral unmixing algorithm based on Dykstra'...
research
07/13/2018

Non-Gaussian Component Analysis using Entropy Methods

Non-Gaussian component analysis (NGCA) is a problem in multidimensional ...
research
04/16/2016

Evolutionary Projection Selection for Radon Barcodes

Recently, Radon transformation has been used to generate barcodes for ta...
research
03/24/2003

Differential Methods in Catadioptric Sensor Design with Applications to Panoramic Imaging

We discuss design techniques for catadioptric sensors that realize given...

Please sign up or login with your details

Forgot password? Click here to reset