Heavy-Tailed Analogues of the Covariance Matrix for ICA

02/22/2017
by   Joseph Anderson, et al.
0

Independent Component Analysis (ICA) is the problem of learning a square matrix A, given samples of X=AS, where S is a random vector with independent coordinates. Most existing algorithms are provably efficient only when each S_i has finite and moderately valued fourth moment. However, there are practical applications where this assumption need not be true, such as speech and finance. Algorithms have been proposed for heavy-tailed ICA, but they are not practical, using random walks and the full power of the ellipsoid algorithm multiple times. The main contributions of this paper are: (1) A practical algorithm for heavy-tailed ICA that we call HTICA. We provide theoretical guarantees and show that it outperforms other algorithms in some heavy-tailed regimes, both on real and synthetic data. Like the current state-of-the-art, the new algorithm is based on the centroid body (a first moment analogue of the covariance matrix). Unlike the state-of-the-art, our algorithm is practically efficient. To achieve this, we use explicit analytic representations of the centroid body, which bypasses the use of the ellipsoid method and random walks. (2) We study how heavy tails affect different ICA algorithms, including HTICA. Somewhat surprisingly, we show that some algorithms that use the covariance matrix or higher moments can successfully solve a range of ICA instances with infinite second moment. We study this theoretically and experimentally, with both synthetic and real-world heavy-tailed data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2015

Heavy-tailed Independent Component Analysis

Independent component analysis (ICA) is the problem of efficiently recov...
research
11/05/2018

User-Friendly Covariance Estimation for Heavy-Tailed Distributions

We propose user-friendly covariance matrix estimators that are robust ag...
research
04/11/2020

Covariance Estimation for Matrix-valued Data

Covariance estimation for matrix-valued data has received an increasing ...
research
02/14/2018

ICA based on Split Generalized Gaussian

Independent Component Analysis (ICA) - one of the basic tools in data an...
research
04/15/2018

Approximating the covariance ellipsoid

We explore ways in which the covariance ellipsoid B={v ∈R^d : E <X,v>^2...
research
11/05/2018

User-Friendly Covariance Estimation for Heavy-Tailed Distributions: A Survey and Recent Results

We offer a survey of selected recent results on covariance estimation fo...
research
08/13/2019

L2P: An Algorithm for Estimating Heavy-tailed Outcomes

Many real-world prediction tasks have outcome (a.k.a. target or response...

Please sign up or login with your details

Forgot password? Click here to reset