Manifold Fitting: An Invitation to Statistics

04/16/2023
by   Zhigang Yao, et al.
0

While classical statistics has addressed observations that are real numbers or elements of a real vector space, at present many statistical problems of high interest in the sciences address the analysis of data that consist of more complex objects, taking values in spaces that are naturally not (Euclidean) vector spaces but which still feature some geometric structure. Manifold fitting is a long-standing problem, and has finally been addressed in recent years by Fefferman et al. (2020, 2021a). We develop a method with a theory guarantee that fits a d-dimensional underlying manifold from noisy observations sampled in the ambient space ℝ^D. The new approach uses geometric structures to obtain the manifold estimator in the form of image sets via a two-step mapping approach. We prove that, under certain mild assumptions and with a sample size N=𝒪(σ^(-d+3)), these estimators are true d-dimensional smooth manifolds whose estimation error, as measured by the Hausdorff distance, is bounded by 𝒪(σ^2log(1/σ)) with high probability. Compared with the existing approaches proposed in Fefferman et al. (2018, 2021b); Genovese et al. (2014); Yao and Xia (2019), our method exhibits superior efficiency while attaining very low error rates with a significantly reduced sample size, which scales polynomially in σ^-1 and exponentially in d. Extensive simulations are performed to validate our theoretical results. Our findings are relevant to various fields involving high-dimensional data in statistics and machine learning. Furthermore, our method opens up new avenues for existing non-Euclidean statistical methods in the sense that it has the potential to unify them to analyze data on manifolds in the ambience space domain.

READ FULL TEXT
research
10/12/2018

Topological Inference of Manifolds with Boundary

Given a set of data points sampled from some underlying space, there are...
research
09/23/2019

Manifold Fitting under Unbounded Noise

There has been an emerging trend in non-Euclidean dimension reduction of...
research
09/30/2019

Manifold Fitting in Ambient Space

Modern data sets in many applications no longer comprise samples of real...
research
01/13/2021

Multiscale regression on unknown manifolds

We consider the regression problem of estimating functions on ℝ^D but su...
research
05/11/2021

Non-Parametric Estimation of Manifolds from Noisy Data

A common observation in data-driven applications is that high dimensiona...
research
10/07/2019

Increasing Expressivity of a Hyperspherical VAE

Learning suitable latent representations for observed, high-dimensional ...
research
04/30/2019

Active Manifolds: A non-linear analogue to Active Subspaces

We present an approach to analyze C^1(R^m) functions that addresses limi...

Please sign up or login with your details

Forgot password? Click here to reset