Principal component analysis in Bayes spaces for sparsely sampled density functions

09/20/2023
by   Lisa Steyer, et al.
0

This paper presents a novel approach to functional principal component analysis (FPCA) in Bayes spaces in the setting where densities are the object of analysis, but only few individual samples from each density are observed. We use the observed data directly to account for all sources of uncertainty, instead of relying on prior estimation of the underlying densities in a two-step approach, which can be inaccurate if small or heterogeneous numbers of samples per density are available. To account for the constrained nature of densities, we base our approach on Bayes spaces, which extend the Aitchison geometry for compositional data to density functions. For modeling, we exploit the isometric isomorphism between the Bayes space and the 𝕃^2 subspace 𝕃_0^2 with integration-to-zero constraint through the centered log-ratio transformation. As only discrete draws from each density are observed, we treat the underlying functional densities as latent variables within a maximum likelihood framework and employ a Monte Carlo Expectation Maximization (MCEM) algorithm for model estimation. Resulting estimates are useful for exploratory analyses of density data, for dimension reduction in subsequent analyses, as well as for improved preprocessing of sparsely sampled density data compared to existing methods. The proposed method is applied to analyze the distribution of maximum daily temperatures in Berlin during the summer months for the last 70 years, as well as the distribution of rental prices in the districts of Munich.

READ FULL TEXT

page 15

page 17

page 25

page 27

research
12/17/2019

Changing reference measure in Bayes spaces with applications to functional data analysis

Probability density functions (PDFs) can be understood as continuous com...
research
09/10/2021

Principal component analysis for high-dimensional compositional data

Dimension reduction for high-dimensional compositional data plays an imp...
research
11/09/2022

Spline Estimation of Functional Principal Components via Manifold Conjugate Gradient Algorithm

Functional principal component analysis has become the most important di...
research
10/24/2010

Local Component Analysis for Nonparametric Bayes Classifier

The decision boundaries of Bayes classifier are optimal because they lea...
research
04/04/2014

Understanding Machine-learned Density Functionals

Kernel ridge regression is used to approximate the kinetic energy of non...
research
03/05/2021

Density ratio model with data-adaptive basis function

In many applications, we collect independent samples from interconnected...
research
12/22/2011

Finding Density Functionals with Machine Learning

Machine learning is used to approximate density functionals. For the mod...

Please sign up or login with your details

Forgot password? Click here to reset