Multiple factor analysis of distributional data

04/19/2018
by   Rosanna Verde, et al.
0

In the framework of Symbolic Data Analysis (SDA), distribution-variables are a particular case of multi-valued variables: each unit is represented by a set of distributions (e.g. histograms, density functions or quantile functions), one for each variable. Factor analysis (FA) methods are primary exploratory tools for dimension reduction and visualization. In the present work, we use Multiple Factor Analysis (MFA) approach for the analysis of data described by distributional variables. Each distributional variable induces a set new numeric variable related to the quantiles of each distribution. We call these new variables as quantile variables and the set of quantile variables related to a distributional one is a block in the MFA approach. Thus, MFA is performed on juxtaposed tables of quantile variables. We show that the criterion decomposed in the analysis is an approximation of the variability based on a suitable metrics between distributions: the squared L_2 Wasserstein distance. Applications on simulated and real distributional data corroborate the method. The interpretation of the results on the factorial planes is performed by new interpretative tools that are related to the several characteristics of the distributions (location, scale and shape).

READ FULL TEXT
research
10/14/2020

Discriminant Analysis of Distributional Data viaFractional Programming

We address classification of distributional data, where units are descri...
research
05/02/2016

Fuzzy clustering of distribution-valued data using adaptive L2 Wasserstein distances

Distributional (or distribution-valued) data are a new type of data aris...
research
05/26/2023

Distributional Reinforcement Learning with Dual Expectile-Quantile Regression

Successful applications of distributional reinforcement learning with qu...
research
10/11/2019

Analytical Quantile Solution for the S-distribution, Random Number Generation and Statistical Data Modeling

The selection of a specific statistical distribution is seldom a simple ...
research
02/22/2021

Distributional data analysis via quantile functions and its application to modelling digital biomarkers of gait in Alzheimer's Disease

With the advent of continuous health monitoring via wearable devices, us...
research
11/09/2020

An Embedded Model Estimator for Non-Stationary Random Functions using Multiple Secondary Variables

An algorithm for non-stationary spatial modelling using multiple seconda...
research
10/14/2011

Bayesian Group Factor Analysis

We introduce a factor analysis model that summarizes the dependencies be...

Please sign up or login with your details

Forgot password? Click here to reset