Robust Principal Component Analysis for Compositional Tables

04/11/2019
by   Julie Rendlová, et al.
0

A data table which is arranged according to two factors can often be considered as a compositional table. An example is the number of unemployed people, split according to gender and age classes. Analyzed as compositions, the relevant information would consist of ratios between different cells of such a table. This is particularly useful when analyzing several compositional tables jointly, where the absolute numbers are in very different ranges, e.g. if unemployment data are considered from different countries. Within the framework of the logratio methodology, compositional tables can be decomposed into independent and interactive parts, and orthonormal coordinates can be assigned to these parts. However, these coordinates usually require some prior knowledge about the data, and they are not easy to handle for exploring the relationships between the given factors. Here we propose a special choice of coordinates with a direct relation to centered logratio (clr) coefficients, which are particularly useful for an interpretation in terms of the original cells of the tables. With these coordinates, robust principal component analysis (PCA) is performed for dimension reduction, allowing to investigate the relationships between the factors. The link between orthonormal coordinates and clr coefficients enables to apply robust PCA, which would otherwise suffer from the singularity of clr coefficients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2020

Independent Component Analysis for Compositional Data

Compositional data represent a specific family of multivariate data, whe...
research
12/29/2021

Compositional Data Regression in Insurance with Exponential Family PCA

Compositional data are multivariate observations that carry only relativ...
research
01/25/2022

Compositional Cubes: A New Concept for Multi-factorial Compositions

Compositional data are commonly known as multivariate observations carry...
research
06/04/2018

MacroPCA: An all-in-one PCA method allowing for missing values as well as cellwise and rowwise outliers

Multivariate data are typically represented by a rectangular matrix (tab...
research
01/23/2019

Incremental Principal Component Analysis Exact implementation and continuity corrections

This paper describes some applications of an incremental implementation ...
research
09/11/2020

TCA and TLRA: A comparison on contingency tables and compositional data

There are two popular general approaches for the analysis and visualizat...

Please sign up or login with your details

Forgot password? Click here to reset