Investigation of robustness and numerical stability of multiple regression and PCA in modeling world development data

07/30/2022
by   Chen Ye Gan, et al.
0

Popular methods for modeling data both labelled and unlabeled, multiple regression and PCA has been used in research for a vast number of datasets. In this investigation, we attempt to push the limits of these two methods by running a fit on world development data, a set notorious for its complexity and high dimensionality. We assess the robustness and numerical stability of both methods using their matrix condition number and ability to capture variance in the dataset. The result indicates poor performance from both methods from a numerical standpoint, yet certain qualitative insights can still be captured.

READ FULL TEXT
research
04/07/2018

Principal Component Analysis: A Natural Approach to Data Exploration

Principal component analysis (PCA) is often used for analysing data in t...
research
10/25/2017

DPCA: Dimensionality Reduction for Discriminative Analytics of Multiple Large-Scale Datasets

Principal component analysis (PCA) has well-documented merits for data e...
research
02/22/2023

On the efficiency-loss free ordering-robustness of product-PCA

This article studies the robustness of the eigenvalue ordering, an impor...
research
08/07/2023

Numerical stability analysis of shock-capturing methods for strong shocks II: high-order finite-volume schemes

The shock instability problem commonly arises in flow simulations involv...
research
12/23/2019

2DR1-PCA and 2DL1-PCA: two variant 2DPCA algorithms based on none L2 norm

In this paper, two novel methods: 2DR1-PCA and 2DL1-PCA are proposed for...
research
09/02/2019

Clustering of count data through a mixture of multinomial PCA

Count data is becoming more and more ubiquitous in a wide range of appli...

Please sign up or login with your details

Forgot password? Click here to reset