A Geometric Perspective on the Power of Principal Component Association Tests in Multiple Phenotype Studies

10/28/2017
by   Zhonghua Liu, et al.
0

Joint analysis of multiple phenotypes can increase statistical power in genetic association studies. Principal component analysis, as a popular dimension reduction method, especially when the number of phenotypes is high-dimensional, has been proposed to analyze multiple correlated phenotypes. It has been empirically observed that the first PC, which summarizes the largest amount of variance, can be less powerful than higher order PCs and other commonly used methods in detecting genetic association signals. In this paper, we investigate the properties of PCA-based multiple phenotype analysis from a geometric perspective by introducing a novel concept called principal angle. A particular PC is powerful if its principal angle is 0^o and is powerless if its principal angle is 90^o. Without prior knowledge about the true principal angle, each PC can be powerless. We propose linear, non-linear and data-adaptive omnibus tests by combining PCs. We show that the omnibus PC test is robust and powerful in a wide range of scenarios. We study the properties of the proposed methods using power analysis and eigen-analysis. The subtle differences and close connections between these combined PC methods are illustrated graphically in terms of their rejection boundaries. Our proposed tests have convex acceptance regions and hence are admissible. The p-values for the proposed tests can be efficiently calculated analytically and the proposed tests have been implemented in a publicly available R package MPAT. We conduct simulation studies in both low and high dimensional settings with various signal vectors and correlation structures. We apply the proposed tests to the joint analysis of metabolic syndrome related phenotypes with data sets collected from four international consortia to demonstrate the effectiveness of the proposed combined PC testing procedures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2020

On Robust Probabilistic Principal Component Analysis using Multivariate t-Distributions

Principal Component Analysis (PCA) is a common multivariate statistical ...
research
09/22/2022

PC Adjusted Testing for Low Dimensional Parameters

In this paper we consider the effect of high dimensional Principal Compo...
research
04/03/2022

Robust PCA for High Dimensional Data based on Characteristic Transformation

In this paper, we propose a novel robust Principal Component Analysis (P...
research
03/16/2019

Spherical Principal Component Analysis

Principal Component Analysis (PCA) is one of the most important methods ...
research
03/09/2022

High Dimensional Statistical Analysis and its Application to ALMA Map of NGC 253

In astronomy, if we denote the dimension of data as d and the number of ...
research
03/26/2022

Principal Structure Identification: Fast Disentanglement of Multi-source Dataset

Analysis of multi-source data, where data on the same objects are collec...
research
05/01/2020

Simultaneous Non-Gaussian Component Analysis (SING) for Data Integration in Neuroimaging

As advances in technology allow the acquisition of complementary informa...

Please sign up or login with your details

Forgot password? Click here to reset