Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

01/26/2018
by   Lucas C. Parra, et al.
0

How does one find data dimensions that are reliably expressed across repetitions? For example, in neuroscience one may want to identify combinations of brain signals that are reliably activated across multiple trials or subjects. For a clinical assessment with multiple ratings, one may want to identify an aggregate score that is reliably reproduced across raters. The approach proposed here --- "correlated components analysis" --- is to identify components that maximally correlate between repetitions (e.g. trials, subjects, raters). This can be expressed as the maximization of the ratio of between-repetition to within-repetition covariance, resulting in a generalized eigenvalue problem. We show that covariances can be computed efficiently without explicitly considering all pairs of repetitions, that the result is equivalent to multi-class linear discriminant analysis for unbiased signals, and that the approach also maximize reliability, defined as the mean divided by the deviation across repetitions. We also extend the method to non-linear components using kernels, discuss regularization to improve numerical stability, present parametric and non-parametric tests to establish statistical significance, and provide code.

READ FULL TEXT

page 18

page 23

research
10/28/2015

Universal Dependency Analysis

Most data is multi-dimensional. Discovering whether any subset of dimens...
research
06/27/2023

Multivariate Rank-Based Analysis of Multiple Endpoints in Clinical Trials: A Global Test Approach

Clinical trials often involve the assessment of multiple endpoints to co...
research
07/14/2021

Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests

While previous distribution shift detection approaches can identify if a...
research
12/21/2017

Robust Detection of Covariate-Treatment Interactions in Clinical Trials

Detection of interactions between treatment effects and patient descript...
research
01/12/2020

Notes on Exact Power Calculations for t Tests and Analysis of Covariance

Tang derived the exact power formulae for t tests and analysis of covari...
research
07/29/2022

Hypothesis tests for multiple responses regression: effect of probiotics on addiction and binge eating disorder

Clinical trials are common in medical research where multiple non-Gaussi...
research
01/12/2021

Statistical analysis of periodic data in neuroscience

Many experimental paradigms in neuroscience involve driving the nervous ...

Please sign up or login with your details

Forgot password? Click here to reset