Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets

03/29/2022
by   Vishnu Suresh Lokhande, et al.
0

Pooling multiple neuroimaging datasets across institutions often enables improvements in statistical power when evaluating associations (e.g., between risk factors and disease outcomes) that may otherwise be too weak to detect. When there is only a single source of variability (e.g., different scanners), domain adaptation and matching the distributions of representations may suffice in many scenarios. But in the presence of more than one nuisance variable which concurrently influence the measurements, pooling datasets poses unique challenges, e.g., variations in the data can come from both the acquisition method as well as the demographics of participants (gender, age). Invariant representation learning, by itself, is ill-suited to fully model the data generation process. In this paper, we show how bringing recent results on equivariant representation learning (for studying symmetries in neural networks) instantiated on structured spaces together with simple use of classical results on causal inference provides an effective practical solution. In particular, we demonstrate how our model allows dealing with more than one nuisance variable under some assumptions and can enable analysis of pooled scientific datasets in scenarios that would otherwise entail removing a large portion of the samples.

READ FULL TEXT

page 2

page 7

page 12

research
06/05/2021

Conditional Contrastive Learning: Removing Undesirable Information in Self-Supervised Representations

Self-supervised learning is a form of unsupervised learning that leverag...
research
07/05/2017

Wasserstein Distance Guided Representation Learning for Domain Adaptation

Domain adaptation aims at generalizing a high-performance learner on a t...
research
09/18/2020

Chemical Property Prediction Under Experimental Biases

The ability to predict the chemical properties of compounds is crucial i...
research
07/02/2023

Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity

This paper presents a novel approach that leverages domain variability t...
research
12/15/2016

A Survey of Inductive Biases for Factorial Representation-Learning

With the resurgence of interest in neural networks, representation learn...
research
09/02/2017

When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, ℓ_2-consistency and Neuroscience Applications

Many studies in biomedical and health sciences involve small sample size...
research
12/14/2021

Measuring Equity: Funnel Representation Measurement

We present a methodology to measure the gender representation for online...

Please sign up or login with your details

Forgot password? Click here to reset