Embracing the Disharmony in Heterogeneous Medical Data

by   Rongguang Wang, et al.

Heterogeneity in medical imaging data is often tackled, in the context of machine learning, using domain invariance, i.e. deriving models that are robust to domain shifts, which can be both within domain (e.g. demographics) and across domains (e.g. scanner/protocol characteristics). However this approach can be detrimental to performance because it necessitates averaging across intra-class variability and reduces discriminatory power of learned models, in order to achieve better intra- and inter-domain generalization. This paper instead embraces the heterogeneity and treats it as a multi-task learning problem to explicitly adapt trained classifiers to both inter-site and intra-site heterogeneity. We demonstrate that the error of a base classifier on challenging 3D brain magnetic resonance imaging (MRI) datasets can be reduced by 2-3 times, in certain tasks, by adapting to the specific demographics of the patients, and different acquisition protocols. Learning the characteristics of domain shifts is achieved via auxiliary learning tasks leveraging commonly available data and variables, e.g. demographics. In our experiments, we use gender classification and age regression as auxiliary tasks helping the network weights trained on a source site adapt to data from a target site; we show that this approach improves classification accuracy by 5-30 datasets on the main classification tasks, e.g. disease classification.



page 1

page 2

page 3

page 4


Harmonization with Flow-based Causal Inference

Heterogeneity in medical data, e.g., from data collected at different si...

Deriving reproducible biomarkers from multi-site resting-state data: An Autism-based example

Resting-state functional Magnetic Resonance Imaging (R-fMRI) holds the p...

Image-level Harmonization of Multi-Site Data using Image-and-Spatial Transformer Networks

We investigate the use of image-and-spatial transformer networks (ISTNs)...

Medical Image Harmonization Using Deep Learning Based Canonical Mapping: Toward Robust and Generalizable Learning in Imaging

Conventional and deep learning-based methods have shown great potential ...

Hierarchical Bayesian Regression for Multi-Site Normative Modeling of Neuroimaging Data

Clinical neuroimaging has recently witnessed explosive growth in data av...

Machine Learning Models Are Not Necessarily Biased When Constructed Properly: Evidence from Neuroimaging Studies

Despite the great promise that machine learning has offered in many fiel...

Large-Margin Classification with Multiple Decision Rules

Binary classification is a common statistical learning problem in which ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.