Classification of Major Depressive Disorder via Multi-Site Weighted LASSO Model

by   Dajiang Zhu, et al.

Large-scale collaborative analysis of brain imaging data, in psychiatry and neu-rology, offers a new source of statistical power to discover features that boost ac-curacy in disease classification, differential diagnosis, and outcome prediction. However, due to data privacy regulations or limited accessibility to large datasets across the world, it is challenging to efficiently integrate distributed information. Here we propose a novel classification framework through multi-site weighted LASSO: each site performs an iterative weighted LASSO for feature selection separately. Within each iteration, the classification result and the selected features are collected to update the weighting parameters for each feature. This new weight is used to guide the LASSO process at the next iteration. Only the fea-tures that help to improve the classification accuracy are preserved. In tests on da-ta from five sites (299 patients with major depressive disorder (MDD) and 258 normal controls), our method boosted classification accuracy for MDD by 4.9 result shows the potential of the proposed new strategy as an ef-fective and practical collaborative platform for machine learning on large scale distributed imaging and biobank data.



There are no comments yet.


page 1

page 2

page 3

page 4


Large-scale Feature Selection of Risk Genetic Factors for Alzheimer's Disease via Distributed Group Lasso Regression

Genome-wide association studies (GWAS) have achieved great success in th...

Embedding Feature Selection for Large-scale Hierarchical Classification

Large-scale Hierarchical Classification (HC) involves datasets consistin...

Sparse Network Modeling

There have been many attempts to identify high-dimensional network featu...

Stable Feature Selection from Brain sMRI

Neuroimage analysis usually involves learning thousands or even millions...

Embracing the Disharmony in Heterogeneous Medical Data

Heterogeneity in medical imaging data is often tackled, in the context o...

Machine-Learning-Driven New Geologic Discoveries at Mars Rover Landing Sites: Jezero and NE Syrtis

A hierarchical Bayesian classifier is trained at pixel scale with spectr...

Pseudo-domains in imaging data improve prediction of future disease status in multi-center studies

In multi-center randomized clinical trials imaging data can be diverse d...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.