Communication-Efficient Integrative Regression in High-Dimensions

12/26/2019
by   Subha Maity, et al.
0

We consider the task of meta-analysis in high-dimensional settings in which the data sources we wish to integrate are similar but non-identical. To borrow strength across such heterogeneous data sources, we introduce a global parameter that addresses several identification issues. We also propose a one-shot estimator of the global parameter that preserves the anonymity of the data sources and converges at a rate that depends on the size of the combined dataset. Finally, we demonstrate the benefits of our approach on a large-scale drug treatment dataset involving several different cancer cell lines.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

05/21/2020

Large-scale comparison of bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic

We present a large-scale comparison of five multidisciplinary bibliograp...
08/28/2021

A robust fusion-extraction procedure with summary statistics in the presence of biased sources

Information from various data sources is increasingly available nowadays...
09/16/2018

Semantic Interoperability Middleware Architecture for Heterogeneous Environmental Data Sources

Data heterogeneity hampers the effort to integrate and infer knowledge f...
11/30/2020

Joint integrative analysis of multiple data sources with correlated vector outcomes

We propose a distributed quadratic inference function framework to joint...
04/29/2019

Competitive Statistical Estimation with Strategic Data Sources

In recent years, data has played an increasingly important role in the e...
03/31/2021

DIVERSE: bayesian Data IntegratiVE learning for precise drug ResponSE prediction

Detecting predictive biomarkers from multi-omics data is important for p...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.