Robust Inference for Federated Meta-Learning

01/02/2023
by   Zijian Guo, et al.
0

Synthesizing information from multiple data sources is critical to ensure knowledge generalizability. Integrative analysis of multi-source data is challenging due to the heterogeneity across sources and data-sharing constraints due to privacy concerns. In this paper, we consider a general robust inference framework for federated meta-learning of data from multiple sites, enabling statistical inference for the prevailing model, defined as the one matching the majority of the sites. Statistical inference for the prevailing model is challenging since it requires a data-adaptive mechanism to select eligible sites and subsequently account for the selection uncertainty. We propose a novel sampling method to address the additional variation arising from the selection. Our devised CI construction does not require sites to share individual-level data and is shown to be valid without requiring the selection of eligible sites to be error-free. The proposed robust inference for federated meta-learning (RIFL) methodology is broadly applicable and illustrated with three inference problems: aggregation of parametric models, high-dimensional prediction models, and inference for average treatment effects. We use RIFL to perform federated learning of mortality risk for patients hospitalized with COVID-19 using real-world EHR data from 16 healthcare centers representing 275 hospitals across four countries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2019

Differential Privacy-enabled Federated Learning for Sensitive Health Data

Leveraging real-world health data for machine learning tasks requires ad...
research
03/10/2021

A Tree-based Federated Learning Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources

Federated learning is an appealing framework for analyzing sensitive dat...
research
01/30/2021

On Data Efficiency of Meta-learning

Meta-learning has enabled learning statistical models that can be quickl...
research
02/16/2021

Scaling Neuroscience Research using Federated Learning

The amount of biomedical data continues to grow rapidly. However, the ab...
research
04/02/2022

Collaborative causal inference with a distributed data-sharing management

Data sharing barriers are paramount challenges arising from multicenter ...
research
01/31/2023

Distributed sequential federated learning

The analysis of data stored in multiple sites has become more popular, r...
research
02/07/2020

Meta-Analysis of Generalized Additive Models in Neuroimaging Studies

Analyzing data from multiple neuroimaging studies has great potential in...

Please sign up or login with your details

Forgot password? Click here to reset