Federated Survival Forests

02/06/2023
by   Alberto Archetti, et al.
0

Survival analysis is a subfield of statistics concerned with modeling the occurrence time of a particular event of interest for a population. Survival analysis found widespread applications in healthcare, engineering, and social sciences. However, real-world applications involve survival datasets that are distributed, incomplete, censored, and confidential. In this context, federated learning can tremendously improve the performance of survival analysis applications. Federated learning provides a set of privacy-preserving techniques to jointly train machine learning models on multiple datasets without compromising user privacy, leading to a better generalization performance. Despite the widespread development of federated learning in recent AI research, only a few studies focus on federated survival analysis. In this work, we present a novel federated algorithm for survival analysis based on one of the most successful survival models, the random survival forest. We call the proposed method Federated Survival Forest (FedSurF). With a single communication round, FedSurF obtains a discriminative power comparable to deep-learning-based federated models trained over hundreds of federated iterations. Moreover, FedSurF retains all the advantages of random forests, namely low computational cost and natural handling of missing values and incomplete datasets. These advantages are especially desirable in real-world federated environments with multiple small datasets stored on devices with low computational capabilities. Numerical experiments compare FedSurF with state-of-the-art survival models in federated networks, showing how FedSurF outperforms deep-learning-based federated algorithms in realistic environments with non-identically distributed data.

READ FULL TEXT
research
08/04/2023

Scaling Survival Analysis in Healthcare with Federated Survival Forests: A Comparative Study on Heart Failure and Breast Cancer Genomics

Survival analysis is a fundamental tool in medicine, modeling the time u...
research
01/28/2023

Heterogeneous Datasets for Federated Survival Analysis Simulation

Survival analysis studies time-modeling techniques for an event of inter...
research
07/12/2022

FedPseudo: Pseudo value-based Deep Learning Models for Federated Survival Analysis

Survival analysis, time-to-event analysis, is an important problem in he...
research
06/16/2020

Federated Survival Analysis with Discrete-Time Cox Models

Building machine learning models from decentralized datasets located in ...
research
01/26/2022

An Efficient and Robust System for Vertically Federated Random Forest

As there is a growing interest in utilizing data across multiple resourc...
research
09/21/2020

Federated Learning for Computational Pathology on Gigapixel Whole Slide Images

Deep Learning-based computational pathology algorithms have demonstrated...
research
06/25/2021

Subgraph Federated Learning with Missing Neighbor Generation

Graphs have been widely used in data mining and machine learning due to ...

Please sign up or login with your details

Forgot password? Click here to reset