Prediction approaches for partly missing multi-omics covariate data: A literature review and an empirical comparison study

02/08/2023
by   Roman Hornung, et al.
0

As the availability of omics data has increased in the last few years, more multi-omics data have been generated, that is, high-dimensional molecular data consisting of several types such as genomic, transcriptomic, or proteomic data, all obtained from the same patients. Such data lend themselves to being used as covariates in automatic outcome prediction because each omics type may contribute unique information, possibly improving predictions compared to using only one omics data type. Frequently, however, in the training data and the data to which automatic prediction rules should be applied, the test data, the different omics data types are not available for all patients. We refer to this type of data as block-wise missing multi-omics data. First, we provide a literature review on existing prediction methods applicable to such data. Subsequently, using a collection of 13 publicly available multi-omics data sets, we compare the predictive performances of several of these approaches for different block-wise missingness patterns. Finally, we discuss the results of this empirical comparison study and draw some tentative conclusions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2018

Generalized Integrative Principal Component Analysis for Multi-Type Data with Block-Wise Missing Structure

High-dimensional multi-source data are encountered in many fields. Despi...
research
12/31/2019

Prediction in the Presence of Missing Covariates

In many applied fields incomplete covariate vectors are commonly encount...
research
03/07/2020

Large-scale benchmark study of survival prediction methods using multi-omics data

Multi-omics data, that is, datasets containing different types of high-d...
research
03/03/2022

Doubly Robust Calibration of Prediction Sets under Covariate Shift

Conformal prediction has received tremendous attention in recent years a...
research
05/27/2023

Automatic Roof Type Classification Through Machine Learning for Regional Wind Risk Assessment

Roof type is one of the most critical building characteristics for wind ...
research
06/29/2020

GLYFE: Review and Benchmark of Personalized Glucose Predictive Models in Type-1 Diabetes

Due to the sensitive nature of diabetes-related data, preventing them fr...
research
05/24/2017

An experimental study of graph-based semi-supervised classification with additional node information

The volume of data generated by internet and social networks is increasi...

Please sign up or login with your details

Forgot password? Click here to reset