Empowering individual trait prediction using interactions

01/25/2019
by   Damian Gola, et al.
0

One component of precision medicine is to construct prediction models with their predictive ability as high as possible, e.g. to enable individual risk prediction. In genetic epidemiology, complex diseases have a polygenic basis and a common assumption is that biological and genetic features affect the outcome under consideration via interactions. In the case of omics data, the use of standard approaches such as generalized linear models may be suboptimal and machine learning methods are appealing to make individual predictions. However, most of these algorithms focus mostly on main or marginal effects of the single features in a dataset. On the other hand, the detection of interacting features is an active area of research in the realm of genetic epidemiology. One big class of algorithms to detect interacting features is based on the multifactor dimensionality reduction (MDR). Here, we extend the model-based MDR (MB-MDR), a powerful extension of the original MDR algorithm, to enable interaction empowered individual prediction. Using a comprehensive simulation study we show that our new algorithm can use information hidden in interactions more efficiently than two other state-of-the-art algorithms, namely the Random Forest and Elastic Net, and clearly outperforms these if interactions are present. The performance of these algorithms is comparable if no interactions are present. Further, we show that our new algorithm is applicable to real data by comparing the performance of the three algorithms on a dataset of rheumatoid arthritis cases and healthy controls. As our new algorithm is not only applicable to biological/genetic data but to all datasets with discrete features, it may have practical implications in other applications as well, and we made our method available as an R package.

READ FULL TEXT
research
11/19/2021

SNPs Filtered by Allele Frequency Improve the Prediction of Hypertension Subtypes

Hypertension is the leading global cause of cardiovascular disease and p...
research
09/02/2016

A case study of algorithm selection for the traveling thief problem

Many real-world problems are composed of several interacting components....
research
02/16/2018

WHInter: A Working set algorithm for High-dimensional sparse second order Interaction models

Learning sparse linear models with two-way interactions is desirable in ...
research
11/26/2019

Random Forest as a Tumour Genetic Marker Extractor

Finding tumour genetic markers is essential to biomedicine due to their ...
research
08/29/2023

STEC: See-Through Transformer-based Encoder for CTR Prediction

Click-Through Rate (CTR) prediction holds a pivotal place in online adve...
research
10/16/2018

Refining interaction search through signed iterative Random Forests

Advances in supervised learning have enabled accurate prediction in biol...
research
12/20/2017

On the Relation of External and Internal Feature Interactions: A Case Study

Detecting feature interactions is imperative for accurately predicting p...

Please sign up or login with your details

Forgot password? Click here to reset