Extending Models Via Gradient Boosting: An Application to Mendelian Models

05/13/2021
by   Theodore Huang, et al.
13

Improving existing widely-adopted prediction models is often a more efficient and robust way towards progress than training new models from scratch. Existing models may (a) incorporate complex mechanistic knowledge, (b) leverage proprietary information and, (c) have surmounted barriers to adoption. Compared to model training, model improvement and modification receive little attention. In this paper we propose a general approach to model improvement: we combine gradient boosting with any previously developed model to improve model performance while retaining important existing characteristics. To exemplify, we consider the context of Mendelian models, which estimate the probability of carrying genetic mutations that confer susceptibility to disease by using family pedigrees and health histories of family members. Via simulations we show that integration of gradient boosting with an existing Mendelian model can produce an improved model that outperforms both that model and the model built using gradient boosting alone. We illustrate the approach on genetic testing data from the USC-Stanford Cancer Genetics Hereditary Cancer Panel (HCP) study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2021

PanelPRO: a general framework for multi-gene, multi-cancer Mendelian risk prediction models

Risk evaluation to identify individuals who are at greater risk of cance...
research
12/27/2020

Effective Email Spam Detection System using Extreme Gradient Boosting

The popularity, cost-effectiveness and ease of information exchange that...
research
10/25/2019

Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting

Heritability is a central measure in genetics quantifying how much of th...
research
10/25/2020

PanelPRO: A R package for multi-syndrome, multi-gene risk modeling for individuals with a family history of cancer

Identifying individuals who are at high risk of cancer due to inherited ...
research
07/26/2019

BGADAM: Boosting based Genetic-Evolutionary ADAM for Convolutional Neural Network Optimization

Among various optimization algorithms, ADAM can achieve outstanding perf...
research
05/09/2022

Wavelet-Based Hybrid Machine Learning Model for Out-of-distribution Internet Traffic Prediction

Efficient prediction of internet traffic is essential for ensuring proac...
research
07/11/2020

Feature Interactions in XGBoost

In this paper, we investigate how feature interactions can be identified...

Please sign up or login with your details

Forgot password? Click here to reset