Personalized Prognostic Models for Oncology: A Machine Learning Approach

06/22/2016
by   David Dooling, et al.
0

We have applied a little-known data transformation to subsets of the Surveillance, Epidemiology, and End Results (SEER) publically available data of the National Cancer Institute (NCI) to make it suitable input to standard machine learning classifiers. This transformation properly treats the right-censored data in the SEER data and the resulting Random Forest and Multi-Layer Perceptron models predict full survival curves. Treating the 6, 12, and 60 months points of the resulting survival curves as 3 binary classifiers, the 18 resulting classifiers have AUC values ranging from .765 to .885. Further evidence that the models have generalized well from the training data is provided by the extremely high levels of agreement between the random forest and neural network models predictions on the 6, 12, and 60 month binary classifiers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2017

Some variations on Random Survival Forest with application to Cancer Research

Random survival forest can be extremely time consuming for large data se...
research
01/05/2018

Tree based classification of tabla strokes

The paper attempts to validate the effectiveness of tree classifiers to ...
research
01/01/2019

A weighted random survival forest

A weighted random survival forest is presented in the paper. It can be r...
research
03/16/2020

A Numerical Transform of Random Forest Regressors corrects Systematically-Biased Predictions

Over the past decade, random forest models have become widely used as a ...
research
01/27/2021

Predicting Participation in Cancer Screening Programs with Machine Learning

In this paper, we present machine learning models based on random forest...
research
05/15/2019

Survival of the Fittest in PlayerUnknown BattleGround

The goal of this paper was to predict the placement in the multiplayer g...
research
10/13/2020

Similarity Based Stratified Splitting: an approach to train better classifiers

We propose a Similarity-Based Stratified Splitting (SBSS) technique, whi...

Please sign up or login with your details

Forgot password? Click here to reset