Towards Understanding the Survival of Patients with High-Grade Gastroenteropancreatic Neuroendocrine Neoplasms: An Investigation of Ensemble Feature Selection in the Prediction

02/20/2023
by   Anna Jenul, et al.
0

Determining the most informative features for predicting the overall survival of patients diagnosed with high-grade gastroenteropancreatic neuroendocrine neoplasms is crucial to improve individual treatment plans for patients, as well as the biological understanding of the disease. Recently developed ensemble feature selectors like the Repeated Elastic Net Technique for Feature Selection (RENT) and the User-Guided Bayesian Framework for Feature Selection (UBayFS) allow the user to identify such features in datasets with low sample sizes. While RENT is purely data-driven, UBayFS is capable of integrating expert knowledge a priori in the feature selection process. In this work we compare both feature selectors on a dataset comprising of 63 patients and 134 features from multiple sources, including basic patient characteristics, baseline blood values, tumor histology, imaging, and treatment information. Our experiments involve data-driven and expert-driven setups, as well as combinations of both. We use findings from clinical literature as a source of expert knowledge. Our results demonstrate that both feature selectors allow accurate predictions, and that expert knowledge has a stabilizing effect on the feature set, while the impact on predictive performance is limited. The features WHO Performance Status, Albumin, Platelets, Ki-67, Tumor Morphology, Total MTV, Total TLG, and SUVmax are the most stable and predictive features in our study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2022

Ensemble feature selection with data-driven thresholding for Alzheimer's disease biomarker discovery

Healthcare datasets present many challenges to both machine learning and...
research
04/30/2021

A User-Guided Bayesian Framework for Ensemble Feature Selection in Life Science Applications (UBayFS)

Training machine learning models on high-dimensional datasets is a chall...
research
03/04/2020

Variation in correlation between prognosis and histologic feature based on biopsy selection

Glioblastoma multiform carries a dismal prognosis with poor response to ...
research
03/25/2021

Searching for waveforms on spatially-filtered epileptic ECoG

Seizures are one of the defining symptoms in patients with epilepsy, and...
research
10/17/2018

Prediction of treatment outcome for autism from structure of the brain based on sure independence screening

Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder,...
research
06/06/2019

Selecting Biomarkers for building optimal treatment selection rules using Kernel Machines

Optimal biomarker combinations for treatment-selection can be derived by...
research
08/10/2019

Robust data-driven approach for predicting the configurational energy of high entropy alloys

High entropy alloys (HEAs) have been increasingly attractive as promisin...

Please sign up or login with your details

Forgot password? Click here to reset