A Methodology for Customizing Clinical Tests for Esophageal Cancer based on Patient Preferences

10/06/2016
by   Asis Roy, et al.
0

Tests for Esophageal cancer can be expensive, uncomfortable and can have side effects. For many patients, we can predict non-existence of disease with 100 certainty, just using demographics, lifestyle, and medical history information. Our objective is to devise a general methodology for customizing tests using user preferences so that expensive or uncomfortable tests can be avoided. We propose to use classifiers trained from electronic health records (EHR) for selection of tests. The key idea is to design classifiers with 100 normal rates, possibly at the cost higher false abnormals. We compare Naive Bayes classification (NB), Random Forests (RF), Support Vector Machines (SVM) and Logistic Regression (LR), and find kernel Logistic regression to be most suitable for the task. We propose an algorithm for finding the best probability threshold for kernel LR, based on test set accuracy. Using the proposed algorithm, we describe schemes for selecting tests, which appear as features in the automatic classification algorithm, using preferences on costs and discomfort of the users. We test our methodology with EHRs collected for more than 3000 patients, as a part of project carried out by a reputed hospital in Mumbai, India. Kernel SVM and kernel LR with a polynomial kernel of degree 3, yields an accuracy of 99.8 using only clinical tests. We demonstrate our test selection algorithm using two case studies, one using cost of clinical tests, and other using "discomfort" values for clinical tests. We compute the test sets corresponding to the lowest false abnormals for each criterion described above, using exhaustive enumeration of 15 clinical tests. The sets turn out to different, substantiating our claim that one can customize test sets based on user preferences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2018

Predicting Chronic Disease Hospitalizations from Electronic Health Records: An Interpretable Classification Approach

Urban living in modern large cities has significant adverse effects on h...
research
11/28/2020

A Role for Prior Knowledge in Statistical Classification of the Transition from MCI to Alzheimer's Disease

The transition from mild cognitive impairment (MCI) to Alzheimer's disea...
research
04/09/2022

Lupus nephritis diagnosis using enhanced moth flame algorithm with support vector machines

Systemic lupus erythematosus is a chronic autoimmune disease that affect...
research
07/21/2017

A Data-Driven Approach to Pre-Operative Evaluation of Lung Cancer Patients

Lung cancer is the number one cause of cancer deaths. Many early stage l...
research
10/31/2021

Predicting Cancer Using Supervised Machine Learning: Mesothelioma

Background: Pleural Mesothelioma (PM) is an unusual, belligerent tumor t...

Please sign up or login with your details

Forgot password? Click here to reset