Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values

04/07/2016
by   Talayeh Razzaghi, et al.
0

This work is motivated by the needs of predictive analytics on healthcare data as represented by Electronic Medical Records. Such data is invariably problematic: noisy, with missing entries, with imbalance in classes of interests, leading to serious bias in predictive modeling. Since standard data mining methods often produce poor performance measures, we argue for development of specialized techniques of data-preprocessing and classification. In this paper, we propose a new method to simultaneously classify large datasets and reduce the effects of missing values. It is based on a multilevel framework of the cost-sensitive SVM and the expected maximization imputation method for missing values, which relies on iterated regression analyses. We compare classification results of multilevel SVM-based algorithms on public benchmark datasets with imbalanced classes and missing values as well as real data in health applications, and show that our multilevel SVM-based method produces fast, and more accurate and robust classification results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2015

Fast Imbalanced Classification of Healthcare Data with Missing Values

In medical domain, data features often contain missing values. This can ...
research
03/01/2018

Interval-based Prediction Uncertainty Bound Computation in Learning with Missing Values

The problem of machine learning with missing values is common in many ar...
research
10/19/2021

Multilevel Stochastic Optimization for Imputation in Massive Medical Data Records

Exploration and analysis of massive datasets has recently generated incr...
research
07/24/2017

Engineering multilevel support vector machines

The computational complexity of solving nonlinear support vector machine...
research
10/22/2018

MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare

Deep learning models exhibit state-of-the-art performance for many predi...
research
09/12/2023

Missing Data Imputation and Multilevel Conditional Autoregressive Modeling of Spatial End-Stage Renal Disease Incidence

End-stage renal disease has many adverse complications associated with i...
research
04/22/2021

MeSIN: Multilevel Selective and Interactive Network for Medication Recommendation

Recommending medications for patients using electronic health records (E...

Please sign up or login with your details

Forgot password? Click here to reset