Machine Learning Performance Analysis to Predict Stroke Based on Imbalanced Medical Dataset

11/14/2022
by   Yuru Jing, et al.
0

Cerebral stroke, the second most substantial cause of death universally, has been a primary public health concern over the last few years. With the help of machine learning techniques, early detection of various stroke alerts is accessible, which can efficiently prevent or diminish the stroke. Medical dataset, however, are frequently unbalanced in their class label, with a tendency to poorly predict minority classes. In this paper, the potential risk factors for stroke are investigated. Moreover, four distinctive approaches are applied to improve the classification of the minority class in the imbalanced stroke dataset, which are the ensemble weight voting classifier, the Synthetic Minority Over-sampling Technique (SMOTE), Principal Component Analysis with K-Means Clustering (PCA-Kmeans), Focal Loss with the Deep Neural Network (DNN) and compare their performance. Through the analysis results, SMOTE and PCA-Kmeans with DNN-Focal Loss work best for the limited size of a large severe imbalanced dataset,which is 2-4 times outperform Kaggle work.

READ FULL TEXT
research
10/29/2019

Predicting Louisiana Public High School Dropout through Imbalanced Learning Techniques

This study is motivated by the magnitude of the problem of Louisiana hig...
research
10/27/2018

Hull Form Optimization with Principal Component Analysis and Deep Neural Network

Designing and modifying complex hull forms for optimal vessel performanc...
research
02/11/2018

PCA-Based Missing Information Imputation for Real-Time Crash Likelihood Prediction Under Imbalanced Data

The real-time crash likelihood prediction has been an important research...
research
03/01/2022

A predictive analytics approach for stroke prediction using machine learning and neural networks

The negative impact of stroke in society has led to concerted efforts to...
research
06/04/2022

Interpretable Models Capable of Handling Systematic Missingness in Imbalanced Classes and Heterogeneous Datasets

Application of interpretable machine learning techniques on medical data...
research
06/03/2022

Impact of the composition of feature extraction and class sampling in medicare fraud detection

With healthcare being critical aspect, health insurance has become an im...

Please sign up or login with your details

Forgot password? Click here to reset