Identifying Stroke Indicators Using Rough Sets

10/19/2021
by   Muhammad Salman Pathan, et al.
0

Stroke is widely considered as the second most common cause of mortality. The adverse consequences of stroke have led to global interest and work for improving the management and diagnosis of stroke. Various techniques for data mining have been used globally for accurate prediction of occurrence of stroke based on the risk factors that are associated with the electronic health care records (EHRs) of the patients. In particular, EHRs routinely contain several thousands of features and most of them are redundant and irrelevant that need to be discarded to enhance the prediction accuracy. The choice of feature-selection methods can help in improving the prediction accuracy of the model and efficient data management of the archived input features. In this paper, we systematically analyze the various features in EHR records for the detection of stroke. We propose a novel rough-set based technique for ranking the importance of the various EHR records in detecting stroke. Unlike the conventional rough-set techniques, our proposed technique can be applied on any dataset that comprises binary feature sets. We evaluated our proposed method in a publicly available dataset of EHR, and concluded that age, average glucose level, heart disease, and hypertension were the most essential attributes for detecting stroke in patients. Furthermore, we benchmarked the proposed technique with other popular feature-selection techniques. We obtained the best performance in ranking the importance of individual features in detecting stroke.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

page 9

page 11

research
03/01/2022

A predictive analytics approach for stroke prediction using machine learning and neural networks

The negative impact of stroke in society has led to concerted efforts to...
research
01/10/2021

Curvature-based Feature Selection with Application in Classifying Electronic Health Records

Electronic Health Records (EHRs) are widely applied in healthcare facili...
research
12/29/2021

An Efficient and Accurate Rough Set for Feature Selection, Classification and Knowledge Representation

This paper present a strong data mining method based on rough set, which...
research
03/13/2019

Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection

Phishing as one of the most well-known cybercrime activities is a decept...
research
05/21/2021

An Explainable Classification Model for Chronic Kidney Disease Patients

Currently, Chronic Kidney Disease (CKD) is experiencing a globally incre...
research
06/18/2022

Tree-Guided Rare Feature Selection and Logic Aggregation with Electronic Health Records Data

Statistical learning with a large number of rare binary features is comm...
research
12/22/2019

Hierarchical Target-Attentive Diagnosis Prediction in Heterogeneous Information Networks

We introduce HTAD, a novel model for diagnosis prediction using Electron...

Please sign up or login with your details

Forgot password? Click here to reset