CRDT: Correlation Ratio Based Decision Tree Model for Healthcare Data Mining

09/24/2015
by   Smita Roy, et al.
0

The phenomenal growth in the healthcare data has inspired us in investigating robust and scalable models for data mining. For classification problems Information Gain(IG) based Decision Tree is one of the popular choices. However, depending upon the nature of the dataset, IG based Decision Tree may not always perform well as it prefers the attribute with more number of distinct values as the splitting attribute. Healthcare datasets generally have many attributes and each attribute generally has many distinct values. In this paper, we have tried to focus on this characteristics of the datasets while analysing the performance of our proposed approach which is a variant of Decision Tree model and uses the concept of Correlation Ratio(CR). Unlike IG based approach, this CR based approach has no biasness towards the attribute with more number of distinct values. We have applied our model on some benchmark healthcare datasets to show the effectiveness of the proposed technique.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2012

Performance Tuning Of J48 Algorithm For Prediction Of Soil Fertility

Data mining involves the systematic analysis of large data sets, and dat...
research
11/17/2012

Cost-sensitive C4.5 with post-pruning and competition

Decision tree is an effective classification approach in data mining and...
research
11/02/2010

Significance of Classification Techniques in Prediction of Learning Disabilities

The aim of this study is to show the importance of two classification te...
research
03/04/2021

Sales Prediction Model Using Classification Decision Tree Approach For Small Medium Enterprise Based on Indonesian E-Commerce Data

The growth of internet users in Indonesia gives an impact on many aspect...
research
04/21/2019

Integrating Association Rules with Decision Trees in Object-Relational Databases

Research has provided evidence that associative classification produces ...
research
06/16/2016

ACDC: α-Carving Decision Chain for Risk Stratification

In many healthcare settings, intuitive decision rules for risk stratific...
research
09/11/2019

LazyBum: Decision tree learning using lazy propositionalization

Propositionalization is the process of summarizing relational data into ...

Please sign up or login with your details

Forgot password? Click here to reset