Direct Nonparametric Predictive Inference Classification Trees

Classification is the task of assigning a new instance to one of a set of predefined categories based on the attributes of the instance. A classification tree is one of the most commonly used techniques in the area of classification. In this paper, we introduce a novel classification tree algorithm which we call Direct Nonparametric Predictive Inference (D-NPI) classification algorithm. The D-NPI algorithm is completely based on the Nonparametric Predictive Inference (NPI) approach, and it does not use any other assumption or information. The NPI is a statistical methodology which learns from data in the absence of prior knowledge and uses only few modelling assumptions, enabled by the use of lower and upper probabilities to quantify uncertainty. Due to the predictive nature of NPI, it is well suited for classification, as the nature of classification is explicitly predictive as well. The D-NPI algorithm uses a new split criterion called Correct Indication (CI). The CI is about the informativity that the attribute variables will indicate, hence, if the attribute is very informative, it gives high lower and upper probabilities for CI. The CI reports the strength of the evidence that the attribute variables will indicate, based on the data. The CI is completely based on the NPI, and it does not use any additional concepts such as entropy. The performance of the D-NPI classification algorithm is tested against several classification algorithms using classification accuracy, in-sample accuracy and tree size on different datasets from the UCI machine learning repository. The experimental results indicate that the D-NPI classification algorithm performs well in terms of classification accuracy and in-sample accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2015

The ABACOC Algorithm: a Novel Approach for Nonparametric Classification of Data Streams

Stream mining poses unique challenges to machine learning: predictive mo...
research
09/28/2020

A new network-base high-level data classification methodology (Quipus) by modeling attribute-attribute interactions

High-level classification algorithms focus on the interactions between i...
research
10/30/2017

Rough extreme learning machine: a new classification method based on uncertainty measure

Extreme learning machine (ELM) is a new single hidden layer feedback neu...
research
07/07/2020

The ordering of future observations from multiple groups

There are many situations where comparison of different groups is of gre...
research
12/24/2020

Predicting Seminal Quality with the Dominance-Based Rough Sets Approach

The paper relies on the clinical data of a previously published study. W...
research
09/14/2020

New complex network building methodology for High Level Classification based on attribute-attribute interaction

High-level classification algorithms focus on the interactions between i...
research
09/11/2020

DART: Data Addition and Removal Trees

How can we update data for a machine learning model after it has already...

Please sign up or login with your details

Forgot password? Click here to reset