Bayes Classification using an approximation to the Joint Probability Distribution of the Attributes

05/29/2022
by   Patrick Hosein, et al.
0

The Naive-Bayes classifier is widely used due to its simplicity, speed and accuracy. However this approach fails when, for at least one attribute value in a test sample, there are no corresponding training samples with that attribute value. This is known as the zero frequency problem and is typically addressed using Laplace Smoothing. However, Laplace Smoothing does not take into account the statistical characteristics of the neighbourhood of the attribute values of the test sample. Gaussian Naive Bayes addresses this but the resulting Gaussian model is formed from global information. We instead propose an approach that estimates conditional probabilities using information in the neighbourhood of the test sample. In this case we no longer need to make the assumption of independence of attribute values and hence consider the joint probability distribution conditioned on the given class which means our approach (unlike the Gaussian and Laplace approaches) takes into consideration dependencies among the attribute values. We illustrate the performance of the proposed approach on a wide range of datasets taken from the University of California at Irvine (UCI) Machine Learning Repository. We also include results for the k-NN classifier and demonstrate that the proposed approach is simple, robust and outperforms standard approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2012

Locally Weighted Naive Bayes

Despite its simplicity, the naive Bayes classifier has surprised machine...
research
01/14/2019

Precision Annealing Monte Carlo Methods for Statistical Data Assimilation: Metropolis-Hastings Procedures

Statistical Data Assimilation (SDA) is the transfer of information from ...
research
07/31/2023

An Efficient Shapley Value Computation for the Naive Bayes Classifier

Variable selection or importance measurement of input variables to a mac...
research
10/12/2017

An Improved Naive Bayes Classifier-based Noise Detection Technique for Classifying User Phone Call Behavior

The presence of noisy instances in mobile phone data is a fundamental is...
research
02/24/2022

A general framework for adaptive two-index fusion attribute weighted naive Bayes

Naive Bayes(NB) is one of the essential algorithms in data mining. Howev...
research
12/21/2014

Locally Weighted Learning for Naive Bayes Classifier

As a consequence of the strong and usually violated conditional independ...
research
02/16/2021

Numerically more stable computation of the p-values for the two-sample Kolmogorov-Smirnov test

The two-sample Kolmogorov-Smirnov test is a widely used statistical test...

Please sign up or login with your details

Forgot password? Click here to reset