An evaluation of Naive Bayesian anti-spam filtering

06/07/2000
by   Ion Androutsopoulos, et al.
0

It has recently been argued that a Naive Bayesian classifier can be used to filter unsolicited bulk e-mail ("spam"). We conduct a thorough evaluation of this proposal on a corpus that we make publicly available, contributing towards standard benchmarks. At the same time we investigate the effect of attribute-set size, training-corpus size, lemmatization, and stop-lists on the filter's performance, issues that had not been previously explored. After introducing appropriate cost-sensitive evaluation measures, we reach the conclusion that additional safety nets are needed for the Naive Bayesian anti-spam filter to be viable in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2012

Reasoning about Bayesian Network Classifiers

Bayesian network classifiers are used in many fields, and one common cla...
research
01/19/2018

Evaluating Predictive Models of Student Success: Closing the Methodological Gap

Model evaluation -- the process of making inferences about the performan...
research
12/12/2019

Investigating the effectiveness of web adblockers

We investigate adblocking filters and the extent to which websites and a...
research
08/11/2020

Revisiting Low Resource Status of Indian Languages in Machine Translation

Indian language machine translation performance is hampered due to the l...
research
01/23/2013

A Bayesian Network Classifier that Combines a Finite Mixture Model and a Naive Bayes Model

In this paper we present a new Bayesian network model for classification...
research
07/07/2000

Naive Bayes and Exemplar-Based approaches to Word Sense Disambiguation Revisited

This paper describes an experimental comparison between two standard sup...

Please sign up or login with your details

Forgot password? Click here to reset