The Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description

08/16/2017
by   Arin Chaudhuri, et al.
0

Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice for the kernel function. The Gaussian kernel has a bandwidth parameter, whose value is important for good results. A small bandwidth leads to overfitting, and the resulting SVDD classifier overestimates the number of anomalies. A large bandwidth leads to underfitting, and the classifier fails to detect many anomalies. In this paper we present a new automatic, unsupervised method for selecting the Gaussian kernel bandwidth. The selected value can be computed quickly, and it is competitive with existing bandwidth selection methods.

READ FULL TEXT

page 5

page 6

research
11/15/2018

The Trace Criterion for Kernel Bandwidth Selection for Support Vector Data Description

Support vector data description (SVDD) is a popular anomaly detection te...
research
02/17/2016

Peak Criterion for Choosing Gaussian Kernel Bandwidth in Support Vector Data Description

Support Vector Data Description (SVDD) is a machine-learning technique u...
research
03/08/2018

A New Bandwidth Selection Criterion for Analyzing Hyperspectral Data Using SVDD

This paper presents a method for hyperspectral image classification usin...
research
06/16/2016

Sampling Method for Fast Training of Support Vector Data Description

Support Vector Data Description (SVDD) is a popular outlier detection te...
research
10/31/2016

Kernel Bandwidth Selection for SVDD: Peak Criterion Approach for Large Data

Support Vector Data Description (SVDD) provides a useful approach to con...
research
12/17/2021

Gaussian RBF Centered Kernel Alignment (CKA) in the Large Bandwidth Limit

We prove that Centered Kernel Alignment (CKA) based on a Gaussian RBF ke...
research
09/12/2007

Bandwidth selection for kernel estimation in mixed multi-dimensional spaces

Kernel estimation techniques, such as mean shift, suffer from one major ...

Please sign up or login with your details

Forgot password? Click here to reset