A New Distance Measure for Non-Identical Data with Application to Image Classification

Distance measures are part and parcel of many computer vision algorithms. The underlying assumption in all existing distance measures is that feature elements are independent and identically distributed. However, in real-world settings, data generally originate from heterogeneous sources even if they do possess a common data-generating mechanism. Since these sources are not identically distributed by necessity, the assumption of identical distribution is inappropriate. Here, we use statistical analysis to show that feature elements of local image descriptors are indeed non-identically distributed. To test the effect of omitting the unified distribution assumption, we created a new distance measure called the Poisson-Binomial Radius (PBR). PBR is a bin-to-bin distance which accounts for the dispersion of bin-to-bin information. PBR's performance was evaluated on twelve benchmark data sets covering six different classification and recognition applications: texture, material, leaf, scene, ear biometrics and category-level image classification. Results from these experiments demonstrate that PBR outperforms state-of-the-art distance measures for most of the data sets and achieves comparable performance on the rest, suggesting that accounting for different distributions in distance measures can improve performance in classification and recognition tasks.

READ FULL TEXT

page 14

page 16

page 20

research
07/15/2020

A cellular automata approach to local patterns for texture recognition

Texture recognition is one of the most important tasks in computer visio...
research
12/04/2014

Image Data Compression for Covariance and Histogram Descriptors

Covariance and histogram image descriptors provide an effective way to c...
research
09/25/2020

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

For data sets with similar features, for example highly correlated featu...
research
04/30/2021

Ranking the information content of distance measures

Real-world data typically contain a large number of features that are of...
research
12/09/2021

A Note on Comparison of F-measures

We comment on a recent TKDE paper "Linear Approximation of F-measure for...
research
04/19/2015

Visual Recognition Using Directional Distribution Distance

In computer vision, an entity such as an image or video is often represe...
research
02/04/2019

Distances between Data Sets Based on Summary Statistics

The concepts of similarity and distance are crucial in data mining. We c...

Please sign up or login with your details

Forgot password? Click here to reset