Neural Neighborhood Encoding for Classification

08/19/2020
by   Kaushik Sinha, et al.
0

Inspired by the fruit-fly olfactory circuit, the Fly Bloom Filter [Dasgupta et al., 2018] is able to efficiently summarize the data with a single pass and has been used for novelty detection. We propose a new classifier (for binary and multi-class classification) that effectively encodes the different local neighborhoods for each class with a per-class Fly Bloom Filter. The inference on test data requires an efficient FlyHash [Dasgupta, et al., 2017] operation followed by a high-dimensional, but sparse, dot product with the per-class Bloom Filters. The learning is trivially parallelizable. On the theoretical side, we establish conditions under which the prediction of our proposed classifier on any test example agrees with the prediction of the nearest neighbor classifier with high probability. We extensively evaluate our proposed scheme with over 50 data sets of varied data dimensionality to demonstrate that the predictive performance of our proposed neuroscience inspired classifier is competitive the the nearest-neighbor classifiers and other single-pass classifiers.

READ FULL TEXT
research
06/22/2022

Nearest Neighbor Classification based on Imbalanced Data: A Statistical Approach

In a classification problem, where the competing classes are not of comp...
research
12/14/2021

Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

The mathematical formalization of a neurological mechanism in the olfact...
research
02/08/2019

Nearest Neighbor Classifier based on Generalized Inter-point Distances for HDLSS Data

In high dimension, low sample size (HDLSS) settings, Euclidean distance ...
research
03/17/2022

Nearest Neighbor Classifier with Margin Penalty for Active Learning

As deep learning becomes the mainstream in the field of natural language...
research
02/07/2019

Land Use Classification Using Multi-neighborhood LBPs

In this paper we propose the use of multiple local binary patterns(LBPs)...
research
12/03/2020

Approximate kNN Classification for Biomedical Data

We are in the era where the Big Data analytics has changed the way of in...
research
01/31/2016

DOLDA - a regularized supervised topic model for high-dimensional multi-class regression

Generating user interpretable multi-class predictions in data rich envir...

Please sign up or login with your details

Forgot password? Click here to reset