Candidates v.s. Noises Estimation for Large Multi-Class Classification Problem

11/02/2017
by   Lei Han, et al.
0

This paper proposes a method for multi-class classification problems, where the number of classes K is large. The method, referred to as Candidates v.s. Noises Estimation (CANE), selects a small subset of candidate classes and samples the remaining classes. We show that CANE is always consistent and computationally efficient. Moreover, the resulting estimator has low statistical variance approaching that of the maximum likelihood estimator, when the observed label belongs to the selected candidates with high probability. In practice, we use a tree structure with leaves as classes to promote fast beam search for candidate selection. We also apply the CANE method to estimate word probabilities in neural language models. Experiments show that CANE achieves better prediction accuracy over the Noise-Contrastive Estimation (NCE), its variants and a number of the state-of-the-art tree classifiers, while it gains significant speedup compared to the standard O(K) methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2012

Hierarchical Maximum Margin Learning for Multi-Class Classification

Due to myriads of classes, designing accurate and efficient classifiers ...
research
06/19/2019

Efficient Algorithms for Set-Valued Prediction in Multi-Class Classification

In cases of uncertainty, a multi-class classifier preferably returns a s...
research
08/24/2018

Analysis of Noise Contrastive Estimation from the Perspective of Asymptotic Variance

There are many models, often called unnormalized models, whose normalizi...
research
06/07/2018

Large scale classification in deep neural network with Label Mapping

In recent years, deep neural network is widely used in machine learning....
research
12/27/2017

Extrapolating Expected Accuracies for Large Multi-Class Problems

The difficulty of multi-class classification generally increases with th...
research
09/26/2014

Beyond Maximum Likelihood: from Theory to Practice

Maximum likelihood is the most widely used statistical estimation techni...
research
07/04/2017

The Candidate Multi-Cut for Cell Segmentation

Two successful approaches for the segmentation of biomedical images are ...

Please sign up or login with your details

Forgot password? Click here to reset