Aggressive Sampling for Multi-class to Binary Reduction with Applications to Text Classification

01/23/2017
by   Bikash Joshi, et al.
0

We address the problem of multi-class classification in the case where the number of classes is very large. We propose a double sampling strategy on top of a multi-class to binary reduction strategy, which transforms the original multi-class problem into a binary classification problem over pairs of examples. The aim of the sampling strategy is to overcome the curse of long-tailed class distributions exhibited in majority of large-scale multi-class classification problems and to reduce the number of pairs of examples in the expanded data. We show that this strategy does not alter the consistency of the empirical risk minimization principle defined over the double sample reduction. Experiments are carried out on DMOZ and Wikipedia collections with 10,000 to 100,000 classes where we show the efficiency of the proposed approach in terms of training and prediction time, memory consumption, and predictive performance with respect to state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2014

Multi-borders classification

The number of possible methods of generalizing binary classification to ...
research
03/25/2020

Adversarial Multi-Binary Neural Network for Multi-class Classification

Multi-class text classification is one of the key problems in machine le...
research
11/24/2018

MEMOIR: Multi-class Extreme Classification with Inexact Margin

Multi-class classification with a very large number of classes, or extre...
research
02/06/2020

Quantification of Differential Information using Matrix Pencil

Any traditional classification problem in general involves modelling ind...
research
09/14/2021

Predicting Loss Risks for B2B Tendering Processes

Sellers and executives who maintain a bidding pipeline of sales engageme...
research
06/15/2016

Logarithmic Time One-Against-Some

We create a new online reduction of multiclass classification to binary ...
research
08/29/2011

Datum-Wise Classification: A Sequential Approach to Sparsity

We propose a novel classification technique whose aim is to select an ap...

Please sign up or login with your details

Forgot password? Click here to reset