Speeding-up One-vs-All Training for Extreme Classification via Smart Initialization

09/27/2021
by   Erik Schultheis, et al.
0

In this paper we show that a simple, data dependent way of setting the initial vector can be used to substantially speed up the training of linear one-versus-all (OVA) classifiers in extreme multi-label classification (XMC). We discuss the problem of choosing the initial weights from the perspective of three goals. We want to start in a region of weight space a) with low loss value, b) that is favourable for second-order optimization, and c) where the conjugate-gradient (CG) calculations can be performed quickly. For margin losses, such an initialization is achieved by selecting the initial vector such that it separates the mean of all positive (relevant for a label) instances from the mean of all negatives – two quantities that can be calculated quickly for the highly imbalanced binary problems occurring in XMC. We demonstrate a speedup of ≈ 3× for training with squared hinge loss on a variety of XMC datasets. This comes in part from the reduced number of iterations that need to be performed due to starting closer to the solution, and in part from an implicit negative mining effect that allows to ignore easy negatives in the CG step. Because of the convex nature of the optimization problem, the speedup is achieved without any degradation in classification accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

Extreme Gradient Boosted Multi-label Trees for Dynamic Classifier Chains

Classifier chains is a key technique in multi-label classification, sinc...
research
09/29/2020

Asymmetric Loss For Multi-Label Classification

Pictures of everyday life are inherently multi-label in nature. Hence, m...
research
03/05/2018

Adversarial Extreme Multi-label Classification

The goal in extreme multi-label classification is to learn a classifier ...
research
06/12/2020

Online Metric Learning for Multi-Label Classification

Existing research into online multi-label classification, such as online...
research
06/22/2021

Gradient-based Label Binning in Multi-label Classification

In multi-label classification, where a single example may be associated ...
research
03/28/2019

Classification of sparse binary vectors

In this work we consider a problem of multi-label classification, where ...

Please sign up or login with your details

Forgot password? Click here to reset