Accelerating Extreme Classification via Adaptive Feature Agglomeration

05/28/2019
by   Ankit Jalan, et al.
0

Extreme classification seeks to assign each data point, the most relevant labels from a universe of a million or more labels. This task is faced with the dual challenge of high precision and scalability, with millisecond level prediction times being a benchmark. We propose DEFRAG, an adaptive feature agglomeration technique to accelerate extreme classification algorithms. Despite past works on feature clustering and selection, DEFRAG distinguishes itself in being able to scale to millions of features, and is especially beneficial when feature sets are sparse, which is typical of recommendation and multi-label datasets. The method comes with provable performance guarantees and performs efficient task-driven agglomeration to reduce feature dimensionalities by an order of magnitude or more. Experiments show that DEFRAG can not only reduce training and prediction times of several leading extreme classification algorithms by as much as 40 address the problem of missing features, as well as offer superior coverage on rare labels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2021

DECAF: Deep Extreme Classification with Label Features

Extreme multi-label classification (XML) involves tagging a data point w...
research
07/31/2021

ECLARE: Extreme Classification with Label Graph Correlations

Deep extreme classification (XC) seeks to train deep architectures that ...
research
12/17/2019

An Embarrassingly Simple Baseline for eXtreme Multi-label Prediction

The goal of eXtreme Multi-label Learning (XML) is to design and learn a ...
research
05/07/2019

A Modular Deep Learning Approach for Extreme Multi-label Text Classification

Extreme multi-label classification (XMC) aims to assign to an instance t...
research
07/26/2022

On Missing Labels, Long-tails and Propensities in Extreme Multi-label Classification

The propensity model introduced by Jain et al. 2016 has become a standar...
research
12/03/2020

A Study on the Autoregressive and non-Autoregressive Multi-label Learning

Extreme classification tasks are multi-label tasks with an extremely lar...
research
06/04/2021

Accelerating Inference for Sparse Extreme Multi-Label Ranking Trees

Tree-based models underpin many modern semantic search engines and recom...

Please sign up or login with your details

Forgot password? Click here to reset