On Missing Labels, Long-tails and Propensities in Extreme Multi-label Classification

07/26/2022
by   Erik Schultheis, et al.
16

The propensity model introduced by Jain et al. 2016 has become a standard approach for dealing with missing and long-tail labels in extreme multi-label classification (XMLC). In this paper, we critically revise this approach showing that despite its theoretical soundness, its application in contemporary XMLC works is debatable. We exhaustively discuss the flaws of the propensity-based approach, and present several recipes, some of them related to solutions used in search engines and recommender systems, that we believe constitute promising alternatives to be followed in XMLC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2016

A High Speed Multi-label Classifier based on Extreme Learning Machines

In this paper a high speed neural network classifier based on extreme le...
research
05/18/2020

Interaction Matching for Long-Tail Multi-Label Classification

We present an elegant and effective approach for addressing limitations ...
research
06/04/2021

Accelerating Inference for Sparse Extreme Multi-Label Ranking Trees

Tree-based models underpin many modern semantic search engines and recom...
research
03/05/2021

Stratified Sampling for Extreme Multi-Label Data

Extreme multi-label classification (XML) is becoming increasingly releva...
research
10/20/2021

Propensity-scored Probabilistic Label Trees

Extreme multi-label classification (XMLC) refers to the task of tagging ...
research
05/28/2019

Accelerating Extreme Classification via Adaptive Feature Agglomeration

Extreme classification seeks to assign each data point, the most relevan...
research
09/23/2021

Unbiased Loss Functions for Multilabel Classification with Missing Labels

This paper considers binary and multilabel classification problems in a ...

Please sign up or login with your details

Forgot password? Click here to reset