Propensity-scored Probabilistic Label Trees

10/20/2021
by   Marek Wydmuch, et al.
0

Extreme multi-label classification (XMLC) refers to the task of tagging instances with small subsets of relevant labels coming from an extremely large set of all possible labels. Recently, XMLC has been widely applied to diverse web applications such as automatic content labeling, online advertising, or recommendation systems. In such environments, label distribution is often highly imbalanced, consisting mostly of very rare tail labels, and relevant labels can be missing. As a remedy to these problems, the propensity model has been introduced and applied within several XMLC algorithms. In this work, we focus on the problem of optimal predictions under this model for probabilistic label trees, a popular approach for XMLC problems. We introduce an inference procedure, based on the A^*-search algorithm, that efficiently finds the optimal solution, assuming that all probabilities and propensities are known. We demonstrate the attractiveness of this approach in a wide empirical study on popular XMLC benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

Probabilistic Label Trees for Extreme Multi-label Classification

Extreme multi-label classification (XMLC) is a learning task of tagging ...
research
07/08/2020

Online probabilistic label trees

We introduce online probabilistic label trees (OPLTs), an algorithm that...
research
01/15/2020

Extreme Regression for Dynamic Search Advertising

This paper introduces a new learning paradigm called eXtreme Regression ...
research
07/26/2022

On Missing Labels, Long-tails and Propensities in Extreme Multi-label Classification

The propensity model introduced by Jain et al. 2016 has become a standar...
research
05/12/2022

Open Vocabulary Extreme Classification Using Generative Models

The extreme multi-label classification (XMC) task aims at tagging conten...
research
03/05/2018

Adversarial Extreme Multi-label Classification

The goal in extreme multi-label classification is to learn a classifier ...
research
04/19/2016

Streaming Label Learning for Modeling Labels on the Fly

It is challenging to handle a large volume of labels in multi-label lear...

Please sign up or login with your details

Forgot password? Click here to reset