A Study on the Autoregressive and non-Autoregressive Multi-label Learning

12/03/2020
by   Elham J. Barezi, et al.
16

Extreme classification tasks are multi-label tasks with an extremely large number of labels (tags). These tasks are hard because the label space is usually (i) very large, e.g. thousands or millions of labels, (ii) very sparse, i.e. very few labels apply to each input document, and (iii) highly correlated, meaning that the existence of one label changes the likelihood of predicting all other labels. In this work, we propose a self-attention based variational encoder-model to extract the label-label and label-feature dependencies jointly and to predict labels for a given input. In more detail, we propose a non-autoregressive latent variable model and compare it to a strong autoregressive baseline that predicts a label based on all previously generated labels. Our model can therefore be used to predict all labels in parallel while still including both label-label and label-feature dependencies through latent variables, and compares favourably to the autoregressive baseline. We apply our models to four standard extreme classification natural language data sets, and one news videos dataset for automated label detection from a lexicon of semantic concepts. Experimental results show that although the autoregressive models, where use a given order of the labels for chain-order label prediction, work great for the small scale labels or the prediction of the highly ranked label, but our non-autoregressive model surpasses them by around 2 we need to predict more labels, or the dataset has a larger number of the labels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2022

OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for Extreme Multi-label Text Classification

Extreme multi-label text classification (XMTC) is the task of finding th...
research
04/11/2019

Ranking-Based Autoencoder for Extreme Multi-label Classification

Extreme Multi-label classification (XML) is an important yet challenging...
research
09/09/2019

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Many tasks in natural language processing can be viewed as multi-label c...
research
05/12/2022

Open Vocabulary Extreme Classification Using Generative Models

The extreme multi-label classification (XMC) task aims at tagging conten...
research
04/04/2018

Online Multi-Label Classification: A Label Compression Method

Many modern applications deal with multi-label data, such as functional ...
research
09/16/2017

Subset Labeled LDA for Large-Scale Multi-Label Classification

Labeled Latent Dirichlet Allocation (LLDA) is an extension of the standa...
research
05/28/2019

Accelerating Extreme Classification via Adaptive Feature Agglomeration

Extreme classification seeks to assign each data point, the most relevan...

Please sign up or login with your details

Forgot password? Click here to reset