Subset Labeled LDA for Large-Scale Multi-Label Classification

09/16/2017
by   Yannis Papanikolaou, et al.
0

Labeled Latent Dirichlet Allocation (LLDA) is an extension of the standard unsupervised Latent Dirichlet Allocation (LDA) algorithm, to address multi-label learning tasks. Previous work has shown it to perform in par with other state-of-the-art multi-label methods. Nonetheless, with increasing label sets sizes LLDA encounters scalability issues. In this work, we introduce Subset LLDA, a simple variant of the standard LLDA algorithm, that not only can effectively scale up to problems with hundreds of thousands of labels but also improves over the LLDA state-of-the-art. We conduct extensive experiments on eight data sets, with label sets sizes ranging from hundreds to hundreds of thousands, comparing our proposed algorithm with the previously proposed LLDA algorithms (Prior--LDA, Dep--LDA), as well as the state of the art in extreme multi-label classification. The results show a steady advantage of our method over the other LLDA algorithms and competitive results compared to the extreme multi-label classification algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2019

HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

Extreme multi-label text classification (XMTC) addresses the problem of ...
research
04/08/2020

Saliency-based Weighted Multi-label Linear Discriminant Analysis

In this paper, we propose a new variant of Linear Discriminant Analysis ...
research
09/08/2016

DiSMEC - Distributed Sparse Machines for Extreme Multi-label Classification

Extreme multi-label classification refers to supervised multi-label lear...
research
10/14/2016

Kernel Alignment Inspired Linear Discriminant Analysis

Kernel alignment measures the degree of similarity between two kernels. ...
research
05/08/2015

Dense Distributions from Sparse Samples: Improved Gibbs Sampling Parameter Estimators for LDA

We introduce a novel approach for estimating Latent Dirichlet Allocation...
research
06/18/2015

A hybrid algorithm for Bayesian network structure learning with application to multi-label learning

We present a novel hybrid algorithm for Bayesian network structure learn...
research
12/03/2020

A Study on the Autoregressive and non-Autoregressive Multi-label Learning

Extreme classification tasks are multi-label tasks with an extremely lar...

Please sign up or login with your details

Forgot password? Click here to reset