Multilabel Classification by Hierarchical Partitioning and Data-dependent Grouping

06/24/2020
by   Shashanka Ubaru, et al.
0

In modern multilabel classification problems, each data instance belongs to a small number of classes from a large set of classes. In other words, these problems involve learning very sparse binary label vectors. Moreover, in large-scale problems, the labels typically have certain (unknown) hierarchy. In this paper we exploit the sparsity of label vectors and the hierarchical structure to embed them in low-dimensional space using label groupings. Consequently, we solve the classification problem in a much lower dimensional space and then obtain labels in the original space using an appropriately defined lifting. Our method builds on the work of (Ubaru Mazumdar, 2017), where the idea of group testing was also explored for multilabel classification. We first present a novel data-dependent grouping approach, where we use a group construction based on a low-rank Nonnegative Matrix Factorization (NMF) of the label matrix of training instances. The construction also allows us, using recent results, to develop a fast prediction algorithm that has a logarithmic runtime in the number of labels. We then present a hierarchical partitioning approach that exploits the label hierarchy in large scale problems to divide up the large label space and create smaller sub-problems, which can then be solved independently via the grouping approach. Numerical results on many benchmark datasets illustrate that, compared to other popular methods, our proposed methods achieve competitive accuracy with significantly lower computational costs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2018

Group Preserving Label Embedding for Multi-Label Classification

Multi-label learning is concerned with the classification of data with m...
research
07/23/2019

Collaborative Filtering and Multi-Label Classification with Matrix Factorization

Machine learning techniques for Recommendation System (RS) and Classific...
research
12/22/2014

On Learning Vector Representations in Hierarchical Label Spaces

An important problem in multi-label classification is to capture label p...
research
06/18/2016

An Efficient Large-scale Semi-supervised Multi-label Classifier Capable of Handling Missing labels

Multi-label classification has received considerable interest in recent ...
research
07/23/2018

Hierarchical Classification using Binary Data

In classification problems, especially those that categorize data into a...
research
06/28/2018

Beyond One-hot Encoding: lower dimensional target embedding

Target encoding plays a central role when learning Convolutional Neural ...
research
07/22/2020

Angle-based hierarchical classification using exact label embedding

Hierarchical classification problems are commonly seen in practice. Howe...

Please sign up or login with your details

Forgot password? Click here to reset