Ranking-Based Autoencoder for Extreme Multi-label Classification

04/11/2019
by   Bingyu Wang, et al.
0

Extreme Multi-label classification (XML) is an important yet challenging machine learning task, that assigns to each instance its most relevant candidate labels from an extremely large label collection, where the numbers of labels, features and instances could be thousands or millions. XML is more and more on demand in the Internet industries, accompanied with the increasing business scale / scope and data accumulation. The extremely large label collections yield challenges such as computational complexity, inter-label dependency and noisy labeling. Many methods have been proposed to tackle these challenges, based on different mathematical formulations. In this paper, we propose a deep learning XML method, with a word-vector-based self-attention, followed by a ranking-based AutoEncoder architecture. The proposed method has three major advantages: 1) the autoencoder simultaneously considers the inter-label dependencies and the feature-label dependencies, by projecting labels and features onto a common embedding space; 2) the ranking loss not only improves the training efficiency and accuracy but also can be extended to handle noisy labeled data; 3) the efficient attention mechanism improves feature representation by highlighting feature importance. Experimental results on benchmark datasets show the proposed method is competitive to state-of-the-art methods.

READ FULL TEXT
research
01/03/2021

Multi-label Ranking: Mining Multi-label and Label Ranking Data

We survey multi-label ranking tasks, specifically multi-label classifica...
research
01/09/2021

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Extreme Multi-label text Classification (XMC) is a task of finding the m...
research
12/03/2020

A Study on the Autoregressive and non-Autoregressive Multi-label Learning

Extreme classification tasks are multi-label tasks with an extremely lar...
research
02/24/2016

Feature ranking for multi-label classification using Markov Networks

We propose a simple and efficient method for ranking features in multi-l...
research
12/17/2019

An Embarrassingly Simple Baseline for eXtreme Multi-label Prediction

The goal of eXtreme Multi-label Learning (XML) is to design and learn a ...
research
05/07/2019

A Modular Deep Learning Approach for Extreme Multi-label Text Classification

Extreme multi-label classification (XMC) aims to assign to an instance t...
research
07/06/2022

A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation

In this paper, we study the partial multi-label (PML) image classificati...

Please sign up or login with your details

Forgot password? Click here to reset