A Survey on Extreme Multi-label Learning

10/08/2022
by   Tong Wei, et al.
0

Multi-label learning has attracted significant attention from both academic and industry field in recent decades. Although existing multi-label learning algorithms achieved good performance in various tasks, they implicitly assume the size of target label space is not huge, which can be restrictive for real-world scenarios. Moreover, it is infeasible to directly adapt them to extremely large label space because of the compute and memory overhead. Therefore, eXtreme Multi-label Learning (XML) is becoming an important task and many effective approaches are proposed. To fully understand XML, we conduct a survey study in this paper. We first clarify a formal definition for XML from the perspective of supervised learning. Then, based on different model architectures and challenges of the problem, we provide a thorough discussion of the advantages and disadvantages of each category of methods. For the benefit of conducting empirical studies, we collect abundant resources regarding XML, including code implementations, and useful tools. Lastly, we propose possible research directions in XML, such as new evaluation metrics, the tail label problem, and weakly supervised XML.

READ FULL TEXT
research
01/03/2021

Multi-label Ranking: Mining Multi-label and Label Ranking Data

We survey multi-label ranking tasks, specifically multi-label classifica...
research
11/23/2020

The Emerging Trends of Multi-Label Learning

Exabytes of data are generated daily by humans, leading to the growing n...
research
06/08/2022

Large Loss Matters in Weakly Supervised Multi-Label Classification

Weakly supervised multi-label classification (WSML) task, which is to le...
research
07/03/2019

Towards Interpretable Deep Extreme Multi-label Learning

Many Machine Learning algorithms, such as deep neural networks, have lon...
research
11/09/2020

A Survey of Label-noise Representation Learning: Past, Present and Future

Classical machine learning implicitly assumes that labels of the trainin...
research
04/12/2023

Evaluation of ChatGPT Model for Vulnerability Detection

In this technical report, we evaluated the performance of the ChatGPT an...
research
02/10/2018

Tips, guidelines and tools for managing multi-label datasets: the mldr.datasets R package and the Cometa data repository

New proposals in the field of multi-label learning algorithms have been ...

Please sign up or login with your details

Forgot password? Click here to reset