Unsupervised Multi-label Dataset Generation from Web Data

05/12/2020
by   Carlos Roig, et al.
0

This paper presents a system towards the generation of multi-label datasets from web data in an unsupervised manner. To achieve this objective, this work comprises two main contributions, namely: a) the generation of a low-noise unsupervised single-label dataset from web-data, and b) the augmentation of labels in such dataset (from single label to multi label). The generation of a single-label dataset uses an unsupervised noise reduction phase (clustering and selection of clusters using anchors) obtaining a 85 images. An unsupervised label augmentation process is then performed to assign new labels to the images in the dataset using the class activation maps and the uncertainty associated with each class. This process is applied to the dataset generated in this paper and a public dataset (Places365) achieving a 9.5 27 the presented system can robustly enrich the initial dataset.

READ FULL TEXT

page 1

page 5

page 6

research
09/25/2021

Integrating Unsupervised Clustering and Label-specific Oversampling to Tackle Imbalanced Multi-label Data

There is often a mixture of very frequent labels and very infrequent lab...
research
02/10/2018

Tips, guidelines and tools for managing multi-label datasets: the mldr.datasets R package and the Cometa data repository

New proposals in the field of multi-label learning algorithms have been ...
research
04/20/2020

Unsupervised Person Re-identification via Multi-label Classification

The challenge of unsupervised person re-identification (ReID) lies in le...
research
04/17/2020

Incorporating Multiple Cluster Centers for Multi-Label Learning

Multi-label learning deals with the problem that each instance is associ...
research
08/15/2021

SCIDA: Self-Correction Integrated Domain Adaptation from Single- to Multi-label Aerial Images

Most publicly available datasets for image classification are with singl...
research
03/13/2017

A Localisation-Segmentation Approach for Multi-label Annotation of Lumbar Vertebrae using Deep Nets

Multi-class segmentation of vertebrae is a non-trivial task mainly due t...
research
11/09/2020

Multi-label Causal Variable Discovery: Learning Common Causal Variables and Label-specific Causal Variables

Causal variables in Markov boundary (MB) have been widely applied in ext...

Please sign up or login with your details

Forgot password? Click here to reset