CLCIFAR: CIFAR-Derived Benchmark Datasets with Human Annotated Complementary Labels

05/15/2023
by   Hsiu-Hsuan Wang, et al.
0

As a weakly-supervised learning paradigm, complementary label learning (CLL) aims to learn a multi-class classifier from only complementary labels, classes to which an instance does not belong. Despite various studies have addressed how to learn from CLL, those methods typically rely on some distributional assumptions on the complementary labels, and are benchmarked only on some synthetic datasets. It remains unclear how the noise or bias arising from the human annotation process would affect those CLL algorithms. To fill the gap, we design a protocol to collect complementary labels annotated by human. Two datasets, CLCIFAR10 and CLCIFAR20, based on CIFAR10 and CIFAR100, respectively, are collected. We analyzed the empirical transition matrices of the collected datasets, and observed that they are noisy and biased. We then performed extensive benchmark experiments on the collected datasets with various CLL algorithms to validate whether the existing algorithms can learn from the real-world complementary datasets. The dataset can be accessed with the following link: https://github.com/ntucllab/complementary_cifar.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2023

Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation

Complementary-label Learning (CLL) is a form of weakly supervised learni...
research
11/27/2017

Learning with Biased Complementary Labels

In this paper we study the classification problem in which we have acces...
research
09/20/2022

Reduction from Complementary-Label Learning to Probability Estimates

Complementary-Label Learning (CLL) is a weakly-supervised learning probl...
research
11/19/2022

Complementary Labels Learning with Augmented Classes

Complementary Labels Learning (CLL) arises in many real-world tasks such...
research
05/17/2023

Complementary Classifier Induced Partial Label Learning

In partial label learning (PLL), each training sample is associated with...
research
06/13/2023

Rank-Aware Negative Training for Semi-Supervised Text Classification

Semi-supervised text classification-based paradigms (SSTC) typically emp...
research
08/16/2020

Training CNN Classifiers for Semantic Segmentation using Partially Annotated Images: with Application on Human Thigh and Calf MRI

Objective: Medical image datasets with pixel-level labels tend to have a...

Please sign up or login with your details

Forgot password? Click here to reset