Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity

04/19/2022
by   Nanqing Dong, et al.
14

Due to the high human cost of annotation, it is non-trivial to curate a large-scale medical dataset that is fully labeled for all classes of interest. Instead, it would be convenient to collect multiple small partially labeled datasets from different matching sources, where the medical images may have only been annotated for a subset of classes of interest. This paper offers an empirical understanding of an under-explored problem, namely partially supervised multi-label classification (PSMLC), where a multi-label classifier is trained with only partially labeled medical images. In contrast to the fully supervised counterpart, the partial supervision caused by medical data scarcity has non-trivial negative impacts on the model performance. A potential remedy could be augmenting the partial labels. Though vicinal risk minimization (VRM) has been a promising solution to improve the generalization ability of the model, its application to PSMLC remains an open question. To bridge the methodological gap, we provide the first VRM-based solution to PSMLC. The empirical results also provide insights into future research directions on partially supervised learning under data scarcity.

READ FULL TEXT

page 1

page 2

page 6

research
06/30/2022

Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images

Using decentralized data for federated training is one promising emergin...
research
09/06/2021

Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification

Annotated images are required for both supervised model training and eva...
research
06/18/2022

Deep Compatible Learning for Partially-Supervised Medical Image Segmentation

Partially-supervised learning can be challenging for segmentation due to...
research
04/04/2023

Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification

Due to the expensive costs of collecting labels in multi-label classific...
research
07/18/2023

Accuracy versus time frontiers of semi-supervised and self-supervised learning on medical images

For many applications of classifiers to medical images, a trustworthy la...
research
09/25/2021

Data, Assemble: Leveraging Multiple Datasets with Heterogeneous and Partial Labels

The success of deep learning relies heavily on large datasets with exten...
research
09/18/2019

A Classification Framework for Stablecoin Designs

Stablecoins promise to bridge fiat currencies with the world of cryptocu...

Please sign up or login with your details

Forgot password? Click here to reset