Collaborative Learning of Semi-Supervised Clustering and Classification for Labeling Uncurated Data

03/09/2020
by   Sara Mousavi, et al.
0

Domain-specific image collections present potential value in various areas of science and business but are often not curated nor have any way to readily extract relevant content. To employ contemporary supervised image analysis methods on such image data, they must first be cleaned and organized, and then manually labeled for the nomenclature employed in the specific domain, which is a time consuming and expensive endeavor. To address this issue, we designed and implemented the Plud system. Plud provides an iterative semi-supervised workflow to minimize the effort spent by an expert and handles realistic large collections of images. We believe it can support labeling datasets regardless of their size and type. Plud is an iterative sequence of unsupervised clustering, human assistance, and supervised classification. With each iteration 1) the labeled dataset grows, 2) the generality of the classification method and its accuracy increases, and 3) manual effort is reduced. We evaluated the effectiveness of our system, by applying it on over a million images documenting human decomposition. In our experiment comparing manual labeling with labeling conducted with the support of Plud, we found that it reduces the time needed to label data and produces highly accurate models for this new domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

Deep Categorization with Semi-Supervised Self-Organizing Maps

Nowadays, with the advance of technology, there is an increasing amount ...
research
02/24/2022

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Semantic segmentation is a challenging computer vision task demanding a ...
research
10/25/2021

Generalized Multi-Task Learning from Substantially Unlabeled Multi-Source Medical Image Data

Deep learning-based models, when trained in a fully-supervised manner, c...
research
02/07/2022

SUD: Supervision by Denoising for Medical Image Segmentation

Training a fully convolutional network for semantic segmentation typical...
research
12/21/2022

Land Cover and Land Use Detection using Semi-Supervised Learning

Semi-supervised learning (SSL) has made significant strides in the field...
research
04/24/2023

Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning

Impressive advances in acquisition and sharing technologies have made th...
research
10/16/2020

Automated Iterative Training of Convolutional Neural Networks for Tree Skeleton Segmentation

Training of convolutional neural networks for semantic segmentation requ...

Please sign up or login with your details

Forgot password? Click here to reset