Optimizing the Procedure of CT Segmentation Labeling

03/24/2023
by   Yaroslav Zharov, et al.
0

In Computed Tomography, machine learning is often used for automated data processing. However, increasing model complexity is accompanied by increasingly large volume datasets, which in turn increases the cost of model training. Unlike most work that mitigates this by advancing model architectures and training algorithms, we consider the annotation procedure and its effect on the model performance. We assume three main virtues of a good dataset collected for a model training to be label quality, diversity, and completeness. We compare the effects of those virtues on the model performance using open medical CT datasets and conclude, that quality is more important than diversity early during labeling; the diversity, in turn, is more important than completeness. Based on this conclusion and additional experiments, we propose a labeling procedure for the segmentation of tomographic images to minimize efforts spent on labeling while maximizing the model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2019

Consistency-Based Semi-Supervised Active Learning: Towards Minimizing Labeling Cost

Active learning (AL) integrates data labeling and model training to mini...
research
11/30/2021

What to Learn, and How: Toward Effective Learning from Rationales

Learning from rationales seeks to augment model training with human-prov...
research
07/25/2023

Towards Unifying Anatomy Segmentation: Automated Generation of a Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines

In this study, we present a method for generating automated anatomy segm...
research
09/08/2018

Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection

Deep learning for clinical applications is subject to stringent performa...
research
07/17/2020

Superpixel-Guided Label Softening for Medical Image Segmentation

Segmentation of objects of interest is one of the central tasks in medic...
research
07/20/2022

Large Scale Radio Frequency Signal Classification

Existing datasets used to train deep learning models for narrowband radi...
research
07/04/2018

Diversity in Machine Learning

Machine learning methods have achieved good performance and been widely ...

Please sign up or login with your details

Forgot password? Click here to reset