Evaluation of Medical Image Segmentation Models for Uncertain, Small or Empty Reference Annotations

09/26/2022
by   Sophie Ostmeier, et al.
0

Performance metrics for medical image segmentation models are used to measure agreement between the reference annotation and the prediction. A common set of metrics is used in the development of such models to make results more comparable. However, there is a mismatch between the distributions in public data sets and cases encountered in clinical practice. Many common metrics fail to measure the impact of this mismatch, especially for clinical data sets containing uncertain, small or empty reference annotation. Thus, models may not be validated for clinically meaningful agreement by such metrics. Dimensions of evaluating clinical value include independence from reference annotation volume size, consideration of uncertainty of reference annotations, reward of volumetric and/or location agreement and reward of correct classification of empty reference annotations. Unlike common public data sets, our in-house data set is more representative. It contains uncertain, small or empty reference annotations. We examine publicly available metrics on the predictions of a deep learning framework in order to identify for which settings common metrics provide clinical meaningful results. We compare to a public benchmark data set without uncertain, small or empty reference annotations. The code will be published.

READ FULL TEXT

page 2

page 12

research
01/31/2020

Automatic lung segmentation in routine imaging is a data diversity problem, not a methodology problem

Automated segmentation of anatomical structures is a crucial step in man...
research
12/14/2020

D-LEMA: Deep Learning Ensembles from Multiple Annotations – Application to Skin Lesion Segmentation

Medical image segmentation annotations suffer from inter/intra-observer ...
research
08/21/2021

Systematic Clinical Evaluation of A Deep Learning Method for Medical Image Segmentation: Radiosurgery Application

We systematically evaluate a Deep Learning (DL) method in a 3D medical i...
research
09/26/2021

Using Soft Labels to Model Uncertainty in Medical Image Segmentation

Medical image segmentation is inherently uncertain. For a given image, t...
research
10/31/2022

Rethinking Generalization: The Impact of Annotation Style on Medical Image Segmentation

Generalization is an important attribute of machine learning models, par...
research
09/08/2021

AgreementLearning: An End-to-End Framework for Learning with Multiple Annotators without Groundtruth

The annotation of domain experts is important for some medical applicati...
research
06/19/2020

Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Classifiers are often tested on relatively small data sets, which should...

Please sign up or login with your details

Forgot password? Click here to reset