Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory Practice

11/05/2019
by   Jeroen Bertels, et al.
10

The Dice score and Jaccard index are commonly used metrics for the evaluation of segmentation tasks in medical imaging. Convolutional neural networks trained for image segmentation tasks are usually optimized for (weighted) cross-entropy. This introduces an adverse discrepancy between the learning optimization objective (the loss) and the end target metric. Recent works in computer vision have proposed soft surrogates to alleviate this discrepancy and directly optimize the desired metric, either through relaxations (soft-Dice, soft-Jaccard) or submodular optimization (Lovász-softmax). The aim of this study is two-fold. First, we investigate the theoretical differences in a risk minimization framework and question the existence of a weighted cross-entropy loss with weights theoretically optimized to surrogate Dice or Jaccard. Second, we empirically investigate the behavior of the aforementioned loss functions w.r.t. evaluation with Dice score and Jaccard index on five medical segmentation tasks. Through the application of relative approximation bounds, we show that all surrogates are equivalent up to a multiplicative factor, and that no optimal weighting of cross-entropy exists to approximate Dice or Jaccard measures. We validate these findings empirically and show that, while it is important to opt for one of the target metric surrogates rather than a cross-entropy-based loss, the choice of the surrogate does not make a statistical difference on a wide range of medical segmentation tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Optimization for Medical Image Segmentation: Theory and Practice when evaluating with Dice Score or Jaccard Index

In many medical imaging and classical computer vision tasks, the Dice sc...
research
07/07/2021

Comparing ML based Segmentation Models on Jet Fire Radiation Zone

Risk assessment is relevant in any workplace, however there is a degree ...
research
04/08/2023

Marginal Thresholding in Noisy Image Segmentation

This work presents a study on label noise in medical image segmentation ...
research
03/13/2021

Simpson's Bias in NLP Training

In most machine learning tasks, we evaluate a model M on a given data po...
research
04/09/2023

On the dice loss gradient and the ways to mimic it

In the past few years, in the context of fully-supervised semantic segme...
research
11/07/2019

Dice Loss for Data-imbalanced NLP Tasks

Many NLP tasks such as tagging and machine reading comprehension are fac...

Please sign up or login with your details

Forgot password? Click here to reset