Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images

09/22/2021
by   Zhenzhen Wang, et al.
16

Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates pathologists' workload. In this paper, we study the problem of refining these approximate annotations in digital pathology to obtain more accurate ones. Some previous works have explored obtaining machine learning models from these inaccurate annotations, but few of them tackle the refinement problem where the mislabeled regions should be explicitly identified and corrected, and all of them require a - often very large - number of training samples. We present a method, named Label Cleaning Multiple Instance Learning (LC-MIL), to refine coarse annotations on a single WSI without the need of external training data. Patches cropped from a WSI with inaccurate labels are processed jointly with a MIL framework, and a deep-attention mechanism is leveraged to discriminate mislabeled instances, mitigating their impact on the predictive model and refining the segmentation. Our experiments on a heterogeneous WSI set with breast cancer lymph node metastasis, liver cancer, and colorectal cancer samples show that LC-MIL significantly refines the coarse annotations, outperforming the state-of-the-art alternatives, even while learning from a single slide. These results demonstrate the LC-MIL is a promising, lightweight tool to provide fine-grained annotations from coarsely annotated pathology sets.

READ FULL TEXT

page 1

page 5

page 6

page 13

page 14

research
03/29/2023

Robust Tumor Detection from Coarse Annotations via Multi-Magnification Ensembles

Cancer detection and classification from gigapixel whole slide images of...
research
03/29/2022

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Large-scale training data with high-quality annotations is critical for ...
research
03/12/2023

Increasing the usefulness of already existing annotations through WSI registration

Computational pathology methods have the potential to improve access to ...
research
10/06/2020

Microscopic fine-grained instance classification through deep attention

Fine-grained classification of microscopic image data with limited sampl...
research
05/23/2017

Deep Multi-instance Networks with Sparse Label Assignment for Whole Mammogram Classification

Mammogram classification is directly related to computer-aided diagnosis...
research
03/15/2017

Label Stability in Multiple Instance Learning

We address the problem of instance label stability in multiple instance ...
research
08/17/2020

Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets

Multi-organ segmentation has extensive applications in many clinical app...

Please sign up or login with your details

Forgot password? Click here to reset