PanNuke Dataset Extension, Insights and Baselines

03/24/2020
by   Jevgenij Gamper, et al.
9

The emerging area of computational pathology (CPath) is ripe ground for the application of deep learning (DL) methods to healthcare due to the sheer volume of raw pixel data in whole-slide images (WSIs) of cancerous tissue slides, generally of the order of $100K{\times}80K$ pixels. However, it is imperative for the DL algorithms relying on nuclei-level details to be able to cope with data from `the clinical wild', which tends to be quite challenging. We study, and extend recently released PanNuke dataset consisting of more than 200,000 nuclei categorized into 5 clinically important classes for the challenging tasks of detecting, segmenting and classifying nuclei in WSIs \footnote{Download dataset here \href{https://jgamper.github.io/PanNukeDataset}{https://jgamper.github.io/PanNukeDataset}} \cite{gamper_pannuke:_2019}. Previous pan-cancer datasets consisted of only up to 9 different tissues and up to 21,000 unlabeled nuclei \cite{kumar2019multi} and just over 24,000 labeled nuclei with segmentation masks \cite{graham_hover-net:_2019}. PanNuke consists of 19 different tissue types from over 20,000 WSIs that have been semi-automatically annotated and quality controlled by clinical pathologists, leading to a dataset with statistics similar to `the clinical wild' and with minimal selection bias. We study the performance of segmentation and classification models when applied to the proposed dataset and demonstrate the application of models trained on PanNuke to whole-slide images. We provide comprehensive statistics about the dataset and outline recommendations and research directions to address the limitations of existing DL tools when applied to real-world CPath applications.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 10

page 11

page 12

research
11/01/2021

PointNu-Net: Simultaneous Multi-tissue Histology Nuclei Segmentation and Classification in the Clinical Wild

Automatic nuclei segmentation and classification plays a vital role in d...
research
01/02/2021

CryoNuSeg: A Dataset for Nuclei Instance Segmentation of Cryosectioned H E-Stained Histological Images

Nuclei instance segmentation plays an important role in the analysis of ...
research
09/07/2023

Evaluating Deep Learning-based Melanoma Classification using Immunohistochemistry and Routine Histology: A Three Center Study

Pathologists routinely use immunohistochemical (IHC)-stained tissue slid...
research
06/27/2023

CellViT: Vision Transformers for Precise Cell Segmentation and Classification

Nuclei detection and segmentation in hematoxylin and eosin-stained (H ...
research
09/13/2022

SFS-A68: a dataset for the segmentation of space functions in apartment buildings

Analyzing building models for usable area, building safety, or energy an...
research
12/17/2021

Towards Launching AI Algorithms for Cellular Pathology into Clinical Pharmaceutical Orbits

Computational Pathology (CPath) is an emerging field concerned with the ...
research
08/17/2018

Epithelium segmentation using deep learning in H&E-stained prostate specimens with immunohistochemistry as reference standard

Prostate cancer (PCa) is graded by pathologists by examining the archite...

Please sign up or login with your details

Forgot password? Click here to reset