ACROBAT – a multi-stain breast cancer histological whole-slide-image data set from routine diagnostics for computational pathology

11/24/2022
by   Philippe Weitz, et al.
0

The analysis of FFPE tissue sections stained with haematoxylin and eosin (H E) or immunohistochemistry (IHC) is an essential part of the pathologic assessment of surgically resected breast cancer specimens. IHC staining has been broadly adopted into diagnostic guidelines and routine workflows to manually assess status and scoring of several established biomarkers, including ER, PGR, HER2 and KI67. However, this is a task that can also be facilitated by computational pathology image analysis methods. The research in computational pathology has recently made numerous substantial advances, often based on publicly available whole slide image (WSI) data sets. However, the field is still considerably limited by the sparsity of public data sets. In particular, there are no large, high quality publicly available data sets with WSIs of matching IHC and H E-stained tissue sections. Here, we publish the currently largest publicly available data set of WSIs of tissue sections from surgical resection specimens from female primary breast cancer patients with matched WSIs of corresponding H E and IHC-stained tissue, consisting of 4,212 WSIs from 1,153 patients. The primary purpose of the data set was to facilitate the ACROBAT WSI registration challenge, aiming at accurately aligning H E and IHC images. For research in the area of image registration, automatic quantitative feedback on registration algorithm performance remains available through the ACROBAT challenge website, based on more than 37,000 manually annotated landmark pairs from 13 annotators. Beyond registration, this data set has the potential to enable many different avenues of computational pathology research, including stain-guided learning, virtual staining, unsupervised pre-training, artefact detection and stain-independent models.

READ FULL TEXT
research
08/25/2022

A Two Step Approach for Whole Slide Image Registration

Multi-stain whole-slide-image (WSI) registration is an active field of r...
research
03/12/2023

Increasing the usefulness of already existing annotations through WSI registration

Computational pathology methods have the potential to improve access to ...
research
06/14/2022

The Open Kidney Ultrasound Data Set

Ultrasound use is because of its low cost, non-ionizing, and non-invasiv...
research
11/26/2018

A Framework for Implementing Machine Learning on Omics Data

The potential benefits of applying machine learning methods to -omics da...
research
08/03/2023

Predicting Ki67, ER, PR, and HER2 Statuses from H E-stained Breast Cancer Images

Despite the advances in machine learning and digital pathology, it is no...
research
11/21/2014

Assessment of algorithms for mitosis detection in breast cancer histopathology images

The proliferative activity of breast tumors, which is routinely estimate...
research
01/25/2018

A Benchmark and Evaluation of Non-Rigid Structure from Motion

Non-Rigid structure from motion (NRSfM), is a long standing and central ...

Please sign up or login with your details

Forgot password? Click here to reset