BRACS: A Dataset for BReAst Carcinoma Subtyping in H E Histology Images

11/08/2021
by   Nadia Brancati, et al.
14

Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women with cancer. Recent advancements in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by the pathologists is cumbersome, time-consuming, and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems have empowered the rapid digitization of pathology slides, and enabled to develop digital workflows. These advances further enable to leverage Artificial Intelligence (AI) to assist, automate, and augment pathological diagnosis. But the AI techniques, especially Deep Learning (DL), require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as, data-acquisition level constrains, time-consuming and expensive annotations, and anonymization of private information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin Eosin (H E)-stained images to facilitate the characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs), and 4539 Regions of Interest (ROIs) extracted from the WSIs. Each WSI, and respective ROIs, are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI- and ROI-level. Further, by including the understudied atypical lesions, BRACS offers an unique opportunity for leveraging AI to better understand their characteristics.

READ FULL TEXT

page 1

page 3

page 4

page 8

research
08/21/2022

Masked Video Modeling with Correlation-aware Contrastive Learning for Breast Cancer Diagnosis in Ultrasound

Breast cancer is one of the leading causes of cancer deaths in women. As...
research
10/09/2019

Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network

Gastric cancer is one of the most common cancers, which ranks third amon...
research
05/20/2023

Technical outlier detection via convolutional variational autoencoder for the ADMANI breast mammogram dataset

The ADMANI datasets (annotated digital mammograms and associated non-ima...
research
03/20/2022

Breast Cancer Induced Bone Osteolysis Prediction Using Temporal Variational Auto-Encoders

Objective and Impact Statement. We adopt a deep learning model for bone ...
research
08/22/2023

A Preliminary Investigation into Search and Matching for Tumour Discrimination in WHO Breast Taxonomy Using Deep Networks

Breast cancer is one of the most common cancers affecting women worldwid...
research
01/17/2023

Multicenter automatic detection of invasive carcinoma on breast whole slide images

Breast cancer is one of the most prevalent cancers worldwide and patholo...
research
11/20/2019

Pan-Cancer Diagnostic Consensus Through Searching Archival Histopathology Images Using Artificial Intelligence

The emergence of digital pathology has opened new horizons for histopath...

Please sign up or login with your details

Forgot password? Click here to reset