Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney

08/04/2021
by   Willy Kuo, et al.
13

The performance of machine learning algorithms used for the segmentation of 3D biomedical images lags behind that of the algorithms employed in the classification of 2D photos. This may be explained by the comparative lack of high-volume, high-quality training datasets, which require state-of-the art imaging facilities, domain experts for annotation and large computational and personal resources to create. The HR-Kidney dataset presented in this work bridges this gap by providing 1.7 TB of artefact-corrected synchrotron radiation-based X-ray phase-contrast microtomography images of whole mouse kidneys and validated segmentations of 33 729 glomeruli, which represents a 1-2 orders of magnitude increase over currently available biomedical datasets. The dataset further contains the underlying raw data, classical segmentations of renal vasculature and uriniferous tubules, as well as true 3D manual annotations. By removing limits currently imposed by small training datasets, the provided data open up the possibility for disruptions in machine learning for biomedical image analysis.

READ FULL TEXT

page 3

page 5

page 6

page 9

page 24

page 26

page 28

research
10/27/2021

MedMNIST v2: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification

We introduce MedMNIST v2, a large-scale MNIST-like dataset collection of...
research
12/09/2020

AIDE: Annotation-efficient deep learning for automatic medical image segmentation

Accurate image segmentation is crucial for medical imaging applications....
research
03/02/2023

Large-Scale Domain-Specific Pretraining for Biomedical Vision-Language Processing

Contrastive pretraining on parallel image-text data has attained great s...
research
09/14/2018

Supervised Machine Learning for Extractive Query Based Summarisation of Biomedical Data

The automation of text summarisation of biomedical publications is a pre...
research
04/18/2019

Examining the Capability of GANs to Replace Real Biomedical Images in Classification Models Training

In this paper, we explore the possibility of generating artificial biome...
research
12/10/2019

OpenBioLink: A benchmarking framework for large-scale biomedical link prediction

SUMMARY: Recently, novel machine-learning algorithms have shown potentia...
research
11/23/2021

ADTOF: A large dataset of non-synthetic music for automatic drum transcription

The state-of-the-art methods for drum transcription in the presence of m...

Please sign up or login with your details

Forgot password? Click here to reset