ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification

01/21/2022
by   Jan Cychnerski, et al.
0

The article presents a new multi-label comprehensive image dataset from flexible endoscopy, colonoscopy and capsule endoscopy, named ERS. The collection has been labeled according to the full medical specification of 'Minimum Standard Terminology 3.0' (MST 3.0), describing all possible findings in the gastrointestinal tract (104 possible labels), extended with an additional 19 labels useful in common machine learning applications. The dataset contains around 6000 precisely and 115,000 approximately labeled frames from endoscopy videos, 3600 precise and 22,600 approximate segmentation masks, and 1.23 million unlabeled frames from flexible and capsule endoscopy videos. The labeled data cover almost entirely the MST 3.0 standard. The data came from 1520 videos of 1135 patients. Additionally, this paper proposes and describes four exemplary experiments in gastrointestinal image classification task performed using the created dataset. The obtained results indicate the high usefulness and flexibility of the dataset in training and testing machine learning algorithms in the field of endoscopic data analysis.

READ FULL TEXT

page 8

page 19

research
08/22/2022

PLMCL: Partial-Label Momentum Curriculum Learning for Multi-Label Image Classification

Multi-label image classification aims to predict all possible labels in ...
research
01/15/2022

SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled Data

Capsule network is a recent new deep network architecture that has been ...
research
03/21/2019

Penobscot Dataset: Fostering Machine Learning Development for Seismic Interpretation

We have seen in the past years the flourishing of machine and deep learn...
research
05/18/2023

MiraBest: A Dataset of Morphologically Classified Radio Galaxies for Machine Learning

The volume of data from current and future observatories has motivated t...
research
03/26/2019

Netherlands Dataset: A New Public Dataset for Machine Learning in Seismic Interpretation

Machine learning and, more specifically, deep learning algorithms have s...
research
08/29/2021

The rUNSWift SPL Field Segmentation Dataset

In RoboCup SPL, soccer field segmentation has been widely recognised as ...

Please sign up or login with your details

Forgot password? Click here to reset