Rail-5k: a Real-World Dataset for Rail Surface Defects Detection

06/28/2021
by   Zihao Zhang, et al.
3

This paper presents the Rail-5k dataset for benchmarking the performance of visual algorithms in a real-world application scenario, namely the rail surface defects detection task. We collected over 5k high-quality images from railways across China, and annotated 1100 images with the help from railway experts to identify the most common 13 types of rail defects. The dataset can be used for two settings both with unique challenges, the first is the fully-supervised setting using the 1k+ labeled images for training, fine-grained nature and long-tailed distribution of defect classes makes it hard for visual algorithms to tackle. The second is the semi-supervised learning setting facilitated by the 4k unlabeled images, these 4k images are uncurated containing possible image corruptions and domain shift with the labeled images, which can not be easily tackle by previous semi-supervised learning methods. We believe our dataset could be a valuable benchmark for evaluating robustness and reliability of visual algorithms.

READ FULL TEXT

page 3

page 4

page 6

page 8

research
06/02/2021

The Semi-Supervised iNaturalist Challenge at the FGVC8 Workshop

Semi-iNat is a challenging dataset for semi-supervised classification wi...
research
02/06/2021

Open-World Semi-Supervised Learning

Supervised and semi-supervised learning methods have been traditionally ...
research
09/25/2020

Semi-Supervised Image Deraining using Gaussian Processes

Recent CNN-based methods for image deraining have achieved excellent per...
research
04/01/2021

A Realistic Evaluation of Semi-Supervised Learning for Fine-Grained Classification

We evaluate the effectiveness of semi-supervised learning (SSL) on a rea...
research
03/30/2021

3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding

The ability to understand the ways to interact with objects from visual ...
research
08/20/2022

A Visual Analytics Framework for Composing a Hierarchical Classification for Medieval Illuminations

Annotated data is a requirement for applying supervised machine learning...
research
10/15/2020

Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study

The development of semi-supervised learning techniques is essential to e...

Please sign up or login with your details

Forgot password? Click here to reset