Quantity beats quality for semantic segmentation of corrosion in images

06/30/2018
by   Will Nash, et al.
0

Dataset creation is typically one of the first steps when applying Artificial Intelligence methods to a new task; and the real world performance of models hinges on the quality and quantity of data available. Producing an image dataset for semantic segmentation is resource intensive, particularly for specialist subjects where class segmentation is not able to be effectively farmed out. The benefit of producing a large, but poorly labelled, dataset versus a small, expertly segmented dataset for semantic segmentation is an open question. Here we show that a large, noisy dataset outperforms a small, expertly segmented dataset for training a Fully Convolutional Network model for semantic segmentation of corrosion in images. A large dataset of 250 images with segmentations labelled by undergraduates and a second dataset of just 10 images, with segmentations labelled by subject matter experts were produced. The mean Intersection over Union and micro F-score metrics were compared after training for 50,000 epochs. This work is illustrative for researchers setting out to develop deep learning models for detection and location of specialist features.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
02/25/2021

Reducing Labelled Data Requirement for Pneumonia Segmentation using Image Augmentations

Deep learning semantic segmentation algorithms can localise abnormalitie...
research
05/10/2019

Semantic Segmentation of Seismic Images

Almost all work to understand Earth's subsurface on a large scale relies...
research
12/10/2021

The Large Labelled Logo Dataset (L3D): A Multipurpose and Hand-Labelled Continuously Growing Dataset

In this work, we present the Large Labelled Logo Dataset (L3D), a multip...
research
08/14/2018

Treepedia 2.0: Applying Deep Learning for Large-scale Quantification of Urban Tree Cover

Recent advances in deep learning have made it possible to quantify urban...
research
08/30/2023

CongNaMul: A Dataset for Advanced Image Processing of Soybean Sprouts

We present 'CongNaMul', a comprehensive dataset designed for various tas...
research
09/01/2023

dacl10k: Benchmark for Semantic Bridge Damage Segmentation

Reliably identifying reinforced concrete defects (RCDs)plays a crucial r...
research
06/24/2018

Dilated Temporal Fully-Convolutional Network for Semantic Segmentation of Motion Capture Data

Semantic segmentation of motion capture sequences plays a key part in ma...

Please sign up or login with your details

Forgot password? Click here to reset