DeepAI AI Chat
Log In Sign Up

Parallel-beam X-ray CT datasets of apples with internal defects and label balancing for machine learning

by   Sophia Bethany Coban, et al.

We present three parallel-beam tomographic datasets of 94 apples with internal defects along with defect label files. The datasets are prepared for development and testing of data-driven, learning-based image reconstruction, segmentation and post-processing methods. The three versions are a noiseless simulation; simulation with added Gaussian noise, and with scattering noise. The datasets are based on real 3D X-ray CT data and their subsequent volume reconstructions. The ground truth images, based on the volume reconstructions, are also available through this project. Apples contain various defects, which naturally introduce a label bias. We tackle this by formulating the bias as an optimization problem. In addition, we demonstrate solving this problem with two methods: a simple heuristic algorithm and through mixed integer quadratic programming. This ensures the datasets can be split into test, training or validation subsets with the label bias eliminated. Therefore the datasets can be used for image reconstruction, segmentation, automatic defect detection, and testing the effects of (as well as applying new methodologies for removing) label bias in machine learning.


page 1

page 15

page 16

page 17

page 18

page 19

page 20

page 21


A Cone-Beam X-Ray CT Data Collection Designed for Machine Learning

Unlike previous works, this open data collection consists of X-ray cone-...

A New Weighting Scheme for Fan-beam and Circle Cone-beam CT Reconstructions

In this paper, we first present an arc based algorithm for fan-beam comp...

Feature reconstruction from incomplete tomographic data without detour

In this paper, we consider the problem of feature reconstruction from in...

3D Image Reconstruction from X-Ray Measurements with Overlap

3D image reconstruction from a set of X-ray projections is an important ...

Image Reconstruction: From Sparsity to Data-adaptive Methods and Machine Learning

The field of image reconstruction has undergone four waves of methods. T...

Adorym: A multi-platform generic x-ray image reconstruction framework based on automatic differentiation

We describe and demonstrate an optimization-based x-ray image reconstruc...

Manifold learning-based feature extraction for structural defect reconstruction

Data-driven quantitative defect reconstructions using ultrasonic guided ...

Code Repositories


Includes scripts supporting the AppleCT Dataset paper.

view repo