EduceLab-Scrolls: Verifiable Recovery of Text from Herculaneum Papyri using X-ray CT

04/04/2023
by   Stephen Parsons, et al.
0

We present a complete software pipeline for revealing the hidden texts of the Herculaneum papyri using X-ray CT images. This enhanced virtual unwrapping pipeline combines machine learning with a novel geometric framework linking 3D and 2D images. We also present EduceLab-Scrolls, a comprehensive open dataset representing two decades of research effort on this problem. EduceLab-Scrolls contains a set of volumetric X-ray CT images of both small fragments and intact, rolled scrolls. The dataset also contains 2D image labels that are used in the supervised training of an ink detection model. Labeling is enabled by aligning spectral photography of scroll fragments with X-ray CT images of the same fragments, thus creating a machine-learnable mapping between image spaces and modalities. This alignment permits supervised learning for the detection of "invisible" carbon ink in X-ray CT, a task that is "impossible" even for human expert labelers. To our knowledge, this is the first aligned dataset of its kind and is the largest dataset ever released in the heritage domain. Our method is capable of revealing accurate lines of text on scroll fragments with known ground truth. Revealed text is verified using visual confirmation, quantitative image metrics, and scholarly review. EduceLab-Scrolls has also enabled the discovery, for the first time, of hidden texts from the Herculaneum papyri, which we present here. We anticipate that the EduceLab-Scrolls dataset will generate more textual discovery as research continues.

READ FULL TEXT

page 3

page 5

page 9

research
05/12/2019

A Cone-Beam X-Ray CT Data Collection Designed for Machine Learning

Unlike previous works, this open data collection consists of X-ray cone-...
research
01/28/2022

A tomographic workflow to enable deep learning for X-ray based foreign object detection

Detection of unwanted (`foreign') objects within products is a common pr...
research
03/20/2020

Bone Structures Extraction and Enhancement in Chest Radiographs via CNN Trained on Synthetic Data

In this paper, we present a deep learning-based image processing techniq...
research
05/31/2023

MSKdeX: Musculoskeletal (MSK) decomposition from an X-ray image for fine-grained estimation of lean muscle mass and muscle volume

Musculoskeletal diseases such as sarcopenia and osteoporosis are major o...
research
04/29/2021

The entire network structure of Crossmodal Transformer

Since the mapping relationship between definitized intra-interventional ...
research
03/24/2021

Semi-Supervised Learning for Bone Mineral Density Estimation in Hip X-ray Images

Bone mineral density (BMD) is a clinically critical indicator of osteopo...

Please sign up or login with your details

Forgot password? Click here to reset