EXSCLAIM! – An automated pipeline for the construction of labeled materials imaging datasets from literature

03/19/2021
by   Eric Schwenker, et al.
0

Due to recent improvements in image resolution and acquisition speed, materials microscopy is experiencing an explosion of published imaging data. The standard publication format, while sufficient for traditional data ingestion scenarios where a select number of images can be critically examined and curated manually, is not conducive to large-scale data aggregation or analysis, hindering data sharing and reuse. Most images in publications are presented as components of a larger figure with their explicit context buried in the main body or caption text, so even if aggregated, collections of images with weak or no digitized contextual labels have limited value. To solve the problem of curating labeled microscopy data from literature, this work introduces the EXSCLAIM! Python toolkit for the automatic EXtraction, Separation, and Caption-based natural Language Annotation of IMages from scientific literature. We highlight the methodology behind the construction of EXSCLAIM! and demonstrate its ability to extract and label open-source scientific images at high volume.

READ FULL TEXT

page 3

page 5

page 6

page 8

page 11

research
09/27/2021

Text to Insight: Accelerating Organic Materials Knowledge Extraction via Deep Learning

Scientific literature is one of the most significant resources for shari...
research
08/30/2016

New Methods to Improve Large-Scale Microscopy Image Analysis with Prior Knowledge and Uncertainty

Multidimensional imaging techniques provide powerful ways to examine var...
research
09/20/2022

Deep learning at the edge enables real-time streaming ptychographic imaging

Coherent microscopy techniques provide an unparalleled multi-scale view ...
research
01/05/2021

Looking Through Glass: Knowledge Discovery from Materials Science Literature using Natural Language Processing

Most of the knowledge in materials science literature is in the form of ...
research
03/25/2022

Self-supervised machine learning model for analysis of nanowire morphologies from transmission electron microscopy images

In the field of soft materials, microscopy is the first and often only a...
research
10/20/2021

Development of an Ontology for an Integrated Image Analysis Platform to enable Global Sharing of Microscopy Imaging Data

Imaging data is one of the most important fundamentals in the current li...
research
03/07/2023

Organelle-specific segmentation, spatial analysis, and visualization of volume electron microscopy datasets

Volume electron microscopy is the method of choice for the in-situ inter...

Please sign up or login with your details

Forgot password? Click here to reset