Toward a Flexible Metadata Pipeline for Fish Specimen Images

11/18/2022
by   Dom Jebbia, et al.
0

Flexible metadata pipelines are crucial for supporting the FAIR data principles. Despite this need, researchers seldom report their approaches for identifying metadata standards and protocols that support optimal flexibility. This paper reports on an initiative targeting the development of a flexible metadata pipeline for a collection containing over 300,000 digital fish specimen images, harvested from multiple data repositories and fish collections. The images and their associated metadata are being used for AI-related scientific research involving automated species identification, segmentation and trait extraction. The paper provides contextual background, followed by the presentation of a four-phased approach involving: 1. Assessment of the Problem, 2. Investigation of Solutions, 3. Implementation, and 4. Refinement. The work is part of the NSF Harnessing the Data Revolution, Biology Guided Neural Networks (NSF/HDR-BGNN) project and the HDR Imageomics Institute. An RDF graph prototype pipeline is presented, followed by a discussion of research implications and conclusion summarizing the results.

READ FULL TEXT
research
11/06/2021

FAIR Metadata: A Community-driven Vocabulary Application

FAIR metadata is critical to supporting FAIR data overall. Transparency,...
research
09/13/2021

Project Pipeline: Preservation, Persistence, and Performance

Preservation pipelines demonstrate extended value when digitized content...
research
09/28/2022

The Role of Metadata in Non-Fungible Tokens: Marketplace Analysis and Collection Organization

An explosion of interest in Non-Fungible Tokens (NFTs) has led to the em...
research
09/20/2018

Specimens as research objects: reconciliation across distributed repositories to enable metadata propagation

Botanical specimens are shared as long-term consultable research objects...
research
08/14/2021

Packaging research artefacts with RO-Crate

An increasing number of researchers support reproducibility by including...
research
02/27/2020

Dataset Search In Biodiversity Research: Do Metadata In Data Repositories Reflect Scholarly Information Needs?

The increasing amount of research data provides the opportunity to link ...

Please sign up or login with your details

Forgot password? Click here to reset