Specimens as research objects: reconciliation across distributed repositories to enable metadata propagation

09/20/2018
by   Nicky Nicolson, et al.
0

Botanical specimens are shared as long-term consultable research objects in a global network of specimen repositories. Multiple specimens are generated from a shared field collection event; generated specimens are then managed individually in separate repositories and independently augmented with research and management metadata which could be propagated to their duplicate peers. Establishing a data-derived network for metadata propagation will enable the reconciliation of closely related specimens which are currently dispersed, unconnected and managed independently. Following a data mining exercise applied to an aggregated dataset of 19,827,998 specimen records from 292 separate specimen repositories, 36 in duplication relationships, allowing the propagation of metadata among the participants in these relationships, totalling: 93,044 type citations, 1,121,865 georeferences, 1,097,168 images and 2,191,179 scientific name determinations. The results enable the creation of networks to identify which repositories could work in collaboration. Some classes of annotation (particularly those regarding scientific name determinations) represent units of scientific work: appropriate management of this data would allow the accumulation of scholarly credit to individual researchers: potential further work in this area is discussed.

READ FULL TEXT

page 7

page 8

research
03/19/2019

Aligning Biomedical Metadata with Ontologies Using Clustering and Embeddings

The metadata about scientific experiments published in online repositori...
research
05/16/2019

The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments

The Center for Expanded Data Annotation and Retrieval (CEDAR) aims to re...
research
11/18/2022

Toward a Flexible Metadata Pipeline for Fish Specimen Images

Flexible metadata pipelines are crucial for supporting the FAIR data pri...
research
10/16/2019

Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Automated classification of metadata of research data by their disciplin...
research
07/16/2021

DoReMi: First glance at a universal OMR dataset

The main challenges of Optical Music Recognition (OMR) come from the nat...
research
11/08/2019

Towards an Open and Scalable Music Metadata Layer

One of the significant issues in the music supply chain today is the lac...

Please sign up or login with your details

Forgot password? Click here to reset