GAP Enhancing Semantic Interoperability of Genomic Datasets and Provenance Through Nanopublications

11/16/2021
by   Matheus Feijoó, et al.
0

While the publication of datasets in scientific repositories has become broadly recognised, the repositories tend to have increasing semantic-related problems. For instance, they present various data reuse obstacles for machine-actionable processes, especially in biological repositories, hampering the reproducibility of scientific experiments. An example of these shortcomings is the GenBank database. We propose GAP, an innovative data model to enhance the semantic data meaning to address these issues. The model focuses on converging related approaches like data provenance, semantic interoperability, FAIR principles, and nanopublications. Our experiments include a prototype to scrape genomic data and trace them to nanopublications as a proof of concept. For this, (meta)data are stored in a three-level nanopub data model. The first level is related to a target organism, specifying data in terms of biological taxonomy. The second level focuses on the biological strains of the target, the central part of our contribution. The strains express information related to deciphered (meta)data of the genetic variations of the genomic material. The third level stores related scientific papers (meta)data. We expect it will offer higher data storage flexibility and more extensive interoperability with other data sources by incorporating and adopting associated approaches to store genomic data in the proposed model.

READ FULL TEXT
research
04/21/2022

Why we should respect analysis results as data

The development and approval of new treatments generates large volumes o...
research
06/28/2023

OpenCitations Meta

OpenCitations Meta is a new database that contains bibliographic metadat...
research
09/19/2022

F*** workflows: when parts of FAIR are missing

The FAIR principles for scientific data (Findable, Accessible, Interoper...
research
01/29/2018

Evaluating approaches for supervised semantic labeling

Relational data sources are still one of the most popular ways to store ...
research
01/19/2022

A Practical Approach of Actions for FAIRification Workflows

Since their proposal in 2016, the FAIR principles have been largely disc...
research
06/05/2019

Enhancing interoperable datasets with virtual links

To achieve semantic interoperability, numerous data standards, ontologie...
research
11/23/2022

FAIRification of MLC data

The multi-label classification (MLC) task has increasingly been receivin...

Please sign up or login with your details

Forgot password? Click here to reset