Making Metadata More FAIR Using Large Language Models

07/24/2023
by   Sowmya S Sundaram, et al.
0

With the global increase in experimental data artifacts, harnessing them in a unified fashion leads to a major stumbling block - bad metadata. To bridge this gap, this work presents a Natural Language Processing (NLP) informed application, called FAIRMetaText, that compares metadata. Specifically, FAIRMetaText analyzes the natural language descriptions of metadata and provides a mathematical similarity measure between two terms. This measure can then be utilized for analyzing varied metadata, by suggesting terms for compliance or grouping similar terms for identification of replaceable terms. The efficacy of the algorithm is presented qualitatively and quantitatively on publicly available research artifacts and demonstrates large gains across metadata related tasks through an in-depth study of a wide variety of Large Language Models (LLMs). This software can drastically reduce the human effort in sifting through various natural language metadata while employing several experimental datasets on the same topic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2023

Comparative Analysis of CHATGPT and the evolution of language models

Interest in Large Language Models (LLMs) has increased drastically since...
research
11/18/2022

Metadata Might Make Language Models Better

This paper discusses the benefits of including metadata when training la...
research
10/22/2022

LMPriors: Pre-Trained Language Models as Task-Specific Priors

Particularly in low-data regimes, an outstanding challenge in machine le...
research
11/10/2021

Multimodal Approach for Metadata Extraction from German Scientific Publications

Nowadays, metadata information is often given by the authors themselves ...
research
06/05/2019

Survey on Publicly Available Sinhala Natural Language Processing Tools and Research

Sinhala is the native language of the Sinhalese people who make up the l...
research
07/11/2023

GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts

In the context of the rapid development of large language models, we hav...
research
09/08/2023

Matching Table Metadata with Business Glossaries Using Large Language Models

Enterprises often own large collections of structured data in the form o...

Please sign up or login with your details

Forgot password? Click here to reset