Building astroBERT, a language model for Astronomy Astrophysics

12/01/2021
by   Felix Grezes, et al.
0

The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and more) without further clarification from the user. At ADS, we are applying modern machine learning and natural language processing techniques to our dataset of recent astronomy publications to train astroBERT, a deeply contextual language model based on research at Google. Using astroBERT, we aim to enrich the ADS dataset and improve its discoverability, and in particular we are developing our own named entity recognition tool. We present here our preliminary results and lessons learned.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2023

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

In spite of the excellent strides made by end-to-end (E2E) models in spe...
research
04/06/2023

Using LSTM and GRU With a New Dataset for Named Entity Recognition in the Arabic Language

Named entity recognition (NER) is a natural language processing task (NL...
research
01/22/2020

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Contextualized embeddings use unsupervised language model pretraining to...
research
08/02/2019

DELTA: A DEep learning based Language Technology plAtform

In this paper we present DELTA, a deep learning based language technolog...
research
08/11/2021

Extracting Semantics from Maintenance Records

Rapid progress in natural language processing has led to its utilization...
research
10/22/2020

Method of noun phrase detection in Ukrainian texts

Introduction. The area of natural language processing considers AI-compl...
research
03/20/2023

NASA Science Mission Directorate Knowledge Graph Discovery

The size of the National Aeronautics and Space Administration (NASA) Sci...

Please sign up or login with your details

Forgot password? Click here to reset