Towards Machine-assisted Meta-Studies: The Hubble Constant

01/31/2019
by   Tom Crossland, et al.
0

We present an approach for automatic extraction of measured values from the astrophysical literature, using the Hubble constant for our pilot study. Our rules-based model -- a classical technique in natural language processing -- has successfully extracted 298 measurements of the Hubble constant, with uncertainties, from the 208,541 available arXiv astrophysics papers. We have also created an artificial neural network classifier to identify papers which report novel measurements. This classifier is applied to the available arXiv data, and is demonstrated to work well in identifying papers which are reporting new measurements. From the analysis of our results we find that reporting measurements with uncertainties and the correct units is critical information to identify novel measurements in free text. Our results correctly highlight the current tension for measurements of the Hubble constant and recover the 3.5σ discrepancy -- demonstrating that the tool presented in this paper is useful for meta-studies of astrophysical measurements from a large number of publications, and showing the potential to generalise this technique to other areas.

READ FULL TEXT
research
07/02/2021

A Systematic Literature Review of Empiricism and Norms of Reporting in Computing Education Research Literature

Computing Education Research (CER) is critical for supporting the increa...
research
05/23/2019

Shades of Dark Uncertainty and Consensus Value for the Newtonian Constant of Gravitation

The Newtonian constant of gravitation, G, stands out in the landscape of...
research
04/09/2018

Towards Reproducible Research: Automatic Classification of Empirical Requirements Engineering Papers

Research must be reproducible in order to make an impact on science and ...
research
01/04/2023

MessageNet: Message Classification using Natural Language Processing and Meta-data

In this paper we propose a new Deep Learning (DL) approach for message c...
research
05/31/2020

NLP Scholar: An Interactive Visual Explorer for Natural Language Processing Literature

As part of the NLP Scholar project, we created a single unified dataset ...
research
04/05/2019

A topological data analysis based classification method for multiple measurements

Machine learning models for repeated measurements are limited. Using top...
research
09/06/2019

Show Your Work: Improved Reporting of Experimental Results

Research in natural language processing proceeds, in part, by demonstrat...

Please sign up or login with your details

Forgot password? Click here to reset