Discovering Mathematical Objects of Interest – A Study of Mathematical Notations

02/07/2020
by   Andre Greiner-Petter, et al.
0

Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems. The contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking P_n^(α, β)(x) with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.

READ FULL TEXT
research
04/13/2018

Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

Mathematical formulae represent complex semantic information in a concis...
research
11/30/2020

Automatic Mathematical Information Retrieval to Perform Translations up to Computer Algebra Systems

In mathematics, LaTeX is the de facto standard to prepare documents, e.g...
research
03/20/2020

Mathematical Formulae in Wikimedia Projects 2020

This poster summarizes our contributions to Wikimedia's processing pipel...
research
02/12/2020

The Space of Mathematical Software Systems – A Survey of Paradigmatic Systems

Mathematical software systems are becoming more and more important in pu...
research
03/21/2016

Interoperability in the OpenDreamKit Project: The Math-in-the-Middle Approach

OpenDreamKit --- "Open Digital Research Environment Toolkit for the Adva...
research
12/04/2020

ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?

The zbMATH database contains more than 4 million bibliographic entries. ...
research
07/13/2023

Parmesan: mathematical concept extraction for education

Mathematics is a highly specialized domain with its own unique set of ch...

Please sign up or login with your details

Forgot password? Click here to reset