Towards Formula Translation using Recursive Neural Networks

11/10/2018
by   Felix Petersen, et al.
0

While it has become common to perform automated translations on natural language, performing translations between different representations of mathematical formulae has thus far not been possible. We implemented the first translator for mathematical formulae based on recursive neural networks. We chose recursive neural networks because mathematical formulae inherently include a structural encoding. In our implementation, we developed new techniques and topologies for recursive tree-to-tree neural networks based on multi-variate multi-valued Long Short-Term Memory cells. We propose a novel approach for mini-batch training that utilizes clustering and tree traversal. We evaluate our translator and analyze the behavior of our proposed topologies and techniques based on a translation from generic LaTeX to the semantic LaTeX notation. We use the semantic LaTeX notation from the Digital Library for Mathematical Formulae and the Digital Repository for Mathematical Formulae at the National Institute for Standards and Technology. We find that a simple heuristics-based clustering algorithm outperforms the conventional clustering algorithms on the task of clustering binary trees of mathematical formulae with respect to their topology. Furthermore, we find a mask for the loss function, which can prevent the neural network from finding a local minimum of the loss function. Given our preliminary results, a complete translation from formula to formula is not yet possible. However, we achieved a prediction accuracy of 47.05 when ignoring the predicted position. Concluding, our work advances the field of recursive neural networks by improving the training speed and quality of training. In the future, we will work towards a complete translation allowing a machine-interpretation of LaTeX formulae.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2015

Long Short-Term Memory Over Tree Structures

The chain-structured long short-term memory (LSTM) has showed to be effe...
research
05/25/2023

Neural Machine Translation for Mathematical Formulae

We tackle the problem of neural machine translation of mathematical form...
research
09/17/2021

Semantic Preserving Bijective Mappings of Mathematical Formulae between Document Preparation Systems and Computer Algebra Systems

Document preparation systems like LaTeX offer the ability to render math...
research
12/02/2013

Bidirectional Recursive Neural Networks for Token-Level Labeling with Structure

Recently, deep architectures, such as recurrent and recursive neural net...
research
02/04/2020

On the use of recurrent neural networks for predictions of turbulent flows

In this paper, the prediction capabilities of recurrent neural networks ...

Please sign up or login with your details

Forgot password? Click here to reset