Towards a Mathematics Formalisation Assistant using Large Language Models

11/14/2022
by   Ayush Agrawal, et al.
0

Mathematics formalisation is the task of writing mathematics (i.e., definitions, theorem statements, proofs) in natural language, as found in books and papers, into a formal language that can then be checked for correctness by a program. It is a thriving activity today, however formalisation remains cumbersome. In this paper, we explore the abilities of a large language model (Codex) to help with formalisation in the Lean theorem prover. We find that with careful input-dependent prompt selection and postprocessing, Codex is able to formalise short mathematical statements at undergrad level with nearly 75% accuracy for 120 theorem statements. For proofs quantitative analysis is infeasible and we undertake a detailed case study. We choose a diverse set of 13 theorems at undergrad level with proofs that fit in two-three paragraphs. We show that with a new prompting strategy Codex can formalise these proofs in natural language with at least one out of twelve Codex completion being easy to repair into a complete proof. This is surprising as essentially no aligned data exists for formalised mathematics, particularly for proofs. These results suggest that large language models are a promising avenue towards fully or partially automating formalisation.

READ FULL TEXT

page 18

page 36

research
05/25/2022

Autoformalization with Large Language Models

Autoformalization is the process of automatically translating from natur...
research
01/05/2023

Towards Autoformalization of Mathematics and Code Correctness: Experiments with Elementary Proofs

The ever-growing complexity of mathematical proofs makes their manual ve...
research
05/20/2023

Experimental results from applying GPT-4 to an unpublished formal language

Can large language models be used to complete mathematical tasks that ar...
research
03/12/2023

Improving the Diproche CNL through autoformalization via GPT-3

The Diproche system is an automated proof checker for texts written in a...
research
02/03/2022

Formal Mathematics Statement Curriculum Learning

We explore the use of expert iteration in the context of language modeli...
research
04/25/2018

Cheap Non-standard Analysis and Computability

Non standard Analysis is an area of Mathematics dealing with notions of ...
research
01/31/2023

Mathematical Capabilities of ChatGPT

We investigate the mathematical capabilities of ChatGPT by testing it on...

Please sign up or login with your details

Forgot password? Click here to reset