ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics

02/24/2023
by   Zhangir Azerbayev, et al.
0

We introduce ProofNet, a benchmark for autoformalization and formal proving of undergraduate-level mathematics. The ProofNet benchmarks consists of 371 examples, each consisting of a formal theorem statement in Lean 3, a natural language theorem statement, and a natural language proof. The problems are primarily drawn from popular undergraduate pure mathematics textbooks and cover topics such as real and complex analysis, linear algebra, abstract algebra, and topology. We intend for ProofNet to be a challenging benchmark that will drive progress in autoformalization and automatic theorem proving. We report baseline results on statement autoformalization via in-context learning. Moreover, we introduce two novel statement autoformalization methods: prompt retrieval and distilled backtranslation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2021

MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics

We present miniF2F, a dataset of formal Olympiad-level mathematics probl...
research
09/30/2019

CSPLib: Twenty Years On

In 1999, we introduced CSPLib, a benchmark library for the constraints c...
research
09/18/2012

Theorem Proving in Large Formal Mathematics as an Emerging AI Field

In the recent years, we have linked a large corpus of formal mathematics...
research
02/12/2020

Using Automated Theorem Provers for Mistake Diagnosis in the Didactics of Mathematics

The Diproche system, an automated proof checker for natural language pro...
research
11/29/2022

Peano: Learning Formal Mathematical Reasoning

General mathematical reasoning is computationally undecidable, but human...
research
09/27/2022

Structure in Theorem Proving: Analyzing and Improving the Isabelle Archive of Formal Proofs

The Isabelle Archive of Formal Proofs has grown to a significant size in...
research
02/03/2022

Formal Mathematics Statement Curriculum Learning

We explore the use of expert iteration in the context of language modeli...

Please sign up or login with your details

Forgot password? Click here to reset