Learning to Reason in Large Theories without Imitation

05/25/2019
by   Kshitij Bansal, et al.
2

Automated theorem proving in large theories can be learned via reinforcement learning over an indefinitely growing action space. In order to select actions, one performs nearest neighbor lookups in the knowledge base to find premises to be applied. Here we address the exploration for reinforcement learning in this space. Approaches (like epsilon-greedy strategy) that sample actions uniformly do not scale to this scenario as most actions lead to dead ends and unsuccessful proofs which are not useful for training our models. In this paper, we compare approaches that select premises using randomly initialized similarity measures and mixing them with the proposals of the learned model. We evaluate these on the HOList benchmark for tactics based higher order theorem proving. We implement an automated theorem prover named DeepHOL-Zero that does not use any of the human proofs and show that our improved exploration method manages to expand the training set continuously. DeepHOL-Zero outperforms the best theorem prover trained by imitation learning alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2019

HOList: An Environment for Machine Learning of Higher-Order Theorem Proving

We present an environment, benchmark, and deep learning driven automated...
research
04/05/2019

HOList: An Environment for Machine Learning of Higher-Order Theorem Proving (extended version)

We present an environment, benchmark, and deep learning driven automated...
research
05/30/2019

Towards Finding Longer Proofs

We present a reinforcement learning (RL) based guidance system for autom...
research
10/29/2020

How do Offline Measures for Exploration in Reinforcement Learning behave?

Sufficient exploration is paramount for the success of a reinforcement l...
research
02/02/2020

An Experimental Study of Formula Embeddings for Automated Theorem Proving in First-Order Logic

Automated theorem proving in first-order logic is an active research are...
research
03/01/2017

HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving

Large computer-understandable proofs consist of millions of intermediate...
research
09/07/2021

Conjectures, Tests and Proofs: An Overview of Theory Exploration

A key component of mathematical reasoning is the ability to formulate in...

Please sign up or login with your details

Forgot password? Click here to reset