Value-based Search in Execution Space for Mapping Instructions to Programs

11/02/2018
by   Dor Muhlgay, et al.
0

Training models to map natural language instructions to programs given target world supervision only requires searching for good programs at training time. Search is commonly done using beam search in the space of partial programs or program trees, but as the length of the instructions grows finding a good program becomes difficult. In this work, we propose a search algorithm that uses the target world state, known at training time, to train a critic network that predicts the expected reward of every search state. We then score search states on the beam by interpolating their expected reward with the likelihood of programs represented by the search state. Moreover, we search not in the space of programs but in a more compressed state of program executions, augmented with recent entities and actions. On the SCONE dataset, we show that our algorithm dramatically improves performance on all three domains compared to standard beam search and other baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2022

CrossBeam: Learning to Search in Bottom-Up Program Synthesis

Many approaches to program synthesis perform a search within an enormous...
research
04/05/2014

MTD(f), A Minimax Algorithm Faster Than NegaScout

MTD(f) is a new minimax search algorithm, simpler and more efficient tha...
research
05/12/2020

Semantic Scaffolds for Pseudocode-to-Code Generation

We propose a method for program generation based on semantic scaffolds, ...
research
05/25/2018

Situated Mapping of Sequential Instructions to Actions with Single-step Reward Observation

We propose a learning approach for mapping context-dependent sequential ...
research
06/12/2019

SPoC: Search-based Pseudocode to Code

We consider the task of mapping pseudocode to long programs that are fun...
research
01/21/2021

PyGlove: Symbolic Programming for Automated Machine Learning

Neural networks are sensitive to hyper-parameter and architecture choice...
research
09/09/2013

Structure Learning of Probabilistic Logic Programs by Searching the Clause Space

Learning probabilistic logic programming languages is receiving an incre...

Please sign up or login with your details

Forgot password? Click here to reset