Learning to Give Checkable Answers with Prover-Verifier Games

08/27/2021
by   Cem Anil, et al.
0

Our ability to know when to trust the decisions made by machine learning systems has not kept up with the staggering improvements in their performance, limiting their applicability in high-stakes domains. We introduce Prover-Verifier Games (PVGs), a game-theoretic framework to encourage learning agents to solve decision problems in a verifiable manner. The PVG consists of two learners with competing objectives: a trusted verifier network tries to choose the correct answer, and a more powerful but untrusted prover network attempts to persuade the verifier of a particular answer, regardless of its correctness. The goal is for a reliable justification protocol to emerge from this game. We analyze variants of the framework, including simultaneous and sequential games, and narrow the space down to a subset of games which provably have the desired equilibria. We develop instantiations of the PVG for two algorithmic tasks, and show that in practice, the verifier learns a robust decision rule that is able to receive useful and reliable information from an untrusted prover. Importantly, the protocol still works even when the verifier is frozen and the prover's messages are directly optimized to convince the verifier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2022

Valid Utility Games with Information Sharing Constraints

The use of game theoretic methods for control in multiagent systems has ...
research
04/21/2021

The Road Less Travelled: Trying And Failing To Generate Walking Simulators

Automated game design is a rapidly growing area of research, yet many as...
research
07/11/2023

Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games

No-regret learners seek to minimize the difference between the loss they...
research
04/03/2016

An electronic-game framework for evaluating coevolutionary algorithms

One of the common artificial intelligence applications in electronic gam...
research
02/12/2015

Computing rational decisions in extensive games with limited foresight

We introduce a class of extensive form games where players might not be ...
research
04/27/2018

Approximating Nash Equilibria for Black-Box Games: A Bayesian Optimization Approach

Game theory has emerged as a powerful framework for modeling a large ran...

Please sign up or login with your details

Forgot password? Click here to reset