(When) Is Truth-telling Favored in AI Debate?

11/11/2019
by   Vojtěch Kovařík, et al.
0

For some problems, humans may not be able to accurately judge the goodness of AI-proposed solutions. Irving et al. (2018) propose that in such cases, we may use a debate between two AI systems to amplify the problem-solving capabilities of a human judge. We introduce a mathematical framework that can model debates of this type and propose that the quality of debate designs should be measured by the accuracy of the most persuasive answer. We describe a simple instance of the debate framework called feature debate and analyze the degree to which such debates track the truth. We argue that despite being very simple, feature debates nonetheless capture many aspects of practical debates such as the incentives to confuse the judge or stall to prevent losing. We then outline how these models should be generalized to analyze a wider range of debate phenomena.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2023

AI Deception: A Survey of Examples, Risks, and Potential Solutions

This paper argues that a range of current AI systems have learned how to...
research
03/08/2023

The Carbon Emissions of Writing and Illustrating Are Lower for AI than for Humans

As AI systems proliferate, their greenhouse gas emissions are an increas...
research
06/09/2017

Off The Beaten Lane: AI Challenges In MOBAs Beyond Player Control

MOBAs represent a huge segment of online gaming and are growing as both ...
research
07/08/2023

Towards The Ultimate Brain: Exploring Scientific Discovery with ChatGPT AI

This paper presents a novel approach to scientific discovery using an ar...
research
01/28/2023

Truth Machines: Synthesizing Veracity in AI Language Models

As AI technologies are rolled out into healthcare, academia, human resou...
research
10/09/2021

Using Human-Guided Causal Knowledge for More Generalized Robot Task Planning

A major challenge in research involving artificial intelligence (AI) is ...
research
08/14/2022

Limits of an AI program for solving college math problems

Drori et al. (2022) report that "A neural network solves, explains, and ...

Please sign up or login with your details

Forgot password? Click here to reset