AI Deception: A Survey of Examples, Risks, and Potential Solutions

08/28/2023
by   Peter S. Park, et al.
0

This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false beliefs in the pursuit of some outcome other than the truth. We first survey empirical examples of AI deception, discussing both special-use AI systems (including Meta's CICERO) built for specific competitive situations, and general-purpose AI systems (such as large language models). Next, we detail several risks from AI deception, such as fraud, election tampering, and losing control of AI systems. Finally, we outline several potential solutions to the problems posed by AI deception: first, regulatory frameworks should subject AI systems that are capable of deception to robust risk-assessment requirements; second, policymakers should implement bot-or-not laws; and finally, policymakers should prioritize the funding of relevant research, including tools to detect AI deception and to make AI systems less deceptive. Policymakers, researchers, and the broader public should work proactively to prevent AI deception from destabilizing the shared foundations of our society.

READ FULL TEXT

page 8

page 9

research
06/13/2022

X-Risk Analysis for AI Research

Artificial intelligence (AI) has the potential to greatly improve societ...
research
10/20/2020

Artificial Tikkun Olam: AI Can Be Our Best Friend in Building an Open Human-Computer Society

Technological advances of virtually every kind pose risks to society inc...
research
11/11/2019

(When) Is Truth-telling Favored in AI Debate?

For some problems, humans may not be able to accurately judge the goodne...
research
11/08/2019

AI Ethics for Systemic Issues: A Structural Approach

The debate on AI ethics largely focuses on technical improvements and st...
research
03/20/2023

Heterogeneity of AI-Induced Societal Harms and the Failure of Omnibus AI Laws

AI-induced societal harms mirror existing problems in domains where AI r...
research
11/26/2020

Overcoming Failures of Imagination in AI Infused System Development and Deployment

NeurIPS 2020 requested that research paper submissions include impact st...
research
12/01/2021

Collaborative AI Needs Stronger Assurances Driven by Risks

Collaborative AI systems (CAISs) aim at working together with humans in ...

Please sign up or login with your details

Forgot password? Click here to reset