Deception Abilities Emerged in Large Language Models

07/31/2023
by   Thilo Hagendorff, et al.
0

Large language models (LLMs) are currently at the forefront of intertwining artificial intelligence (AI) systems with human communication and everyday life. Thus, aligning them with human values is of great importance. However, given the steady increase in reasoning abilities, future LLMs are under suspicion of becoming able to deceive human operators and utilizing this ability to bypass monitoring efforts. As a prerequisite to this, LLMs need to possess a conceptual understanding of deception strategies. This study reveals that such strategies emerged in state-of-the-art LLMs, such as GPT-4, but were non-existent in earlier LLMs. We conduct a series of experiments showing that state-of-the-art LLMs are able to understand and induce false beliefs in other agents, that their performance in complex deception scenarios can be amplified utilizing chain-of-thought reasoning, and that eliciting Machiavellianism in LLMs can alter their propensity to deceive. In sum, revealing hitherto unknown machine behavior in LLMs, our study contributes to the nascent field of machine psychology.

READ FULL TEXT

page 6

page 8

research
07/26/2023

A Sentence is Worth a Thousand Pictures: Can Large Language Models Understand Human Language?

Artificial Intelligence applications show great potential for language-r...
research
06/13/2023

Human-Like Intuitive Behavior and Reasoning Biases Emerged in Language Models – and Disappeared in GPT-4

Large language models (LLMs) are currently at the forefront of intertwin...
research
08/10/2023

Metacognitive Prompting Improves Understanding in Large Language Models

In Large Language Models (LLMs), there have been consistent advancements...
research
08/03/2023

Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors

Recent advances in the performance of large language models (LLMs) have ...
research
05/30/2023

Strategic Reasoning with Language Models

Strategic reasoning enables agents to cooperate, communicate, and compet...
research
03/24/2023

Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Large language models (LLMs) are currently at the forefront of intertwin...
research
09/01/2023

Large Language Models for Semantic Monitoring of Corporate Disclosures: A Case Study on Korea's Top 50 KOSPI Companies

In the rapidly advancing domain of artificial intelligence, state-of-the...

Please sign up or login with your details

Forgot password? Click here to reset