Multiparty Dynamics and Failure Modes for Machine Learning and Artificial Intelligence

10/16/2018
by   David Manheim, et al.
0

Overoptimization failures in machine learning and artificial intelligence systems can involve specification gaming, reward hacking, fragility to distributional shifts, and Goodhart's or Campbell's law. These failure modes are an important challenge in building safe AI systems, and multi-agent systems have additional failure modes that are closely related. These failure modes for multi-agent systems are more complex, more problematic, and less well understood than the single-agent case. They are also already occurring, largely unnoticed. After motivating the discussion with examples from poker-playing AI, the paper explains why these failure modes are in some sense fundamental. Following this, the paper categorizes failure modes, provides definitions, and cites examples for each of: accidental steering, coordination failures, adversarial misalignment, input spoofing and filtering, and goal co-option or direct hacking. The paper then discusses ongoing and potential work on mitigation of these failure modes, and what to expect when these failures continue to proliferate.

READ FULL TEXT
research
10/16/2018

Overoptimization Failures and Specification Gaming in Multi-agent Systems

Overoptimization failures in machine learning and AI can involve specifi...
research
03/13/2018

Categorizing Variants of Goodhart's Law

There are several distinct failure modes for overoptimization of systems...
research
11/25/2019

Failure Modes in Machine Learning Systems

In the last two years, more than 200 papers have been written on how mac...
research
09/18/2022

Autonomous Task Planning for Heterogeneous Multi-Agent Systems

This paper presents a solution to the automatic task planning problem fo...
research
10/29/2021

UDIS: Unsupervised Discovery of Bias in Deep Visual Recognition Models

Deep learning models have been shown to learn spurious correlations from...
research
07/15/2019

Classification Schemas for Artificial Intelligence Failures

In this paper we examine historical failures of artificial intelligence ...
research
02/13/2020

The Conditional Entropy Bottleneck

Much of the field of Machine Learning exhibits a prominent set of failur...

Please sign up or login with your details

Forgot password? Click here to reset