Modeling AGI Safety Frameworks with Causal Influence Diagrams

06/20/2019
by   Tom Everitt, et al.
2

Proposals for safe AGI systems are typically made at the level of frameworks, specifying how the components of the proposed system should be trained and interact with each other. In this paper, we model and compare the most promising AGI safety frameworks using causal influence diagrams. The diagrams show the optimization objective and causal assumptions of the framework. The unified representation permits easy comparison of frameworks and their assumptions. We hope that the diagrams will serve as an accessible and visual introduction to the main AGI safety frameworks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2023

Causal Diagrams for Structural Engineers

Causal diagrams are logic and graphical tools that depict assumptions ab...
research
08/17/2022

Discovering Agents

Causal models of agents have been used to analyse the safety aspects of ...
research
02/23/2022

A Complete Criterion for Value of Information in Soluble Influence Diagrams

Influence diagrams have recently been used to analyse the safety and fai...
research
02/06/2013

Myopic Value of Information in Influence Diagrams

We present a method for calculation of myopic value of information in in...
research
02/27/2013

A Decision-Based View of Causality

Most traditional models of uncertainty have focused on the associational...
research
02/26/2019

Understanding Agent Incentives using Causal Influence Diagrams, Part I: Single Action Settings

Agents are systems that optimize an objective function in an environment...
research
02/02/2021

Agent Incentives: A Causal Perspective

We present a framework for analysing agent incentives using causal influ...

Please sign up or login with your details

Forgot password? Click here to reset