Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety

01/09/2022
by   Issa Rice, et al.
0

Several different approaches exist for ensuring the safety of future Transformative Artificial Intelligence (TAI) or Artificial Superintelligence (ASI) systems, and proponents of different approaches have made different and debated claims about the importance or usefulness of their work in the near term, and for future systems. Highly Reliable Agent Designs (HRAD) is one of the most controversial and ambitious approaches, championed by the Machine Intelligence Research Institute, among others, and various arguments have been made about whether and how it reduces risks from future AI systems. In order to reduce confusion in the debate about AI safety, here we build on a previous discussion by Rice which collects and presents four central arguments which are used to justify HRAD as a path towards safety of AI systems. We have titled the arguments (1) incidental utility,(2) deconfusion, (3) precise specification, and (4) prediction. Each of these makes different, partly conflicting claims about how future AI systems can be risky. We have explained the assumptions and claims based on a review of published and informal literature, along with consultation with experts who have stated positions on the topic. Finally, we have briefly outlined arguments against each approach and against the agenda overall.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2017

Guidelines for Artificial Intelligence Containment

With almost daily improvements in capabilities of artificial intelligenc...
research
04/04/2019

A Systematic Literature Review about the impact of Artificial Intelligence on Autonomous Vehicle Safety

Autonomous Vehicles (AV) are expected to bring considerable benefits to ...
research
02/01/2022

Explainable AI through the Learning of Arguments

Learning arguments is highly relevant to the field of explainable artifi...
research
08/07/2020

Uncontrollability of AI

Invention of artificial general intelligence is predicted to cause a shi...
research
07/19/2020

On Controllability of AI

Invention of artificial general intelligence is predicted to cause a shi...
research
06/19/2022

Modeling Transformative AI Risks (MTAIR) Project – Summary Report

This report outlines work by the Modeling Transformative AI Risk (MTAIR)...
research
07/22/2019

Less (Data) Is More: Why Small Data Holds the Key to the Future of Artificial Intelligence

The claims that big data holds the key to enterprise successes and that ...

Please sign up or login with your details

Forgot password? Click here to reset