Robustness to fundamental uncertainty in AGI alignment

07/25/2018
by   G Gordon Worley III, et al.
0

The AGI alignment problem has a bimodal distribution of outcomes with most outcomes clustering around the poles of total success and existential, catastrophic failure. Consequently, attempts to solve AGI alignment should, all else equal, prefer false negatives (ignoring research programs that would have been successful) to false positives (pursuing research programs that will unexpectedly fail). Thus, we propose adopting a policy of responding to points of metaphysical and practical uncertainty associated with the alignment problem by limiting and choosing necessary assumptions to reduce the risk false positives. Herein we explore in detail some of the relevant points of uncertainty that AGI alignment research hinges on and consider how to reduce false positives in response to them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/12/2021

Using uncertainty estimation to reduce false positives in liver lesion detection

Despite the successes of deep learning techniques at detecting objects i...
research
03/04/2021

GAssert: A Fully Automated Tool to Improve Assertion Oracles

This demo presents the implementation and usage details of GASSERT, the ...
research
05/13/2021

Gradual Program Analysis for Null Pointers

Static analysis tools typically address the problem of excessive false p...
research
04/06/2022

Emphasis on the Minimization of False Negatives or False Positives in Binary Classification

The minimization of specific cases in binary classification, such as fal...
research
10/20/2017

Solving the "false positives" problem in fraud prediction

In this paper, we present an automated feature engineering based approac...
research
03/05/2020

An algorithm for reconstruction of triangle-free linear dynamic networks with verification of correctness

Reconstructing a network of dynamic systems from observational data is a...
research
12/08/2022

Simulation of Attacker Defender Interaction in a Noisy Security Game

In the cybersecurity setting, defenders are often at the mercy of their ...

Please sign up or login with your details

Forgot password? Click here to reset