Designing a Safe Autonomous Artificial Intelligence Agent based on Human Self-Regulation

01/05/2017
by   Mark Muraven, et al.
0

There is a growing focus on how to design safe artificial intelligent (AI) agents. As systems become more complex, poorly specified goals or control mechanisms may cause AI agents to engage in unwanted and harmful outcomes. Thus it is necessary to design AI agents that follow initial programming intentions as the program grows in complexity. How to specify these initial intentions has also been an obstacle to designing safe AI agents. Finally, there is a need for the AI agent to have redundant safety mechanisms to ensure that any programming errors do not cascade into major problems. Humans are autonomous intelligent agents that have avoided these problems and the present manuscript argues that by understanding human self-regulation and goal setting, we may be better able to design safe AI agents. Some general principles of human self-regulation are outlined and specific guidance for AI design is given.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2022

A Cognitive Framework for Delegation Between Error-Prone AI and Human Agents

With humans interacting with AI-based systems at an increasing rate, it ...
research
05/02/2018

AI safety via debate

To make AI systems broadly useful for challenging real-world tasks, we n...
research
05/29/2023

Towards a Unifying Model of Rationality in Multiagent Systems

Multiagent systems deployed in the real world need to cooperate with oth...
research
02/01/2014

Godseed: Benevolent or Malevolent?

It is hypothesized by some thinkers that benign looking AI objectives ma...
research
05/07/2020

A Proposal for Intelligent Agents with Episodic Memory

In the future we can expect that artificial intelligent agents, once dep...
research
03/09/2018

Institutional Metaphors for Designing Large-Scale Distributed AI versus AI Techniques for Running Institutions

Artificial Intelligence (AI) started out with an ambition to reproduce t...
research
04/21/2022

Creative Problem Solving in Artificially Intelligent Agents: A Survey and Framework

Creative Problem Solving (CPS) is a sub-area within Artificial Intellige...

Please sign up or login with your details

Forgot password? Click here to reset