Demanding and Designing Aligned Cognitive Architectures

12/19/2021
by   Koen Holtman, et al.
0

With AI systems becoming more powerful and pervasive, there is increasing debate about keeping their actions aligned with the broader goals and needs of humanity. This multi-disciplinary and multi-stakeholder debate must resolve many issues, here we examine three of them. The first issue is to clarify what demands stakeholders might usefully make on the designers of AI systems, useful because the technology exists to implement them. We make this technical topic more accessible by using the framing of cognitive architectures. The second issue is to move beyond an analytical framing that treats useful intelligence as being reward maximization only. To support this move, we define several AI cognitive architectures that combine reward maximization with other technical elements designed to improve alignment. The third issue is how stakeholders should calibrate their interactions with modern machine learning researchers. We consider how current fashions in machine learning create a narrative pull that participants in technical and policy discussions should be aware of, so that they can compensate for it. We identify several technically tractable but currently unfashionable options for improving AI alignment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2023

Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

There is increasing attention being given to how to regulate AI systems....
research
05/09/2022

Aligned with Whom? Direct and social goals for AI systems

As artificial intelligence (AI) becomes more powerful and widespread, th...
research
06/25/2022

Aligning Artificial Intelligence with Humans through Public Policy

Given that Artificial Intelligence (AI) increasingly permeates our lives...
research
09/19/2023

Generative AI vs. AGI: The Cognitive Strengths and Weaknesses of Modern LLMs

A moderately detailed consideration of interactive LLMs as cognitive sys...
research
01/13/2020

Artificial Intelligence, Values and Alignment

This paper looks at philosophical questions that arise in the context of...
research
04/12/2018

Incomplete Contracting and AI Alignment

We suggest that the analysis of incomplete contracting developed by law ...

Please sign up or login with your details

Forgot password? Click here to reset