Prototyping three key properties of specific curiosity in computational reinforcement learning

05/20/2022
by   Nadia M. Ady, et al.
0

Curiosity for machine agents has been a focus of intense research. The study of human and animal curiosity, particularly specific curiosity, has unearthed several properties that would offer important benefits for machine learners, but that have not yet been well-explored in machine intelligence. In this work, we introduce three of the most immediate of these properties – directedness, cessation when satisfied, and voluntary exposure – and show how they may be implemented together in a proof-of-concept reinforcement learning agent; further, we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity. As we would hope, the agent exhibits short-term directed behaviour while updating long-term preferences to adaptively seek out curiosity-inducing situations. This work therefore presents a novel view into how specific curiosity operates and in the future might be integrated into the behaviour of goal-seeking, decision-making agents in complex environments.

READ FULL TEXT

page 2

page 3

page 4

research
12/01/2022

Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have

Curiosity for machine agents has been a focus of lively research activit...
research
04/08/2020

Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game

Learning how to adapt to complex and dynamic environments is one of the ...
research
07/20/2021

Learning Altruistic Behaviours in Reinforcement Learning without External Rewards

Can artificial agents learn to assist others in achieving their goals wi...
research
03/17/2022

The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

Learned communication between agents is a powerful tool when approaching...
research
10/24/2022

Evaluating Long-Term Memory in 3D Mazes

Intelligent agents need to remember salient information to reason in par...
research
01/17/2022

Detecting danger in gridworlds using Gromov's Link Condition

Gridworlds have been long-utilised in AI research, particularly in reinf...

Please sign up or login with your details

Forgot password? Click here to reset