Learning Causal Overhypotheses through Exploration in Children and Computational Models

02/21/2022
by   Eliza Kosoy, et al.
21

Despite recent progress in reinforcement learning (RL), RL algorithms for exploration still remain an active area of research. Existing methods often focus on state-based metrics, which do not consider the underlying causal structures of the environment, and while recent research has begun to explore RL environments for causal learning, these environments primarily leverage causal information through causal inference or induction rather than exploration. In contrast, human children - some of the most proficient explorers - have been shown to use causal information to great benefit. In this work, we introduce a novel RL environment designed with a controllable causal structure, which allows us to evaluate exploration strategies used by both agents and children in a unified environment. In addition, through experimentation on both computation models and children, we demonstrate that there are significant differences between information-gain optimal RL exploration in causal environments and the exploration of children in the same environments. We conclude with a discussion of how these findings may inspire new directions of research into efficient exploration and disambiguation of causal structures for RL algorithms.

READ FULL TEXT

page 4

page 13

research
11/02/2022

Causal Counterfactuals for Improving the Robustness of Reinforcement Learning

Reinforcement learning (RL) is applied in a wide variety of fields. RL e...
research
06/16/2022

Towards Understanding How Machines Can Learn Causal Overhypotheses

Recent work in machine learning and cognitive science has suggested that...
research
05/06/2020

Exploring Exploration: Comparing Children with RL Agents in Unified Environments

Research in developmental psychology consistently shows that children ex...
research
06/18/2022

EST: Evaluating Scientific Thinking in Artificial Agents

Theoretical ideas and empirical research have shown us a seemingly surpr...
research
10/29/2021

GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL

Out-of-distribution (OOD) detection is a well-studied topic in supervise...
research
02/29/2020

Causal Learning by a Robot with Semantic-Episodic Memory in an Aesop's Fable Experiment

Corvids, apes, and children solve The Crow and The Pitcher task (from Ae...
research
06/30/2017

Probabilistic Active Learning of Functions in Structural Causal Models

We consider the problem of learning the functions computing children fro...

Please sign up or login with your details

Forgot password? Click here to reset