World Discovery Models

02/20/2019
by   Mohammad Gheshlaghi Azar, et al.
0

As humans we are driven by a strong desire for seeking novelty in our world. Also upon observing a novel pattern we are capable of refining our understanding of the world based on the new information---humans can discover their world. The outstanding ability of the human mind for discovery has led to many breakthroughs in science, art and technology. Here we investigate the possibility of building an agent capable of discovering its world using the modern AI technology. In particular we introduce NDIGO, Neural Differential Information Gain Optimisation, a self-supervised discovery model that aims at seeking new information to construct a global view of its world from partial and noisy observations. Our experiments on some controlled 2-D navigation tasks show that NDIGO outperforms state-of-the-art information-seeking methods in terms of the quality of the learned representation. The improvement in performance is particularly significant in the presence of white or structured noise where other information-seeking methods follow the noise instead of discovering their world.

READ FULL TEXT

page 6

page 9

page 10

page 11

page 14

research
09/10/2023

FOLLOWUPQG: Towards Information-Seeking Follow-up Question Generation

Humans ask follow-up questions driven by curiosity, which reflects a cre...
research
12/27/2018

QRFA: A Data-Driven Model of Information-Seeking Dialogues

Understanding the structure of interaction processes helps us to improve...
research
07/15/2020

Active World Model Learning with Progress Curiosity

World models are self-supervised predictive models of how the world evol...
research
07/31/2023

HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution

The rise of large language models (LLMs) had a transformative impact on ...
research
05/25/2022

Towards More Realistic Generation of Information-Seeking Conversations

In this paper, we introduce a novel framework SimSeek (simulating inform...
research
05/21/2020

Novel Policy Seeking with Constrained Optimization

In this work, we address the problem of learning to seek novel policies ...
research
04/13/2023

Power-seeking can be probable and predictive for trained agents

Power-seeking behavior is a key source of risk from advanced AI, but our...

Please sign up or login with your details

Forgot password? Click here to reset