Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

05/12/2019
by   Yuhang Song, et al.
0

Intrinsic rewards are introduced to simulate how human intelligence works, which are usually evaluated by intrinsically-motivated play, i.e., playing games without extrinsic rewards but evaluated with extrinsic rewards. However, none of the existing intrinsic reward approaches can achieve human-level performance under this very challenging setting of intrinsically-motivated play. In this work, we propose a novel megalomania-driven intrinsic reward (mega-reward) which, to our knowledge, is the first approach that achieves comparable human-level performance in intrinsically-motivated play. The intuition of mega-rewards comes from the observation that infants' intelligence develops when they try to gain more control on entities in an environment; therefore, mega-reward aims to maximize the control capabilities of agents on given entities in a given environment. To formalize mega-reward, a relational transition model is proposed to bridge the gaps between direct and latent control. Experimental studies show that mega-reward can (i) greatly outperform all state-of-the-art intrinsic reward approaches, (ii) generally achieves the same level of performance as Ex-PPO and professional human-level scores; and (iii) has also superior performance when it is incorporated with extrinsic reward.

READ FULL TEXT

page 1

page 5

page 6

page 10

research
02/06/2023

Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning

We introduce an exploration bonus for deep reinforcement learning method...
research
01/26/2023

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

We present AIRS: Automatic Intrinsic Reward Shaping that intelligently a...
research
02/19/2023

AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network

Deducing the contribution of each agent and assigning the corresponding ...
research
12/21/2020

Evaluating Agents without Rewards

Reinforcement learning has enabled agents to solve challenging tasks in ...
research
10/12/2017

Identifying On-time Reward Delivery Projects with Estimating Delivery Duration on Kickstarter

In Crowdfunding platforms, people turn their prototype ideas into real p...
research
05/19/2022

Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse Reward Visual Scenes

Many scenes in real life can be abstracted to the sparse reward visual s...
research
12/15/2019

How Should an Agent Practice?

We present a method for learning intrinsic reward functions to drive the...

Please sign up or login with your details

Forgot password? Click here to reset