State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning

12/06/2022
by   Ziqi Wang, et al.
0

In this paper we revisit endless online level generation with the recently proposed experience-driven procedural content generation via reinforcement learning (EDRL) framework, from an observation that EDRL tends to generate recurrent patterns. Inspired by this phenomenon, we formulate a notion of state space closure, which means that any state that may appear in an infinite-horizon online generation process can be found in a finite horizon. Through theoretical analysis we find that though state space closure arises a concern about diversity, it makes the EDRL trained on a finite-horizon generalised to the infinite-horizon scenario without deterioration of content quality. Moreover, we verify the quality and diversity of contents generated by EDRL via empirical studies on the widely used Super Mario Bros. benchmark. Experimental results reveal that the current EDRL approach's ability of generating diverse game levels is limited due to the state space closure, whereas it does not suffer from reward deterioration given a horizon longer than the one of training. Concluding our findings and analysis, we argue that future works in generating online diverse and high-quality contents via EDRL should address the issue of diversity on the premise of state space closure which ensures the quality.

READ FULL TEXT
research
06/30/2021

Experience-Driven PCG via Reinforcement Learning: A Super Mario Bros Study

We introduce a procedural content generation (PCG) framework at the inte...
research
07/13/2023

On the Effective Horizon of Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) algorithms often rely on (forward) ...
research
07/12/2022

Online Game Level Generation from Music

Game consists of multiple types of content, while the harmony of differe...
research
06/26/2019

A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning

We consider the finite horizon continuous reinforcement learning problem...
research
03/18/2022

Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning

In this paper, we consider the infinite-horizon reach-avoid zero-sum gam...
research
07/12/2022

Unsupervised learning of observation functions in state-space models by nonparametric moment methods

We investigate the unsupervised learning of non-invertible observation f...
research
12/03/2018

Conscious enactive computation

This paper looks at recent debates in the enactivist literature on compu...

Please sign up or login with your details

Forgot password? Click here to reset