Jelly Bean World: A Testbed for Never-Ending Learning

Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging researchers to design machine learning systems that can learn to perform a wider variety of inter-related tasks in more complex environments. To date, there is no environment or testbed to facilitate the development and evaluation of never-ending learning systems. To this end, we propose the Jelly Bean World testbed. The Jelly Bean World allows experimentation over two-dimensional grid worlds which are filled with items and in which agents can navigate. This testbed provides environments that are sufficiently complex and where more generally intelligent algorithms ought to perform better than current state-of-the-art reinforcement learning approaches. It does so by producing non-stationary environments and facilitating experimentation with multi-task, multi-agent, multi-modal, and curriculum learning settings. We hope that this new freely-available software will prompt new research and interest in the development and evaluation of never-ending learning systems and more broadly, general intelligence systems.


page 1

page 2

page 7

page 8

page 11

page 12

page 14

page 16


An investigation of licensing of datasets for machine learning based on the GQM model

Dataset licensing is currently an issue in the development of machine le...

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Many real-world tasks involve multiple agents with partial observability...

Continuous Coordination As a Realistic Scenario for Lifelong Learning

Current deep reinforcement learning (RL) algorithms are still highly tas...

A Testbed for Carbon-Aware Applications and Systems

To mitigate the growing carbon footprint of computing systems, there has...

Recursive Experts: An Efficient Optimal Mixture of Learning Systems in Dynamic Environments

Sequential learning systems are used in a wide variety of problems from ...

Is Writing Prompts Really Making Art?

In recent years Generative Machine Learning systems have advanced signif...

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

We present Myriad, a testbed written in JAX for learning and planning in...

Please sign up or login with your details

Forgot password? Click here to reset