Polycraft World AI Lab (PAL): An Extensible Platform for Evaluating Artificial Intelligence Agents

01/27/2023
by   Stephen A. Goss, et al.
0

As artificial intelligence research advances, the platforms used to evaluate AI agents need to adapt and grow to continue to challenge them. We present the Polycraft World AI Lab (PAL), a task simulator with an API based on the Minecraft mod Polycraft World. Our platform is built to allow AI agents with different architectures to easily interact with the Minecraft world, train and be evaluated in multiple tasks. PAL enables the creation of tasks in a flexible manner as well as having the capability to manipulate any aspect of the task during an evaluation. All actions taken by AI agents and external actors (non-player-characters, NPCs) in the open-world environment are logged to streamline evaluation. Here we present two custom tasks on the PAL platform, one focused on multi-step planning and one focused on navigation, and evaluations of agents solving them. In summary, we report a versatile and extensible AI evaluation platform with a low barrier to entry for AI researchers to utilize.

READ FULL TEXT

page 7

page 8

page 10

page 13

page 20

page 21

research
02/10/2019

EvalAI: Towards Better Evaluation Systems for AI Agents

We introduce EvalAI, an open source platform for evaluating and comparin...
research
12/12/2016

DeepMind Lab

DeepMind Lab is a first-person 3D game platform designed for research an...
research
04/02/2019

Habitat: A Platform for Embodied AI Research

We present Habitat, a new platform for research in embodied artificial i...
research
09/12/2019

The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition

Recent advances in artificial intelligence have been strongly driven by ...
research
09/17/2021

CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research

Interest in applying Artificial Intelligence (AI) techniques to compiler...
research
02/05/2019

Dungeon Crawl Stone Soup as an Evaluation Domain for Artificial Intelligence

Dungeon Crawl Stone Soup is a popular, single-player, free and open-sour...
research
06/02/2022

Artificial Open World for Evaluating AGI: a Conceptual Design

How to evaluate Artificial General Intelligence (AGI) is a critical prob...

Please sign up or login with your details

Forgot password? Click here to reset