Commander's Intent: A Dataset and Modeling Approach for Human-AI Task Specification in Strategic Play

08/17/2022
by   Pradyumna Tambwekar, et al.
0

Effective Human-AI teaming requires the ability to communicate the goals of the team and constraints under which you need the agent to operate. Providing the ability to specify the shared intent or operation criteria of the team can enable an AI agent to perform its primary function while still being able to cater to the specific desires of the current team. While significant work has been conducted to instruct an agent to perform a task, via language or demonstrations, prior work lacks a focus on building agents which can operate within the parameters specified by a team. Worse yet, there is a dearth of research pertaining to enabling humans to provide their specifications through unstructured, naturalist language. In this paper, we propose the use of goals and constraints as a scaffold to modulate and evaluate autonomous agents. We contribute to this field by presenting a novel dataset, and an associated data collection protocol, which maps language descriptions to goals and constraints corresponding to specific strategies developed by human participants for the board game Risk. Leveraging state-of-the-art language models and augmentation procedures, we develop a machine learning framework which can be used to identify goals and constraints from unstructured strategy descriptions. To empirically validate our approach we conduct a human-subjects study to establish a human-baseline for our dataset. Our results show that our machine learning architecture is better able to interpret unstructured language descriptions into strategy specifications than human raters tasked with performing the same machine translation task (F(1,272.53) = 17.025, p < 0.001).

READ FULL TEXT

page 8

page 15

research
09/13/2020

Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman

In multi-agent learning, agents must coordinate with each other in order...
research
12/21/2018

Human-AI Learning Performance in Multi-Armed Bandits

People frequently face challenging decision-making problems in which out...
research
03/07/2021

Adaptive Agent Architecture for Real-time Human-Agent Teaming

Teamwork is a set of interrelated reasoning, actions and behaviors of te...
research
08/04/2022

Creative Wand: A System to Study Effects of Communications in Co-Creative Settings

Recent neural generation systems have demonstrated the potential for pro...
research
07/05/2021

The MineRL BASALT Competition on Learning from Human Feedback

The last decade has seen a significant increase of interest in deep lear...
research
09/29/2021

Collaborative Storytelling with Human Actors and AI Narrators

Large language models can be used for collaborative storytelling. In thi...
research
10/28/2021

Human-Computer Interaction Glow Up: Examining Operational Trust and Intention Towards Mars Autonomous Systems

Tactful coordination on earth between hundreds of operators from diverse...

Please sign up or login with your details

Forgot password? Click here to reset