Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

11/03/2022
by   Mesut Yang, et al.
0

AI agents designed to collaborate with people benefit from models that enable them to anticipate human behavior. However, realistic models tend to require vast amounts of human data, which is often hard to collect. A good prior or initialization could make for more data-efficient training, but what makes for a good prior on human behavior? Our work leverages a very simple assumption: people generally act closer to optimal than to random chance. We show that using optimal behavior as a prior for human models makes these models vastly more data-efficient and able to generalize to new environments. Our intuition is that such a prior enables the training to focus one's precious real-world data on capturing the subtle nuances of human suboptimality, instead of on the basics of how to do the task in the first place. We also show that using these improved human models often leads to better human-AI collaboration performance compared to using models based on real human data alone.

READ FULL TEXT

page 6

page 7

page 14

research
04/22/2022

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

Models of human behavior for prediction and collaboration tend to fall i...
research
01/06/2023

Improving Human-AI Collaboration With Descriptions of AI Behavior

People work with AI systems to improve their decision making, but often ...
research
03/02/2023

Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games

We aim to understand how people assess human likeness in navigation prod...
research
04/12/2021

Building Mental Models through Preview of Autopilot Behaviors

Effective human-vehicle collaboration requires an appropriate un-derstan...
research
03/06/2023

Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

Human models play a crucial role in human-robot interaction (HRI), enabl...
research
04/19/2023

Multipar-T: Multiparty-Transformer for Capturing Contingent Behaviors in Group Conversations

As we move closer to real-world AI systems, AI agents must be able to de...
research
12/15/2020

Indecision Modeling

AI systems are often used to make or contribute to important decisions i...

Please sign up or login with your details

Forgot password? Click here to reset