Human-AI Learning Performance in Multi-Armed Bandits

12/21/2018
by   Ravi Pandya, et al.
0

People frequently face challenging decision-making problems in which outcomes are uncertain or unknown. Artificial intelligence (AI) algorithms exist that can outperform humans at learning such tasks. Thus, there is an opportunity for AI agents to assist people in learning these tasks more effectively. In this work, we use a multi-armed bandit as a controlled setting in which to explore this direction. We pair humans with a selection of agents and observe how well each human-agent team performs. We find that team performance can beat both human and agent performance in isolation. Interestingly, we also find that an agent's performance in isolation does not necessarily correlate with the human-agent team's performance. A drop in agent performance can lead to a disproportionately large drop in team performance, or in some settings can even improve team performance. Pairing a human with an agent that performs slightly better than them can make them perform much better, while pairing them with an agent that performs the same can make them them perform much worse. Further, our results suggest that people have different exploration strategies and might perform better with agents that match their strategy. Overall, optimizing human-agent team performance requires going beyond optimizing agent performance, to understanding how the agent's suggestions will influence human decision-making.

READ FULL TEXT

page 2

page 3

research
10/02/2021

Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams

When humans collaborate with each other, they often make decisions by ob...
research
01/08/2022

Modeling Human-AI Team Decision Making

AI and humans bring complementary skills to group deliberations. Modelin...
research
08/17/2022

Commander's Intent: A Dataset and Modeling Approach for Human-AI Task Specification in Strategic Play

Effective Human-AI teaming requires the ability to communicate the goals...
research
05/02/2023

ADVISE: AI-accelerated Design of Evidence Synthesis for Global Development

When designing evidence-based policies and programs, decision-makers mus...
research
01/26/2022

Probe-Based Interventions for Modifying Agent Behavior

Neural nets are powerful function approximators, but the behavior of a g...
research
03/02/2023

Compensating for Sensing Failures via Delegation in Human-AI Hybrid Systems

Given an increasing prevalence of intelligent systems capable of autonom...
research
01/21/2023

My Actions Speak Louder Than Your Words: When User Behavior Predicts Their Beliefs about Agents' Attributes

An implicit expectation of asking users to rate agents, such as an AI de...

Please sign up or login with your details

Forgot password? Click here to reset