Explaining Agent Behavior with Large Language Models

09/19/2023
by   Xijia Zhang, et al.
0

Intelligent agents such as robots are increasingly deployed in real-world, safety-critical settings. It is vital that these agents are able to explain the reasoning behind their decisions to human counterparts, however, their behavior is often produced by uninterpretable models such as deep neural networks. We propose an approach to generate natural language explanations for an agent's behavior based only on observations of states and actions, agnostic to the underlying model representation. We show how a compact representation of the agent's behavior can be learned and used to produce plausible explanations with minimal hallucination while affording user interaction with a pre-trained large language model. Through user studies and empirical experiments, we show that our approach generates explanations as helpful as those generated by a human domain expert while enabling beneficial interactions such as clarification and counterfactual queries.

READ FULL TEXT

page 3

page 4

research
05/22/2023

MaNtLE: Model-agnostic Natural Language Explainer

Understanding the internal reasoning behind the predictions of machine l...
research
11/01/2019

Generating Justifications for Norm-Related Agent Decisions

We present an approach to generating natural language justifications of ...
research
09/14/2020

An Argumentation-based Approach for Explaining Goal Selection in Intelligent Agents

During the first step of practical reasoning, i.e. deliberation or goals...
research
04/07/2023

Generative Agents: Interactive Simulacra of Human Behavior

Believable proxies of human behavior can empower interactive application...
research
07/31/2023

Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?

Coaxing out desired behavior from pretrained models, while avoiding unde...
research
03/19/2021

Semantic Contextual Reasoning to Provide Human Behavior

In recent years, the world has witnessed various primitives pertaining t...
research
09/27/2017

WHY: Natural Explanations from a Robot Navigator

Effective collaboration between a robot and a person requires natural co...

Please sign up or login with your details

Forgot password? Click here to reset