Differential Assessment of Black-Box AI Agents

03/24/2022
by   Rashmeet Kaur Nayyar, et al.
0

Much of the research on learning symbolic models of AI agents focuses on agents with stationary models. This assumption fails to hold in settings where the agent's capabilities may change as a result of learning, adaptation, or other post-deployment modifications. Efficient assessment of agents in such settings is critical for learning the true capabilities of an AI system and for ensuring its safe usage. In this work, we propose a novel approach to differentially assess black-box AI agents that have drifted from their previously known models. As a starting point, we consider the fully observable and deterministic setting. We leverage sparse observations of the drifted agent's current behavior and knowledge of its initial model to generate an active querying policy that selectively queries the agent and computes an updated model of its functionality. Empirical evaluation shows that our approach is much more efficient than re-learning the agent model from scratch. We also show that the cost of differential assessment using our method is proportional to the amount of drift in the agent's functionality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

Learning User-Interpretable Descriptions of Black-Box AI System Capabilities

Several approaches have been developed to answer specific questions that...
research
06/07/2023

Autonomous Capability Assessment of Black-Box Sequential Decision-Making Systems

It is essential for users to understand what their AI systems can and ca...
research
08/21/2021

Learning Causal Models of Autonomous Agents using Interventions

One of the several obstacles in the widespread use of AI systems is the ...
research
12/29/2019

Learning Generalized Models by Interrogating Black-Box Autonomous Agents

This paper develops a new approach for estimating the internal model of ...
research
03/07/2023

Bootstrap The Original Latent: Learning a Private Model from a Black-box Model

In this paper, considering the balance of data/model privacy of model ow...
research
02/16/2020

Active Bayesian Assessment for Black-Box Classifiers

Recent advances in machine learning have led to increased deployment of ...
research
03/14/2022

Safe adaptation in multiagent competition

Achieving the capability of adapting to ever-changing environments is a ...

Please sign up or login with your details

Forgot password? Click here to reset