Factorized Machine Self-Confidence for Decision-Making Agents

10/15/2018
by   Brett W Israelsen, et al.
0

Algorithmic assurances from advanced autonomous systems assist human users in understanding, trusting, and using such systems appropriately. Designing these systems with the capacity of assessing their own capabilities is one approach to creating an algorithmic assurance. The idea of `machine self-confidence' is introduced for autonomous systems. Using a factorization based framework for self-confidence assessment, one component of self-confidence, called `solver-quality', is discussed in the context of Markov decision processes for autonomous systems. Markov decision processes underlie much of the theory of reinforcement learning, and are commonly used for planning and decision making under uncertainty in robotics and autonomous systems. A `solver quality' metric is formally defined in the context of decision making algorithms based on Markov decision processes. A method for assessing solver quality is then derived, drawing inspiration from empirical hardness models. Finally, numerical experiments for an unmanned autonomous vehicle navigation problem under different solver, parameter, and environment conditions indicate that the self-confidence metric exhibits the desired properties. Discussion of results, and avenues for future investigation are included.

READ FULL TEXT
research
10/15/2018

Machine Self-Confidence in Autonomous Systems via Meta-Analysis of Decision Processes

Algorithmic assurances from advanced autonomous systems assist human use...
research
03/22/2022

A Factor-Based Framework for Decision-Making Competency Self-Assessment

We summarize our efforts to date in developing a framework for generatin...
research
06/05/2012

A Mixed Observability Markov Decision Process Model for Musical Pitch

Partially observable Markov decision processes have been widely used to ...
research
11/05/2021

Regular Decision Processes for Grid Worlds

Markov decision processes are typically used for sequential decision mak...
research
07/16/2022

ChronosPerseus: Randomized Point-based Value Iteration with Importance Sampling for POSMDPs

In reinforcement learning, agents have successfully used environments mo...
research
06/07/2020

Implications of Human Irrationality for Reinforcement Learning

Recent work in the behavioural sciences has begun to overturn the long-h...
research
09/20/2022

Adaptive and Collaborative Bathymetric Channel-Finding Approach for Multiple Autonomous Marine Vehicle

This paper reports an investigation into the problem of rapid identifica...

Please sign up or login with your details

Forgot password? Click here to reset