Adaptive Simulation-based Training of AI Decision-makers using Bayesian Optimization

03/27/2017
by   Brett W Israelsen, et al.
0

This work studies how an AI-controlled dog-fighting agent with tunable decision-making parameters can learn to optimize performance against an intelligent adversary, as measured by a stochastic objective function evaluated on simulated combat engagements. Gaussian process Bayesian optimization (GPBO) techniques are developed to automatically learn global Gaussian Process (GP) surrogate models, which provide statistical performance predictions in both explored and unexplored areas of the parameter space. This allows a learning engine to sample full-combat simulations at parameter values that are most likely to optimize performance and also provide highly informative data points for improving future predictions. However, standard GPBO methods do not provide a reliable surrogate model for the highly volatile objective functions found in aerial combat, and thus do not reliably identify global maxima. These issues are addressed by novel Repeat Sampling (RS) and Hybrid Repeat/Multi-point Sampling (HRMS) techniques. Simulation studies show that HRMS improves the accuracy of GP surrogate models, allowing AI decision-makers to more accurately predict performance and efficiently tune parameters.

READ FULL TEXT

page 37

page 39

research
12/13/2016

Towards Adaptive Training of Agent-based Sparring Partners for Fighter Pilots

A key requirement for the current generation of artificial decision-make...
research
12/13/2016

Hybrid Repeat/Multi-point Sampling for Highly Volatile Objective Functions

A key drawback of the current generation of artificial decision-makers i...
research
05/31/2023

A Study of Bayesian Neural Network Surrogates for Bayesian Optimization

Bayesian optimization is a highly efficient approach to optimizing objec...
research
07/18/2017

Robust Bayesian Optimization with Student-t Likelihood

Bayesian optimization has recently attracted the attention of the automa...
research
07/23/2018

Weak in the NEES?: Auto-tuning Kalman Filters with Bayesian Optimization

Kalman filters are routinely used for many data fusion applications incl...
research
06/11/2023

Additive Multi-Index Gaussian process modeling, with application to multi-physics surrogate modeling of the quark-gluon plasma

The Quark-Gluon Plasma (QGP) is a unique phase of nuclear matter, theori...
research
06/29/2019

Multi-objective multi-generation Gaussian process optimizer for design optimization

We present a multi-objective optimization algorithm that uses Gaussian p...

Please sign up or login with your details

Forgot password? Click here to reset