Better safe than sorry: Risky function exploitation through safe optimization

02/02/2016
by   Eric Schulz, et al.
0

Exploration-exploitation of functions, that is learning and optimizing a mapping between inputs and expected outputs, is ubiquitous to many real world situations. These situations sometimes require us to avoid certain outcomes at all cost, for example because they are poisonous, harmful, or otherwise dangerous. We test participants' behavior in scenarios in which they have to find the optimum of a function while at the same time avoid outputs below a certain threshold. In two experiments, we find that Safe-Optimization, a Gaussian Process-based exploration-exploitation algorithm, describes participants' behavior well and that participants seem to care firstly whether a point is safe and then try to pick the optimal point from all such safe points. This means that their trade-off between exploration and exploitation can be seen as an intelligent, approximate, and homeostasis-driven strategy.

READ FULL TEXT
research
05/15/2023

Mastering the exploration-exploitation trade-off in Bayesian Optimization

Gaussian Process based Bayesian Optimization is a well-known sample effi...
research
02/22/2018

Diverse Exploration for Fast and Safe Policy Improvement

We study an important yet under-addressed problem of quickly and safely ...
research
09/13/2019

ISL: Optimal Policy Learning With Optimal Exploration-Exploitation Trade-Off

Traditionally, off-policy learning algorithms (such as Q-learning) and e...
research
04/16/2023

Dynamic Exploration-Exploitation Trade-Off in Active Learning Regression with Bayesian Hierarchical Modeling

Active learning provides a framework to adaptively sample the most infor...
research
11/22/2021

Analysis of Exploration vs. Exploitation in Adaptive Information Sampling

Adaptive information sampling approaches enable efficient selection of m...
research
06/08/2018

Fidelity-based Probabilistic Q-learning for Control of Quantum Systems

The balance between exploration and exploitation is a key problem for re...
research
01/23/2021

Safe Learning and Optimization Techniques: Towards a Survey of the State of the Art

Safe learning and optimization deals with learning and optimization prob...

Please sign up or login with your details

Forgot password? Click here to reset