Non-local Optimization: Imposing Structure on Optimization Problems by Relaxation
In stochastic optimization, particularly in evolutionary computation and reinforcement learning, the optimization of a function f: Ω→ℝ is often addressed through optimizing a so-called relaxation θ∈Θ↦𝔼_θ(f) of f, where Θ resembles the parameters of a family of probability measures on Ω. We investigate the structure of such relaxations by means of measure theory and Fourier analysis, enabling us to shed light on the success of many associated stochastic optimization methods. The main structural traits we derive and that allow fast and reliable optimization of relaxations are the resemblance of optimal values of f, Lipschitzness of gradients, and convexity. We emphasize settings where f does not involve the latter structure, e.g., in the presence of (stochastic) disturbance.
READ FULL TEXT