Quantum Policy Iteration via Amplitude Estimation and Grover Search – Towards Quantum Advantage for Reinforcement Learning

06/09/2022
by   Simon Wiedemann, et al.
0

We present a full implementation and simulation of a novel quantum reinforcement learning (RL) method and mathematically prove a quantum advantage. Our approach shows in detail how to combine amplitude estimation and Grover search into a policy evaluation and improvement scheme. We first develop quantum policy evaluation (QPE) which is quadratically more efficient compared to an analogous classical Monte Carlo estimation and is based on a quantum mechanical realization of a finite Markov decision process (MDP). Building on QPE, we derive a quantum policy iteration that repeatedly improves an initial policy using Grover search until the optimum is reached. Finally, we present an implementation of our algorithm for a two-armed bandit MDP which we then simulate. The results confirm that QPE provides a quantum advantage in RL problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

Variational Quantum Reinforcement Learning via Evolutionary Optimization

Recent advance in classical reinforcement learning (RL) and quantum comp...
research
03/03/2022

Quantum Reinforcement Learning via Policy Iteration

Quantum computing has shown the potential to substantially speed up mach...
research
02/16/2023

Quantum Computing Provides Exponential Regret Improvement in Episodic Reinforcement Learning

In this paper, we investigate the problem of episodic reinforcement lear...
research
06/05/2018

Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning

Exogenous state variables and rewards can slow down reinforcement learni...
research
02/10/2022

Uncovering Instabilities in Variational-Quantum Deep Q-Networks

Deep Reinforcement Learning (RL) has considerably advanced over the past...
research
07/03/2022

Government Intervention in Catastrophe Insurance Markets: A Reinforcement Learning Approach

This paper designs a sequential repeated game of a micro-founded society...
research
01/31/2022

Reinforcement Learning with Heterogeneous Data: Estimation and Inference

Reinforcement Learning (RL) has the promise of providing data-driven sup...

Please sign up or login with your details

Forgot password? Click here to reset