K-spin Hamiltonian for quantum-resolvable Markov decision processes

04/13/2020
by   Eric B. Jones, et al.
0

The Markov decision process is the mathematical formalization underlying the modern field of reinforcement learning when transition and reward functions are unknown. We derive a pseudo-Boolean cost function that is equivalent to a K-spin Hamiltonian representation of the discrete, finite, discounted Markov decision process with infinite horizon. This K-spin Hamiltonian furnishes a starting point from which to solve for an optimal policy using heuristic quantum algorithms such as adiabatic quantum annealing and the quantum approximate optimization algorithm on near-term quantum hardware. In proving that the variational minimization of our Hamiltonian is equivalent to the Bellman optimality condition we establish an interesting analogy with classical field theory. Along with proof-of-concept calculations to corroborate our formulation by simulated and quantum annealing against classical Q-Learning, we analyze the scaling of physical resources required to solve our Hamiltonian on quantum hardware.

READ FULL TEXT
research
11/05/2019

A Note on Quantum Markov Models

The study of Markov models is central to control theory and machine lear...
research
06/05/2019

Quantum Algorithms for Solving Dynamic Programming Problems

We present quantum algorithms for solving finite-horizon and infinite-ho...
research
11/28/2019

A Data Driven Approach to Learning The Hamiltonian Matrix in Quantum Mechanics

We present a new machine learning technique which calculates a real-valu...
research
12/27/2019

Quantum Logic Gate Synthesis as a Markov Decision Process

Reinforcement learning has witnessed recent applications to a variety of...
research
10/25/2019

On the convergence of projective-simulation-based reinforcement learning in Markov decision processes

In recent years, the interest in leveraging quantum effects for enhancin...
research
11/17/2022

AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

Reinforcement learning has recently been used to approach well-known NP-...
research
07/07/2022

Quantum Advantage in Variational Bayes Inference

Variational Bayes (VB) inference algorithm is used widely to estimate bo...

Please sign up or login with your details

Forgot password? Click here to reset