Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control

02/12/2018
by   Moritz August, et al.
0

In this work we introduce the application of black-box quantum control as an interesting rein- forcement learning problem to the machine learning community. We analyze the structure of the reinforcement learning problems arising in quantum physics and argue that agents parameterized by long short-term memory (LSTM) networks trained via stochastic policy gradients yield a general method to solving them. In this context we introduce a variant of the proximal policy optimization (PPO) algorithm called the memory proximal policy optimization (MPPO) which is based on this analysis. We then show how it can be applied to specific learning tasks and present results of nu- merical experiments showing that our method achieves state-of-the-art results for several learning tasks in quantum control with discrete and continouous control parameters.

READ FULL TEXT
research
08/29/2021

Photonic Quantum Policy Learning in OpenAI Gym

In recent years, near-term noisy intermediate scale quantum (NISQ) compu...
research
08/24/2018

Memory Time Span in LSTMs for Multi-Speaker Source Separation

With deep learning approaches becoming state-of-the-art in many speech (...
research
10/30/2019

Quantum Optical Experiments Modeled by Long Short-Term Memory

We demonstrate how machine learning is able to model experiments in quan...
research
02/20/2021

Decaying Clipping Range in Proximal Policy Optimization

Proximal Policy Optimization (PPO) is among the most widely used algorit...
research
03/20/2022

Variational Quantum Policy Gradients with an Application to Quantum Control

Quantum Machine Learning models are composed by Variational Quantum Circ...
research
10/05/2018

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Proximal Policy Optimization (PPO) is a highly popular model-free reinfo...
research
09/22/2020

Fast Black-Box Quantum State Preparation

Quantum state preparation is an important ingredient for other higher-le...

Please sign up or login with your details

Forgot password? Click here to reset