Scalable photonic reinforcement learning by time-division multiplexing of laser chaos

03/26/2018
by   Makoto Naruse, et al.
0

Reinforcement learning involves decision making in dynamic and uncertain environments and constitutes a crucial element of artificial intelligence. In our previous work, we experimentally demonstrated that the ultrafast chaotic oscillatory dynamics of lasers can be used to solve the two-armed bandit problem efficiently, which requires decision making concerning a class of difficult trade-offs called the exploration-exploitation dilemma. However, only two selections were employed in that research; thus, the scalability of the laser-chaos-based reinforcement learning should be clarified. In this study, we demonstrated a scalable, pipelined principle of resolving the multi-armed bandit problem by introducing time-division multiplexing of chaotically oscillated ultrafast time-series. The experimental demonstrations in which bandit problems with up to 64 arms were successfully solved are presented in this report. Detailed analyses are also provided that include performance comparisons among laser chaos signals generated in different physical conditions, which coincide with the diffusivity inherent in the time series. This study paves the way for ultrafast reinforcement learning by taking advantage of the ultrahigh bandwidths of light wave and practical enabling technologies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2017

Ultrafast photonic reinforcement learning based on laser chaos

Reinforcement learning involves decision making in dynamic and uncertain...
research
05/19/2022

Parallel bandit architecture based on laser chaos for reinforcement learning

Accelerating artificial intelligence by photonics is an active field of ...
research
03/30/2022

Theory of Acceleration of Decision Making by Correlated Times Sequences

Photonic accelerators have been intensively studied to provide enhanced ...
research
05/26/2020

Arm order recognition in multi-armed bandit problem with laser chaos time series

By exploiting ultrafast and irregular time series generated by lasers wi...
research
05/12/2022

Controlling chaotic itinerancy in laser dynamics for reinforcement learning

Photonic artificial intelligence has attracted considerable interest in ...
research
07/02/2021

Conflict-free collective stochastic decision making by orbital angular momentum entangled photons

In recent cross-disciplinary studies involving both optics and computing...
research
02/26/2016

Category theoretic foundation of single-photon-based decision making

Decision making is a vital function in the age of machine learning and a...

Please sign up or login with your details

Forgot password? Click here to reset