Acceleration Method for Learning Fine-Layered Optical Neural Networks

09/01/2021
by   Kazuo Aoyama, et al.
0

An optical neural network (ONN) is a promising system due to its high-speed and low-power operation. Its linear unit performs a multiplication of an input vector and a weight matrix in optical analog circuits. Among them, a circuit with a multiple-layered structure of programmable Mach-Zehnder interferometers (MZIs) can realize a specific class of unitary matrices with a limited number of MZIs as its weight matrix. The circuit is effective for balancing the number of programmable MZIs and ONN performance. However, it takes a lot of time to learn MZI parameters of the circuit with a conventional automatic differentiation (AD), which machine learning platforms are equipped with. To solve the time-consuming problem, we propose an acceleration method for learning MZI parameters. We create customized complex-valued derivatives for an MZI, exploiting Wirtinger derivatives and a chain rule. They are incorporated into our newly developed function module implemented in C++ to collectively calculate their values in a multi-layered structure. Our method is simple, fast, and versatile as well as compatible with the conventional AD. We demonstrate that our method works 20 times faster than the conventional AD when a pixel-by-pixel MNIST task is performed in a complex-valued recurrent neural network with an MZI-based hidden unit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2009

Practical approach to programmable analog circuits with memristors

We suggest an approach to use memristors (resistors with memory) in prog...
research
05/31/2023

M3ICRO: Machine Learning-Enabled Compact Photonic Tensor Core based on PRogrammable Multi-Operand Multimode Interference

Photonic computing shows promise for transformative advancements in mach...
research
09/13/2019

Electro-optical Neural Networks based on Time-stretch Method

In this paper, a novel architecture of electro-optical neural networks b...
research
05/18/2018

Knowledge Discovery from Layered Neural Networks based on Non-negative Task Decomposition

Interpretability has become an important issue in the machine learning f...
research
11/05/2016

Neuromorphic Silicon Photonic Networks

Photonic systems for high-performance information processing have attrac...
research
03/01/2017

Modular Representation of Layered Neural Networks

Layered neural networks have greatly improved the performance of various...

Please sign up or login with your details

Forgot password? Click here to reset