Building Blocks for a Complex-Valued Transformer Architecture

06/16/2023
by   Florian Eilers, et al.
0

Most deep learning pipelines are built on real-valued operations to deal with real-valued inputs such as images, speech or music signals. However, a lot of applications naturally make use of complex-valued signals or images, such as MRI or remote sensing. Additionally the Fourier transform of signals is complex-valued and has numerous applications. We aim to make deep learning directly applicable to these complex-valued signals without using projections into ℝ^2. Thus we add to the recent developments of complex-valued neural networks by presenting building blocks to transfer the transformer architecture to the complex domain. We present multiple versions of a complex-valued Scaled Dot-Product Attention mechanism as well as a complex-valued layer normalization. We test on a classification and a sequence generation task on the MusicNet dataset and show improved robustness to overfitting while maintaining on-par performance when compared to the real-valued transformer architecture.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2015

Learning Representations Using Complex-Valued Nets

Complex-valued neural networks (CVNNs) are an emerging field of research...
research
10/22/2019

Complex Transformer: A Framework for Modeling Complex-Valued Sequence

While deep learning has received a surge of interest in a variety of fie...
research
06/28/2023

Complex-valued Adaptive System Identification via Low-Rank Tensor Decomposition

Machine learning (ML) and tensor-based methods have been of significant ...
research
07/01/2017

Better than Real: Complex-valued Neural Nets for MRI Fingerprinting

The task of MRI fingerprinting is to identify tissue parameters from com...
research
07/28/2022

A Hybrid Complex-valued Neural Network Framework with Applications to Electroencephalogram (EEG)

In this article, we present a new EEG signal classification framework by...
research
10/18/2019

Surreal: Complex-Valued Deep Learning as Principled Transformations on a Rotational Lie Group

Complex-valued deep learning has attracted increasing attention in recen...
research
08/17/2022

Complex-Value Spatio-temporal Graph Convolutional Neural Networks and its Applications to Electric Power Systems AI

The effective representation, precessing, analysis, and visualization of...

Please sign up or login with your details

Forgot password? Click here to reset