Compressing deep quaternion neural networks with targeted regularization

07/26/2019
by   Riccardo Vecchi, et al.
0

In recent years, hyper-complex deep networks (e.g., quaternion-based) have received increasing interest with applications ranging from image reconstruction to 3D audio processing. Similarly to their real-valued counterparts, quaternion neural networks might require custom regularization strategies to avoid overfitting. In addition, for many real-world applications and embedded implementations there is the need of designing sufficiently compact networks, with as few weights and units as possible. However, the problem of how to regularize and/or sparsify quaternion-valued networks has not been properly addressed in the literature as of now. In this paper we show how to address both problems by designing targeted regularization strategies, able to minimize the number of connections and neurons of the network during training. To this end, we investigate two extensions of ℓ_1 and structured regularization to the quaternion domain. In our experimental evaluation, we show that these tailored strategies significantly outperform classical (real-valued) regularization strategies, resulting in small networks especially suitable for low-power and real-time applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2020

Complex-Valued Convolutional Neural Networks for MRI Reconstruction

Many real-world signal sources are complex-valued, having real and imagi...
research
01/11/2023

Rethinking complex-valued deep neural networks for monaural speech enhancement

Despite multiple efforts made towards adopting complex-valued deep neura...
research
09/17/2020

Complex-Valued vs. Real-Valued Neural Networks for Classification Perspectives: An Example on Non-Circular Data

The contributions of this paper are twofold. First, we show the potentia...
research
12/13/2017

Deep Quaternion Networks

The field of deep learning has seen significant advancement in recent ye...
research
08/15/2021

Towards Understanding Theoretical Advantages of Complex-Reaction Networks

Complex-valued neural networks have attracted increasing attention in re...
research
02/29/2016

On Complex Valued Convolutional Neural Networks

Convolutional neural networks (CNNs) are the cutting edge model for supe...
research
09/04/2022

Recurrent Bilinear Optimization for Binary Neural Networks

Binary Neural Networks (BNNs) show great promise for real-world embedded...

Please sign up or login with your details

Forgot password? Click here to reset