Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks

06/11/2019
by   Yi Tay, et al.
0

Many state-of-the-art neural models for NLP are heavily parameterized and thus memory inefficient. This paper proposes a series of lightweight and memory efficient neural architectures for a potpourri of natural language processing (NLP) tasks. To this end, our models exploit computation using Quaternion algebra and hypercomplex spaces, enabling not only expressive inter-component interactions but also significantly (75%) reduced parameter size due to lesser degrees of freedom in the Hamilton product. We propose Quaternion variants of models, giving rise to new architectures such as the Quaternion attention Model and Quaternion Transformer. Extensive experiments on a battery of NLP tasks demonstrates the utility of proposed Quaternion-inspired models, enabling up to 75% reduction in parameter size without significant loss in performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2019

An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing

Multi-Task Learning (MTL) aims at boosting the overall performance of ea...
research
02/07/2017

Comparative Study of CNN and RNN for Natural Language Processing

Deep neural networks (DNN) have revolutionized the field of natural lang...
research
01/24/2020

Compressing Language Models using Doped Kronecker Products

Kronecker Products (KP) have been used to compress IoT RNN Applications ...
research
03/31/2023

BERTino: an Italian DistilBERT model

The recent introduction of Transformers language representation models a...
research
04/05/2021

Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modelling

As a well-established approach, factorization machine (FM) is capable of...
research
11/10/2017

Efficient Representation for Natural Language Processing via Kernelized Hashcodes

Kernel similarity functions have been successfully applied in classifica...
research
02/04/2020

Lightweight Convolutional Representations for On-Device Natural Language Processing

The increasing computational and memory complexities of deep neural netw...

Please sign up or login with your details

Forgot password? Click here to reset