Grouped self-attention mechanism for a memory-efficient Transformer

10/02/2022
by   Bumjun Jung, et al.
0

Time-series data analysis is important because numerous real-world tasks such as forecasting weather, electricity consumption, and stock market involve predicting data that vary over time. Time-series data are generally recorded over a long period of observation with long sequences owing to their periodic characteristics and long-range dependencies over time. Thus, capturing long-range dependency is an important factor in time-series data forecasting. To solve these problems, we proposed two novel modules, Grouped Self-Attention (GSA) and Compressed Cross-Attention (CCA). With both modules, we achieved a computational space and time complexity of order O(l) with a sequence length l under small hyperparameter limitations, and can capture locality while considering global information. The results of experiments conducted on time-series datasets show that our proposed model efficiently exhibited reduced computational complexity and performance comparable to or better than existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Many real-world applications require the prediction of long sequence tim...
research
06/29/2019

Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting

Time series forecasting is an important problem across many domains, inc...
research
01/04/2023

Infomaxformer: Maximum Entropy Transformer for Long Time-Series Forecasting Problem

The Transformer architecture yields state-of-the-art results in many tas...
research
08/26/2023

Multivariate time series classification with dual attention network

One of the topics in machine learning that is becoming more and more rel...
research
11/13/2022

HigeNet: A Highly Efficient Modeling for Long Sequence Time Series Prediction in AIOps

Modern IT system operation demands the integration of system software an...
research
01/05/2023

Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution

Long-term time-series forecasting (LTTF) has become a pressing demand in...
research
03/01/2019

Dominant Dataset Selection Algorithms for Time-Series Data Based on Linear Transformation

With the explosive growth of time-series data, the scale of time-series ...

Please sign up or login with your details

Forgot password? Click here to reset