Approximate FPGA-based LSTMs under Computation Time Constraints

01/07/2018
by   Michalis Rizakis, et al.
0

Recurrent Neural Networks and in particular Long Short-Term Memory (LSTM) networks have demonstrated state-of-the-art accuracy in several emerging Artificial Intelligence tasks. However, the models are becoming increasingly demanding in terms of computational and memory load. Emerging latency-sensitive applications including mobile robots and autonomous vehicles often operate under stringent computation time constraints. In this paper, we address the challenge of deploying computationally demanding LSTMs at a constrained time budget by introducing an approximate computing scheme that combines iterative low-rank compression and pruning, along with a novel FPGA-based LSTM architecture. Combined in an end-to-end framework, the approximation method's parameters are optimised and the architecture is configured to address the problem of high-performance LSTM execution in time-constrained applications. Quantitative evaluation on a real-life image captioning application indicates that the proposed methods required up to 6.5x less time to achieve the same application-level accuracy compared to a baseline method, while achieving an average of 25x higher accuracy under the same computation time constraints.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2016

Learning Compact Recurrent Neural Networks

Recurrent neural networks (RNNs), including long short-term memory (LSTM...
research
05/02/2019

Approximate LSTMs for Time-Constrained Inference: Enabling Fast Reaction in Self-Driving Cars

The need to recognise long-term dependencies in sequential data such as ...
research
05/23/2018

CNN+CNN: Convolutional Decoders for Image Captioning

Image captioning is a challenging task that combines the field of comput...
research
05/30/2018

Grow and Prune Compact, Fast, and Accurate LSTMs

Long short-term memory (LSTM) has been widely used for sequential data m...
research
09/02/2023

Accelerating LSTM-based High-Rate Dynamic System Models

In this paper, we evaluate the use of a trained Long Short-Term Memory (...
research
05/30/2018

Grow and Prune Compact, Fast, and AccurateLSTMs

Long short-term memory (LSTM) has been widely used for sequential data m...
research
10/27/2021

End-to-end LSTM based estimation of volcano event epicenter localization

In this paper, an end-to-end based LSTM scheme is proposed to address th...

Please sign up or login with your details

Forgot password? Click here to reset