Efficient Beam Tree Recursion

07/20/2023
by   Jishnu Ray Chowdhury, et al.
0

Beam Tree Recursive Neural Network (BT-RvNN) was recently proposed as a simple extension of Gumbel Tree RvNN and it was shown to achieve state-of-the-art length generalization performance in ListOps while maintaining comparable performance on other tasks. However, although not the worst in its kind, BT-RvNN can be still exorbitantly expensive in memory usage. In this paper, we identify the main bottleneck in BT-RvNN's memory usage to be the entanglement of the scorer function and the recursive cell function. We propose strategies to remove this bottleneck and further simplify its memory usage. Overall, our strategies not only reduce the memory usage of BT-RvNN by 10-16 times but also create a new state-of-the-art in ListOps while maintaining similar performance in other tasks. In addition, we also propose a strategy to utilize the induced latent-tree node representations produced by BT-RvNN to turn BT-RvNN from a sentence encoder of the form f:ℝ^n × d→ℝ^d into a sequence contextualizer of the form f:ℝ^n × d→ℝ^n × d. Thus, our proposals not only open up a path for further scalability of RvNNs but also standardize a way to use BT-RvNNs as another building block in the deep learning toolkit that can be easily stacked or interfaced with other popular models such as Transformers and Structured State Space models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2023

Beam Tree Recursive Cells

We propose Beam Tree Recursive Cell (BT-Cell) - a backpropagation-friend...
research
01/07/2017

Structural Attention Neural Networks for improved sentiment analysis

We introduce a tree-structured attention neural network for sentences an...
research
11/06/2018

Learning to Embed Sentences Using Attentive Recursive Trees

Sentence embedding is an effective feature representation for most deep ...
research
02/08/2023

Ordered Memory Baselines

Natural language semantics can be modeled using the phrase-structured mo...
research
10/09/2020

Recursive Top-Down Production for Sentence Generation with Latent Trees

We model the recursive production property of context-free grammars for ...
research
11/05/2019

Memory Augmented Recursive Neural Networks

Recursive neural networks have shown an impressive performance for model...
research
04/16/2019

Expediting TTS Synthesis with Adversarial Vocoding

Recent approaches in text-to-speech (TTS) synthesis employ neural networ...

Please sign up or login with your details

Forgot password? Click here to reset