Modeling Hierarchical Structures with Continuous Recursive Neural Networks

06/10/2021
by   Jishnu Ray Chowdhury, et al.
0

Recursive Neural Networks (RvNNs), which compose sequences according to their underlying hierarchical syntactic structure, have performed well in several natural language processing tasks compared to similar models without structural biases. However, traditional RvNNs are incapable of inducing the latent structure in a plain text sequence on their own. Several extensions have been proposed to overcome this limitation. Nevertheless, these extensions tend to rely on surrogate gradients or reinforcement learning at the cost of higher bias or variance. In this work, we propose Continuous Recursive Neural Network (CRvNN) as a backpropagation-friendly alternative to address the aforementioned limitations. This is done by incorporating a continuous relaxation to the induced structure. We demonstrate that CRvNN achieves strong performance in challenging synthetic tasks such as logical inference and ListOps. We also show that CRvNN performs comparably or better than prior latent structure models on real-world tasks such as sentiment analysis and natural language inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2023

Discrete Latent Structure in Neural Networks

Many types of data from fields including natural language processing, co...
research
01/03/2022

Learning with Latent Structures in Natural Language Processing: A Survey

While end-to-end learning with fully differentiable models has enabled t...
research
01/13/2013

Cutting Recursive Autoencoder Trees

Deep Learning models enjoy considerable success in Natural Language Proc...
research
05/31/2023

Beam Tree Recursive Cells

We propose Beam Tree Recursive Cell (BT-Cell) - a backpropagation-friend...
research
02/25/2019

Cooperative Learning of Disjoint Syntax and Semantics

There has been considerable attention devoted to models that learn to jo...
research
11/28/2021

FastTrees: Parallel Latent Tree-Induction for Faster Sequence Encoding

Inducing latent tree structures from sequential data is an emerging tren...

Please sign up or login with your details

Forgot password? Click here to reset