Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Autoencoders

04/03/2019
by   Andrew Drozdov, et al.
0

We introduce deep inside-outside recursive autoencoders (DIORA), a fully-unsupervised method for discovering syntax that simultaneously learns representations for constituents within the induced tree. Our approach predicts each word in an input sentence conditioned on the rest of the sentence and uses inside-outside dynamic programming to consider all possible binary trees over the sentence. At test time the CKY algorithm extracts the highest scoring parse. DIORA achieves a new state-of-the-art F1 in unsupervised binary constituency parsing (unlabeled) in two benchmark datasets, WSJ and MultiNLI.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2020

Unsupervised Parsing via Constituency Tests

We propose a method for unsupervised parsing based on the linguistic not...
research
10/09/2020

Recursive Top-Down Production for Sentence Generation with Latent Trees

We model the recursive production property of context-free grammars for ...
research
05/25/2017

Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

We introduce a neural network that represents sentences by composing the...
research
03/14/2023

Do Transformers Parse while Predicting the Masked Word?

Pre-trained language models have been shown to encode linguistic structu...
research
01/27/2022

An improved judgement algorithm of point in-out convex polygons

This paper proposed a method to judge whether the point is inside or out...
research
08/09/2020

Fast and Accurate Neural CRF Constituency Parsing

Estimating probability distribution is one of the core issues in the NLP...
research
10/05/2021

Co-training an Unsupervised Constituency Parser with Weak Supervision

We introduce a method for unsupervised parsing that relies on bootstrapp...

Please sign up or login with your details

Forgot password? Click here to reset