Leveraging Newswire Treebanks for Parsing Conversational Data with Argument Scrambling

02/13/2019
by   Riyaz Ahmad Bhat, et al.
0

We investigate the problem of parsing conversational data of morphologically-rich languages such as Hindi where argument scrambling occurs frequently. We evaluate a state-of-the-art non-linear transition-based parsing system on a new dataset containing 506 dependency trees for sentences from Bollywood (Hindi) movie scripts and Twitter posts of Hindi monolingual speakers. We show that a dependency parser trained on a newswire treebank is strongly biased towards the canonical structures and degrades when applied to conversational data. Inspired by Transformational Generative Grammar, we mitigate the sampling bias by generating all theoretically possible alternative word orders of a clause from the existing (kernel) structures in the treebank. Training our parser on canonical and transformed structures improves performance on conversational data by around 9 parser.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2015

Unsupervised Dependency Parsing: Let's Use Supervised Parsers

We present a self-training approach to unsupervised dependency parsing t...
research
05/27/2019

Sequential Graph Dependency Parser

We propose a method for non-projective dependency parsing by incremental...
research
04/03/2017

A Transition-Based Directed Acyclic Graph Parser for UCCA

We present the first parser for UCCA, a cross-linguistically applicable ...
research
01/12/2022

Biaffine Discourse Dependency Parsing

We provide a study of using the biaffine model for neural discourse depe...
research
06/05/2019

Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation

We propose a new domain adaptation method for Combinatory Categorial Gra...
research
04/19/2018

Consistent CCG Parsing over Multiple Sentences for Improved Logical Reasoning

In formal logic-based approaches to Recognizing Textual Entailment (RTE)...
research
06/01/2016

Improved Parsing for Argument-Clusters Coordination

Syntactic parsers perform poorly in prediction of Argument-Cluster Coord...

Please sign up or login with your details

Forgot password? Click here to reset