Is POS Tagging Necessary or Even Helpful for Neural Dependency Parsing?

by   Yu Zhang, et al.

In the pre deep learning era, part-of-speech tags have been considered as indispensable ingredients for feature engineering in dependency parsing due to their important role in alleviating data sparseness of purely lexical features, and quite a few works focus on joint tagging and parsing models to avoid error propagation. In contrast, recent studies suggest that POS tagging becomes much less important or even useless for neural parsing, especially when using character-based word representations such as CharLSTM. Yet there still lacks a full and systematic investigation on this interesting issue, both empirically and linguistically. To answer this, we design four typical multi-task learning frameworks (i.e., Share-Loose, Share-Tight, Stack-Discrete, Stack-Hidden), for joint tagging and parsing based on the state-of-the-art biaffine parser. Considering that it is much cheaper to annotate POS tags than parse trees, we also investigate the utilization of large-scale heterogeneous POS-tag data. We conduct experiments on both English and Chinese datasets, and the results clearly show that POS tagging (both homogeneous and heterogeneous) can still significantly improve parsing performance when using the Stack-Hidden joint framework. We conduct detailed analysis and gain more insights from the linguistic aspect.



There are no comments yet.


page 1

page 2

page 3

page 4


A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing

We propose the first joint model for Vietnamese word segmentation, part-...

A Novel Neural Network Model for Joint POS Tagging and Graph-based Dependency Parsing

We present a novel neural network model that learns POS tagging and grap...

Better, Faster, Stronger Sequence Tagging Constituent Parsers

Sequence tagging models for constituent parsing are faster, but less acc...

Stack-propagation: Improved Representation Learning for Syntax

Traditional syntax models typically leverage part-of-speech (POS) inform...

On the Frailty of Universal POS Tags for Neural UD Parsers

We present an analysis on the effect UPOS accuracy has on parsing perfor...

What Taggers Fail to Learn, Parsers Need the Most

We present an error analysis of neural UPOS taggers to evaluate why usin...

Joint and conditional estimation of tagging and parsing models

This paper compares two different ways of estimating statistical languag...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.