Supertagging the Long Tail with Tree-Structured Decoding of Complex Categories

12/02/2020
by   Jakob Prange, et al.
0

Although current CCG supertaggers achieve high accuracy on the standard WSJ test set, few systems make use of the categories' internal structure that will drive the syntactic derivation during parsing. The tagset is traditionally truncated, discarding the many rare and complex category types in the long tail. However, supertags are themselves trees. Rather than give up on rare tags, we investigate constructive models that account for their internal structure, including novel methods for tree-structured prediction. Our best tagger is capable of recovering a sizeable fraction of the long-tail supertags and even generates CCG categories that have never been seen in training, while approximating the prior state of the art in overall tag accuracy with fewer parameters. We further investigate how well different approaches generalize to out-of-domain evaluation sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2021

DropLoss for Long-Tail Instance Segmentation

Long-tailed class distributions are prevalent among the practical applic...
research
11/29/2019

Learning Generalizable Representations via Diverse Supervision

The problem of rare category recognition has received a lot of attention...
research
04/02/2021

Adaptive Class Suppression Loss for Long-Tail Object Detection

To address the problem of long-tail distribution for the large vocabular...
research
04/29/2019

A Study on Action Detection in the Wild

The recent introduction of the AVA dataset for action detection has caus...
research
03/23/2022

Geometry-Aware Supertagging with Heterogeneous Dynamic Convolutions

The syntactic categories of categorial grammar formalisms are structured...
research
01/26/2023

Neural-Symbolic Inference for Robust Autoregressive Graph Parsing via Compositional Uncertainty Quantification

Pre-trained seq2seq models excel at graph semantic parsing with rich ann...
research
10/13/2022

Benchmarking Long-tail Generalization with Likelihood Splits

In order to reliably process natural language, NLP systems must generali...

Please sign up or login with your details

Forgot password? Click here to reset