ÆTHEL: Automatically Extracted Type-Logical Derivations for Dutch

12/29/2019
by   Konstantinos Kogkalidis, et al.
0

We present ÆTHEL, a semantic compositionality dataset for written Dutch. ÆTHEL consists of two parts. First, it contains a lexicon of supertags for about 900,000 words in context. The supertags correspond to types of the simply typed linear lambda-calculus, enhanced with dependency decorations that capture grammatical roles supplementary to function-argument structures. On the basis of these types, ÆTHEL further provides 72,623 validated derivations, presented in four equivalent formats: natural-deduction and sequent-style proofs, linear logic proofnets and the associated programs (lambda terms) for meaning composition. ÆTHEL's types and derivations are obtained by means of an extraction algorithm applied to the syntactic analyses of LASSY-Small, the gold standard corpus of written Dutch. We discuss the extraction algorithm and show how `virtual elements' in the original LASSY annotation of unbounded dependencies and coordination phenomena give rise to higher-order types. We suggest some example usecases highlighting the benefits of a type-driven approach at the syntax semantics interface. The following resources are open-sourced with ÆTHEL: the lexical mappings between words and types, a subset of the dataset consisting of 8,569 semantic parses, and the Python code that implements the extraction algorithm.

READ FULL TEXT
research
02/08/2018

Classical Higher-Order Processes

Classical Processes (CP) is a calculus where the proof theory of classic...
research
02/02/2016

The Grail theorem prover: Type theory for syntax and semantics

As the name suggests, type-logical grammars are a grammar formalism base...
research
06/17/2015

Pragmatic Side Effects

In the quest to give a formal compositional semantics to natural languag...
research
06/06/2022

A Category Theoretic View of Contextual Types: from Simple Types to Dependent Types

We describe the categorical semantics for a simply typed variant and a s...
research
02/23/2023

SPINDLE: Spinning Raw Text into Lambda Terms with Graph Attention

This paper describes SPINDLE - an open source Python module implementing...
research
05/12/2020

A Frobenius Algebraic Analysis for Parasitic Gaps

The interpretation of parasitic gaps is an ostensible case of non-linear...
research
01/25/2020

Introduction of Quantification in Frame Semantics

Feature Structures (FSs) are a widespread tool used for decompositional ...

Please sign up or login with your details

Forgot password? Click here to reset