CGELBank: CGEL as a Framework for English Syntax Annotation

10/01/2022
by   Brett Reynolds, et al.
0

We introduce the syntactic formalism of the Cambridge Grammar of the English Language (CGEL) to the world of treebanking through the CGELBank project. We discuss some issues in linguistic analysis that arose in adapting the formalism to corpus annotation, followed by quantitative and qualitative comparisons with parallel UD and PTB treebanks. We argue that CGEL provides a good tradeoff between comprehensiveness of analysis and usability for annotation, which motivates expanding the treebank with automatic conversion in the future.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2023

CGELBank Annotation Manual v1.0

CGELBank is a treebank and associated tools based on a syntactic formali...
research
05/15/2023

Using LLM-assisted Annotation for Corpus Linguistics: A Case Study of Local Grammar Analysis

Chatbots based on Large Language Models (LLMs) have shown strong capabil...
research
12/31/2020

UCCA's Foundational Layer: Annotation Guidelines v2.1

This is the annotation manual for Universal Conceptual Cognitive Annotat...
research
11/24/2021

For the Purpose of Curry: A UD Treebank for Ashokan Prakrit

We present the first linguistically annotated treebank of Ashokan Prakri...
research
08/27/2022

On Unsupervised Training of Link Grammar Based Language Models

In this short note we explore what is needed for the unsupervised traini...
research
04/19/2016

Syntactic and semantic classification of verb arguments using dependency-based and rich semantic features

Corpus Pattern Analysis (CPA) has been the topic of Semeval 2015 Task 15...
research
04/19/2022

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Incorporating stronger syntactic biases into neural language models (LMs...

Please sign up or login with your details

Forgot password? Click here to reset