A Corpus of Adpositional Supersenses for Mandarin Chinese

03/18/2020
by   Siyao Peng, et al.
0

Adpositions are frequent markers of semantic relations, but they are highly ambiguous and vary significantly from language to language. Moreover, there is a dearth of annotated corpora for investigating the cross-linguistic variation of adposition semantics, or for building multilingual disambiguation systems. This paper presents a corpus in which all adpositions have been semantically annotated in Mandarin Chinese; to the best of our knowledge, this is the first Chinese corpus to be broadly annotated with adposition semantics. Our approach adapts a framework that defined a general set of supersenses according to ostensibly language-independent semantic criteria, though its development focused primarily on English prepositions (Schneider et al., 2018). We find that the supersense categories are well-suited to Chinese adpositions despite syntactic differences from English. On a Mandarin translation of The Little Prince, we achieve high inter-annotator agreement and analyze semantic correspondences of adposition tokens in bitext.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2018

Adpositional Supersenses for Mandarin Chinese

This study adapts Semantic Network of Adposition and Case Supersenses (S...
research
03/09/2020

Shallow Discourse Annotation for Chinese TED Talks

Text corpora annotated with language-related properties are an important...
research
06/12/2023

SE#PCFG: Semantically Enhanced PCFG for Password Analysis and Cracking

Much research has been done on user-generated textual passwords. Surpris...
research
02/22/2017

Improving Chinese SRL with Heterogeneous Annotations

Previous studies on Chinese semantic role labeling (SRL) have concentrat...
research
09/07/2022

That Slepen Al the Nyght with Open Ye! Cross-era Sequence Segmentation with Switch-memory

The evolution of language follows the rule of gradual change. Grammar, v...
research
02/26/2019

On the Idiosyncrasies of the Mandarin Chinese Classifier System

While idiosyncrasies of the Chinese classifier system have been a richly...
research
06/20/2022

Misspelling Semantics In Thai

User-generated content is full of misspellings. Rather than being just r...

Please sign up or login with your details

Forgot password? Click here to reset