Object-oriented lexical encoding of multiword expressions: Short and sweet

10/23/2018
by   Agata Savary, et al.
0

Multiword expressions (MWEs) exhibit both regular and idiosyncratic properties. Their idiosyncrasy requires lexical encoding in parallel with their component words. Their (at times intricate) regularity, on the other hand, calls for means of flexible factorization to avoid redundant descriptions of shared properties. However, so far, non-redundant general-purpose lexical encoding of MWEs has not received a satisfactory solution. We offer a proof of concept that this challenge might be effectively addressed within eXtensible MetaGrammar (XMG), an object-oriented metagrammar framework. We first make an existing metagrammatical resource, the FrenchTAG grammar, MWE-aware. We then evaluate the factorization gain during incremental implementation with XMG on a dataset extracted from an MWE-annotated reference corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2020

Detecting Multiword Expression Type Helps Lexical Complexity Assessment

Multiword expressions (MWEs) represent lexemes that should be treated as...
research
03/09/2016

Lexical bundles in computational linguistics academic literature

In this study we analyzed a corpus of 8 million words academic literatur...
research
10/29/2021

Overview of ADoBo 2021: Automatic Detection of Unassimilated Borrowings in the Spanish Press

This paper summarizes the main findings of the ADoBo 2021 shared task, p...
research
10/05/2017

Bilingual Words and Phrase Mappings for Marathi and Hindi SMT

Lack of proper linguistic resources is the major challenges faced by the...
research
05/27/2022

UAlberta at SemEval 2022 Task 2: Leveraging Glosses and Translations for Multilingual Idiomaticity Detection

We describe the University of Alberta systems for the SemEval-2022 Task ...
research
03/04/2017

Lexical Resources for Hindi Marathi MT

In this paper we describe some ways to utilize various lexical resources...
research
10/13/2020

RuSemShift: a dataset of historical lexical semantic change in Russian

We present RuSemShift, a large-scale manually annotated test set for the...

Please sign up or login with your details

Forgot password? Click here to reset