AGRR-2019: A Corpus for Gapping Resolution in Russian

06/10/2019
by   Maria Ponomareva, et al.
0

This paper provides a comprehensive overview of the gapping dataset for Russian that consists of 7.5k sentences with gapping (as well as 15k relevant negative sentences) and comprises data from various genres: news, fiction, social media and technical texts. The dataset was prepared for the Automatic Gapping Resolution Shared Task for Russian (AGRR-2019) - a competition aimed at stimulating the development of NLP tools and methods for processing of ellipsis. In this paper, we pay special attention to the gapping resolution methods that were introduced within the shared task as well as an alternative test set that illustrates that our corpus is a diverse and representative subset of Russian language gapping sufficient for effective utilization of machine learning techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2020

Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpus

This article introduces the Wanca 2017 corpus of texts crawled from the ...
research
06/09/2017

Overview of the NLPCC 2017 Shared Task: Chinese News Headline Categorization

In this paper, we give an overview for the shared task at the CCF Confer...
research
02/02/2019

Making a Case for Social Media Corpus for Detecting Depression

The social media platform provides an opportunity to gain valuable insig...
research
06/16/2023

Sheffield's Submission to the AmericasNLP Shared Task on Machine Translation into Indigenous Languages

In this paper we describe the University of Sheffield's submission to th...
research
02/24/2021

SocialNLP EmotionGIF 2020 Challenge Overview: Predicting Reaction GIF Categories on Social Media

We present an overview of the EmotionGIF2020 Challenge, held at the 8th ...
research
05/26/2017

Helping News Editors Write Better Headlines: A Recommender to Improve the Keyword Contents & Shareability of News Headlines

We present a software tool that employs state-of-the-art natural languag...
research
01/04/2021

Are Eliminated Spans Useless for Coreference Resolution? Not at all

Various neural-based methods have been proposed so far for joint mention...

Please sign up or login with your details

Forgot password? Click here to reset