ArgRewrite V.2: an Annotated Argumentative Revisions Corpus

06/03/2022
by   Omid Kashefi, et al.
9

Analyzing how humans revise their writings is an interesting research question, not only from an educational perspective but also in terms of artificial intelligence. Better understanding of this process could facilitate many NLP applications, from intelligent tutoring systems to supportive and collaborative writing environments. Developing these applications, however, requires revision corpora, which are not widely available. In this work, we present ArgRewrite V.2, a corpus of annotated argumentative revisions, collected from two cycles of revisions to argumentative essays about self-driving cars. Annotations are provided at different levels of purpose granularity (coarse and fine) and scope (sentential and subsentential). In addition, the corpus includes the revision goal given to each writer, essay scores, annotation verification, pre- and post-study surveys collected from participants as meta-data. The variety of revision unit scope and purpose granularity levels in ArgRewrite, along with the inclusion of new types of meta-data, can make it a useful resource for research and applications that involve revision analysis. We demonstrate some potential applications of ArgRewrite V.2 in the development of automatic revision purpose predictors, as a training source and benchmark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2020

SemClinBr – a multi institutional and multi specialty semantically annotated corpus for Portuguese clinical NLP tasks

The high volume of research focusing on extracting patient's information...
research
06/11/2021

How Should Agents Ask Questions For Situated Learning? An Annotated Dialogue Corpus

Intelligent agents that are confronted with novel concepts in situated e...
research
03/12/2019

Extracting localized information from a Twitter corpus for flood prevention

In this paper, we discuss the collection of a corpus associated to tropi...
research
11/20/2019

Casting a Wide Net: Robust Extraction of Potentially Idiomatic Expressions

Idiomatic expressions like `out of the woods' and `up the ante' present ...
research
04/19/2017

A Large Self-Annotated Corpus for Sarcasm

We introduce the Self-Annotated Reddit Corpus (SARC), a large corpus for...
research
04/22/2016

SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies

We present a new resource for Swedish, SweLL, a corpus of Swedish Learne...
research
12/05/2019

Love Me, Love Me, Say (and Write!) that You Love Me: Enriching the WASABI Song Corpus with Lyrics Annotations

We present the WASABI Song Corpus, a large corpus of songs enriched with...

Please sign up or login with your details

Forgot password? Click here to reset