The Natural Stories Corpus

08/18/2017
by   Richard Futrell, et al.
0

It is now a common practice to compare models of human language processing by predicting participant reactions (such as reading times) to corpora consisting of rich naturalistic linguistic materials. However, many of the corpora used in these studies are based on naturalistic text and thus do not contain many of the low-frequency syntactic constructions that are often required to distinguish processing theories. Here we describe a new corpus consisting of English texts edited to contain many low-frequency syntactic constructions while still sounding fluent to native speakers. The corpus is annotated with hand-corrected parse trees and includes self-paced reading time data. Here we give an overview of the content of the corpus and release the data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2022

The Copenhagen Corpus of Eye Tracking Recordings from Natural Reading of Danish Texts

Eye movement recordings from reading are one of the richest signals of h...
research
10/27/2022

Creating a morphological and syntactic tagged corpus for the Uzbek language

Nowadays, creation of the tagged corpora is becoming one of the most imp...
research
01/29/2023

A Discerning Several Thousand Judgments: GPT-3 Rates the Article + Adjective + Numeral + Noun Construction

Knowledge of syntax includes knowledge of rare, idiosyncratic constructi...
research
04/10/2017

Automatic semantic role labeling on non-revised syntactic trees of journalistic texts

Semantic Role Labeling (SRL) is a Natural Language Processing task that ...
research
04/26/2022

Disambiguation of morpho-syntactic features of African American English – the case of habitual be

Recent research has highlighted that natural language processing (NLP) s...
research
07/07/2023

Testing the Predictions of Surprisal Theory in 11 Languages

A fundamental result in psycholinguistics is that less predictable words...
research
01/13/2016

Predicting the Effectiveness of Self-Training: Application to Sentiment Classification

The goal of this paper is to investigate the connection between the perf...

Please sign up or login with your details

Forgot password? Click here to reset