The Copenhagen Corpus of Eye Tracking Recordings from Natural Reading of Danish Texts

04/28/2022
by   Nora Hollenstein, et al.
0

Eye movement recordings from reading are one of the richest signals of human language processing. Corpora of eye movements during reading of contextualized running text is a way of making such records available for natural language processing purposes. Such corpora already exist in some languages. We present CopCo, the Copenhagen Corpus of eye tracking recordings from natural reading of Danish texts. It is the first eye tracking corpus of its kind for the Danish language. CopCo includes 1,832 sentences with 34,897 tokens of Danish text extracted from a collection of speech manuscripts. This first release of the corpus contains eye tracking data from 22 participants. It will be extended continuously with more participants and texts from other genres. We assess the data quality of the recorded eye movements and find that the extracted features are in line with related research. The dataset available here: https://osf.io/ud8s5/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2019

ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and Annotation

We recorded and preprocessed ZuCo 2.0, a new dataset of simultaneous eye...
research
03/31/2023

WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset

We create WebQAmGaze, a multilingual low-cost eye-tracking-while-reading...
research
05/07/2018

Relating Eye-Tracking Measures With Changes In Knowledge on Search Tasks

We conducted an eye-tracking study where 30 participants performed searc...
research
08/18/2017

The Natural Stories Corpus

It is now a common practice to compare models of human language processi...
research
04/06/2022

EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios

We present the Eyetracked Multi-Modal Translation (EMMT) corpus, a datas...
research
10/20/2020

Individual corpora predict fast memory retrieval during reading

The corpus, from which a predictive language model is trained, can be co...
research
03/09/2023

SEAM: An Integrated Activation-Coupled Model of Sentence Processing and Eye Movements in Reading

Models of eye-movement control during reading, developed largely within ...

Please sign up or login with your details

Forgot password? Click here to reset