Character-based Surprisal as a Model of Human Reading in the Presence of Errors

02/02/2019
by   Michael Hahn, et al.
0

Intuitively, human readers cope easily with errors in text; typos, misspelling, word substitutions, etc. do not unduly disrupt natural reading. Previous work indicates that letter transpositions result in increased reading times, but it is unclear if this effect generalizes to more natural errors. In this paper, we report an eye-tracking study that compares two error types (letter transpositions and naturally occurring misspelling) and two error rates (10 unimpaired comprehension in spite of these errors, but error words cause more reading difficulty than correct words. Also, transpositions are more difficult than misspellings, and a high error rate increases difficulty for all words, including correct ones. We then present a computational model that uses character-based (rather than traditional word-based) surprisal to account for these results. The model explains that transpositions are harder than misspellings because they contain unexpected letter combinations. It also explains the error rate effect: upcoming words are more difficultto predict when the context is degraded, leading to increased surprisal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2021

Lip reading using external viseme decoding

Lip-reading is the operation of recognizing speech from lip movements. T...
research
02/07/2022

Selecting Seed Words for Wordle using Character Statistics

Wordle, a word guessing game rose to global popularity in the January of...
research
12/26/2020

Smartajweed Automatic Recognition of Arabic Quranic Recitation Rules

Tajweed is a set of rules to read the Quran in a correct Pronunciation o...
research
05/28/2019

A Cost Efficient Approach to Correct OCR Errors in Large Document Collections

Word error rate of an ocr is often higher than its character error rate....
research
07/01/2022

Quality increases as the error rate decreases

In this paper we propose an approach to the design of processes and soft...
research
05/03/2023

Explore the difficulty of words and its influential attributes based on the Wordle game

We adopt the distribution and expectation of guessing times in game Word...
research
03/12/2020

Comments on `Design and Implementation of Model-Predictive Control With Friction Compensation on an Omnidirectional Mobile Robot'

There are errors in the dynamics model in <cit.>. In addition, some deta...

Please sign up or login with your details

Forgot password? Click here to reset