Word Familiarity and Frequency

06/09/2018
by   Kumiko Tanaka-Ishii, et al.
0

Word frequency is assumed to correlate with word familiarity, but the strength of this correlation has not been thoroughly investigated. In this paper, we report on our analysis of the correlation between a word familiarity rating list obtained through a psycholinguistic experiment and the log-frequency obtained from various corpora of different kinds and sizes (up to the terabyte scale) for English and Japanese. Major findings are threefold: First, for a given corpus, familiarity is necessary for a word to achieve high frequency, but familiar words are not necessarily frequent. Second, correlation increases with the corpus data size. Third, a corpus of spoken language correlates better than one of written language. These findings suggest that cognitive familiarity ratings are correlated to frequency, but more highly to that of spoken rather than written language.

READ FULL TEXT

page 8

page 9

research
08/10/2020

When words collide: Bayesian meta-analyses of distractor and target properties in the picture-word interference paradigm

In the picture-word interference paradigm, participants name pictures wh...
research
10/04/2018

Building a language evolution tree based on word vector combination model

In this paper, we try to explore the evolution of language through case ...
research
03/30/2017

Neutral evolution and turnover over centuries of English word popularity

Here we test Neutral models against the evolution of English word freque...
research
05/31/2021

More than just Frequency? Demasking Unsupervised Hypernymy Prediction Methods

This paper presents a comparison of unsupervised methods of hypernymy pr...
research
02/01/2020

Novel Language Resources for Hindi: An Aesthetics Text Corpus and a Comprehensive Stop Lemma List

This paper is an effort to complement the contributions made by research...
research
05/07/2020

The Danish Gigaword Project

Danish is a North Germanic/Scandinavian language spoken primarily in Den...
research
07/29/2023

Automatic Extraction of the Romanian Academic Word List: Data and Methods

This paper presents the methodology and data used for the automatic extr...

Please sign up or login with your details

Forgot password? Click here to reset