Neural Language Models for Nineteenth-Century English

05/24/2021 ∙ by Kasra Hosseini, et al. ∙ 44

We present four types of neural language models trained on a large historical dataset of books in English, published between 1760-1900 and comprised of  5.1 billion tokens. The language model architectures include static (word2vec and fastText) and contextualized models (BERT and Flair). For each architecture, we trained a model instance using the whole dataset. Additionally, we trained separate instances on text published before 1850 for the two static models, and four instances considering different time slices for BERT. Our models have already been used in various downstream tasks where they consistently improved performance. In this paper, we describe how the models have been created and outline their reuse potential.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

Code Repositories

AtypicalAnimacy

Repository for code underlying the paper 'Living Machines: A Study of Atypical Animacy' (COLING2020)


view repo
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.