research
∙
05/04/2023
What changes when you randomly choose BPE merge operations? Not much
We introduce three simple randomized variants of byte pair encoding (BPE...
research
∙
02/28/2022
ParaNames: A Massively Multilingual Entity Name Corpus
This preprint describes work in progress on ParaNames, a multilingual pa...
research
∙
02/24/2022
Toward More Meaningful Resources for Lower-resourced Languages
In this position paper, we describe our perspective on how meaningful re...
research
∙
04/01/2021
Mining Wikidata for Name Resources for African Languages
This work supports further development of language technology for the la...
research
∙
03/20/2021