Deep learning-based language models pretrained on large unannotated text...
The multilingual BERT model is trained on 104 languages and meant to ser...
News articles such as sports game reports are often thought to closely f...
A common approach for improving OCR quality is a post-processing step ba...
In this paper we present a novel lemmatization method based on a
sequenc...
Driven by a large number of potential applications in areas like
bioinfo...