Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER

09/08/2019
by   Milan Straka, et al.
0

Contextualized embeddings, which capture appropriate word meaning depending on context, have recently been proposed. We evaluate two meth ods for precomputing such embeddings, BERT and Flair, on four Czech text processing tasks: part-of-speech (POS) tagging, lemmatization, dependency pars ing and named entity recognition (NER). The first three tasks, POS tagging, lemmatization and dependency parsing, are evaluated on two corpora: the Prague Dependency Treebank 3.5 and the Universal Dependencies 2.3. The named entity recognition (NER) is evaluated on the Czech Named Entity Corpus 1.1 and 2.0. We report state-of-the-art results for the above mentioned tasks and corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2020

Beheshti-NER: Persian Named Entity Recognition Using BERT

Named entity recognition is a natural language processing task to recogn...
research
06/29/2020

Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models

This paper describes our study on using mutilingual BERT embeddings and ...
research
11/11/2020

Overview of CAPITEL Shared Tasks at IberLEF 2020: Named Entity Recognition and Universal Dependencies Parsing

We present the results of the CAPITEL-EVAL shared task, held in the cont...
research
08/15/2021

DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants

Named entity recognition (NER) is usually developed and tested on text f...
research
02/26/2019

Entity Recognition at First Sight: Improving NER with Eye Movement Information

Previous research shows that eye-tracking data contains information abou...
research
10/29/2020

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

We investigate using Named Entity Recognition on a new type of user-gene...
research
08/24/2023

Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines

This paper presents a set of industrial-grade text processing models for...

Please sign up or login with your details

Forgot password? Click here to reset