Evaluating KGR10 Polish word embeddings in the recognition of temporal expressions using BiLSTM-CRF

04/03/2019
by   Jan Kocoń, et al.
0

The article introduces a new set of Polish word embeddings, built using KGR10 corpus, which contains more than 4 billion words. These embeddings are evaluated in the problem of recognition of temporal expressions (timexes) for the Polish language. We described the process of KGR10 corpus creation and a new approach to the recognition problem using Bidirectional Long-Short Term Memory (BiLSTM) network with additional CRF layer, where specific embeddings are essential. We presented experiments and conclusions drawn from them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2019

When Specialization Helps: Using Pooled Contextualized Embeddings to Detect Chemical and Biomedical Entities in Spanish

The recognition of pharmacological substances, compounds and proteins is...
research
03/30/2022

Detecting Unassimilated Borrowings in Spanish: An Annotated Corpus and Approaches to Modeling

This work presents a new resource for borrowing identification and analy...
research
04/26/2022

Approach to Predicting News – A Precise Multi-LSTM Network With BERT

Varieties of Democracy (V-Dem) is a new approach to conceptualizing and ...
research
11/09/2017

The Lifted Matrix-Space Model for Semantic Composition

Recent advances in tree structured sentence encoding models have shown t...
research
09/27/2017

Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition

Named Entity Recognition (NER) is one of the most common tasks of the na...
research
06/24/2019

Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation

Attention mechanisms have seen some success for natural language process...
research
10/30/2017

Deep word embeddings for visual speech recognition

In this paper we present a deep learning architecture for extracting wor...

Please sign up or login with your details

Forgot password? Click here to reset