Method of the coherence evaluation of Ukrainian text

by   S. D. Pogorilyy, et al.

Due to the growing role of the SEO technologies, it is necessary to perform an automated analysis of the article's quality. Such approach helps both to return the most intelligible pages for the user's query and to raise the web sites positions to the top of query results. An automated assessment of a coherence is a part of the complex analysis of the text. In this article, main methods for text coherence measurements for Ukrainian language are analyzed. Expediency of using the semantic similarity graph method in comparison with other methods are explained. It is suggested the improvement of that method by the pre-training of the neural network for vector representations of sentences. Experimental examination of the original method and its modifications is made. Training and examination procedures are made on the corpus of Ukrainian texts, which were previously retrieved from abstracts and full texts of Ukrainian scientific articles. The testing procedure is implemented by performing of two typical tasks for the text coherence assessment: document discrimination task and insertion task. Accordingly to the analysis it is defined the most effective combination of method's modification and its parameter for the measurement of the text coherence.


page 1

page 2

page 3

page 4


Assessment of text coherence based on the cohesion estimation

In this paper, a graph-based coherence estimation method based on the co...

Text Coherence Analysis Based on Deep Neural Network

In this paper, we propose a novel deep coherence model (DCM) using a con...

Evaluating text coherence based on the graph of the consistency of phrases to identify symptoms of schizophrenia

Different state-of-the-art methods of the detection of schizophrenia sym...

Coherence Models for Dialogue

Coherence across multiple turns is a major challenge for state-of-the-ar...

Happiness Entailment: Automating Suggestions for Well-Being

Understanding what makes people happy is a central topic in psychology. ...

Evaluation of Thematic Coherence in Microblogs

Collecting together microblogs representing opinions about the same topi...

Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model

Evaluating the readability of a text can significantly facilitate the pr...