A big data approach towards sarcasm detection in Russian

06/01/2023
by   A. A. Gurin, et al.
0

We present a set of deterministic algorithms for Russian inflection and automated text synthesis. These algorithms are implemented in a publicly available web-service www.passare.ru. This service provides functions for inflection of single words, word matching and synthesis of grammatically correct Russian text. Selected code and datasets are available at https://github.com/passare-ru/PassareFunctions/ Performance of the inflectional functions has been tested against the annotated corpus of Russian language OpenCorpora, compared with that of other solutions, and used for estimating the morphological variability and complexity of different parts of speech in Russian.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2017

The Algorithmic Inflection of Russian and Generation of Grammatically Correct Text

We present a deterministic algorithm for Russian inflection. This algori...
research
05/25/2023

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages

We create publicly available language identification (LID) datasets and ...
research
02/14/2022

Semantic Matching from Different Perspectives

In this paper, we pay attention to the issue which is usually overlooked...
research
01/25/2021

MadDog: A Web-based System for Acronym Identification and Disambiguation

Acronyms and abbreviations are the short-form of longer phrases and they...
research
11/07/2022

Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding

We present a dataset generator engine named Web-based Visual Corpus Buil...
research
10/31/2016

RNN Approaches to Text Normalization: A Challenge

This paper presents a challenge to the community: given a large corpus o...

Please sign up or login with your details

Forgot password? Click here to reset