Automatic Extraction of Bengali Root Verbs using Paninian Grammar

03/31/2020
by   Arijit Das, et al.
0

In this research work, we have proposed an algorithm based on supervised learning methodology to extract the root forms of the Bengali verbs using the grammatical rules proposed by Panini [1] in Ashtadhyayi. This methodology can be applied for the languages which are derived from Sanskrit. The proposed system has been developed based on tense, person and morphological inflections of the verbs to find their root forms. The work has been executed in two phases: first, the surface level forms or inflected forms of the verbs have been classified into a certain number of groups of similar tense and person. For this task, a standard pattern, available in Bengali language has been used. Next, a set of rules have been applied to extract the root form from the surface level forms of a verb. The system has been tested on 10000 verbs collected from the Bengali text corpus developed in the TDIL project of the Govt. of India. The accuracy of the output has been achieved 98 verified by a linguistic expert. Root verb identification is a key step in semantic searching, multi-sentence search query processing, understanding the meaning of a language, disambiguation of word sense, classification of the sentences etc.

READ FULL TEXT
research
07/23/2017

Rule-Based Spanish Morphological Analyzer Built From Spell Checking Lexicon

Preprocessing tools for automated text analysis have become more widely ...
research
03/31/2020

Improvement of electronic Governance and mobile Governance in Multilingual Countries with Digital Etymology using Sanskrit Grammar

With huge improvement of digital connectivity (Wifi,3G,4G) and digital d...
research
02/07/2017

Fixing the Infix: Unsupervised Discovery of Root-and-Pattern Morphology

We present an unsupervised and language-agnostic method for learning roo...
research
08/03/2023

Lexicon and Rule-based Word Lemmatization Approach for the Somali Language

Lemmatization is a Natural Language Processing (NLP) technique used to n...
research
05/20/2022

Uzbek affix finite state machine for stemming

This work presents a morphological analyzer for the Uzbek language using...
research
10/02/2020

Automatic Extraction of Rules Governing Morphological Agreement

Creating a descriptive grammar of a language is an indispensable step fo...
research
02/06/2023

Evolution of grammatical forms: some quantitative approaches

Grammatical forms are said to evolve via two main mechanisms. These are,...

Please sign up or login with your details

Forgot password? Click here to reset