Rational Kernels for Arabic Stemming and Text Classification

02/26/2015
by   Attia Nehar, et al.
0

In this paper, we address the problems of Arabic Text Classification and stemming using Transducers and Rational Kernels. We introduce a new stemming technique based on the use of Arabic patterns (Pattern Based Stemmer). Patterns are modelled using transducers and stemming is done without depending on any dictionary. Using transducers for stemming, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Stemming experiments are conducted on three word collections and classification experiments are done on the Saudi Press Agency dataset. Results show that our approach, when compared with other approaches, is promising specially in terms of Accuracy, Recall and F1.

READ FULL TEXT

page 9

page 10

page 11

research
06/20/2020

AraDIC: Arabic Document Classification using Image-Based Character Embeddings and Class-Balanced Loss

Classical and some deep learning techniques for Arabic text classificati...
research
05/14/2020

OSACT4 Shared Task on Offensive Language Detection: Intensive Preprocessing-Based Approach

The preprocessing phase is one of the key phases within the text classif...
research
12/28/2022

Data Augmentation using Transformers and Similarity Measures for Improving Arabic Text Classification

Learning models are highly dependent on data to work effectively, and th...
research
10/20/2019

Rational Kernels: A survey

Many kinds of data are naturally amenable to being treated as sequences....
research
03/14/2023

Optimizing Deep Learning Model Parameters with the Bees Algorithm for Improved Medical Text Classification

This paper introduces a novel mechanism to obtain the optimal parameters...
research
06/14/2021

Evaluating Various Tokenizers for Arabic Text Classification

The first step in any NLP pipeline is learning word vector representatio...
research
11/20/2020

Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification

Text classification is one of the challenging computational tasks in mac...

Please sign up or login with your details

Forgot password? Click here to reset