A Comparative Study on Different Types of Approaches to Bengali document Categorization

01/27/2017
by   Md. Saiful Islam, et al.
0

Document categorization is a technique where the category of a document is determined. In this paper three well-known supervised learning techniques which are Support Vector Machine(SVM), Naïve Bayes(NB) and Stochastic Gradient Descent(SGD) compared for Bengali document categorization. Besides classifier, classification also depends on how feature is selected from dataset. For analyzing those classifier performances on predicting a document against twelve categories several feature selection techniques are also applied in this article namely Chi square distribution, normalized TFIDF (term frequency-inverse document frequency) with word analyzer. So, we attempt to explore the efficiency of those three-classification algorithms by using two different feature selection techniques in this article.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2013

Feature Selection Based on Term Frequency and T-Test for Text Categorization

Much work has been done on feature selection. Existing methods are based...
research
11/01/2019

Efficient Feature Selection techniques for Sentiment Analysis

Sentiment analysis is a domain of study that focuses on identifying and ...
research
10/06/2022

A Machine Learning Based Approach to Categorize Research Journals

In this modern technological era, categorization and ranking of research...
research
06/18/2017

Towards the Improvement of Automated Scientific Document Categorization by Deep Learning

This master thesis describes an algorithm for automated categorization o...
research
02/18/2012

Comparing SVM and Naive Bayes classifiers for text categorization with Wikitology as knowledge enrichment

The activity of labeling of documents according to their content is know...
research
01/16/2017

Semantic classifier approach to document classification

In this paper we propose a new document classification method, bridging ...
research
08/28/2022

Classification and Detection of Mesothelioma Cancer Using Feature Selection-Enabled Machine Learning Technique

Cancer of the mesothelium, sometimes referred to as malignant mesothelio...

Please sign up or login with your details

Forgot password? Click here to reset