Document Classification Using Distributed Machine Learning

02/10/2018
by   Galip Aydın, et al.
0

In this paper, we investigate the performance and success rates of Naïve Bayes Classification Algorithm for automatic classification of Turkish news into predetermined categories like economy, life, health etc. We use Apache Big Data technologies such as Hadoop, HDFS, Spark and Mahout, and apply these distributed technologies to Machine Learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2018

Classification of Scientific Papers With Big Data Technologies

Data sizes that cannot be processed by conventional data storage and ana...
research
04/30/2019

Semi-Unsupervised Lifelong Learning for Sentiment Classification

Lifelong machine learning is a novel machine learning paradigm which con...
research
09/12/2019

Machine Learning in/for Blockchain: Future and Challenges

Machine learning (including deep and reinforcement learning) and blockch...
research
11/14/2020

TDAsweep: A Novel Dimensionality Reduction Method for Image Classification Tasks

One of the most celebrated achievements of modern machine learning techn...
research
08/07/2017

Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

Distributed learning is an effective way to analyze big data. In distrib...
research
03/30/2021

Text Classification Using Hybrid Machine Learning Algorithms on Big Data

Recently, there are unprecedented data growth originating from different...
research
01/31/2020

Similarità per la ricerca del dominio di una frase

English. This document aims to study the best algorithms to verify the b...

Please sign up or login with your details

Forgot password? Click here to reset