Natural Language Processing using Hadoop and KOSHIK

08/15/2016
by   Emre Erturk, et al.
0

Natural language processing, as a data analytics related technology, is used widely in many research areas such as artificial intelligence, human language processing, and translation. At present, due to explosive growth of data, there are many challenges for natural language processing. Hadoop is one of the platforms that can process the large amount of data required for natural language processing. KOSHIK is one of the natural language processing architectures, and utilizes Hadoop and contains language processing components such as Stanford CoreNLP and OpenNLP. This study describes how to build a KOSHIK platform with the relevant tools, and provides the steps to analyze wiki data. Finally, it evaluates and discusses the advantages and disadvantages of the KOSHIK architecture, and gives recommendations on improving the processing performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2019

Simple Natural Language Processing Tools for Danish

This technical note describes a set of baseline tools for automatic proc...
research
01/21/2021

Challenges Encountered in Turkish Natural Language Processing Studies

Natural language processing is a branch of computer science that combine...
research
04/20/2021

Problems and Countermeasures in Natural Language Processing Evaluation

Evaluation in natural language processing guides and promotes research o...
research
05/16/2022

Reasoning about Procedures with Natural Language Processing: A Tutorial

This tutorial provides a comprehensive and in-depth view of the research...
research
01/17/2022

A Literature Survey of Recent Advances in Chatbots

Chatbots are intelligent conversational computer systems designed to mim...
research
05/07/2019

Development of Deep Learning Based Natural Language Processing Model for Turkish

Natural language is one of the most fundamental features that distinguis...
research
04/25/2022

Information Retrieval in Friction Stir Welding of Aluminum Alloys by using Natural Language Processing based Algorithms

Text summarization is a technique for condensing a big piece of text int...

Please sign up or login with your details

Forgot password? Click here to reset