Distributed NLP

02/10/2018
by   Galip Aydın, et al.
0

In this paper we present the performance of parallel text processing with Map Reduce on a cloud platform. Scientific papers in Turkish language are processed using Zemberek NLP library. Experiments were run on a Hadoop cluster and compared with the single machines performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2021

Spark NLP: Natural Language Understanding at Scale

Spark NLP is a Natural Language Processing (NLP) library built on top of...
research
06/19/2021

TweeNLP: A Twitter Exploration Portal for Natural Language Processing

We present TweeNLP, a one-stop portal that organizes Twitter's natural l...
research
06/17/2015

Editorial for the First Workshop on Mining Scientific Papers: Computational Linguistics and Bibliometrics

The workshop "Mining Scientific Papers: Computational Linguistics and Bi...
research
09/10/2021

How May I Help You? Using Neural Text Simplification to Improve Downstream NLP Tasks

The general goal of text simplification (TS) is to reduce text complexit...
research
02/28/2022

Detecting Stance in Scientific Papers: Did we get more Negative Recently?

In this paper, we classify scientific articles in the domain of natural ...
research
02/08/2020

autoNLP: NLP Feature Recommendations for Text Analytics Applications

While designing machine learning based text analytics applications, ofte...
research
02/02/2018

Measuring Spark on AWS: A Case Study on Mining Scientific Publications with Annotation Query

Annotation Query (AQ) is a program that provides the ability to query ma...

Please sign up or login with your details

Forgot password? Click here to reset