Spark NLP: Natural Language Understanding at Scale

01/26/2021
by   Veysel Kocaman, et al.
1

Spark NLP is a Natural Language Processing (NLP) library built on top of Apache Spark ML. It provides simple, performant and accurate NLP annotations for machine learning pipelines that can scale easily in a distributed environment. Spark NLP comes with 1100 pre trained pipelines and models in more than 192 languages. It supports nearly all the NLP tasks and modules that can be used seamlessly in a cluster. Downloaded more than 2.7 million times and experiencing nine times growth since January 2020, Spark NLP is used by 54 healthcare organizations as the worlds most widely used NLP library in the enterprise.

READ FULL TEXT
research
04/08/2022

Classification of Natural Language Processing Techniques for Requirements Engineering

Research in applying natural language processing (NLP) techniques to req...
research
02/10/2018

Distributed NLP

In this paper we present the performance of parallel text processing wit...
research
04/20/2011

Understanding Exhaustive Pattern Learning

Pattern learning in an important problem in Natural Language Processing ...
research
07/06/2021

Shell Language Processing: Unix command parsing for Machine Learning

In this article, we present a Shell Language Preprocessing (SLP) library...
research
02/09/2022

pNLP-Mixer: an Efficient all-MLP Architecture for Language

Large pre-trained language models drastically changed the natural langua...
research
05/06/2023

ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Verification of machine learning models used in Natural Language Process...
research
06/22/2022

Enhancing Networking Cipher Algorithms with Natural Language

This work provides a survey of several networking cipher algorithms and ...

Please sign up or login with your details

Forgot password? Click here to reset