A Scalable Framework for Multilevel Streaming Data Analytics using Deep Learning

07/15/2019
by   Shihao Ge, et al.
0

The rapid growth of data in velocity, volume, value, variety, and veracity has enabled exciting new opportunities and presented big challenges for businesses of all types. Recently, there has been considerable interest in developing systems for processing continuous data streams with the increasing need for real-time analytics for decision support in the business, healthcare, manufacturing, and security. The analytics of streaming data usually relies on the output of offline analytics on static or archived data. However, businesses and organizations like our industry partner Gnowit, strive to provide their customers with real time market information and continuously look for a unified analytics framework that can integrate both streaming and offline analytics in a seamless fashion to extract knowledge from large volumes of hybrid streaming data. We present our study on designing a multilevel streaming text data analytics framework by comparing leading edge scalable open-source, distributed, and in-memory technologies. We demonstrate the functionality of the framework for a use case of multilevel text analytics using deep learning for language understanding and sentiment analysis including data indexing and query processing. Our framework combines Spark streaming for real time text processing, the Long Short Term Memory (LSTM) deep learning model for higher level sentiment analysis, and other tools for SQL-based analytical processing to provide a scalable solution for multilevel streaming text analytics.

READ FULL TEXT
research
09/25/2020

A Big Data Lake for Multilevel Streaming Analytics

Large organizations are seeking to create new architectures and scalable...
research
08/07/2017

Real Time Analytics: Algorithms and Systems

Velocity is one of the 4 Vs commonly used to characterize Big Data. In t...
research
05/31/2019

Fast Online "Next Best Offers" using Deep Learning

In this paper, we present iPrescribe, a scalable low-latency architectur...
research
11/03/2019

A Streaming Analytics Language for Processing Cyber Data

We present a domain-specific language called SAL(the Streaming Analytics...
research
10/19/2019

Real-Time Lip Sync for Live 2D Animation

The emergence of commercial tools for real-time performance-based 2D ani...
research
05/02/2018

Architecture for Analysis of Streaming Data

While several attempts have been made to construct a scalable and flexib...
research
02/08/2017

Character-level Deep Conflation for Business Data Analytics

Connecting different text attributes associated with the same entity (co...

Please sign up or login with your details

Forgot password? Click here to reset