An Optimized Tri-store System for Multi-model Data Analytics

05/22/2023
by   Xiuwen Zheng, et al.
0

Data science applications increasingly rely on heterogeneous data sources and analytics. This has led to growing interest in polystore systems, especially analytical polystores. In this work, we focus on a class of emerging multi-data model analytics workloads that fluidly straddle relational, graph, and text analytics. Instead of a generic polystore, we build a “tri-store” system that is more aware of the underlying data models to better optimize execution to improve scalability and runtime efficiency. We name our system AWESOME (Analytics WorkbEnch for SOcial MEdia). It features a powerful domain-specific language named ADIL. ADIL builds on top of underlying query engines (e.g., SQL and Cypher) and features native data types for succinctly specifying cross-engine queries and NLP operations, as well as automatic in-memory and query optimizations. Using real-world tri-model analytical workloads and datasets, we empirically demonstrate the functionalities of AWESOME for scalable data science applications and evaluate its efficiency.

READ FULL TEXT
research
12/01/2021

Processing Analytical Queries in the AWESOME Polystore [Information Systems Architectures]

Modern big data applications usually involve heterogeneous data sources ...
research
03/23/2017

Flare: Native Compilation for Heterogeneous Workloads in Apache Spark

The need for modern data analytics to combine relational, procedural, an...
research
12/14/2022

Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics

As modern data pipelines continue to collect, produce, and store a varie...
research
04/13/2023

SIGNAL – The SAP Signavio Analytics Query Language

This paper provides an introduction to and discussion of SIGNAL, an indu...
research
03/28/2014

DimmWitted: A Study of Main-Memory Statistical Analytics

We perform the first study of the tradeoff space of access methods and r...
research
03/05/2021

Extend the FFmpeg Framework to Analyze Media Content

This paper introduces a new set of video analytics plugins developed for...
research
05/03/2020

An Algebraic Approach for High-level Text Analytics

Text analytical tasks like word embedding, phrase mining, and topic mode...

Please sign up or login with your details

Forgot password? Click here to reset