Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources

02/28/2018
by   Edmon Begoli, et al.
0

Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD. Calcite's architecture consists of a modular and extensible query optimizer with hundreds of built-in optimization rules, a query processor capable of processing a variety of query languages, an adapter architecture designed for extensibility, and support for heterogeneous data models and stores (relational, semi-structured, streaming, and geospatial). This flexible, embeddable, and extensible architecture is what makes Calcite an attractive choice for adoption in big-data frameworks. It is an active project that continues to introduce support for the new types of data sources, query languages, and approaches to query processing and optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2018

An Approach to Handle Big Data Warehouse Evolution

One of the purposes of Big Data systems is to support analysis of data g...
research
10/02/2020

FedQPL: A Language for Logical Query Plans over Heterogeneous Federations of RDF Data Sources (Extended Version)

Federations of RDF data sources provide great potential when queried for...
research
10/07/2019

The Query Translation Landscape: a Survey

Whereas the availability of data has seen a manyfold increase in past ye...
research
08/09/2021

Towards a Generic Multimodal Architecture for Batch and Streaming Big Data Integration

Big Data are rapidly produced from various heterogeneous data sources. T...
research
08/25/2022

Mask-Mediator-Wrapper: A revised mediator-wrapper architecture for heterogeneous data source integration

This paper deals with the mediator-wrapper architecture. It is an import...
research
06/02/2003

GraXML - Modular Geometric Modeler

Many entities managed by HEP Software Frameworks represent spatial (3-di...
research
12/01/2021

Processing Analytical Queries in the AWESOME Polystore [Information Systems Architectures]

Modern big data applications usually involve heterogeneous data sources ...

Please sign up or login with your details

Forgot password? Click here to reset