DeepAI AI Chat
Log In Sign Up

Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources

by   Edmon Begoli, et al.
University of Waterloo
Hortonworks Inc.
Oak Ridge National Laboratory

Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD. Calcite's architecture consists of a modular and extensible query optimizer with hundreds of built-in optimization rules, a query processor capable of processing a variety of query languages, an adapter architecture designed for extensibility, and support for heterogeneous data models and stores (relational, semi-structured, streaming, and geospatial). This flexible, embeddable, and extensible architecture is what makes Calcite an attractive choice for adoption in big-data frameworks. It is an active project that continues to introduce support for the new types of data sources, query languages, and approaches to query processing and optimization.


page 1

page 2

page 3

page 4


An Approach to Handle Big Data Warehouse Evolution

One of the purposes of Big Data systems is to support analysis of data g...

FedQPL: A Language for Logical Query Plans over Heterogeneous Federations of RDF Data Sources (Extended Version)

Federations of RDF data sources provide great potential when queried for...

The Query Translation Landscape: a Survey

Whereas the availability of data has seen a manyfold increase in past ye...

Towards a Generic Multimodal Architecture for Batch and Streaming Big Data Integration

Big Data are rapidly produced from various heterogeneous data sources. T...

Mask-Mediator-Wrapper: A revised mediator-wrapper architecture for heterogeneous data source integration

This paper deals with the mediator-wrapper architecture. It is an import...

GraXML - Modular Geometric Modeler

Many entities managed by HEP Software Frameworks represent spatial (3-di...

Processing Analytical Queries in the AWESOME Polystore [Information Systems Architectures]

Modern big data applications usually involve heterogeneous data sources ...