MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming

08/30/2021
by   Valter Uotila, et al.
0

The variety of data is one of the important issues in the era of Big Data. The data are naturally organized in different formats and models, including structured data, semi-structured data, and unstructured data. Prior research has envisioned an approach to abstract multi-model data with a schema category and an instance category by using category theory. In this paper, we demonstrate a system, called MultiCategory, which processes multi-model queries based on category theory and functional programming. This demo is centered around four main scenarios to show a tangible system. First, we show how to build a schema category and an instance category by loading different models of data, including relational, XML, key-value, and graph data. Second, we show a few examples of query processing by using the functional programming language Haskell. Third, we demo the flexible outputs with different models of data for the same input query. Fourth, to better understand the category theoretical structure behind the queries, we offer a variety of graphical hooks to explore and visualize queries as graphs with respect to the schema category, as well as the query processing procedure with Haskell.

READ FULL TEXT
research
09/01/2021

MORTAL: A Tool of Automatically Designing Relational Storage Schemas for Multi-model Data through Reinforcement Learning

Considering relational databases having powerful capabilities in handlin...
research
01/13/2022

A Formal Category Theoretical Framework for Multi-model Data Transformations

Data integration and migration processes in polystores and multi-model d...
research
04/13/2022

SkiQL: A Unified Schema Query Language

Most NoSQL systems are schema-on-read: data can be stored without first ...
research
01/16/2018

An Integration-Oriented Ontology to Govern Evolution in Big Data Ecosystems

Big Data architectures allow to flexibly store and process heterogeneous...
research
04/10/2022

Iceberg Sensemaking: A Process Model for Critical Data Analysis and Visualization

We offer a new model of the sensemaking process for data science and vis...
research
10/02/2018

Heterogeneous Replica for Query on Cassandra

Cassandra is a popular structured storage system with high-performance, ...
research
08/29/2016

Learning and Inferring Relations in Cortical Networks

A pressing scientific challenge is to understand how brains work. Of par...

Please sign up or login with your details

Forgot password? Click here to reset