SchemaDB: Structures in Relational Datasets

11/24/2021
by   Cody James Christopher, et al.
0

In this paper we introduce the SchemaDB data-set; a collection of relational database schemata in both sql and graph formats. Databases are not commonly shared publicly for reasons of privacy and security, so schemata are not available for study. Consequently, an understanding of database structures in the wild is lacking, and most examples found publicly belong to common development frameworks or are derived from textbooks or engine benchmark designs. SchemaDB contains 2,500 samples of relational schemata found in public repositories which we have standardised to MySQL syntax. We provide our gathering and transformation methodology, summary statistics, and structural analysis, and discuss potential downstream research tasks in several domains.

READ FULL TEXT
research
06/23/2023

Relational Playground: Teaching the Duality of Relational Algebra and SQL

Students in introductory data management courses are often taught how to...
research
06/28/2023

Toward a Scalable Census of Dashboard Designs in the Wild: A Case Study with Tableau Public

Dashboards remain ubiquitous artifacts for presenting or reasoning with ...
research
10/06/2021

Reconsidering Optimistic Algorithms for Relational DBMS

At DBKDA 2019, we demonstrated that StrongDBMS with simple but rigorous ...
research
09/20/2023

Relational Expressions for Data Transformation and Computation

Separate programming models for data transformation (declarative) and co...
research
10/22/2017

A Brief Comparison of Two Enterprise-Class RDBMSs

This paper is an extended version of a report from a student-developed s...
research
11/30/2022

Generating Realistic Synthetic Relational Data through Graph Variational Autoencoders

Synthetic data generation has recently gained widespread attention as a ...

Please sign up or login with your details

Forgot password? Click here to reset