Extracting JSON Schemas with Tagged Unions

06/12/2023
by   Stefan Klessinger, et al.
0

With data lakes and schema-free NoSQL document stores, extracting a descriptive schema from JSON data collections is an acute challenge. In this paper, we target the discovery of tagged unions, a JSON Schema design pattern where the value of one property of an object (the tag) conditionally implies subschemas for sibling properties. We formalize these implications as conditional functional dependencies and capture them using the JSON Schema operators if-then-else. We further motivate our heuristics to avoid overfitting. Experiments with our prototype implementation are promising, and show that this form of tagged unions can successfully be detected in real-world GeoJSON and TopoJSON datasets. In discussing future work, we outline how our approach can be extended further.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2022

Designing a Visual Tool for Property Graph Schema Extraction and Refinement: An Expert Study

The design space of visual tools that aim to help people create schemas ...
research
02/28/2020

An Empirical Study on the Design and Evolution of NoSQL Database Schemas

We study how software engineers design and evolve their domain model whe...
research
02/27/2022

Negation-Closure for JSON Schema

JSON Schema is an evolving standard for describing families of JSON docu...
research
02/25/2022

Witness Generation for JSON Schema

JSON Schema is an important, evolving standard schema language for famil...
research
09/15/2017

A Rule-Based Approach to Analyzing Database Schema Objects with Datalog

Database schema elements such as tables, views, triggers and functions a...
research
10/14/2020

A new approach for extracting the conceptual schema of texts based on the linguistic Thematic Progression theory

The purpose of this article is to present a new approach for the discove...
research
11/28/2019

Type Safety with JSON Subschema

JSON is a popular data format used pervasively in web APIs, cloud comput...

Please sign up or login with your details

Forgot password? Click here to reset