The Forgotten Document-Oriented Database Management Systems: An Overview and Benchmark of Native XML DODBMSes in Comparison with JSON DODBMSes

02/03/2021
by   Ciprian-Octavian Truică, et al.
0

In the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to relieve the issue of rigid data structures present in relational databases, by introducing semi-structured and flexible schema design. As current data generated by different sources and devices, especially from IoT sensors and actuators, use either XML or JSON format, depending on the application, database technologies that store and query semi-structured data in XML format are needed. Thus, Native XML Databases, which were initially designed to manipulate XML data using standardized querying languages, i.e., XQuery and XPath, were rebranded as NoSQL Document-Oriented Databases Systems. Currently, the majority of these solutions have been replaced with the more modern JSON based Database Management Systems. However, we believe that XML-based solutions can still deliver performance in executing complex queries on heterogeneous collections. Unfortunately nowadays, research lacks a clear comparison of the scalability and performance for database technologies that store and query documents in XML versus the more modern JSON format. Moreover, to the best of our knowledge, there are no Big Data-compliant benchmarks for such database technologies. In this paper, we present a comparison for selected Document-Oriented Database Systems that either use the XML format to encode documents, i.e., BaseX, eXist-db, and Sedna, or the JSON format, i.e., MongoDB, CouchDB, and Couchbase. To underline the performance differences we also propose a benchmark that uses a heterogeneous complex schema on a large DBLP corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2021

Columnar Formats for Schemaless LSM-based Document Stores

In the last decade, document store database systems have gained more tra...
research
07/26/2017

A Comparative Analysis of XML Documents, XML Enabled Databases and Native XML Databases

With the increasing popularity of XML data and a great need for a databa...
research
06/12/2018

Performance evaluation for CRUD operations in asynchronously replicated document oriented database

NoSQL databases are becoming increasingly popular as more developers see...
research
04/25/2019

Ephemeral Data Handling in Microservices - Technical Report

In modern application areas for software systems --- like eHealth, the I...
research
02/23/2022

Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching

Semi-structured query systems for document-oriented databases have many ...
research
11/04/2019

Incremental extraction of a NoSQL database model using an MDA-based process

In recent years, the need to use NoSQL systems to store and exploit big ...
research
06/07/2019

Holistic evaluation of XML queries with structural preferences on an annotated strong dataguide

With the emergence of XML as de facto format for storing and exchanging ...

Please sign up or login with your details

Forgot password? Click here to reset