Search on Secondary Attributes in Geo-Distributed Systems

01/09/2018
by   Dimitrios Vasilas, et al.
0

In the age of big data, more and more applications need to query and analyse large volumes of continuously updated data in real-time. In response, cloud-scale storage systems can extend their interface that allows fast lookups on the primary key with the ability to retrieve data based on non-primary attributes. However, the need to ingest content rapidly and make it searchable immediately while supporting low-latency, high-throughput query evaluation, as well as the geo-distributed nature and weak consistency guarantees of modern storage systems pose several challenges to the implementation of indexing and search systems. We present our early-stage work on the design and implementation of an indexing and query processing system that enables realtime queries on secondary attributes of data stored in geo-distributed, weakly consistent storage systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2018

A Modular Design for Geo-Distributed Querying

Most distributed storage systems provide limited abilities for querying ...
research
12/12/2020

Cortex: Harnessing Correlations to Boost Query Performance

Databases employ indexes to filter out irrelevant records, which reduces...
research
03/15/2018

Global Stabilization for Causally Consistent Partial Replication

Causally consistent distributed storage systems have received significan...
research
08/27/2018

Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems

In recent years, the Log Structured Merge (LSM) tree has been widely ado...
research
02/01/2019

Incremental Techniques for Large-Scale Dynamic Query Processing

Many applications from various disciplines are now required to analyze f...
research
07/04/2017

Ingestion, Indexing and Retrieval of High-Velocity Multidimensional Sensor Data on a Single Node

Multidimensional data are becoming more prevalent, partly due to the ris...
research
11/26/2019

Distributed graphs: in search of fast, low-latency, resource-efficient, semantics-rich Big-Data processing

Large graphs can be processed with single high-memory or distributed sys...

Please sign up or login with your details

Forgot password? Click here to reset