FAST: Frequency-Aware Spatio-Textual Indexing for In-Memory Continuous Filter Query Processing

09/08/2017
by   Ahmed R. Mahmood, et al.
0

Many applications need to process massive streams of spatio-textual data in real-time against continuous spatio-textual queries. For example, in location-aware ad targeting publish/subscribe systems, it is required to disseminate millions of ads and promotions to millions of users based on the locations and textual profiles of users. In this paper, we study indexing of continuous spatio-textual queries. There exist several related spatio-textual indexes that typically integrate a spatial index with a textual index. However, these indexes usually have a high demand for main-memory and assume that the entire vocabulary of keywords is known in advance. Also, these indexes do not successfully capture the variations in the frequencies of keywords across different spatial regions and treat frequent and infrequent keywords in the same way. Moreover, existing indexes do not adapt to the changes in workload over space and time. For example, some keywords may be trending at certain times in certain locations and this may change as time passes. This affects the indexing and searching performance of existing indexes significantly. In this paper, we introduce FAST, a Frequency-Aware Spatio-Textual index for continuous spatio-textual queries. FAST is a main-memory index that requires up to one third of the memory needed by the state-of-the-art index. FAST does not assume prior knowledge of the entire vocabulary of indexed objects. FAST adaptively accounts for the difference in the frequencies of keywords within their corresponding spatial regions to automatically choose the best indexing approach that optimizes the insertion and search times. Extensive experimental evaluation using real and synthetic datasets demonstrates that FAST is up to 3x faster in search time and 5x faster in insertion time than the state-of-the-art indexes.

READ FULL TEXT
research
03/06/2020

Structural Indexing for Conjunctive Path Queries

Structural indexing is an approach to accelerating query evaluation, whe...
research
12/08/2020

A Foundation for Spatio-Textual-Temporal Cube Analytics (Extended Version)

Large amounts of spatial, textual, and temporal data are being produced ...
research
02/28/2023

WISK: A Workload-aware Learned Index for Spatial Keyword Queries

Spatial objects often come with textual information, such as Points of I...
research
06/16/2021

Sentiment Progression based Searching and Indexing of Literary Textual Artefacts

Literary artefacts are generally indexed and searched based on titles, m...
research
06/05/2023

Fast Search-By-Classification for Large-Scale Databases Using Index-Aware Decision Trees and Random Forests

The vast amounts of data collected in various domains pose great challen...
research
03/24/2022

Kratt: Developing an Automatic Subject Indexing Tool for The National Library of Estonia

Manual subject indexing in libraries is a time-consuming and costly proc...
research
11/11/2022

Efficient Immediate-Access Dynamic Indexing

In a dynamic retrieval system, documents must be ingested as they arrive...

Please sign up or login with your details

Forgot password? Click here to reset