Proximity Full-Text Search with a Response Time Guarantee by Means of Additional Indexes with Multi-Component Keys

Full-text search engines are important tools for information retrieval. In a proximity full-text search, a document is relevant if it contains query terms near each other, especially if the query terms are frequently occurring words. For each word in the text, we use additional indexes to store information about nearby words at distances from the given word of less than or equal to MaxDistance, which is a parameter. We had shown that additional indexes with three-component keys can be used to improve the average query execution time up to 94.7 times if the queries consist of high-frequency used words. In this paper, we present a new search algorithm with even more performance gains. We also present results of search experiments, which show that three-component key indexes enable much faster searches in comparison with two-component key indexes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2020

Proximity full-text searches of frequently occurring words with a response time guarantee

Full-text search engines are important tools for information retrieval. ...
research
08/01/2021

Relevance ranking for proximity full-text search based on additional indexes with multi-component keys

The problem of proximity full-text search is considered. If a search que...
research
07/18/2020

About a structure of easily updatable full-text indexes

We consider strategies to organize easily updatable associative arrays i...
research
06/14/2020

An efficient algorithm for three-component key index construction

In this paper, proximity full-text searches in large text arrays are con...
research
11/18/2018

Proximity Full-Text Search with a Response Time Guarantee by Means of Additional Indexes

Full-text search engines are important tools for information retrieval. ...
research
01/27/2018

Using Additional Indexes for Fast Full-Text Search of Phrases That Contains Frequently Used Words

Searches for phrases and word sets in large text arrays by means of addi...
research
04/02/2020

Semantic Image Search for Robotic Applications

Generalization in robotics is one of the most important problems. New ge...

Please sign up or login with your details

Forgot password? Click here to reset