Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

03/22/2015
by   Kamran Kowsari, et al.
0

Searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Dictionary, and reflecting the presence or absence of particular letters. This representation preserves closeness of word distortions in terms of closeness of the created binary vectors within Hamming distance of 2 deviations. This paper talks about the Golay Coding Transformation Hash Table and how it can be used on a FuzzyFind Dictionary as a new technology for using in searching through big data. This method is introduced by linear time complexity for generating the dictionary and constant time complexity to access the data and update by new data sets, also updating for new data sets is linear time depends on new data points. This technique is based on searching only for letters of English that each segment has 23 bits, and also we have more than 23-bit and also it could work with more segments as reference table.

READ FULL TEXT
research
11/02/2017

An Optimal Choice Dictionary

A choice dictionary is a data structure that can be initialized with a p...
research
11/04/2019

Nearly Optimal Static Las Vegas Succinct Dictionary

Given a set S of n (distinct) keys from key space [U], each associated w...
research
09/26/2017

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

This paper introduces a novel real-time Fuzzy Supervised Learning with B...
research
09/13/2022

A Hash Table Without Hash Functions, and How to Get the Most Out of Your Random Bits

This paper considers the basic question of how strong of a probabilistic...
research
01/14/2020

Semi-automatic methods for adding words to the dictionary of VepKar corpus based on inflectional rules extracted from Wiktionary

The article describes a technique for using English Wiktionary inflectio...
research
04/04/2020

Correction to: A Practical, Provably Linear Time, In-place and Stable Merge Algorithm via the Perfect Shuffle

We correct a paper previously submitted to CoRR. That paper claimed that...
research
10/31/2017

Replace or Retrieve Keywords In Documents at Scale

In this paper we introduce, the FlashText algorithm for replacing keywor...

Please sign up or login with your details

Forgot password? Click here to reset