Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

03/22/2015
by   Kamran Kowsari, et al.
0

Searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Dictionary, and reflecting the presence or absence of particular letters. This representation preserves closeness of word distortions in terms of closeness of the created binary vectors within Hamming distance of 2 deviations. This paper talks about the Golay Coding Transformation Hash Table and how it can be used on a FuzzyFind Dictionary as a new technology for using in searching through big data. This method is introduced by linear time complexity for generating the dictionary and constant time complexity to access the data and update by new data sets, also updating for new data sets is linear time depends on new data points. This technique is based on searching only for letters of English that each segment has 23 bits, and also we have more than 23-bit and also it could work with more segments as reference table.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 7

11/02/2017

An Optimal Choice Dictionary

A choice dictionary is a data structure that can be initialized with a p...
11/04/2019

Nearly Optimal Static Las Vegas Succinct Dictionary

Given a set S of n (distinct) keys from key space [U], each associated w...
09/26/2017

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

This paper introduces a novel real-time Fuzzy Supervised Learning with B...
05/25/2011

Multiscale Geometric Methods for Data Sets II: Geometric Multi-Resolution Analysis

Data sets are often modeled as point clouds in R^D, for D large. It is o...
04/04/2020

Correction to: A Practical, Provably Linear Time, In-place and Stable Merge Algorithm via the Perfect Shuffle

We correct a paper previously submitted to CoRR. That paper claimed that...
09/20/2018

Small Uncolored and Colored Choice Dictionaries

A choice dictionary can be initialized with a parameter n∈N and subseque...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.