Diversification on Big Data in Query Processing

08/02/2018
by   Meifan Zhang, et al.
0

Recently, in the area of big data, some popular applications such as web search engines and recommendation systems, face the problem to diversify results during query processing. In this sense, it is both significant and essential to propose methods to deal with big data in order to increase the diversity of the result set. In this paper, we firstly define a set's diversity and an element's ability to improve the set's overall diversity. Based on these definitions, we propose a diversification framework which has good performance in terms of effectiveness and efficiency. Also, this framework has theoretical guarantee on probability of success. Secondly, we design implementation algorithms based on this framework for both numerical and string data. Thirdly, for numerical and string data respectively, we carry out extensive experiments on real data to verify the performance of our proposed framework, and also perform scalability experiments on synthetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2019

Probery: A Probability-based Incomplete Query Optimization for Big Data

Nowadays, query optimization has been highly concerned in big data manag...
research
10/22/2018

biggy: An Implementation of Unified Framework for Big Data Management System

Various tools, softwares and systems are proposed and implemented to tac...
research
08/26/2018

A MapReduce based Big-data Framework for Object Extraction from Mosaic Satellite Images

We propose a framework stitching of vector representations of large scal...
research
03/08/2019

Do we still need fuzzy classifiers for Small Data in the Era of Big Data?

The Era of Big Data has forced researchers to explore new distributed so...
research
07/30/2019

A performance comparison of Dask and Apache Spark for data-intensive neuroimaging pipelines

In the past few years, neuroimaging has entered the Big Data era due to ...
research
07/17/2020

Diversifying Anonymized Data with Diversity Constraints

Recently introduced privacy legislation has aimed to restrict and contro...
research
08/10/2021

Diversity-aware Web APIs Recommendation with Compatibility Guarantee

With the ever-increasing prevalence of web APIs (Application Programming...

Please sign up or login with your details

Forgot password? Click here to reset