Semantic and Influence aware k-Representative Queries over Social Streams

01/29/2019
by   Yanhao Wang, et al.
0

Massive volumes of data continuously generated on social platforms have become an important information source for users. A primary method to obtain fresh and valuable information from social streams is social search. Although there have been extensive studies on social search, existing methods only focus on the relevance of query results but ignore the representativeness. In this paper, we propose a novel Semantic and Influence aware k-Representative (k-SIR) query for social streams based on topic modeling. Specifically, we consider that both user queries and elements are represented as vectors in the topic space. A k-SIR query retrieves a set of k elements with the maximum representativeness over the sliding window at query time w.r.t. the query vector. The representativeness of an element set comprises both semantic and influence scores computed by the topic model. Subsequently, we design two approximation algorithms, namely Multi-Topic ThresholdStream (MTTS) and Multi-Topic ThresholdDescend (MTTD), to process k-SIR queries in real-time. Both algorithms leverage the ranked lists maintained on each topic for k-SIR processing with theoretical guarantees. Extensive experiments on real-world datasets demonstrate the effectiveness of k-SIR query compared with existing methods as well as the efficiency and scalability of our proposed algorithms for k-SIR processing.

READ FULL TEXT
research
04/06/2023

LSketch: A Label-Enabled Graph Stream Sketch Toward Time-Sensitive Queries

Graph streams represent data interactions in real applications. The mini...
research
07/06/2020

Topic-based Community Search over Spatial-Social Networks (Technical Report)

Recently, the community search problem has attracted significant attenti...
research
02/14/2018

Influential User Subscription on Time-Decaying Social Streams

Influence maximization which asks for k-size seed set from a social netw...
research
08/26/2019

NETR-Tree: An Eifficient Framework for Social-Based Time-Aware Spatial Keyword Query

The prevalence of social media and the development of geo-positioning te...
research
09/20/2017

ProbeSim: Scalable Single-Source and Top-k SimRank Computations on Dynamic Graphs

Single-source and top-k SimRank queries are two important types of simil...
research
03/15/2021

Online Topic-Aware Entity Resolution Over Incomplete Data Streams (Technical Report)

In many real applications such as the data integration, social network a...
research
03/22/2021

Efficient Processing of k-regret Minimization Queries with Theoretical Guarantees

Assisting end users to identify desired results from a large dataset is ...

Please sign up or login with your details

Forgot password? Click here to reset