An Improved Approach for Estimating Social POI Boundaries With Textual Attributes on Social Media

12/18/2020
by   Cong Tran, et al.
0

It has been insufficiently explored how to perform density-based clustering by exploiting textual attributes on social media. In this paper, we aim at discovering a social point-of-interest (POI) boundary, formed as a convex polygon. More specifically, we present a new approach and algorithm, built upon our earlier work on social POI boundary estimation (SoBEst). This SoBEst approach takes into account both relevant and irrelevant records within a geographic area, where relevant records contain a POI name or its variations in their text field. Our study is motivated by the following empirical observation: a fixed representative coordinate of each POI that SoBEst basically assumes may be far away from the centroid of the estimated social POI boundary for certain POIs. Thus, using SoBEst in such cases may possibly result in unsatisfactory performance on the boundary estimation quality (BEQ), which is expressed as a function of the F-measure. To solve this problem, we formulate a joint optimization problem of simultaneously finding the radius of a circle and the POI's representative coordinate c by allowing to update c. Subsequently, we design an iterative SoBEst (I-SoBEst) algorithm, which enables us to achieve a higher degree of BEQ for some POIs. The computational complexity of the proposed I-SoBEst algorithm is shown to scale linearly with the number of records. We demonstrate the superiority of our algorithm over competing clustering methods including the original SoBEst.

READ FULL TEXT

page 10

page 11

research
06/14/2018

Improved Density-Based Spatio--Textual Clustering on Social Media

DBSCAN may not be sufficient when the input data type is heterogeneous i...
research
06/09/2018

DIR-ST^2: Delineation of Imprecise Regions Using Spatio--Temporal--Textual Information

An imprecise region is referred to as a geographical area without a clea...
research
08/05/2019

Animal Wildlife Population Estimation Using Social Media Images Collections

We are losing biodiversity at an unprecedented scale and in many cases, ...
research
10/24/2015

Combine CRF and MMSEG to Boost Chinese Word Segmentation in Social Media

In this paper, we propose a joint algorithm for the word segmentation on...
research
03/02/2023

Building Dynamic Ontological Models for Place using Social Media Data from Twitter and Sina Weibo

Place holds human thoughts and experiences. Space is defined with geomet...
research
07/25/2019

A new approach (extra vertex) and generalization of Shoelace Algorithm usage in convex polygon (Point-in-Polygon)

In this paper we aim to bring new approach into usage of Shoelace Algori...

Please sign up or login with your details

Forgot password? Click here to reset