A natural language processing and geospatial clustering framework for harvesting local place names from geotagged housing advertisements

09/08/2018
by   Yingjie Hu, et al.
0

Local place names are frequently used by residents living in a geographic region. Such place names may not be recorded in existing gazetteers, due to their vernacular nature, relative insignificance to a gazetteer covering a large area (e.g., the entire world), recent establishment (e.g., the name of a newly-opened shopping center), or other reasons. While not always recorded, local place names play important roles in many applications, from supporting public participation in urban planning to locating victims in disaster response. In this paper, we propose a computational framework for harvesting local place names from geotagged housing advertisements. We make use of those advertisements posted on local-oriented websites, such as Craigslist, where local place names are often mentioned. The proposed framework consists of two stages: natural language processing (NLP) and geospatial clustering. The NLP stage examines the textual content of housing advertisements, and extracts place name candidates. The geospatial stage focuses on the coordinates associated with the extracted place name candidates, and performs multi-scale geospatial clustering to filter out the non-place names. We evaluate our framework by comparing its performance with those of six baselines. We also compare our result with four existing gazetteers to demonstrate the not-yet-recorded local place names discovered by our framework.

READ FULL TEXT
research
06/21/2018

An empirical study on the names of points of interest and their changes with geographic distance

While Points Of Interest (POIs), such as restaurants, hotels, and barber...
research
08/17/2018

Disambiguating fine-grained place names from descriptions by clustering

Everyday place descriptions often contain place names of fine-grained fe...
research
10/23/2020

Neural Code Completion with Anonymized Variable Names

Source code processing heavily relies on the methods widely used in natu...
research
06/28/2017

Generating Appealing Brand Names

Providing appealing brand names to newly launched products, newly formed...
research
12/17/2019

Function Naming in Stripped Binaries Using Neural Networks

In this paper we investigate the problem of automatically naming pieces ...
research
05/06/2019

Anonymized BERT: An Augmentation Approach to the Gendered Pronoun Resolution Challenge

We present our 7th place solution to the Gendered Pronoun Resolution cha...
research
07/22/2017

Identifying civilians killed by police with distantly supervised entity-event extraction

We propose a new, socially-impactful task for natural language processin...

Please sign up or login with your details

Forgot password? Click here to reset