DeepAI AI Chat
Log In Sign Up

Autodetection and Classification of Hidden Cultural City Districts from Yelp Reviews

by   Harini Suresh, et al.

Topic models are a way to discover underlying themes in an otherwise unstructured collection of documents. In this study, we specifically used the Latent Dirichlet Allocation (LDA) topic model on a dataset of Yelp reviews to classify restaurants based off of their reviews. Furthermore, we hypothesize that within a city, restaurants can be grouped into similar "clusters" based on both location and similarity. We used several different clustering methods, including K-means Clustering and a Probabilistic Mixture Model, in order to uncover and classify districts, both well-known and hidden (i.e. cultural areas like Chinatown or hearsay like "the best street for Italian restaurants") within a city. We use these models to display and label different clusters on a map. We also introduce a topic similarity heatmap that displays the similarity distribution in a city to a new restaurant.


page 5

page 6

page 10


Topic Model Supervised by Understanding Map

Inspired by the notion of Center of Mass in physics, an extension called...

Discovering Latent Patterns of Urban Cultural Interactions in WeChat for Modern City Planning

Cultural activity is an inherent aspect of urban life and the success of...

Dirichlet-vMF Mixture Model

This document is about the multi-document Von-Mises-Fisher mixture model...

Topic Analysis for Text with Side Data

Although latent factor models (e.g., matrix factorization) obtain good p...

Unification of HDP and LDA Models for Optimal Topic Clustering of Subject Specific Question Banks

There has been an increasingly popular trend in Universities for curricu...

Reconstructing Pompeian Households

A database of objects discovered in houses in the Roman city of Pompeii ...

An Analysis of Human-centered Geolocation

Online social networks contain a constantly increasing amount of images ...