Police Text Analysis: Topic Modeling and Spatial Relative Density Estimation

02/08/2022
by   Sarah Huestis-Mitchell, et al.
24

We analyze a large corpus of police incident narrative documents in understanding the spatial distribution of the topics. The motivation for doing this is that police narratives in each incident report contains very fine-grained information that is richer than the category that is manually assigned by the police. Our approach is to split the corpus into topics using two different unsupervised machine learning algorithms - Latent Dirichlet Allocation and Non-negative Matrix Factorization. We validate the performance of each learned topic model using model coherence. Then, using a k-nearest neighbors density ratio estimation (kNN-DRE) approach that we propose, we estimate the spatial density ratio per topic and use this for data discovery and analysis of each topic, allowing for insights into the described incidents at scale. We provide a qualitative assessment of each topic and highlight some key benefits for using our kNN-DRE model for estimating spatial trends.

READ FULL TEXT

page 6

page 7

page 8

page 9

research
01/31/2022

Guided Semi-Supervised Non-negative Matrix Factorization on Legal Documents

Classification and topic modeling are popular techniques in machine lear...
research
07/06/2021

Topic Modeling in the Voynich Manuscript

This article presents the results of investigations using topic modeling...
research
12/18/2019

Topic subject creation using unsupervised learning for topic modeling

We describe the use of Non-Negative Matrix Factorization (NMF) and Laten...
research
08/21/2022

SeNMFk-SPLIT: Large Corpora Topic Modeling by Semantic Non-negative Matrix Factorization with Automatic Model Selection

As the amount of text data continues to grow, topic modeling is serving ...
research
04/21/2021

Clustering Introductory Computer Science Exercises Using Topic Modeling Methods

Manually determining concepts present in a group of questions is a chall...
research
08/19/2023

Exploring the Power of Topic Modeling Techniques in Analyzing Customer Reviews: A Comparative Analysis

The exponential growth of online social network platforms and applicatio...
research
05/18/2019

Cross-referencing using Fine-grained Topic Modeling

Cross-referencing, which links passages of text to other related passage...

Please sign up or login with your details

Forgot password? Click here to reset