Crime Prediction using Machine Learning with a Novel Crime Dataset

11/03/2022
by   Faisal Tareque Shohan, et al.
0

Crime is an unlawful act that carries legal repercussions. Bangladesh has a high crime rate due to poverty, population growth, and many other socio-economic issues. For law enforcement agencies, understanding crime patterns is essential for preventing future criminal activity. For this purpose, these agencies need structured crime database. This paper introduces a novel crime dataset that contains temporal, geographic, weather, and demographic data about 6574 crime incidents of Bangladesh. We manually gather crime news articles of a seven year time span from a daily newspaper archive. We extract basic features from these raw text. Using these basic features, we then consult standard service-providers of geo-location and weather data in order to garner these information related to the collected crime incidents. Furthermore, we collect demographic information from Bangladesh National Census data. All these information are combined that results in a standard machine learning dataset. Together, 36 features are engineered for the crime prediction task. Five supervised machine learning classification algorithms are then evaluated on this newly built dataset and satisfactory results are achieved. We also conduct exploratory analysis on various aspects the dataset. This dataset is expected to serve as the foundation for crime incidence prediction systems for Bangladesh and other countries. The findings of this study will help law enforcement agencies to forecast and contain crime as well as to ensure optimal resource allocation for crime patrol and prevention.

READ FULL TEXT

page 9

page 14

page 15

page 16

page 19

research
04/26/2022

Using Machine Learning to Fuse Verbal Autopsy Narratives and Binary Features in the Analysis of Deaths from Hyperglycaemia

Lower-and-middle income countries are faced with challenges arising from...
research
07/04/2018

CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction

In this paper, we introduce the Chinese AI and Law challenge dataset (CA...
research
05/18/2018

Can machine learning identify interesting mathematics? An exploration using empirically observed laws

We explore the possibility of using machine learning to identify interes...
research
05/10/2020

Belief Rule Based Expert System to Identify the Crime Zones

This paper focuses on Crime zone Identification. Then, it clarifies how ...
research
08/18/2020

RTFN: Robust Temporal Feature Network

Time series analysis plays a vital role in various applications, for ins...
research
07/03/2018

BIN-CT: Urban Waste Collection based in Predicting the Container Fill Level

The fast demographic growth, together with the concentration of the popu...

Please sign up or login with your details

Forgot password? Click here to reset