Application of Data Science to Discover Violence-Related Issues in Iraq

06/14/2020
by   Merari González, et al.
0

Data science has been satisfactorily used to discover social issues in several parts of the world. However, there is a lack of governmental open data to discover those issues in countries such as Iraq. This situation arises the following questions: how to apply data science principles to discover social issues despite the lack of open data in Iraq? How to use the available data to make predictions in places without data? Our contribution is the application of data science to open non-governmental big data from the Global Database of Events, Language, and Tone (GDELT) to discover particular violence-related social issues in Iraq. Specifically we applied the K-Nearest Neighbors, Näive Bayes, Decision Trees, and Logistic Regression classification algorithms to discover the following issues: refugees, humanitarian aid, violent protests, fights with artillery and tanks, and mass killings. The best results were obtained with the Decision Trees algorithm to discover areas with refugee crises and artillery fights. The accuracy for these two events is 0.7629. The precision to discover the locations of refugee crises is 0.76, the recall is 0.76, and the F1-score is 0.76. Also, our approach discovers the locations of artillery fights with a precision of 0.74, a recall of 0.75, and a F1-score of 0.75.

READ FULL TEXT

page 5

page 10

research
01/20/2022

Scalable k-d trees for distributed data

Data structures known as k-d trees have numerous applications in scienti...
research
07/09/2022

Supervised Machine Learning for Effective Missile Launch Based on Beyond Visual Range Air Combat Simulations

This work compares supervised machine learning methods using reliable da...
research
02/24/2021

Sentiment Analysis of Code-Mixed Social Media Text (Hinglish)

This paper discusses the results obtained for different techniques appli...
research
10/27/2021

The chemical space of terpenes: insights from data science and AI

Terpenes are a widespread class of natural products with significant che...
research
08/19/2020

LMFAO: An Engine for Batches of Group-By Aggregates

LMFAO is an in-memory optimization and execution engine for large batche...
research
09/06/2021

Data Science Kitchen at GermEval 2021: A Fine Selection of Hand-Picked Features, Delivered Fresh from the Oven

This paper presents the contribution of the Data Science Kitchen at Germ...

Please sign up or login with your details

Forgot password? Click here to reset