Prediction of adverse events in Afghanistan: regression analysis of time series data grouped not by geographic dependencies

by   Krzysztof Fiok, et al.

The aim of this study was to approach a difficult regression task on highly unbalanced data regarding active theater of war in Afghanistan. Our focus was set on predicting the negative events number without distinguishing precise nature of the events given historical data on investment and negative events per each of predefined 400 Afghanistan districts. In contrast with previous research on the matter, we propose an approach to analysis of time series data that benefits from non-conventional aggregation of these territorial entities. By carrying out initial exploratory data analysis we demonstrate that dividing data according to our proposal allows to identify strong trend and seasonal components in the selected target variable. Utilizing this approach we also tried to estimate which data regarding investments is most important for prediction performance. Based on our exploratory analysis and previous research we prepared 5 sets of independent variables that were fed to 3 machine learning regression models. The results expressed by mean absolute and mean square errors indicate that leveraging historical data regarding target variable allows for reasonable performance, however unfortunately other proposed independent variables does not seem to improve prediction quality.


page 1

page 2

page 3

page 4


An analysis of deep neural networks for predicting trends in time series data

Recently, a hybrid Deep Neural Network (DNN) algorithm, TreNet was propo...

An exploratory time series analysis of total deaths per month in Brazil since 2015

In this article, we investigate the historical series of the total numbe...

Predicting Berth Stay for Tanker Terminals: A Systematic and Dynamic Approach

Given the trend of digitization and increasing number of maritime transp...

A Non-linear Function-on-Function Model for Regression with Time Series Data

In the last few decades, building regression models for non-scalar varia...

Modern strategies for time series regression

This paper discusses several modern approaches to regression analysis in...

A windowed correlation based feature selection method to improve time series prediction of dengue fever cases

The performance of data-driven prediction models depends on the availabi...

Estimating Conditional Transfer Entropy in Time Series using Mutual Information and Non-linear Prediction

We propose a new estimator to measure directed dependencies in time seri...