Use of Simulation Models for the Development of a Statistical Production Framework for Mobile Network Data with the simutils Package

01/20/2022
by   B. Oancea, et al.
0

We propose to use agent-based simulation models for the development of statistical methods in Official Statistics, especially in relation with the new digital data sources. We present a mobile network data simulator which is managed through the simutils R package which provides geospatial representations of the simulated data. While the synthetic data are produced by an external tool, our simutils package allows an R user to parameterize and run this external simulation tool, to build geospatial data structures from the simulation output or to compute several aggregates. The geospatial data structures were designed with the purpose of using them in a visualization package too. Useful simulation models require the incorporation of real metadata from mobile telecommunication networks driving us to the inclusion of functionalities allowing the user to specify and validate them. All metadata are specified using XML file whose structure are defined in corresponding XSD files. Our R package includes example data sets and we show here how validate the metadata, how to run a simulation and how build the geospatial data structures and how to compute different aggregates.

READ FULL TEXT

page 7

page 9

research
12/20/2019

Data Validation Infrastructure for R

Checking data quality against domain knowledge is a common activity that...
research
05/08/2020

Monitoring data in R with the lumberjack package

Monitoring data while it is processed and transformed can yield detailed...
research
06/02/2023

Auditable data structures: theory and applications

Every digital process needs to consume some data in order to work proper...
research
03/12/2020

Jiskefet, a bookkeeping application for ALICE

A new bookkeeping system called Jiskefet is being developed for A Large ...
research
05/10/2023

Statistical Plasmode Simulations – Potentials, Challenges and Recommendations

Statistical data simulation is essential in the development of statistic...
research
07/27/2016

Comparing the Performance of Graphical Structure Learning Algorithms with TETRAD

In this report we describe a tool for comparing the performance of causa...
research
09/14/2019

Harmonise and integrate heterogeneous areal data with the R package arealDB

Areal data is a common data type to store information such as biodiversi...

Please sign up or login with your details

Forgot password? Click here to reset