Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

01/04/2022
by   Alex Berke, et al.
0

Location data collected from mobile devices represent mobility behaviors at individual and societal levels. These data have important applications ranging from transportation planning to epidemic modeling. However, issues must be overcome to best serve these use cases: The data often represent a limited sample of the population and use of the data jeopardizes privacy. To address these issues, we present and evaluate a system for generating synthetic mobility data using a deep recurrent neural network (RNN) which is trained on real location data. The system takes a population distribution as input and generates mobility traces for a corresponding synthetic population. Related generative approaches have not solved the challenges of capturing both the patterns and variability in individuals' mobility behaviors over longer time periods, while also balancing the generation of realistic data with privacy. Our system leverages RNNs' ability to generate complex and novel sequences while retaining patterns from training data. Also, the model introduces randomness used to calibrate the variation between the synthetic and real data at the individual level. This is to both capture variability in human mobility, and protect user privacy. Location based services (LBS) data from more than 22,700 mobile devices were used in an experimental evaluation across utility and privacy metrics. We show the generated mobility data retain the characteristics of the real data, while varying from the real data at the individual level, and where this amount of variation matches the variation within the real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2018

Generative Models for Simulating Mobility Trajectories

Mobility datasets are fundamental for evaluating algorithms pertaining t...
research
08/13/2018

Mitigating Location Privacy Attacks on Mobile Devices using Dynamic App Sandboxing

We present the design, implementation and evaluation of a system, called...
research
01/05/2023

Zen: LSTM-based generation of individual spatiotemporal cellular traffic with interactions

Domain-wide recognized by their high value in human presence and activit...
research
10/26/2020

Open Smartphone Data for Structured Mobility and Utilization Analysis in Ubiquitous Systems

The development and evaluation of new data mining methods for ubiquitous...
research
08/21/2018

MobilityMirror: Bias-Adjusted Transportation Datasets

We describe customized synthetic datasets for publishing mobility data. ...
research
07/17/2018

Privacy-preserving classifiers recognize shared mobility behaviours from WiFi network imperfect data

This paper proves the concept that it is feasible to accurately recogniz...
research
04/19/2022

Mobility Analysis Workflow (MAW): An accessible, interoperable, and reproducible container system for processing raw mobile data

Mobility analysis, or understanding and modeling of people's mobility pa...

Please sign up or login with your details

Forgot password? Click here to reset