MobilityMirror: Bias-Adjusted Transportation Datasets

08/21/2018
by   Luke Rodriguez, et al.
0

We describe customized synthetic datasets for publishing mobility data. Private companies are providing new transportation modalities, and their data is of high value for integrative transportation research, policy enforcement, and public accountability. However, these companies are disincentivized from sharing data not only to protect the privacy of individuals (drivers and/or passengers), but also to protect their own competitive advantage. Moreover, demographic biases arising from how the services are delivered may be amplified if released data is used in other contexts. We describe a model and algorithm for releasing origin-destination histograms that removes selected biases in the data using causality-based methods. We compute the origin-destination histogram of the original dataset then adjust the counts to remove undesirable causal relationships that can lead to discrimination or violate contractual obligations with data owners. We evaluate the utility of the algorithm on real data from a dockless bike share program in Seattle and taxi data in New York, and show that these adjusted transportation datasets can retain utility while removing bias in the underlying data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

On the Data Fight Between Cities and Mobility Providers

E-Scooters are changing transportation habits. In an attempt to oversee ...
research
12/03/2020

Origin-Aware Next Destination Recommendation with Personalized Preference Attention

Next destination recommendation is an important task in the transportati...
research
01/04/2022

Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

Location data collected from mobile devices represent mobility behaviors...
research
06/29/2020

Passive Wi-Fi Monitoring in Public Transport: A case study in the Madeira Island

Transportation has become of evermore importance in the last years, affe...
research
09/14/2020

Private data sharing between decentralized users through the privGAN architecture

More data is almost always beneficial for analysis and machine learning ...
research
05/03/2019

In Defense of Synthetic Data

Synthetic datasets have long been thought of as second-rate, to be used ...
research
02/24/2022

Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops

Conventional origin-destination (OD) matrices record the count of trips ...

Please sign up or login with your details

Forgot password? Click here to reset