Imputing Missing Boarding Stations With Machine Learning Methods

03/10/2020
by   Nadav Shalit, et al.
0

With the increase in population densities and environmental awareness, public transport has become an important aspect of urban life. Consequently, large quantities of transportation data are generated, and mining data from smart card use has become a standardized method to understand the travel habits of passengers. Public transport datasets, however, often may lack data integrity; boarding stop information may be missing due to either imperfect acquirement processes or inadequate reporting. As a result, large quantities of observations and even complete sections of cities might be absent from the smart card database. We have developed a machine (supervised) learning method to impute missing boarding stops based on ordinal classification. In addition, we present a new metric, Pareto Accuracy, to evaluate algorithms where classes have an ordinal nature. Results are based on a case study in the Israeli city of Beer Sheva for one month of data. We show that our proposed method significantly notably outperforms current imputation methods and can improve the accuracy and usefulness of large-scale transportation data.

READ FULL TEXT
research
02/16/2019

Short-distance commuters in the smart city

This study models and examines commuter's preferences for short-distance...
research
06/01/2021

Multimodal Transportation with Ridesharing of Personal Vehicles

The current public transportation system is unable to keep up with the g...
research
06/29/2020

Gamification and Engagement of Tourists and Residents in Public Transportation

Cities are becoming very congested. There is a need to reduce the number...
research
11/03/2021

Unsupervised embedding and similarity detection of microregions using public transport schedules

The role of spatial data in tackling city-related tasks has been growing...
research
07/06/2021

PAC: Partial Area Cluster for adjusting the distribution of transportation platforms in modern cities

In the modern city, the utilization rate of public transportation attach...
research
05/24/2018

Evaluating Non-Motorized Transport Popularity of Urban Roads by Sports GPS Tracks

Non-motorized transport is becoming increasingly important in urban deve...
research
07/16/2020

Leveraging the Self-Transition Probability of Ordinal Pattern Transition Graph for Transportation Mode Classification

The analysis of GPS trajectories is a well-studied problem in Urban Comp...

Please sign up or login with your details

Forgot password? Click here to reset