Rethinking travel behavior modeling representations through embeddings

08/31/2019
by   Francisco C. Pereira, et al.
0

This paper introduces the concept of travel behavior embeddings, a method for re-representing discrete variables that are typically used in travel demand modeling, such as mode, trip purpose, education level, family type or occupation. This re-representation process essentially maps those variables into a latent space called the embedding space. The benefit of this is that such spaces allow for richer nuances than the typical transformations used in categorical variables (e.g. dummy encoding, contrasted encoding, principal components analysis). While the usage of latent variable representations is not new per se in travel demand modeling, the idea presented here brings several innovations: it is an entirely data driven algorithm; it is informative and consistent, since the latent space can be visualized and interpreted based on distances between different categories; it preserves interpretability of coefficients, despite being based on Neural Network principles; and it is transferrable, in that embeddings learned from one dataset can be reused for other ones, as long as travel behavior keeps consistent between the datasets. The idea is strongly inspired on natural language processing techniques, namely the word2vec algorithm. Such algorithm is behind recent developments such as in automatic translation or next word prediction. Our method is demonstrated using a model choice model, and shows improvements of up to 60% with respect to initial likelihood, and up to 20 the corresponding traditional model (i.e. using dummy variables) in out-of-sample evaluation. We provide a new Python package, called PyTre (PYthon TRavel Embeddings), that others can straightforwardly use to replicate our results or improve their own models. Our experiments are themselves based on an open dataset (swissmetro).

READ FULL TEXT
research
09/24/2021

Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance

This study proposes a novel approach that combines theory and data-drive...
research
03/07/2023

Deep hybrid model with satellite imagery: how to combine demand modeling and computer vision for behavior analysis?

Classical demand modeling analyzes travel behavior using only low-dimens...
research
12/02/2020

On Extending NLP Techniques from the Categorical to the Latent Space: KL Divergence, Zipf's Law, and Similarity Search

Despite the recent successes of deep learning in natural language proces...
research
09/15/2018

Modelling Latent Travel Behaviour Characteristics with Generative Machine Learning

In this paper, we implement an information-theoretic approach to travel ...
research
04/20/2020

Learning Geometric Word Meta-Embeddings

We propose a geometric framework for learning meta-embeddings of words f...
research
07/16/2019

Information processing constraints in travel behaviour modelling: A generative learning approach

Travel decisions tend to exhibit sensitivity to uncertainty and informat...

Please sign up or login with your details

Forgot password? Click here to reset