Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks

10/22/2020
by   Shenhao Wang, et al.
0

Researchers often treat data-driven and theory-driven models as two disparate or even conflicting methods in travel behavior analysis. However, the two methods are highly complementary because data-driven methods are more predictive but less interpretable and robust, while theory-driven methods are more interpretable and robust but less predictive. Using their complementary nature, this study designs a theory-based residual neural network (TB-ResNet) framework, which synergizes discrete choice models (DCMs) and deep neural networks (DNNs) based on their shared utility interpretation. The TB-ResNet framework is simple, as it uses a (δ, 1-δ) weighting to take advantage of DCMs' simplicity and DNNs' richness, and to prevent underfitting from the DCMs and overfitting from the DNNs. This framework is also flexible: three instances of TB-ResNets are designed based on multinomial logit model (MNL-ResNets), prospect theory (PT-ResNets), and hyperbolic discounting (HD-ResNets), which are tested on three data sets. Compared to pure DCMs, the TB-ResNets provide greater prediction accuracy and reveal a richer set of behavioral mechanisms owing to the utility function augmented by the DNN component in the TB-ResNets. Compared to pure DNNs, the TB-ResNets can modestly improve prediction and significantly improve interpretation and robustness, because the DCM component in the TB-ResNets stabilizes the utility functions and input gradients. Overall, this study demonstrates that it is both feasible and desirable to synergize DCMs and DNNs by combining their utility specifications under a TB-ResNet framework. Although some limitations remain, this TB-ResNet framework is an important first step to create mutual benefits between DCMs and DNNs for travel behavior modeling, with joint improvement in prediction, interpretation, and robustness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2019

Deep Neural Networks for Choice Analysis: Architectural Design with Alternative-Specific Utility Functions

Whereas deep neural network (DNN) is increasingly applied to choice anal...
research
10/24/2022

Deep Grey-Box Modeling With Adaptive Data-Driven Models Toward Trustworthy Estimation of Theory-Driven Models

The combination of deep neural nets and theory-driven models, which we c...
research
05/30/2023

Incorporating Domain Knowledge in Deep Neural Networks for Discrete Choice Models

Discrete choice models (DCM) are widely employed in travel demand analys...
research
02/18/2023

Structural Neural Additive Models: Enhanced Interpretable Machine Learning

Deep neural networks (DNNs) have shown exceptional performances in a wid...
research
06/26/2020

Interpretable Factorization for Neural Network ECG Models

The ability of deep learning (DL) to improve the practice of medicine an...
research
09/24/2021

Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance

This study proposes a novel approach that combines theory and data-drive...
research
02/27/2018

How (Not) To Train Your Neural Network Using the Information Bottleneck Principle

In this theory paper, we investigate training deep neural networks (DNNs...

Please sign up or login with your details

Forgot password? Click here to reset