Modeling Household Online Shopping Demand in the U.S.: A Machine Learning Approach and Comparative Investigation between 2009 and 2017

01/11/2021
by   Limon Barua, et al.
0

Despite the rapid growth of online shopping and research interest in the relationship between online and in-store shopping, national-level modeling and investigation of the demand for online shopping with a prediction focus remain limited in the literature. This paper differs from prior work and leverages two recent releases of the U.S. National Household Travel Survey (NHTS) data for 2009 and 2017 to develop machine learning (ML) models, specifically gradient boosting machine (GBM), for predicting household-level online shopping purchases. The NHTS data allow for not only conducting nationwide investigation but also at the level of households, which is more appropriate than at the individual level given the connected consumption and shopping needs of members in a household. We follow a systematic procedure for model development including employing Recursive Feature Elimination algorithm to select input variables (features) in order to reduce the risk of model overfitting and increase model explainability. Extensive post-modeling investigation is conducted in a comparative manner between 2009 and 2017, including quantifying the importance of each input variable in predicting online shopping demand, and characterizing value-dependent relationships between demand and the input variables. In doing so, two latest advances in machine learning techniques, namely Shapley value-based feature importance and Accumulated Local Effects plots, are adopted to overcome inherent drawbacks of the popular techniques in current ML modeling. The modeling and investigation are performed both at the national level and for three of the largest cities (New York, Los Angeles, and Houston). The models developed and insights gained can be used for online shopping-related freight demand generation and may also be considered for evaluating the potential impact of relevant policies on online shopping demand.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2020

Machine Learning and value generation in Software Development: a survey

Machine Learning (ML) has become a ubiquitous tool for predicting and cl...
research
06/11/2021

Analyzing the Travel and Charging Behavior of Electric Vehicles – A Data-driven Approach

The increasing market penetration of electric vehicles (EVs) may pose si...
research
10/28/2019

The Study of Machine Learning Models in Predicting the Intention of Adolescents to Smoke Cigarettes

The use of electronic cigarette (e-cigarette) is increasing among adoles...
research
10/28/2019

A Study of Machine Learning Models in Predicting the Intention of Adolescents to Smoke Cigarettes

The use of electronic cigarette (e-cigarette) is increasing among adoles...
research
04/24/2023

Π-ML: A dimensional analysis-based machine learning parameterization of optical turbulence in the atmospheric surface layer

Turbulent fluctuations of the atmospheric refraction index, so-called op...
research
06/08/2023

Sequential mediation of parasocial relationships for purchase intention: PLS-SEM and machine learning approach

Companies employ social media influencers SMIs due to the compelling evi...

Please sign up or login with your details

Forgot password? Click here to reset