Transportation system in the U.S. produced 28% of the total greenhouse gas (GHG) emissions in 2018, which is the largest share from a single source . According to the United States Environmental Protection Agency in 2017, 81% of the GHG was CO, which is a major contributor to climate change and global warming. The employment of information and communication technology (ICT) and connected and automated vehicles (CAVs) has been suggested as a potential solution to alleviate the undesirable social and environmental impacts of transportation systems . Particularly, routing schemes for CAVs have an explicit impact on traffic [11, 1, 26, 51] and environmental  characteristics.
While myopic routing is reactive, the anticipatory routing is proactive . Myopic routing uses the current network state, while the anticipatory routing exploits the predicted conditions, while routing vehicles to their destinations . Anticipatory routing is considered a promising approach to improve traffic characteristics of a network, while avoiding congestion–especially when we employ a high market penetration rate (MPR) of vehicles that are equipped with a routing system that is based on anticipatory information [10, 8, 32].
Eco-routing is a special case of routing that specifically considers the environmental aspects [30, 47]. Several studies in the literature developed myopic eco-routing systems. For instance,  and  developed single as well as multi-objective eco-routing for CAVs and found encouraging results. A review paper developed by classified the eco-routing models and illustrated the strengths and weaknesses of each category. The common limitations are related to the level of data point resolution, scale of the case study, and number of objectives optimized at once. With reference to anticipatory routing in general, a scarcity of studies, especially recent ones, in the literature has been noticed. For the few anticipatory routing studies considering only travel time as the routing objective [34, 8, 10], the limitations associated are related to the scale of the case study , level of temporal and spatial resolution , use of centralized solutions that suffer from scaling issues, and the use of reflective prediction models [34, 8, 10, 23]. The anticipatory routing studies found, did not employ sophisticated predictive models, which is a major limitation. The forecasted travel time data points were provided by running the traffic simulation in advance [9, 24, 8] or based on regression models between speed and other traffic variables, such as density . In most of the studies, travel time of time step , obtained from the traffic simulation or historical data, was used in time step for the anticipatory routing application, where is the prediction interval considered .
Unlike previous studies, this work develops anticipatory multi-objective eco-routing strategies that can be implemented in the routing systems for connected and automated vehicles. Similar to the myopic routing schemes proposed in , a microscopic level of aggregation, an urban network as a case study, and a high level of spatial (link level) and temporal (1 minute) resolution are employed in this study. Unlike the previous anticipatory approaches that were centralized solutions, this study develops the anticipatory routing strategies based on a dynamic distributed routing framework i.e. End-to-End Routing for Connected and Automated Vehicles (E2ECAV) . In this study three eco-routing strategies are applied to optimize travel time (TT), GHG, as well as the combination of TT and GHG. The performance indicators considered for comparison between the different routing strategies are average TT, average vehicle kilometres travelled (VKT), total GHG, and total NOx emissions produced. Following are the main contributions:
Examination of the most representative GHG costing approach at highly disaggregate spatial (link level) and temporal (1 minute prediction interval) resolution.
Development of anticipatory multi-objective eco-routing strategies, while employing the predictive model for speed developed here and a GHG prediction model developed in .
Detailed comparison between myopic and anticipatory routing strategies.
A brief literature review related to the existing eco-routing studies, their strengths, and weaknesses is presented in Section 2. In addition, existing predictive models of travel time and GHG and anticipatory routing studies are illustrated in Chapter 2. The specifications of the case study deployed is in Section 1. Section 4 includes the details related to the traffic and emission models, GHG costing approaches, and GHG and speed predictive models. Section 5 incorporates the results and discussions related. Finally, Section 6 summarizes the major findings and future outlook.
Due to the advancements in sensing and communication technologies, their utilization in transportation systems, and emergence of connected and automated vehicles, the availability of the real-time high-resolution data has become feasible at large-scale. Such data are adopted to develop highly accurate prediction models that have the potential to be used for anticipatory routing with multiple objectives. In this section, the studies found in the literature related to the myopic eco-routing, predictive models of time and GHG, and anticipatory routing are briefly presented.
 developed a comprehensive review of eco-routing studies. They reported that the previous myopic eco-routing studies have predominantly used macroscopic level traffic and emission models, are based on small case studies, used centralized routing mechanisms, and have optimized single routing objective at a time. To overcome the aforementioned limitations,  applied multi-objective eco-routing in a distributed routing framework. The authors used the per lane weighted average for the GHG cost on links. Although normalizing by the number of lanes resulted in the underestimation for the links with a large number of lanes, reductions in the travel time and emissions produced were noticed when the multi-objective routing was applied.  developed myopic multi-objective eco-routing strategies for CAVs, considering the cost of idling as a penalty at the downstream intersection of a link at every interval. The authors used the marginal cost for GHG in the objective function. They found that including the penalty cost contributed to reductions of 4% and 3% in the average travel time and total GHG produced, respectively, in the case of the multi-objective routing when compared to the single objective routing.
The predictive models are an essential component of the anticipatory routing.  conducted a comprehensive review related to the short-term traffic forecasting. It was noticed that studies mainly considered freeways as their case studies, statistical models, and a temporal resolution of five minutes in most of the cases . Freeways are utilized due to the complexity associated with urban congested networks. The vehicular dynamics in an urban areas are subject to changes during a short time period (seconds). It is the result of stop and go phenomena and shorter length of the links, when compared to the freeways. A low level of temporal resolution is employed due to the scarcity of microscopic data points and the high computational power required. The statistical models, for example autoregressive integrated moving average (ARIMA) model, are easy to use, but have limitations when dealing with complex non-linear relationships between the variables concerned .
When it comes to the travel time prediction, there are two main streams. One that predicts travel time directly, while the other stream predicts speed and consequently travel time. For directly forecasting travel time, several predictive models have been employed. Linear modelling , nonlinear autoregressive with external inputs (NARX) model 
, nonlinear autoregressive model (NAR), clustering 
, neural networks (NNs), and deep neural networks [39, 13].
A large body of literature exists where travel time is implicitly predicted from speed, including [18, 52, 31, 53, 22, 21]. The common features between most of the aforementioned predictive models are the low temporal resolution and small case study employed. Not to forget that the statistical models are dominating, despite their inability to capture the complicated relationship between the variables in concern. Even when a large network was employed as in , the speed was predicted at a low temporal resolution level. With regards to the predictive models for speed and travel time, it was found in the literature that long short terms memory (LSTM) outperformed other predictive approaches including the ARIMA model .
Related to the GHG predictive models, GHG emissions were predicted based on yearly data points of fuel , gross domestic product, or other economical factors [7, 6, 35]. The predictive models varied from statistical [38, 46] to deep neural networks based . To overcome the limitations of the previous predictive models, the low spatial (national) and temporal (year) resolution,  developed a predictive model based on LSTM. The GHG emission rate (ER) at a link level and one minute time resolution was predicted based on the most representative traffic indicators of the previous time intervals.
With reference to the anticipatory routing, several frameworks were proposed [8, 10, 32] to minimize travel time.  developed a framework for the anticipatory routing in a network with only a single OD pair, 14 links, and eleven OD paths. The author developed a simulator with three major components. The three components considered are all time-dependent and consist of network conditions, path splits, and guidance messages. Three maps were used to illustrate the relationships between the aforementioned three variables. The network loading map, used the path splits to define the network conditions. The guidance map employed the network conditions to define the guidance messages. Finally, the routing map translated the guidance messages into path splits. In terms of the predicted traffic variables, the real-time traffic characteristics and other related data were forecasted at short- and medium-term for the anticipatory routing. The variable message signs (VMS) were employed to provide vehicles with the best route. When the best route is defined, the driver’s compliance was not guaranteed. Hence, the author incorporated a logit model to define the drivers’ path choice proposed DynaMIT, which can be employed to generate real-time guidance provided to the drivers. Off-line and real time information were adopted. The off-line data as well as historical network conditions that were used for the state estimation. While the real-time data were obtained from the control system. Two simulation tools were used, a demand and supply simulator. The demand simulator estimated and forecasted the origin-destination (OD) flow, departure time of drivers, model, and route choice. While the supply simulator directly simulated the interactions between the demand and supply (network). Anticipatory routing was applied while travel time minimization was the routing objective. The predicted travel time was a function of experienced travel time from the previous iteration. Speed on links was estimated based on the linear relationship with density on the link in concern. The VMS were used to inform drivers with the information of the best route. With regards to the findings, the authors found that anticipatory routing is promising as it contributed to reductions in the travel time of vehicles
developed a framework for the anticipatory routing in a network with only a single OD pair, 14 links, and eleven OD paths. The author developed a simulator with three major components. The three components considered are all time-dependent and consist of network conditions, path splits, and guidance messages. Three maps were used to illustrate the relationships between the aforementioned three variables. The network loading map, used the path splits to define the network conditions. The guidance map employed the network conditions to define the guidance messages. Finally, the routing map translated the guidance messages into path splits. In terms of the predicted traffic variables, the real-time traffic characteristics and other related data were forecasted at short- and medium-term for the anticipatory routing. The variable message signs (VMS) were employed to provide vehicles with the best route. When the best route is defined, the driver’s compliance was not guaranteed. Hence, the author incorporated a logit model to define the drivers’ path choice. 
proposed DynaMIT, which can be employed to generate real-time guidance provided to the drivers. Off-line and real time information were adopted. The off-line data as well as historical network conditions that were used for the state estimation. While the real-time data were obtained from the control system. Two simulation tools were used, a demand and supply simulator. The demand simulator estimated and forecasted the origin-destination (OD) flow, departure time of drivers, model, and route choice. While the supply simulator directly simulated the interactions between the demand and supply (network). Anticipatory routing was applied while travel time minimization was the routing objective. The predicted travel time was a function of experienced travel time from the previous iteration. Speed on links was estimated based on the linear relationship with density on the link in concern. The VMS were used to inform drivers with the information of the best route. With regards to the findings, the authors found that anticipatory routing is promising as it contributed to reductions in the travel time of vehicles.  suggested proactive re-routing strategies to reduce travel time. They predicted congestion based on density/jam density ratio. The speed was predicted based on the Greenshield model, linear relationship between density and speed, which is a limitation as realistically the relationship is not linear. When the density is low, the speed is underestimated . The authors found that their rerouting performed as good as the dynamic traffic assignment . Another example is the work by , who applied re-routing based on congestion prediction. One of the predictive models developed was based on the spatiotemporal correlation. The authors assumed that the traffic flow was constant during each prediction time interval, which is unrealistic.  proposed a dynamic congestion model based on crowdsourcing in order to apply the anticipatory routing for a set of cooperative vehicles by predicting the probability distribution of traffic conditions. The time interval adopted for the routes update was 1 minute and the data was obtained from the GPS traces and social media. They found that their approach outperformed the myopic routing approach .
To summarize, the existing predictive models are associated with limitations related to the spatial and temporal resolution. The anticipatory routing studies did not adopt efficient and more accurate, predictive models and were used in the context of a centralized routing framework. The centralized routing frameworks require a large infrastructure investment, are highly sensitivity to system failures, and involve high degree of complexity in the case of a system upgrade . Hence, this study will tackle the aforementioned limitations. To the best of our knowledge, our study is the first of its kind to apply the anticipatory multi-objective eco-routing while deploying deep learning based predictive models in a distributed routing framework.
3 Case study
Downtown Toronto’s road network is adopted as a case study because it experiences high levels of recurrent congestion–especially during the morning peak period . Downtown Toronto is the financial centre of Canada and has the highest job density among the major cities in the country. The network is composed of 223 links and 76 intersections. Based on the 2019/2020 Toronto’s vital report , several factors contribute to the excessive congestion levels in Toronto. The population of Toronto has increased yearly by 1% since 2011. Due to high cost of living, Toronto is the most expensive major city in the country. The ownership costs are growing four times faster than income, while renting costs are growing two times faster than income over the last decade . The vehicular demand is provided by the Transportation Tomorrow Survey (TTS) for the period between 7:45am and 8:00am for the year 2014. Links in the case study are associated with different features with respect to the speed limit, number of lanes, and number of directions, i.e. a high level of heterogeneity is assured for a generic application. Figure 1 illustrates the area, including the major roads. The high level of heterogeneity assures a realistic generic application, especially for prediction. The speed limit on links is 2%, 1%, 30%, 59%, and 8% of 10, 30, 40, 60, and 80 (km/h), respectively. With regards to the number of lanes, 1, 2, 3, and 4 of 7%, 71%, 15%, and 7%, respectively are used.
This section includes the specifications of the traffic and emission models, GHG costing approaches investigated, GHG and speed predictive models adopted, and the routing strategies taken into account. Figure 2 demonstrates the general framework followed in this study.
4.1 Traffic and emission models
Microscopic traffic  and emission simulators  are deployed in this study to obtain high resolution data points at every second. The Intelligent Driver Model (IDM)  is the car-following adopted for the displacement estimation at every second, which is used to calculate the speed of vehicles . The second-by-second vehicular characteristics are captured and then used to estimate the space mean link indicators. When all of the vehicles reach their destinations, the simulation ends. The indicators of links, speed, density, GHG, and flow, are updated at every minute.
The Motor Vehicle Emission Simulator (MOVES) is the emission model employed to estimate the second by second GHG (in ) of every vehicle . The second-by-second emissions produced by vehicles are estimated based on the vehicle operating mode, which depends on the vehicle specific power (VSP) .
4.2 GHG costing approaches
For a robust eco-routing strategy, five different GHG costing approaches have been assessed as illustrated in Table 1. GHG, as in Equation 1, is for when GHG cost is the sum of GHG emissions of vehicles on the studied link during and interval .
is a binary variable, 1 if vehicleis on the studied link at time and 0 otherwise. As the links in the network have different number of lanes, GHG normalizes the total GHG cost based on the number of lanes following Equation 2. Considering a higher temporal resolution is illustrated in the GHG costing approach. Weighted average of GHG produced on link is the outcome of Equation 3. At every second of any interval (1-60 seconds), a weight , is multiplied by the GHG produced by the vehicles. It means that the GHG emissions produced at of second 1 is multiplied by a weight of 1, while the GHG emission produced at second = 60 is multiplied by a weight of 60. The most recent seconds of an interval are associated with a higher weight in the link cost. The sum is then divided by the sum of weights. This costing approach takes the GHG cost at every second, which means it is associated with a high temporal resolution compared to GHG and GHG. GHG as in Equation 4 follows the same logic of GHG, but divides by the number of lanes of link to normalize. Finally, GHG is the marginal cost of one vehicle traversing the studied link . The marginal cost, as shown in Equation 5, depends on an estimated emission rate (ER) and the TT of interval on link . TT is obtained following Equation 6, where represents link length and is link speed of time interval.
4.3 GHG emission rate and speed predictive models
Two separate LSTM networks have been trained to predict the variables, TT and GHG, required for the anticipatory multi-objective routing. LSTM has been chosen as it outperforms the statistical models for time series data, such as ARIMA 
. It also overcomes the shortcoming of standard neural networks, such as the vanishing gradient problem.
With reference to the LSTM architecture and hyper-parameters, it has been found that the selection of predictors, number of sequences, and the set of hyper-parameters [40, 20] have an explicit impact on the prediction performance. In addition, increasing the depth of the NNs may also introduce further enhancements [37, 19]. Not to forget that the efficiency of the hyper-parameters tuning process is profound . In this study, a comprehensive correlation analysis has been conducted for each of the predicted responses as discussed in Section 5.1.2, GHG ER and speed. For each of the predicted variables, i.e. GHG ER and speed, first the most representative predictors and number of previous time steps (sequences) used in the model are defined based on the correlation analysis. Then hyper-parameters are tuned in two stages. The manual tuning mainly tries to narrow the search range of the hyper-parameters in concern for a more efficient systematic tuning process based on the Bayesian optimization . Further details related to the predictive LSTM network can be found in . To compare between the trained LSTM networks, four indicators are utilized: 1) correlation coefficient between observed and predicted GHG ERs (in g/sec), 2) fit to the ideal 45 line, 3) root mean square error (RMSE), and 4) R .
4.3.1 Data collection
This step is essential for the development of the predictive models of GHG ER and speed. The quality of the data collected contributes to how reflective the predictive models are. The data points are extracted from an agent-based traffic model developed in . The demand is synthesized based on actual data from the Transportation Tomorrow Survey (TTS). To train the LSTM networks, 80% of the data is employed, while 20% is used for testing. The training and testing sets are 48,652 and 12,159 data points, respectively for the LSTM predictive models. The high level of heterogeneity of the traffic and environmental variables contributes to more generic predictive models. A wide variety of traffic and environmental conditions are captured and used for training the predictive models. To produce different traffic conditions, different demand levels and different departure time distributions are adopted.
The number of vehicles varies from 2,437 to 6,988, representing 0.7 to 2 times the actual demand in the year 2014. The departure time distributions employed to generate the data are exponential, uniform, and normal. Figure 7 illustrates the statistical analysis of the three profound traffic variables, speed, flow, and density in addition to the GHG ER in the data set employed for training the LSTM network. Figure (a)a shows that the mode and average are 40 km/h and 56.16 km/h, respectively. Speed range is from 0 to 80 km/h. Density (veh/km.lane) as in Figure (b)b and flow (veh/h) as in Figure (c)c represent different traffic conditions due to the wide range of their values. Finally, GHG ER (in ) starts from less than 1 g/sec to more than 5 g/sec as in Figure (d)d.
4.3.2 Distributed routing framework
A dynamic distributed routing framework, End-to-End Connected Autonomous Vehicles (E2ECAV) , is used for testing the proposed routing strategies. E2ECAV is based on a network of Intelligent Intersections (I2s) that can dynamically route connected and automated vehicles (CAVs). Two types of communication are employed, the vehicle to infrastructure (V2I) and infrastructure to infrastructure (I2I). Via the local communication between the agents, the I2s develop a coherent view of the network. For further description of E2ECAV and various applications we refer the readers to [12, 16, 45, 11, 1].
4.4 Routing strategies
Three strategies are examined for both myopic and anticipatory routing, as illustrated in Table 2. Equation 7 presents the general formula followed based on the routing objective, TT, GHG, or TT&GHG. Where is travel time on link and is emissions on link that is the GHG (in ) in our case; is number of links of a path ; and and are the weights associated with travel time and emissions, respectively. ( and ) in Equation 7 are of different units. When multi-objective routing is applied, converting them to a consistent unit (e.g.monetary value) is the solution for a realistic application. The weights in Equation 7 are used for the aforementioned normalization process. The routing strategies considered are illustrated in Table 2. Every routing strategy is run for five replications of different seeds to account for the stochasticity. When the routing strategy is myopic, current time step of the variables in concern are adopted. While when the routing strategy is anticipatory, predicted values at of the developed predicted models are employed. For instance, routing strategy TT means that TT is obtained based on the current time step value of speed following Equation 6. While TTGHG is the anticipatory routing strategy of when the predicted TT and GHG of time step (of the best trained LSTM networks) are the routing objectives. The TT cost of links at every minute follows Equation 6. The link GHG cost in this study is chosen based on the analysis of the different GHG costing approaches as demonstrated in Equation 1, 2, 3, 4, and 5.
5 Results and discussion
Here we compare the results of anticipatory routing strategies with the myopic strategies. To achieve this, predictive models of the related variables i.e. GHG cost and speed, are required. The GHG cost related variable LSTM based predictive model is developed in , but the major findings are shared in the following related sections. The speed predictive model is developed in this work in Section 5.1. The comparison between the routing strategies is conducted in Sections 5.2 to 5.4.
5.1 Development of the predictive models
Before developing the predictive models, the most representative GHG costing approach is investigated in Section 5.1.1. A comprehensive correlation analysis is applied, for the GHG ER and speed, in Section 5.1.2 followed by the predictive models developed in Section 5.1.3.
5.1.1 GHG costing approaches
This analysis is applied for GHG, as myopic routing is the base case and to illustrate which GHG costing approach is the most suitable for our application. Figure (a)a shows that normalizing based on the number of lanes as in GHG and GHG is associated with a slight enhancement compared to GHG and GHG, respectively. This is due to the different number of lanes of the links in the network that makes the total GHG not reflective of the actual conditions. For the costing approach GHG and GHG, if the GHG cost of link with two lanes and with four lanes are 70 and 100 grams, respectively, link would be prioritized. However, dividing by the number of lanes shows that link should be prioritized based on GHG or GHG. A reduction in the average TT of around 3% in both GHG and GHG compared to GHG and GHG, respectively, is observed. Using the total GHG on links of the GHG costing approach, triggers the highest average TT, average VKT, total GHG, and total NOx of 16 minute, 2 km, 2,518 kg, and 0.716 kg, respectively. The explanation is related to not considering any of the traffic characteristics on the link, such as speed, density, or flow. 100 gram of the total GHG emission, GHG, can be for two dramatically different sets of traffic characteristics. The first condition can be for a very congested short link of low capacity, while the other condition can be for an uncongested long link of high capacity.
GHG illustrates a significant reduction in the average TT, average VKT total GHG, and total NOx of 33%, 21%, 32%, and 25%, respectively, when compared to GHG. The main justification is the high temporal resolution adopted, compared to GHG. Furthermore, giving higher weights to the most recent seconds contributes to the improvements in terms of average TT, average VKT total GHG, and total NOx. The closer to the prediction update, the more weight is used for the GHG produced by the vehicles. GHG, which is the marginal cost of 1 vehicle based on the GHG ER and TT on the studied link, is very much comparable to the GHG in terms of the performance. It is despite the lower temporal resolution utilized by GHG (1 min) when compared to GHG (1 second). GHG is the most representative and suitable costing approach as it is based on the reflective ER and speed on the studied link as illustrated in Equation 5. Due to the quasi-convex relationship between the speed and GHG ER , too low or too high speed will trigger higher emission rates. Links with too high speed are associated with high GHG ERs, but less travel time. Links with too low speed contribute adversely not only to the GHG ER, but also to the travel time on the links. It is expected that the GHG marginal cost will search for the optimal combination of speed, VKT, and GHG ER to satisfy the optimization Equation 7. GHG is the cost used for the myopic and anticipatory routing strategies in this study.
5.1.2 Correlation analysis of the predicted variables
This analysis is an essential step for reliably selecting the predictors and number of sequences of the LSTM models developed. For the GHG ER correlation analysis, details can be found in . A comprehensive list of variables has been assessed to develop reliable predictive models. Not only the link characteristics (speed, density, flow, and delay (difference between free flow travel time and actual travel time)), but also the in-links characteristics (speed, density, and flow) are included for the correlation analysis. The in-links characteristics are included to enhance the spatial dimension. The traffic state at time on the up steam links implies how the traffic condition will be on the downstream links at time . Five time steps (minutes) are considered for this analysis as within the aforementioned period modifications in the traffic state is detected. The maximum link length in the network is around 450 meters. The speed range is from 0 to 80 km/h. Under the free flow traffic condition, the maximum travel time required to traverse a link is around 0.8 minute. The GHG ER at the sixth minute is highly correlated with speed, GHG ER, density, and in-links speed over the last five minutes, followed by the rest of the variables, as shown in . The GHG emission estimation  depends primarily on speed, which justifies the strong correlation. Speed and density have a more or less monotonically decreasing relationship , which explains the strong correlation between the GHG ER and density. In terms of the correlation analysis for speed, Figure 11 shows the linear correlation between speed at the sixth minute and both the traffic and environmental indicators of the previous five minutes. Not only the indicators on the studied links are considered, but also the in-links characteristics as in the case of the GHG ER correlation analysis to better reflect on the spatial dimension. Figure 11 shows an increase in the correlation factor of all the variables and minutes, except for delay. The top four highly correlated variables with speed at the sixth minute are speed, density, flow per lane, and in-links speed. The high correlation coefficient with density of 0.70 and 0.36 with flow is based on the correlation between the three variables shown in the transportation fundamental diagrams, speed, density, and flow . Speed and density are associated with a monotonically decreasing relationship .
5.1.3 GHG and speed predictive models
For each of the predicted variables, GHG ER and speed, a comprehensive list of predictors and number of sequences have been examined. The best predictive LSTM networks of both the GHG ER and speed are of two hidden layers while the hyper-parameters are systematically tuned. In terms of the predictors, for GHG ER forecasting 
the best set is of three sequences of speed, GHG ER, density, and in-links speed. For speed prediction, the best setting is associated also with the three sequences of speed, density, and in-links speed. In terms of the hyper-parameters, two solvers are considered, the adaptive moment estimation (Adam) methods
and the stochastic gradient descent with momentum (sgdm).
Several hyper-parameters are considered for tuning. The initial learning rate, max epochs, learning rate drop factor, momentum, learning rate drop period, number of hidden units of the first LSTM (hidden) layer, and the number of hidden units of the second (LSTM) layer when used are the hyper-parameters tuned.The first stage of tuning is manual, which is adopted to narrow the search range of the optimal hyper-parameters. The next stage is systematically using the Bayesian optimization [50, 4], which employs a narrowed search range obtained from the manual tuning stage. The training results of the best LSTM predictive networks of the GHG ER and speed are shown in Figure 14, respectively.
For the prediction performance, four indicators are deployed. The correlation coefficient between observed and predicted GHG ERs (in g/sec), the fit to the ideal straight curve reflecting on the precision, R statistics, and the RMSE reflecting on the accuracy are used in this study . The correlation coefficient of GHG ER and speed prediction is 0.77 and 0.92, respectively. The RMSE of the GHG ER and speed predictive models is 0.36 gram and 5.95 kmh, respectively. The performance of the speed predictive model is noticeably better than the one of the GHG ER. This is due to the complicated relationship between the GHG ER and the predictors, while speed has more straight forward relationships with the predictors used. For instance, speed and density have a monotonically decreasing relationship , while the GHG ER has a quasi-convex relationship with the most important predictor (speed) .
It is noticed in both Figure (a)a and (b)b that true values higher than 4 g/sec and 60 km/h, respectively, are not predicted with a high level of accuracy. This stems from the fact that the frequency of the data points reflecting these conditions is much less compared to the other conditions. as in Figure 7. The data points reflecting on the GHG ER greater than 4 g/sec, as in Figure (d)d, is only 0.007% of the total GHG ER data points. The high GHG is also associated with either low or high speed. It is probably because of the quasi-convex relationship of GHG ER with speed . Similarly, the data points of speed higher than 60 km/h and less than 20 km/h represent only 0.11% of the total data points, while the data points of speed between 20 and 60 km/h represent 0.89% as in Figure (a)a.
5.2 Routing strategies analysis
As illustrated in Table 2, six routing strategies are analyzed using the E2ECAV  routing framework. Single and multi-objectives are considered while routing is myopic and anticipatory. Mean TT, mean VKT, total GHG, and total NOx are the performance indicators taken into account and the results are shown in Figure 17. NOx is the pollutant reflecting on the public health . It is of high importance to include the NOx as a performance factor to assess the impact of the different routing strategies. It is important to note that logical constraints have been included for a realistic application while estimating the cost on links. When either the predicted GHG ER or speed of time step is negative, the value is set to zero. Predicted speed of time step that surpasses the link speed limit is set to the link speed limit. With regards to the results, Figure 17 illustrates that the anticipatory routing strategies outperform the myopic ones whatever the routing objective is. The justification of this outcome is threefold. Firstly, the reflective costing approach of the GHG emissions (when GHG is part of the optimization process), which optimizes not only the GHG ER, but also the travel time implicitly to avoid re-routing is used. The marginal cost prioritizes the links of speed close to the optimal value based on the quasi-convex relationship between the GHG ER and speed . Secondly, the sophisticated predictive models developed based on high resolution data points are adopted. Lastly, by taking into account the traffic conditions and their evolution, routing is more proactive than simply being reactive to the current conditions.
The performance trend for myopic routing strategies is similar to that of the anticipatory routing strategies. From the worst to the best, TT is followed by GHG and TT&GHG in terms of the four performance indicators. Whether it is myopic or anticipatory routing, when TT is optimized the worst performance indicators are observed compared to when the routing objective is GHG or TT&GHG. The justification is that when TT is the objective all that matters is the TT spent on the links regardless of the VKT, GHG and NOx indicators. The vehicles are distributed in the network to achieve the least TT, but this comes at the cost of longer distances travelled and more GHG and NOx produced. Nevertheless, GHG and GHG introduce a decrease in the total GHG produced of 11% and 6% compared to TT and TT, respectively. This improvement can be directly linked to the marginal costing approach as in Equation 5, which takes into account not only the GHG ER, but also the speed on links. In other words when the routing objective is GHG, the chosen links are defined based on the best combination of the GHG ER and speed that minimizes the cost following Equation 7. The relationship between GHG and speed is quasi-convex  in which too low or too high speed will contribute to a higher GHG ER and higher GHG cost eventually. The multi-objective TT&GHG outperforms both TT and GHG in terms of the whole performance indicators. A reduction in average TT, average VKT, total GHG, and total NOx of 17%, 13%, 16%, and 14%, respectively, is observed for the TT& GHG strategy compared to TT strategy. The reduction in the performance indicators of TT&GHG is marginal compared to the GHG routing strategy. TT&GHG routing objective controls the TT cost and does not allow it to neglect the GHG objective. Thus, TT is reduced as long as it still satisfies the objective of reducing GHG. Paths of longer distances are not chosen as this triggers the increase in both the GHG and NOx produced like in TT routing strategy. Comparing the best anticipatory routing strategy, TT&GHG to the TT, a reduction in average TT, average VKT, total GHG, and total NOx of 17%, 22%, 18%, and 20%, respectively is noticed. With reference to the NOx variable, it has been found that the relationship between NOx and speed is quasi-convex . Moreover, previous studies have confirmed that at high speeds NOx is sensitive to aggressive driving . Figure (b)b shows that when GHG is part of the routing objective (GHG or TT&GHG), the NOx produced is less compared to when TT is the objective regardless of the routing protocol, myopic and anticipatory. It can be concluded that the additional time and longer trips experienced by vehicles in the case of myopic and anticipatory routing contribute to the increase in the GHG and NOx emissions produced.
5.3 Path analysis
For this analysis, one vehicle is chosen randomly and its myopic and anticipatory paths with different routing objectives are investigated. Comparing the myopic from Figure 18 with the anticipatory routes in Figure 19 for each of the objectives, shows that there is more re-routing in the former contributing to longer trips and probably more time in the network as illustrated in Section 5.2. This main explanation is that the cost of links is based on the current traffic conditions and does not consider the evolution of traffic in future. This analysis supports the findings in Figure (a)a, which demonstrates the decrease in average TT and VKT while anticipatory routing is adopted. Whether the routing protocol is myopic or anticipatory, comparing the length of the path of the TT to GHG and TT&GHG routing strategies illustrates that the length of the path of TT strategy is the longest. The main justification is that when TT is the objective, vehicles are distributed in the network utilizing uncongested links to achieve the least TT regardless of the distance travelled, the GHG, and NOx produced. Figure 19 illustrates that instead of taking route (west-east), of two link, route (south-north), of five links, is chosen. The vehicle traversed an additional distance of around 700 meters when taking route compared to GHG and TT&GHG routing strategies. The speed of route is around 38 km/h, while the speed of route is around 1 km/h. This asserts that vehicles are distributed in the network to links of high speed to minimize the TT regardless of the distance travelled. When TT is the objective, the time spent on the links is optimized, while when GHG is part of routing objective the links of optimal speed are prioritized as long as the GHG marginal cost is minimized. The length of routes of the GHG and TT&GHG routing strategies is comparable as illustrated in Figure 19.
5.4 Network level analysis
To examine the effect at the network level of the myopic and anticipatory routing strategies while adopting different routing objectives, average speed, GHG, and NOx produced over time are examined. The demand is loaded at 7:45am and the total demand is the network at around 8:00am, which represents the peak of the congestion. Comparing Figure 20 to Figure 21, shows that the network has been loaded and unloaded quicker in the case of anticipatory routing. Particularly, for TT the vehicles spend 15% more time in the network compared to TT. This finding is aligned with one of the TT, VKT, GHG, and NOx in Figure (a)a for TT compared to TT. Figures 18 and 19 demonstrate the re-routing in the case of myopic and anticipatory routing strategies, respectively. The additional re-routing noticed in the case of TT, which triggers longer trip lengths compared to TT, contributes to the additional time spent in the network as well. The throughput of TT is less compared to GHG and TT&GHG. It takes around 8% and 10.6% less time to load and unload the network for GHG and TT&GHG, respectively, compared to TT. The main explanation is that when travel time minimization is the objective, all link options are analyzed and the final cost of travel time does not take into account the VKT as long as the objective is minimized. However, when GHG is the part of the optimization process, the links of optimal speed are prioritized and the re-routing is averted. The GHG marginal cost as in Equation 5 makes sure that vehicles spend the least time and travel the least distance while the objective is minimized which is observed in Figure 18 and 19. The average speed till 8:10am of the myopic routing strategies in Figure 21 is almost identical for the three routing strategies, while for the anticipatory routing GHG and TT&GHG are associated with a slightly higher speed than TT as in Figure 21. This asserts the importance and positive impact of the anticipatory routing, which takes into account the future state of traffic conditions in the network. After 8:10am and till the end of the simulation, the increase in speed for GHG and TT&GHG compared to TT is higher than in the case of the GHG and TT&GHG compared to TT. The reason is that the vehicles reach their destination faster in the case of anticipatory routing compared to myopic routing, which means less vehicles are in the network in the former case.
The main difference between Figure 20 and 21 is that the period of time vehicles produce GHG is longer in the former as the anticipatory routing includes the future state of the traffic conditions and deals with the changes proactively compared to the myopic routing. In addition, the GHG costing approach takes into account not only the GHG ER, but also the speed and VKT implicitly. The additional time and VKT experienced by the vehicles in the case of TT, as shown in Figure (a)a, contribute to the higher levels of GHG compared to GHG and TT&GHG in Figure 20, especially after the congestion peak at 8:05am. The number of vehicles in the network is an essential factor to keep in mind. On the other hand, comparing the GHG over time of TT to GHG and TT&GHG, illustrates less variation.
NOx over time follows the same trend of GHG over time and is associated with less emissions over time while the anticipatory routing is utilized as in Figure 21 compared to Figure 20. For the NOx over time analysis, as shown in Figures 20 and 21, the high values till around 8:00am are due to the high speed of the uncongested network. After 8:00am the number of vehicles and speed level control the NOx produced. At 8:00am, the complete demand is in the network and the speed is the variable with direct impact on the NOx produced. It is observed that for TT in Figure 20 and TT in Figure 21, NOx is higher than the cases when GHG or TT&GHG is the routing objective. This is because vehicles were directed to longer paths, but of higher speeds to minimize the travel time. NOx is sensitive to aggressive driving , which makes higher speed links unfavourable. However, the reduction in the TT and VKT experienced in the network in the case of anticipatory routing means higher throughput over time. The higher throughput contributes to less NOx over time.
It can be concluded that using predicted link cost is associated with significant improvements at the network level. Furthermore, utilizing the GHG marginal cost, which takes into account not only the GHG ER, but also the speed and VKT, is very effective and outperforms the travel time cost in terms of the whole performance indicators.
6 Conclusion and potential directions
Current eco-routing studies are predominantly associated with limitations related to the aggregation level used in the traffic flow models, scale of the case study, centralized routing, and the number of objectives optimized simultaneously .  and  overcame these limitations and applied myopic multi-objective eco-routing strategies in a distributed routing framework with favourable outcomes. However, the technological advancements related to ICT and CAVs have not yet been exploited completely. Hence, this study suggested anticipatory multi-objective eco-routing strategies using a distributed routing system for connected and automated vehicles i.e. E2ECAV . Predictive models of GHG ER and speed were developed and used. A deep learning based time series model i.e. LSTM is trained while systematically tuned. For sequential data, LSTM is known to be the most powerful recurrent NN architectures . Furthermore, the LSTM model employed here outperformed the commonly used statistical time series model e.g. ARIMA and clustering .
The major findings of this study are as follows. Anticipatory routing strategies outperform the myopic ones due to the inclusion of future traffic conditions in the route calculations. The paths of myopic routing strategies demonstrate a high degree of re-routing as the cost does not consider the future traffic and environmental conditions. Routing based on GHG as the objective is associated with noticeable reduction in average TT, average VKT, total GHG, and total NOx compared to the case where TT is the objective. This stemms from the costing approach for the marginal cost in Equation 5, that results in the best combination of speed and GHG ER on links that minimizes the GHG cost. Re-routing is minimized when GHG is part of the optimization process as the increase in the VKT has a negative impact on the GHG produced. The GHG routing objective contributes to less TT, VKT that leads to less GHG, NOx in the network. For myopic and anticipatory routing, TT&GHG routing strategy introduces a slight enhancement in terms of the four performance indicators compared to GHG routing strategy. Comparison of the former to the latter would be like comparing the system optimal to the User Equilibrium (UE) . Comparing the best anticipatory routing strategy, which optimizes not only TT, but also GHG, a reduction in average TT, average VKT, total GHG, and total NOx of 17%, 21%, 18%, and 20%, respectively, is noticed when compared with the myopic routing aiming at TT minimization.
For future work, utilizing real data points from sensors instead of simulated data would result in higher heterogeneity in the data and ensure robustness in the models. The constrained eco-routing concept is an important aspect to be tackled and illustrates the trade offs compared to the regular eco-routing application. As 100% CAVs MPR is employed in this study, the impact of different MPRs should be taken into account. The most preferable MPR of CAVs could vary from a traffic condition to another and this has to be defined. The predictive models utilized in this study can be further enhanced by using more data points to represent the conditions of low frequency of occurrence. In addition, predictive models can be developed based on categorized characteristics e.g. speed limit, number of lanes, etc. of links for further enhancements. With regards to the scaleability aspect, it is suggested that anticipatory routing is applied in a larger network with both uninterrupted and interrupted traffic flow. Nevertheless, the predictive models should accommodate the difference between the two types of traffic flow. The employed distributed routing framework adopts only the V2I and I2I communication. The impact of incorporating the V2V communication is another suggestion for the future work. Incorporating anticipatory routing strategies as options in the personal navigation platforms would contribute to more efficient and sustainable transportation systems. Despite the strong contradiction between the NOx and speed, including the NOx as a routing objective is preferred.
-  L. Alfaseeh, S. Djavadian, and B. Farooq. Impact of distributed routing of intelligent vehicles on urban traffic. In 2018 IEEE International Smart Cities Conference (ISC2), pages 1–7. IEEE, 2018.
-  L. Alfaseeh, S. Djavadian, R. Tu, B. Farooq, and M. Hatzopoulou. Multi-objective eco-routing in a distributed routing framework. In 2019 IEEE International Smart Cities Conference (ISC2), pages 747–752. IEEE, 2019.
-  L. Alfaseeh and B. Farooq. Multifactor taxonomy of ecorouting models and future outlook. Journal of Sensors, 2020, 2020.
-  L. Alfaseeh, R. Tu, B. Farooq, and M. Hatzopoulou. Greenhouse gas emission prediction on road network using deep sequence learning. Under review, arXiv:2004.08286v1, 2020.
-  L. Amarpuri, N. Yadav, G. Kumar, and S. Agrawal. Prediction of co 2 emissions using deep learning hybrid approach: A case study in indian context. In 2019 Twelfth International Conference on Contemporary Computing (IC3), pages 1–6. IEEE, 2019.
-  B. Ameyaw and L. Yao. Analyzing the impact of gdp on co2 emissions and forecasting africa’s total co2 emissions with non-assumption driven bidirectional long short-term memory. Sustainability, 10(9):3110, 2018.
-  B. Ameyaw, L. Yao, A. Oppong, and J. K. Agyeman. Investigating, forecasting and proposing emission mitigation pathways for co2 emissions from fossil fuel combustion only: A case study of selected countries. Energy policy, 130:7–21, 2019.
-  M. Ben-Akiva, M. Bierlaire, D. Burton, H. N. Koutsopoulos, and R. Mishalani. Network state estimation and prediction for real-time traffic management. Networks and spatial economics, 1(3-4):293–318, 2001.
-  A. Bilali, G. Isaac, S. Amini, and N. Motamedidehkordi. Analyzing the impact of anticipatory vehicle routing on the network performance. Transportation Research Procedia, 41:494–506, 2019.
-  J. A. Bottom. Consistent anticipatory route guidance. PhD thesis, Massachusetts Institute of Technology, 2000.
-  S. Djavadian and B. Farooq. Distributed dynamic routing using network of intelligent intersections. In ITS Canada ACGM (2018), 2018.
-  S. Djavadian, R. Tu, B. Farooq, and M. Hatzopoulou. Multi-objective eco-routing for dynamic control of connected & automated vehicles. arXiv preprint arXiv:2005.00815, 2020.
-  Y. Duan, Y. Lv, and F.-Y. Wang. Travel time prediction with lstm neural network. In 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pages 1053–1058. IEEE, 2016.
M. Elhenawy, H. Chen, and H. A. Rakha.
Dynamic travel time prediction using data clustering and genetic programming.Transportation Research Part C: Emerging Technologies, 42:82–98, 2014.
-  U. EPA. Sources of greenhouse gas emissions. Retrieved December, 2020.
-  B. Farooq and S. Djavadian. Distributed traffic management system with dynamic end-to-end routing, u.s. provisional patent service no. 62/865,725, 2019.
-  U. T. T. Force. The high cost of congestion in canadian cities. Council of Ministers Transportation and Highway Safety, 2012.
-  Y. Gu, W. Lu, L. Qin, M. Li, and Z. Shao. Short-term prediction of lane-level traffic speeds: A fusion deep learning model. Transportation research part C: emerging technologies, 106:1–16, 2019.
M. Hermans and B. Schrauwen.
Training and analysing deep recurrent neural networks.In Advances in neural information processing systems, pages 190–198, 2013.
F. Hutter, J. Lücke, and L. Schmidt-Thieme.
Beyond manual tuning of hyperparameters.KI-Künstliche Intelligenz, 29(4):329–337, 2015.
-  S. Innamaa. Short-term prediction of traffic situation using mlp-neural networks. In Proceedings of the 7th world congress on intelligent transport systems, Turin, Italy, pages 6–9, 2000.
-  S. Ishak and C. Alecsandru. Optimizing traffic prediction performance of neural networks under various topological, input, and traffic condition settings. Journal of Transportation Engineering, 130(4):452–465, 2004.
-  D. E. Kaufman, R. L. Smith, and K. E. Wunderlich. An iterative routing/assignment method for anticipatory real-time route guidance. In Vehicle Navigation and Information Systems Conference, 1991, volume 2, pages 693–700. IEEE, 1991.
-  K. Kim, M. Kwon, J. Park, and Y. Eun. Dynamic vehicular route guidance using traffic prediction information. Mobile Information Systems, 2016, 2016.
-  D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
-  J. Lee and B. Park. Evaluation of route guidance strategies based on vehicle-infrastructure integration under incident conditions. Transportation research record, 2086(1):107–114, 2008.
-  Z. Liang and Y. Wakahara. Real-time urban traffic amount prediction models for dynamic route guidance systems. EURASIP Journal on Wireless Communications and Networking, 2014(1):85, 2014.
-  Z. C. Lipton, J. Berkowitz, and C. Elkan. A critical review of recurrent neural networks for sequence learning. arXiv preprint arXiv:1506.00019, 2015.
-  S. Liu and Q. Qu. Dynamic collective routing using crowdsourcing data. Transportation Research Part B: Methodological, 93:450–469, 2016.
-  L. Luo, Y.-E. Ge, F. Zhang, and X. J. Ban. Real-time route diversion control in a model predictive control framework with multiple objectives: Traffic efficiency, emission reduction and fuel economy. Transportation Research Part D: Transport and Environment, 48:332–356, 2016.
-  X. Ma, Z. Tao, Y. Wang, H. Yu, and Y. Wang. Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transportation Research Part C: Emerging Technologies, 54:187–197, 2015.
-  H. S. Mahmassani. Development and testing of dynamic traffic assignment and simulation procedures for atis/atms applications. 1994.
-  A. S. Mane and S. S. Pulugurtha. Link-level travel time prediction using artificial neural network models. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC), pages 1487–1492. IEEE, 2018.
-  J. Pan, I. S. Popa, K. Zeitouni, and C. Borcea. Proactive vehicular traffic rerouting for lower travel time. IEEE Transactions on vehicular technology, 62(8):3551–3568, 2013.
-  H.-T. Pao and C.-M. Tsai. Modeling and forecasting the co2 emissions, energy consumption, and economic growth in brazil. Energy, 36(5):2450–2458, 2011.
-  C. S. Papacostas and P. D. Prevedouros. Transportation engineering and planning. 1993.
-  R. Pascanu, C. Gulcehre, K. Cho, and Y. Bengio. How to construct deep recurrent neural networks. arXiv preprint arXiv:1312.6026, 2013.
-  A. Rahman and M. M. Hasan. Modeling and forecasting of carbon dioxide emissions in bangladesh using autoregressive integrated moving average (arima) models. Open Journal of Statistics, 7(4):560–566, 2017.
-  X. Ran, Z. Shan, Y. Fang, and C. Lin. An lstm-based method with attention mechanism for travel time prediction. Sensors, 19(4):861, 2019.
-  N. Reimers and I. Gurevych. Optimal hyperparameters for deep lstm-networks for sequence labeling tasks. arXiv preprint arXiv:1707.06799, 2017.
-  C. Robert. Machine learning, a probabilistic perspective, 2014.
-  J. Snoek, H. Larochelle, and R. P. Adams. Practical bayesian optimization of machine learning algorithms. In Advances in neural information processing systems, pages 2951–2959, 2012.
-  Toronto. Vitalsigns. 2020.
-  M. Treiber, A. Hennecke, and D. Helbing. Congested traffic states in empirical observations and microscopic simulations. Physical Review E - Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 62(2):1805–1824, 2000.
-  R. Tu, L. Alfaseeh, S. Djavadian, B. Farooq, and M. Hatzopoulou. Quantifying the impacts of dynamic control in connected and automated vehicles on greenhouse gas emissions and urban no2 concentrations. Transportation Research Part D: Transport and Environment, 73:142–151, 2019.
-  C. Tudor. Predicting the evolution of co2 emissions in bahrain with automated forecasting methods. Sustainability, 8(9):923, 2016.
-  G. H. Tzeng and C.-H. Chen. Multiobjective decision making for traffic assignment. IEEE Transactions on Engineering Management, 40(2):180–187, 1993.
-  USEPA. Exhaust emission rates for light-duty on-road vehicles in moves2014: Final report, 2015.
-  E. I. Vlahogianni, M. G. Karlaftis, and J. C. Golias. Short-term traffic forecasting: Where we are and where we’re going. Transportation Research Part C: Emerging Technologies, 43:3–19, 2014.
-  J. Wu, X.-Y. Chen, H. Zhang, L.-D. Xiong, H. Lei, and S.-H. Deng. Hyperparameter optimization for machine learning models based on bayesian optimization. Journal of Electronic Science and Technology, 17(1):26–40, 2019.
-  X. Yang and W. W. Recker. Modeling dynamic vehicle navigation in a self-organizing, peer-to-peer, distributed traffic information system. Journal of intelligent transportation Systems, 10(4):185–204, 2006.
-  B. Yao, C. Chen, Q. Cao, L. Jin, M. Zhang, H. Zhu, and B. Yu. Short-term traffic speed prediction for an urban corridor. Computer-Aided Civil and Infrastructure Engineering, 32(2):154–169, 2017.
-  M. Yildirimoglu and N. Geroliminis. Experienced travel time prediction for congested freeways. Transportation Research Part B: Methodological, 53:45–63, 2013.
-  S. K. Zegeye, B. De Schutter, H. Hellendoorn, and E. Breunesse. Model-based traffic control for balanced reduction of fuel consumption, emissions, and travel time. In Proceedings of the 12th IFAC Symposium on Transportation Systems (2009), pages 149–154, 2009.
-  G. P. Zhang. Time series forecasting using a hybrid arima and neural network model. Neurocomputing, 50:159–175, 2003.
-  T. Zhang, J. Jin, H. Yang, H. Guo, and X. Ma. Link speed prediction for signalized urban traffic network using a hybrid deep learning approach. In 2019 IEEE Intelligent Transportation Systems Conference (ITSC), pages 2195–2200. IEEE, 2019.
-  X. Zhang and J. A. Rice. Short-term travel time prediction. Transportation Research Part C: Emerging Technologies, 11(3-4):187–210, 2003.
-  J. Zhao, J. Zhang, S. Jia, Q. Li, and Y. Zhu. A mapreduce framework for on-road mobile fossil fuel combustion co 2 emission estimation. In 2011 19th International Conference on Geoinformatics, pages 1–4. IEEE, 2011.