Log In Sign Up

Competitors-Aware Stochastic Lap Strategy Optimisation for Race Hybrid Vehicles

World Endurance Championship (WEC) racing events are characterised by a relevant performance gap among competitors. The fastest vehicles category, consisting in hybrid vehicles, has to respect energy usage constraints set by the technical regulation. Considering absence of competitors, i.e. traffic conditions, the optimal energy usage strategy for lap time minimisation is typically computed through a constrained optimisation problem. To the best of our knowledge, the majority of state-of-the-art works neglects competitors. This leads to a mismatch with the real world, where traffic generates considerable time losses. To bridge this gap, we propose a new framework to offline compute optimal strategies for the powertrain energy management considering competitors. Through analysis of the available data from previous events, statistics on the sector times and overtaking probabilities are extracted to encode the competitors' behaviour. Adopting a multi-agent model, the statistics are then used to generate realistic Monte Carlo (MC) simulation of their position along the track. The simulator is then adopted to identify the optimal strategy as follows. We develop a longitudinal vehicle model for the ego-vehicle and implement an optimisation problem for lap time minimisation in absence of traffic, based on Genetic Algorithms. Solving the optimisation problem for a variety of constraints generates a set of candidate optimal strategies. Stochastic Dynamic Programming is finally implemented to choose the best strategy considering competitors, whose motion is generated by the MC simulator. Our approach, validated on data from a real stint of race, allows to significantly reduce the lap time.


Design optimisation of piezoelectric energy harvesters for bridge infrastructure

Vibrational energy harvested from the bridge excitation due to the traff...

A Case Study of Vehicle Route Optimization

In the last decades, the classical Vehicle Routing Problem (VRP), i.e., ...

Multi-Agent Chance-Constrained Stochastic Shortest Path with Application to Risk-Aware Intelligent Intersection

In transportation networks, where traffic lights have traditionally been...

Evolving Pacing Strategies for Team Pursuit Track Cycling

Team pursuit track cycling is a bicycle racing sport held on velodromes ...

I Introduction

In motorsport events, strategic decisions may have a relevant impact on the result of the competition. Particularly, with reference to World Endurance Championship (WEC) [6], there are four vehicle categories characterised by very different performance. This results in the formation of traffic congestion, obliging the fastest category, i.e. Le Mans Prototype 1 (LMP1), to overlap other vehicles many times during the six hours of an event. LMP1 consists in hybrid vehicles that have to respect constraints set by the technical regulation [10] for the usage of both the electrical and thermal energy. Race engineers define how to use the energy budget in an optimal way along the track with the aim to minimise the lap time.

In the last years, many state-of-the-art works have addressed the problem of lap strategy optimisation for race electric [2, 25, 5, 16, 7, 8] and hybrid [11, 22, 24] vehicles. In the majority of the cases, the formulations include very detailed and complex models of the powertrain, but do not include the presence of competitors in the driving scenario, which instead represents a fundamental aspect for what said above. Competitors are therefore neglected due to the difficulties in effectively simulating their behaviour, as well as including this information into the optimisation problem, which may become burdensome and not real-time solvable. However, this may result in overtakings being performed in points along the track where a significant deviation from the pre-computed optimal trajectory is necessary, thus determining relevant time losses.

Statement of contributions. To bridge this gap, we propose a computationally efficient procedure to offline identify the best strategy for the energy budget utilisation along the track, taking into account realistic traffic conditions. By best strategy we refer to the one that statistically minimises the lap time while respecting the technical regulation.

Our contribution is threefold.

Statistical modelling of the competitors’ behaviour

we perform a statistical analysis to practically describe the behaviour of the competitors along the track in WEC events, in terms of travel time and their mutual interactions, i.e. overtaking probability. The statistical analysis is necessary since we do not know the behavioural model of the competitors. Statistical analyses of previous events have been carried out to extract the free sector times and overtaking probability distributions. The latter refers to the probability of occurrence of overtakings between two vehicle categories along the track. This information is later employed to generate realistic simulations of the competitors’ behaviour.

Computation of traffic-free optimal strategies

theoretically, it would be necessary to define the best points of application of the electric motor in real time according to the actual traffic conditions, but this task is computationally too expensive to guarantee sufficiently fast cycle times. Therefore, the following solution is proposed.

We generate a set of possible traffic-free optimal solutions, which aim to minimise the lap time while respecting the constraints imposed by the technical regulation. The set of varied solution is obtained by imposing different extra constraints on the distribution of the powertrain energy budget along the track, according to engineering expertise. The optimisations are solved using Genetic Algorithms, for their computational quickness and ease of tuning. This choice is supported by a comparison with a more classical Mixed Integer Quadratically Constrained Program (MIQCP). In both cases, the ego-vehicle is described through a longitudinal model. The set of traffic-free candidate optimal strategies will be finally tested in real traffic conditions to determine the most suitable one, as explained in the next point.

Optimal strategy in presence of traffic

to identify the optimal strategy in presence of traffic, we propose an evaluation metrics based on Stochastic Dynamic Programming (SDP)

[20, 21]. The dataset used by SDP is generated through the Monte Carlo (MC) [1] simulations, relying on a multi-agent influence/reaction model and on the previously computed statistics of the competitors.

Sections organisation. The remainder of this paper is organised as follows. Sec. II presents state-of-the-art related work. In Sec. III, we perform the statistical analysis of the past WEC events, to compute the free sector times and overtaking probability distributions. Sec. IV details the longitudinal vehicle model that is used to solve the powertrain energy budget optimisation problem. The latter, considering traffic-free conditions, is then formulated and solved in Sec. V, comparing a Mixed Integer Quadratically Constrained Program (MIQCP) formulation and Genetic Algorithms. In Sec. VI, we show how to generate Monte Carlo numerical simulations of the competitors’ positions along the track, employing the free sector times and the overtaking probabilities distributions, as well as a multi-agent model. Finally, in Sec. VII, the previously computed traffic-free strategies are combined with the traffic-aware Monte Carlo simulations, and Stochastic Dynamic Programming is employed to evaluate the statistically best strategy in presence of traffic. Finally, conclusions are discussed in Sec. VIII.

Ii Related work

Simulating the competitors’ motion in racing events is an active field of research. A realistic simulator for circuit motorsports is proposed in [15]. It includes the effects of tire degradation, fuel mass loss, pit stops and overtaking. However, it employs a lap-wise discretisation, which is incompatible with the goal of our work. In fact, we aim to generate an optimal lap strategy that is section-wise. A discrete-event simulation model is developed in [4], which is suitable for decision making to define the race strategy. The track is divided into sections. For each of them, the vehicle and environment characteristics, such as the fuel mass and the air resistance penalty, are taken into account. However, the computation of the lap time is performed using a deterministic approach, without considering the uncertainty related to the interactions between the vehicles, which is our target.

Many state-of-the-art works deal with energy management for electric and hybrid vehicles under a lower-level perspective compared to our work. Time-optimal energy management and gear shift for hybrid race cars is investigated in [11]. Given fuel and battery consumption targets, they implement a computationally efficient algorithm to solve the problem, mixing convex optimisation, Dynamic Programming and the Pontryagin’s minimum principle. In [23], a similar problem is solved for real-time control of the Formula 1 power unit using a two-level Model Predictive Control scheme. Minimum-lap-time optimisation for all-wheel drive electric race cars is presented in [8]. An optimal adaptive race strategy for Formula-E cars is presented in [2]. It is based on an adaptive equivalent consumption minimisation strategy (A-ECMS) approach, and compared with a global optimal benchmark provided by Dynamic Programming. In [25]

, they introduce a lap strategy optimisation method based on a Big Bang - Big Crunch approach for Solar cars in long-distance races. Finally, for Solar cars, heuristic methods are compared in

[5]. One of the implemented methods is Genetic Algorithms, which is adopted in our framework.

The works presented here accurately represent the dynamics of the vehicle powertrain. They cannot however be employed in our framework since they neglect competitors and overtakings, which affect higher-level strategic decisions.

Iii Statistical analysis of the competitors’ performance

One of the key contributions of the proposed methodology for lap strategy optimisation is modelling competitors’ motion along the track. Modelling and simulating the behaviour of the competitors allows in fact to design and evaluate lap strategies that can be effectively actuated in realistic racing situations. In this section, we detail how to extract a set of useful statistical indices from the publicly available fraction of data collected in previous races by WEC. The set of statistical indices is meant to synthetically describe the competitors’ performance during the race, and they will be lately used to simulate their motion along the track.

Being WEC events long-lasting races, the generated amount of data provides a relevant statistical basis for our scope. However, since GPS data of the vehicles are provided by Federation Internationale de l’Automobile (FIA) to every team (for their own vehicle only) during the race but they are not publicly available, the proposed approach relies on an alternative procedure. Based on the statistical distribution of sector times data, we aim to describe the behaviour of each competitor during the race in a probabilistic fashion. The following sections illustrate the entire procedure.

Iii-a Dataset and data cleaning

In motorsport competitions, circuits are typically divided into three main sectors. The sector times indicate the time interval spent by a car in each sector of the circuit for a specific lap. Differently from the GPS data, the sector time database is freely available111 An example of the database structure is reported in Table I, where the columns , and contain the three sector times.

# Lap Stop Elapsed Class Group Team
[s] [s] [s] [s]
1 1 33.978 38.779 32.358 105.115 LMP1 H Porsche
1 2 33.846 37.727 31.753 208.441 LMP1 H Porsche
1 3 33.340 37.789 36.823 316.393 LMP1 H Porsche
77 30 40.246 46.643 40.224 3833.398 LMGTE Am Porsche
77 31 B 40.622 47.265 45.367 3966.652 LMGTE Am Porsche
77 32 121.453 45.261 38.340 4171.706 LMGTE Am Porsche
TABLE I: General structure of the FIA sector times database

To eliminate spurious laps from the dataset, f.i. laps led by the safety car, the corresponding sector times have been clustered through the DBSCAN algorithm [13]

, and only the data belonging to the cluster with the fastest sector times have been considered for successive analyses. The clusters of fast sector times and the outliers are shown in Figure

1 for each car.

(a) .
(b) .
(c) .
Fig. 1: Clusters subdivision of the sector times according to DBSCAN.

Iii-B Forecasting the competitors’ speed profiles

Resorting to the sector times, we aim to estimate the speed profile of each competitor. Then, it is straightforward to derive an estimate of the competitors position during the race, which is fundamental to statistically model their behaviour.

The proposed procedure to estimate the speed profiles is now described. Considering a lap performed by our vehicle in absence of traffic and without using KERS, we extract a reference profile from GPS data, as shown in Fig. 2. We then scale it according to the measured sector times to estimate the speed profile of each competitor. The adopted linear scaling equation is



  • is the -th sector time performed by the car in the reference video;

  • is the speed of the car in the reference video, at the frame of the -th sector;

  • is the -th sector time of the -th lap performed by the -th competitor;

  • is the reconstructed speed of the -th competitor, at the frame in the -th sector of the -th lap;

  • indicates the three sectors;

  • , with being the total number of laps performed by the competitor ;

  • , with being the total number of competitors.

Fig. 2: Speed profile reconstructed using the linear scaling approach.

Iii-C Statistics of the competitors’ behaviour

Referring to the predicted speed profiles, the following statistics have been extracted from the cleaned dataset:

  • the free sector times distributions;

  • the overtaking probabilities.

The free sector times are defined as the sector times performed by a competitor with the preceding vehicle far at least 100 m for the whole duration of the sector. Under these conditions, it is possible to assume that the competitor performance and behaviour have not been influenced by the other competitors. Free sector times are useful to forecast the performance of competitors in absence of interactions. In Figure 3, an example of the sector times distribution is shown for a competitor.

Fig. 3: Free sector times distributions for a single competitor.

The overtaking probabilities are modelled as functions of the section and category of the two competitors involved. These statistics are useful to practically describe the interactions between competitors along the sectors. Sections are a finer subdivision of the circuit with respect to the sectors and they allow to distinguish between straights and curves, which are typically characterised by very different probabilities of overtaking. Figure 4 represents an arbitrary subdivision of the circuit of Bahrain into sections, which has been considered as the test circuit to validate the proposed approach.

Fig. 4: Subdivision of the Bahrain circuit into sections.

Denoting with the category of the vehicle that is attempting an overtake and with the category of the vehicle that may be overtaken, the overtaking probability of the pair along section is computed as

where is the number of overtakings of on in section , and is the number of times A and B have been in section with at most meters distance one from the other. Let us define the following notation:

  • LMP1: Le Mans Prototype 1;

  • LMP2: Le Mans Prototype 2;

  • LMGTE Pro: Le Mans Grand Touring Endurance Professionals;

  • LMGTE Am: Le Mans Grand Touring Endurance Amateurs.

Then, we can represent the overtaking probabilities for each pair of vehicle categories and each section. An example is provided in Figure 5, for the classes LMP1 and LMP2.

(a) Class LMP1.
(b) Class LMP2.
Fig. 5: Examples of overtaking probabilities distributions.

Iv Ego-vehicle model

The statistics derived in the previous section are used to model the behaviour of the competitors, whose actual dynamics is unknown due to lack of information. Considering the ego-vehicle instead, for which we aim to design the optimal lap strategy, the necessary information to model its dynamics is supposed to be available. Therefore, we propose here a dynamic model of the ego-vehicle, which is used to study the effect of the electrical energy usage on the speed profile. In the next section, the model will be used to compute optimal lap strategies.

The designed model is the result of a trade-off between computational complexity and accuracy in the description of the relevant dynamics. The longitudinal dynamics of the vehicle is modelled as



  • m is the total vehicle mass, accounting also for the fuel and the driver;

  • g is the gravity acceleration;

  • is the ground slope;

  • is the vehicle speed;

  • h is the height with respect to the road at which the vehicle centre of mass is located;

  • is the resistive rolling force at the front wheels;

  • is the resistive rolling force at the rear wheels;

  • is the aerodynamic resistive force;

  • is the thrust applied to ground by the electric motor through the front tires;

  • is the thrust applied to ground by the combustion engine through the rear tires;

  • and are the aerodynamic downforces, split between the front and rear tires, respectively;

  • and are the front and rear vertical tire forces, respectively;

  • is the distance between the front and rear axles.

A schematic representation of the involved quantities is depicted in Figure 6.

Fig. 6: Schematic representation of the ego-vehicle (LMP1).

Being the torque curves of the electric motor and of the combustion engine known, it is possible to write


With reference to (3) and (4), represents the theoretically available thrust at the rear wheels provided by the combustion engine, is the engine torque, is the engine speed in rpm, is the gear ratio and is the gear selected by the driver. Similarly, represents the theoretical thrust provided by the electric motor at the front wheels, having in this case just one transmission ratio available.

The amount of thrust transferred from the motors to ground depends on the maximum forces that can be generated by the tires through friction. The absolute value of the maximum tire forces and at the front and rear wheels, respectively, are given by



the tire-road friction coefficient. The vertical forces can be calculated through moments equilibrium as




is the air density, is the lift coefficient and is the reference surface. The aerodynamic force is given by


where is the drag coefficient. Finally, the resistive rolling forces are


being the rolling resistance coefficient.

The maximum longitudinal thrust that each tire can generate through friction can be calculated by vector difference between the maximum tire force and the lateral tire force experienced during curves. The lateral tire forces

and at the front and rear tires, respectively, can be computed with fair approximation considering a curve with a radius of curvature and constant vehicle speed as


Therefore, the absolute value of the maximum longitudinal forces and exchangeable with ground by the front and rear tires, respectively, are


The driving/braking torques commanded by the vehicle powertrain depend on the usage mode. In the WEC events under analysis, there are four different modes. The vehicle central unit can decide whether to power the electric motor and combustion engine at the same time (mode 1), power the combustion engine only (mode 2), undergo sailing222Sailing is the condition for which the combustion engine is automatically powered off by the control unit to satisfy the technical constraint on the fuel usage, even if the driver applies full throttle. In this powertrain usage mode, the vehicle is decelerated by the aerodynamic forces and by the intervention of the KERS, as explained later. (mode 3) or actuate the brakes (mode 4). Different modes imply different longitudinal tire forces, and thus different vehicle accelerations. In modes 1 and 2, the absolute value of the longitudinal tire forces can be computed as the minimum value between the thrust that the motors are capable to provide and the thrust that the tires are capable to transfer, that is


In mode 3, the total longitudinal force is equal to the force deriving from the electric torque during sailing


Finally, during braking the total longitudinal force is


where is the deceleration force generated by brakes.

Considering a spatial discretisation of m, the speed profile is reconstructed using the longitudinal model and compared to a reference one. Given the vehicle velocity and the throttle/brake commands at the current spatial discretisation point, all of the quantities are evaluated through (2)-(14), so as to obtain the vehicle acceleration , and then the velocity at the next discretisation point. The procedure is repeated iteratively for each discretisation point.

Hereinafter, the speed profile computed for the Bahrain circuit is compared with the real one that the reference car experienced during the real race. The results are shown in Fig. 7.

Fig. 7: Comparison between the reference speed profile and the one computed by means of the longitudinal vehicle model.

It is evident that the longitudinal vehicle model generates an overestimated speed profile with respect to the real one. This is due to different effects that have not been taken into account by the model. Therefore, three corrective coefficients are introduced into the model and properly tuned to better fit the real data. The corrective coefficients are:

  • the engine coefficient, which scales the thrust provided by the combustion engine;

  • the adherence coefficient, which scales the friction coefficient in low-speed curves;

  • the downforce coefficient, which scales the lift coefficient in high-speed curves.

After introducing the corrective coefficients into the model, the two speed profiles result to be coherent, as shown in Figure 8.

Fig. 8: Comparison between the reference speed profile and the one computed by means of the longitudinal vehicle model, after tuning the corrective coefficients.

V Traffic-free lap strategy optimisation

Having developed and validated the ego-vehicle model, it is now possible to formulate and solve an optimisation problem for the lap time minimisation, through the management of the powertrain energy budget. We remark that, at this stage, we aim to identify the best points of activation of the electric motor in a lap, considering absence of traffic.

Theoretically, the optimal strategy should be computed online during the real race, in order to take into account the actual traffic conditions. However, being this computation time consuming, we offline compute a set of traffic-free energy management strategies, and subsequently evaluate the best one in simulated traffic conditions (see Sec. VII).

The optimisation problem was initially solved using a MIQCP formulation. Although the method can provide optimal solutions, it has many shortcomings. First of all, it is time consuming, even for the resolution of a single optimisation problem. Therefore, generating a set of offline strategies would have been intractable. Moreover, numerical issues can degrade the performance of the solver, e.g. the presence of dense columns in the resulting matrices. The solver detects and eliminates as many dense columns as possible before optimising, but this may cause numerical instability. Finally, the problem could be ill-conditioned, which may lead to inconsistent results.

These issues may be attenuated by tuning the solver parameters, which is a difficult and time-consuming operation. To cope with this, we propose an alternative method to solve the optimisation problem, based on Genetic Algorithms (GA), which provides suboptimal solutions whereas it does not suffer from the aforementioned problems. The MIQCP approach is presented just to provide a baseline for assessing the suitability of the GA. Then, the optimisation is solved multiple times with GA, varying the constraints on the electric motor usage, to generate the set of candidate lap strategies.

The basic optimisation problem, written according to the spatial discretisation, is given by

subject to (15b)


  • is the lap time;

  • is the optimisation variable, and represents the powertrain usage mode at each spatial discretisation point along the track through a one-hot encoding vector of four elements;

  • is the consumed electrical energy in a lap;

  • is the maximum allowed electrical energy consumption in a lap;

  • represents the kilograms of fuel consumption in a lap;

  • represents the maximum allowed kilograms of fuel consumption in a lap;

  • is the amount of recovered electrical energy in a lap;

  • is the minimum allowed amount of recovered electrical energy in a lap.

The constraints (15c) and (15d) are set by the WEC technical regulation. Referring to the Bahrain International Circuit, the corresponding limits are and . Instead, the constraint (15e) is necessary to keep the state of charge of the battery greater than or equal to a constant value at the end of each lap. Thus, the minimum amount of recovered electrical energy must be equal to the amount of consumed electrical energy, that is


Since the electrical energy can be recovered through both the Heat Energy Recovery System (HERS), , and the Kinetic Energy Recovery System (KERS), , we can write


where the amount recovered through the HERS is track dependent and fixed for each lap.

In the next sections, the terms in (15) will be linked to the longitudinal vehicle dynamics. Lap time optimisation based on a MIQCP formulation is detailed in Sec. V-A, whereas the GA approach is presented in Sec. V-B.

V-a MIQCP solver

We decide to divide the track into subportions with spatial discretisation m. The cost function and constraints are then reformulated as functions of the optimisation variables.

V-A1 Cost function

We first link the vehicle acceleration to the lap time, whose minimisation is the objective of (15). The kinetic energy of the vehicle satisfies


where refers to the generic curvilinear abscissa. Identifying with the discretisation points along the curvilinear abscissa, i.e. for the generic variable it holds that , the above equation can be discretised using the forward Euler method, obtaining




Computing the vehicle speed through the above equation would involve a square root operator. In order to preserve linearity, which is convenient for solving optimisation problems, the method presented in [12]

is employed. It is possible to prove that a geometric mean inequality constraint written as


where , can be reformulated as a second-order conic constraint


Relaxing (20) as


and taking the square root from both sides we obtain


Hence, it is possible to reformulate the relaxed constraint as the following convex quadratic constraint


linking the speed and the kinetic energy. The inverse of speed is usually defined as ‘lethargy’,


that is the spatial derivative of time. Since (26) is a nonlinear constraint, we relax it and transform into a convex quadratic constraint


Finally, the lap time can be expressed as


Therefore, the relationship between the control inputs , and the objective is fully described by (19), (25) and (27), resorting to the intermediate variables and .

V-A2 Constraints

To make the constraints of (15) explicit, it is necessary to calculate the consumed fuel and used/recovered electrical energy in each portion of the track. The technical regulation defines the maximum fuel consumption per second at full thrust, that is . Given the combustion engine torque , the amount of consumed fuel kilograms can be computed as


From the thrust provided by the electric motor and its efficiency in traction , it is straightforward to compute the used electrical energy usage as


The electrical energy can be recovered through the KERS both during sailing or in the braking phase. In the first case, the KERS intervenes to partly recover the kinetic energy (). In the second case, instead, the energy is recovered through a braking force , lower in module than the maximum value that can be guaranteed by the brakes and the adherence with ground. Hence, according to the situation, the recovered electrical energy can be estimated using one of the two expressions


where is the efficiency of the electric motor in recuperation phase. The constraints of (15) can be finally reformulated as


The optimisation problem was built using the toolbox YALMIP [18]. The resulting Mixed Integer Quadratically Constrained Program (MIQCP) was then solved with IBM® ILOG® CPLEX® version 12.8.0. The optimised speed profile is shown in Figure 9, as compared with the reference one. There is significant coherence between the reference speed profile and the one resulting from optimisation. Finally, the points of activation of the electric motor are highlighted in Figure 10, from which we can conclude that the solver decides to use the electric motor in the first part of the straights, achieving the highest possible speed in the shortest time. This is coherent with the behaviour of professional human drivers.

Fig. 9: Comparison between the reference speed profile and the optimal one, obtained by solving the MIQCP.

Fig. 10: MIQCP-based optimal speed profile with detail of the electric motor activation points.

V-B Genetic-Algorithms-based solver

As previously mentioned, we now propose an alternative resolution method for the optimisation problem based on Genetic Algorithms. The circuit is divided into regions, each one from the beginning of a straight to the beginning of the following one. Differently from the MIQCP approach, whose optimisation variable embodies the powertrain usage mode directly, we consider optimisation variables for each region, that is the electrical energy consumption and the fuel consumption . The total number of optimisation variables in a lap is therefore . We further assume that the energy budget of each region, encoded by the optimisation variables, is actuated continuously from the beginning of the region (straight) until it is completely consumed. This assumption is supported by the fact that the optimal way to apply energy is to use it in the first part of the acceleration region, as known to the experts of the field. Finally, we highlight that, given an instance of the optimisation variables and the previous assumption, it is possible to uniquely identify the corresponding powertrain usage mode, speed profile and lap time.

With reference to the traditional nomenclature of GA, an instance of the optimisation variables constitutes the genome of an individual of the population. We employ a discretisation step of kJ for the electrical energy consumption and g for the fuel consumption, and select the optimisation variables ranges as and , respectively. The discretisation speeds up mutations and crossovers to reach the optimised solution, and constitutes a fair approximation being the step sizes three orders of magnitude lower than the maximum value that the variables can take. The inequality constraints (15c) and (15d) are then enforced through linear inequalities on the optimisation variables. Finally, the fitness function is the lap time generated by the genome.

A reasonable initial population was provided to the GA to hot start the optimisation and improve computational times. The simplest way to obtain a hot-start genome in the population is to compute the amount of electrical energy and fuel consumed in each region of the circuit in a real lap. The chosen reference lap was performed in absence of interactions with competitors and with the lowest lap time possible, so as to be close to the optimal solution. Another important hyperparameter to be carefully selected is the population size, which is critical for the convergence of the GA. Literature provides some heuristic methods to select the dimension of the population. Anyway, for the majority of the problems they indicate that an increase in the population size statistically reduces the error between the solution found by the GA and the exact solution

[14], at the expense of an increase in the computational time. The most common rule of thumb establishes that the dimension of the population needs to be at least equal to the size of the genome. Converting the variables from decimal to binary, as typical for GA, we obtain


from which the initial population needs to have at least individuals. After analysis of the results, we finally changed the population size to individuals to obtain stable and repeatable solutions.

Differently from the previous method based on the MIQCP formulation, which generates a single optimal solution in absence of competitors, we employ the GA to create a set of optimal energy strategies, whose efficacy will be then evaluated in realistic traffic conditions. The evaluation is performed in the next section. The set of strategies is obtained by launching the GA many times, each with different constraints on the electrical energy usage. The underlying reasoning for heuristically imposing extra constraints is the following. Since the strategies will be evaluated in complex traffic scenarios, activating the electric motor where overtaking is easier to be performed, e.g. a straight, can be beneficial to reduce the time loss related to the maneuver and the lap time. Table II shows the constraints applied in each strategy and the corresponding optimal lap time. It can be noticed that the applied constraints have an action range of at least 100 meters. This is done to respect the sensitivity/capability of the driver to manually intervene on the power-off/on button of the electric motor. Some exemplary optimal strategies are shown in Figure 11.

Strategies # Lap time (s) Electrical Energy constraints
1 104.028 No constraints
2 104.092 No KERS first 100 meters straight
3 104.152 No KERS first 100 meters straight
4 104.273 No KERS first 110 meters straight
5 104.290 No KERS first 100 meters straight
6 104.310 No KERS first 200 meters straight
7 104.373 No KERS first 100 meters straight
8 104.396 No KERS first 300 meters straight
9 104.402 No KERS first 100 meters and straights
10 104.415 No KERS first 100 meters and straights
11 104.417 No KERS first 200 meters straight
12 104.566 No KERS first 100 meters and straights
13 104.603 No KERS first 200 meters straight
14 104.674 No KERS first 70 meters straight
15 104.733 No KERS first 140 meters straight
TABLE II: Strategies and their Electrical Energy constraints generated by the Genetic Algorithm
(a) Strategy 1.
(b) Strategy 5.
(c) Strategy 13.
(d) Strategy 10.
Fig. 11: Speed profiles resulting from the GA-based optimisation. The green lines represent the points of activation of the electric motor.

Vi Multi-agent simulations of the competitors’ behaviour

The approach presented in Sec. V generated a set of offline strategies for the lap time minimisation in absence of competitors. To evaluate them in realistic race conditions, where competitors are present along the track and mutually interact with each other, it is necessary to simulate their behaviour. In this section, we explain how to build multi-agent Monte Carlo simulations of the competitors’ behaviour from the statistics computed in Sec. III.

The forecasts have to consider a time horizon that is long enough to allow the ego-car to complete the lap. For this reason, we decide to perform two-laps-long forecasts starting from the initial condition. The positions of the competitors in absence of mutual interactions are forecast by means of the free sector times probabilistic distributions, derived from Sec. III. With reference to Figure 3, we remark that the associated probability distributions are not Gaussian. To cope with this, we perform random sampling, with enough extractions to guarantee that the probabilistic distributions of the competitors’ behaviour can be reliably approximated. The forecast position along the track of an exemplary competitor for different simulations is shown in Figure 12.

Fig. 12: Simulated distribution of forecast positions of an exemplary competitor in absence of mutual interactions.

At this point, we introduce the interactions between the competitors. To do so, a multi-agent-based model has been developed, resorting to the Influence/Reaction principle [19, 3, 9]. Each competitor is considered as a rational agent that tries to minimise its lap time. If a competitor gets closer to the preceding one, it perceives its influence and has to react, performing overtaking or following. When an overtaking may be performed, the category of the involved vehicles is evaluated and, taking into account the section along the circuit where the overtaking may occur, the overtaking probability is computed from the statistics derived in Sec. III. A random number in the range is extracted and compared to the overtaking probability. If it is lower, then overtaking succeeds, otherwise the car keeps on following the preceding car for the entire length of the section. The procedure is then repeated at the successive section.

Repeating these steps for all possible overtakings involving the competitors along the circuit generates a Monte Carlo numerical simulation. Two examples of Monte Carlo simulation are reported in Figure 13.

(a) Example 1.
(b) Example 2.
Fig. 13: Monte Carlo numerical simulations of the competitors’ motion along the track. The colours of the lines represent different categories: red for LMP1, blue for LMP2, green for LMGTE Pro and finally black for LMGTE Am. The circles, instead, have the following meaning: the blue one represents the position along the circuit where two vehicles get close enough to activate the Influence/Reaction principle, whereas the black one defines the position where the overtaking occurs.

To reliably forecast the behaviour of the competitors, it is necessary to generate a significant number of Monte Carlo numerical simulations. This means performing a large amount of random samplings of free sector times and overtaking probabilities. These computations may however be time consuming. Indeed, they have to be performed in real time at the beginning of each lap, as they depend on the actual positions of the competitors along the circuit. A solution to this issue can be addressed through parallel computing, since the simulations are completely independent from each other. The statistical analysis of the outputs and evaluation of the optimal strategy in presence of realistic traffic conditions is described in the next section.

Vii Stochastic lap strategy optimisation in traffic conditions

Optimal strategies for the powertrain energy budget in absence of traffic have been presented in Sec. V. On the contrary, the Monte Carlo approach defined in VI allows to forecast the positions of the competitors, accounting for their mutual interactions. We now propose a novel way to combine the two features and build a stochastic optimal solver, aiming to identify the strategy that statistically guarantees the minimum lap time in presence of traffic.

Stochastic Dynamic Programming (SDP), which is the method we employ in this work, is an optimisation-based method for making sequential decisions under uncertainty. In our framework, uncertainty is associated to the occurrence of overtakings. Being SDP a discrete decision-making process and considering stages , the following ingredients are defined at each timestep :

  • an initial state , where is the set of feasible states at stage ;

  • a decision variable , where is the set of feasible actions that can be chosen at stage ;

  • an immediate cost or reward function , representing the incurred cost or the gained reward at stage if an action is chosen at the state ;

  • a state transition function , defining the state change from towards ;

  • a discount factor ;

  • a conditional probability , representing the probability to move into state given the current state and the chosen action .

The optimal control problem can be iteratively described through the value function at the generic stage as


which represents the expected optimal cost that can be attained from the state if the optimal action is chosen.

According to (34), the optimal strategy is the one that minimises the value function at the initial stage of the problem, and can be computed resorting to policy iteration. However, we have already selected fifteen strategies a priori in Sec. V, through the energy budget optimisation problem. Thus, we do not perform policy iteration, but policy evaluation: the value function deriving from each of the pre-computed strategies is evaluated. Finally, the best strategy is identified as the one that minimises the value function at the first stage.

To apply this technique, we first test each of the fifteen optimal energy strategies in each of the Monte Carlo numerical simulations. Figure 14 reports an example of this procedure.

Fig. 14: Testing the energy budget strategy in a Monte Carlo numerical simulation. The thick red line represents the spatial profile of the reference car, whereas the other lines represent the competitors.

The parameters of this technique have been defined in the following way. The variable , representing the state of the system, consists in the negative number of the possible overtakings to be performed in the reference lap. Indeed, the positions of the reference car and of the competitors are known from respectively the optimisation problem and the Monte Carlo numerical simulations. Thus, for each combination of optimal strategies and numerical simulations it is possible to compute the number of possible overtakings that the reference car can perform in the lap. For instance, with reference to Figure 14, three overtakings can occur. This implies that the state of the system is initialised to . When an overtaking occurs, the state is augmented by 1. The decision variable consists in the usage of the powertrain energy budget, either coming from the combustion engine and from the electric motor. The variable refers to the probability of changing the state from to , once a specific decision variable is chosen. Since we have defined the state as the number of possible overtakings to be performed in a lap, this probability is exactly the overtaking probability that we have drawn from the analysis of previous years events as a function of the category of the involved vehicles and of the section where the overtaking may occur.

The variable is defined as a cost. When the reference car approaches a competitor, according to the categories and the section where the vehicles are, there is a specific overtaking probability. If overtaking is not performed, the ego-vehicle needs to slow down, which involves a time loss. The latter can be computed as the difference between the time instant at which the reference car has reached the end of the section behind the preceding car and the instant of time at which the reference car would have reached the end of the section without traffic. Moreover, if the reference car is obliged to stay behind the preceding car in a section, it has to reduce its speed. Once it is able to overtake the slower vehicle in a successive section, it needs time before reaching its reference optimal speed profile again. This time loss is also added to the cost.

The parameter is a factor that discounts future costs, which is typically employed in infinite-time problems. Since we deal with a finite-time problem, the parameter is set equal to .

The stages of SDP coincide with the sections defined for the computation of the overtaking probabilities in Sec. III

. In each section the algorithm evaluates whether the reference car may perform an overtaking. If a competitor is close enough, there is a specific probability of performing the overtaking. If overtaking succeeds, the state is augmented by one and there is no associated time loss. On the contrary, the reference car is obliged to follow the preceding vehicle, the state does not change, and a time loss is introduced in the cost function It is convenient to graphically represent all of the probabilistic events rising from overtakings in a lap. In this regard, we adopt a suitable and common representation called Decision Tree. To build Decision Trees, the possible occurrence of an overtaking is evaluated in each section from the beginning to the end of the track (forward pass

[17]). Then, a backward pass computes the value function at the initial stage of the problem, i.e. . It consists in multiplying the probability of an event with the associated cost values, and summing them up stage by stage from the end to the beginning, according to (34). A portion of the Decision Trees for the optimal strategy is reported in Figure 15.

(a) Probability Decision Tree. The number above the state represents the probability of occurrence of a particular event.
(b) Cost Decision Tree. The number above the state represents the cost of a particular event, i.e. its associated time loss expressed in seconds.
Fig. 15: Example of Probability and Cost Decision Trees. The negative value over each circle represents the state of the system, i.e. the number of overtakings still to be performed in the lap. A red circle with state equal to 1 indicates that all possible overtakings have already been performed for that probabilistic event. The left column of numbers identifies the decision stages where overtakings may occur, whereas the right column specifies the section along the track.

The value functions at time zero of each strategy are reported in Table III. The fifth strategy results to be the optimal one, having the lowest time loss. Despite this strategy is approximately 0.3 s slower than the first one in absence of traffic conditions, i.e. the reference car, it is statistically faster of approximately 0.5 s considering traffic conditions. Moreover, the fifth strategy has a high statistical significance, since it resulted to be the optimal strategy in approximately 70% of the Monte Carlo numerical simulations. Further graphical interpretations are provided in Figures 16 and 17 to support our approach. They both show that the optimal strategy tends to save electric power and release it to perform overtakings only in portions of the track where they produce lower time losses.

Strategy #
1 2.131
2 1.987
3 1.792
4 1.884
5 1.662
6 2.092
7 2.176
8 2.338
9 2.359
10 2.421
11 2.428
12 2.619
13 2.211
14 2.886
15 2.534
TABLE III: Value function at the initial stage from Stochastic Dynamic Programming

Fig. 16: Analysis of time losses due to overtakings in a full lap. The black-orange-red colours of the sections represent increasing time losses due to overtakings. The blue circle marks the position where the first overtaking is probable to occur, according to the forecast of the competitors positions and to the default energy budget strategy, i.e. the first strategy. The red circle expresses the same concept for the fifth energy strategy, i.e. the optimal one. Since this strategy forces the solver not to use the electric motor in the first straight, the ego-car is expected to perform the first overtaking further along the track. This highlights that the proposed approach is suitable to shift overtakings to positions where they cause lower time losses.

Fig. 17: Speed profiles resulting from the first and fifth strategy. The optimal strategy reaches a lower top speed in the first straight, due to a lower usage of the electrical energy. The saved energy is then spread into the last two straights, where it is easier to perform overtakings and reach higher speed.

The analysis can be extended to a sequence of laps, i.e. a stint of the race. Considering the fifth stint of the Bahrain 2017 event, we launch a Monte Carlo numerical simulation starting with the initial positions of the competitors, and identify the best among the fifteen pre-computed strategies through SDP. The positions of the ego-car during the real race are then replaced by the space profile deriving from the optimal strategy. Then, SDP is employed a second time to evaluate the statistical time loss due to overtakings that would have been generated through the optimal strategy. Figure 18 shows that our method can lead to a time gain of s compared to the original strategy used during the race. Running the procedure multiple times, we obtained an upper bound equal to s and a lower bound of

s, with confidence interval

. Figure 19 represents the temporal gain lap by lap, with associated confidence interval. It is evident that the application of the proposed method can have a significant impact on the result of an event, since the time difference between the winner and the competitors is usually of a few seconds.

Fig. 18: Result from the application of Stochastic Dynamic Programming to an entire stint of the race. The red thick line represents the space profile deriving from the suggested strategy at the final lap of the stint, whereas the red thick dotted line represents the positions occupied by the reference car during the last lap of the stint in the real race. The other lines represent the competitors’ motion. Adopting our approach to an entire stint of 26 laps, a time gain of 6.44 s would have been achieved.

Fig. 19: Lap-by-lap temporal gain generated by our approach.

Viii Conclusions

In this work, we have presented an efficient procedure to generate optimal lap strategies for LMP1 hybrid electric vehicles in WEC events. Particularly, the framework computes the optimal energy budget utilisation that statistically minimises the ego-vehicle lap time in presence of competitors, while complying with the technical regulations.

Our approach relies on multiple contributions. First, we have shown how to extract meaningful statistics regarding the competitors’ motion along the track, using the limited amount of publicly available data from previous races. The statistics model the sector times and overtaking probability distributions of each vehicle type. They are used to develop realistic Monte Carlo simulations of the agents motion and interactions in a lap. Second, we have computed a set of candidate traffic-free solutions to the ego-vehicle lap strategy optimisation problem using Genetic Algorithms, presenting them as a more computationally efficient alternative to the classical MIQCP formulation. Finally, the traffic-aware optimal solution is statistically identified among the candidate traffic-free policies using Stochastic Dynamic Programming. Relying on several Monte Carlo simulations of the competitors’ motion, Stochastic Dynamic Programming is used to evaluate the best strategy among the candidate ones.

To validate our approach, we have applied it to a stint of a real race. The results show that our approach leads to an average time gain of s, compared to the strategy that was actuated during the real race. The time gain is particularly significant, since the time difference between the winner and the leading vehicles is usually of a few seconds in WEC events.

Finally, we highlight that our strategy can be easily extended to other types of racing events. In fact, it only relies on an ego-vehicle longitudinal model and on the sector times data of the vehicles from previous races, which are commonly available. The extension to different types of events is left to future works.


This work has been developed in collaboration with Dr. Ing. h.c. F. Porsche AG. We would like to thank the Porsche Motorsport Team for providing sensitive data on WEC events and their vehicle. A sincere thanks to Stephen Mitas, Porsche Chief Race Engineer, and to Emiliano Giangiulio and Vincenzo Scali, Porsche engineers, for contributing to the establishment of this work and for their support.


  • [1] V.N. Alexandrov, C. G. Martel, and J. Straßburg (2011) Monte carlo scalable algorithms for computational finance. Procedia Computer Science 4 (Complete), pp. 1708–1715 (English). External Links: Document Cited by: §I.
  • [2] P. G. Anselma (2021) Optimal adaptive race strategy for a Formula-E car. Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, pp. 09544070211047343 (en). Note: Publisher: IMECHE External Links: ISSN 0954-4070, Document Cited by: §I, §II.
  • [3] J. Barthélemy and T. Carletti (2017) An adaptive agent-based approach to traffic simulation. Transportation Research Procedia 25, pp. 1238–1248. Note: World Conference on Transport Research - WCTR 2016 Shanghai. 10-15 July 2016 External Links: ISSN 2352-1465, Document Cited by: §VI.
  • [4] J. Bekker and W. Lotz (2009) Planning formula one race strategies using discrete-event simulation. Journal of the Operational Research Society 60 (7), pp. 952–961. External Links: Document Cited by: §II.
  • [5] E. Betancur, G. Osorio-Gómez, and J. C. Rivera (2017-10) Heuristic Optimization for the Energy Management and Race Strategy of a Solar Car. Sustainability 9 (10), pp. 1576 (en). External Links: ISSN 2071-1050, Document Cited by: §I, §II.
  • [6] A. Boretti (2018) CHAPTER 3 World Endurance Championship: 2012 to Present. In Prototype Powertrain in Motorsport Endurance Racing: an SAE Technical Paper Compilation, pp. 15–23. Note: Conference Name: Prototype Powertrain in Motorsport Endurance Racing: an SAE Technical Paper Compilation External Links: ISBN 978-0-7680-9160-1 Cited by: §I.
  • [7] O. Borsboom, C. A. Fahdzyana, T. Hofman, and M. Salazar (2021-09) A Convex Optimization Framework for Minimum Lap Time Design and Control of Electric Race Cars. IEEE Transactions on Vehicular Technology 70 (9), pp. 8478–8489. Note: Conference Name: IEEE Transactions on Vehicular Technology External Links: ISSN 1939-9359, Document Cited by: §I.
  • [8] S. Broere and M. Salazar (2021-11) Minimum-lap-time Control Strategies for All-wheel Drive Electric Race Cars via Convex Optimization. Note: Comment: Submitted to ECC 2022 External Links: Link Cited by: §I, §II.
  • [9] J. Dávila, M. Uzcátegui, and K. Tucci (2005) A multi-agent theory for simulation. In International Conference on Modelling, Simulation and Optimization, Oranjestad, Aruba. Cited by: §VI.
  • [10] F. I. de l’Automobile (2018) Regulations. External Links: Link Cited by: §I.
  • [11] P. Duhr, G. Christodoulou, C. Balerna, M. Salazar, A. Cerofolini, and C. H. Onder (2021) Time-optimal gearshift and energy management strategies for a hybrid electric race car. Applied Energy 282, pp. 115980 (en). External Links: ISSN 0306-2619 Cited by: §I, §II.
  • [12] S. Ebbesen, M. Salazar, P. Elbert, C. Bussi, and C. H. Onder (2018) Time-optimal control strategies for a hybrid electric race car. IEEE Transactions on Control Systems Technology 26 (1), pp. 233 – 247 (Undetermined). External Links: Document Cited by: §V-A1.
  • [13] M. Ester, H. Kriegel, J. Sander, and X. Xu (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, KDD’96, pp. 226–231. Cited by: §III-A.
  • [14] S. P. Gotshall and B. Rylander (2002) Optimal population size and the genetic algorithm. Cited by: §V-B.
  • [15] A. Heilmeier, M. Graf, and M. Lienkamp (2018-11) A Race Simulation for Strategy Decisions in Circuit Motorsports. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC), pp. 2986–2993. Cited by: §II.
  • [16] T. Herrmann, F. Passigato, J. Betz, and M. Lienkamp (2020-09) Minimum Race-Time Planning-Strategy for an Autonomous Electric Racecar. In 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), pp. 1–6. External Links: Document Cited by: §I.
  • [17] P. A. Jensen and J. F. Bard (2002) Operations research models and methods. 1 edition, John Wiley & Sons Inc.. Cited by: §VII.
  • [18] J. Lofberg (2004) YALMIP : a toolbox for modeling and optimization in matlab. In 2004 IEEE International Conference on Robotics and Automation (IEEE Cat. No.04CH37508), Vol. , pp. 284–289. External Links: Document Cited by: §V-A2.
  • [19] F. Michel (2007) The irm4s model: the influence/reaction principle for multiagent based simulation. In AAMAS ’07, Cited by: §VI.
  • [20] M. L. Puterman (2005) Markov decision processes: discrete stochastic dynamic programming. 1 edition, John Wiley & Sons Inc.. Cited by: §I.
  • [21] S. Ross (1983) Introduction to stochastic dynamic programming. Academic Press Inc.. Cited by: §I.
  • [22] M. Salazar, C. Balerna, P. Elbert, F. P. Grando, and C. H. Onder (2017-12) Real-Time Control Algorithms for a Hybrid Electric Race Car Using a Two-Level Model Predictive Control Scheme. IEEE Transactions on Vehicular Technology 66 (12), pp. 10911–10922. Note: Conference Name: IEEE Transactions on Vehicular Technology External Links: ISSN 1939-9359 Cited by: §I.
  • [23] M. Salazar, C. Balerna, P. Elbert, F. P. Grando, and C. H. Onder (2017) Real-time control algorithms for a hybrid electric race car using a two-level model predictive control scheme. IEEE Transactions on Vehicular Technology 66 (12), pp. 10911–10922. External Links: Document Cited by: §II.
  • [24] M. Salazar, P. Elbert, S. Ebbesen, C. Bussi, and C. H. Onder (2017-11) Time-optimal Control Policy for a Hybrid Electric Race Car. IEEE Transactions on Control Systems Technology 25 (6), pp. 1921–1934. Note: Conference Name: IEEE Transactions on Control Systems Technology External Links: ISSN 1558-0865, Document Cited by: §I.
  • [25] E. Yesil, A. O. Onol, A. Icke, and O. Atabay (2013-11) Strategy optimization of a solar car for a long-distance race using Big Bang — Big Crunch optimization. pp. 521–526. External Links: Document Cited by: §I, §II.