Defending a Perimeter from a Ground Intruder Using an Aerial Defender: Theory and Practice

09/07/2021 ∙ by Elijah S. Lee, et al. ∙ NYU college University of Pennsylvania 0

The perimeter defense game has received interest in recent years as a variant of the pursuit-evasion game. A number of previous works have solved this game to obtain the optimal strategies for defender and intruder, but the derived theory considers the players as point particles with first-order assumptions. In this work, we aim to apply the theory derived from the perimeter defense problem to robots with realistic models of actuation and sensing and observe performance discrepancy in relaxing the first-order assumptions. In particular, we focus on the hemisphere perimeter defense problem where a ground intruder tries to reach the base of a hemisphere while an aerial defender constrained to move on the hemisphere aims to capture the intruder. The transition from theory to practice is detailed, and the designed system is simulated in Gazebo. Two metrics for parametric analysis and comparative study are proposed to evaluate the performance discrepancy.

READ FULL TEXT VIEW PDF
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

page 5

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

I Introduction

The pursuit-evasion games (PEGs) have been widely investigated over the past years and are used in many applications including mobile robotics [4]. There are many variants of PEGs under different assumptions on the players and the environments. One route is to consider the players as point particles [9, 15]. Liang et al. [9] address a PEG with three point particles: target, attacker, and defender. The attacker aims to capture the target while avoiding the defender, and the defender aims to defend the target while trying to capture the attacker. Other work [15] allows the point particle players move in three dimensions and solves the differential games with three defenders and one intruder with equal speeds.

Researchers also have focused on solving the PEGs with real robots [14, 1, 2]. Vidal et al. [14] propose a hierarchical hybrid system to implement the pursuit-evasion game scenario on real UAVs and UGVs. The PEG between two pursuing and one evading unmanned aerial vehicles is solved in [1]

. Deep learning based approach for vision-based UAV pursuit-evasion is implemented in

[2].

This work formulates a variant of pursuit-evasion game known as the target-guarding problem [5]. In this problem, intruders aim to reach the target without being captured by defenders, while defenders try to capture intruders [9, 12, 7]. When defenders are constrained to move along the perimeter of the target region, we call this problem as perimeter defense game, which we refer to [13] for detailed survey. Previous works consider the players as point particles with first-order assumptions [9, 5, 12, 7, 13]. The optimal strategies are theoretically proved for multiplayer game on two-dimensional convex shapes [12] and for three-dimensional game on hemisphere [7].

Fig. 1: Hemisphere perimeter defense game in theory and practice shows the discrepancy in executing optimal strategies.

There are many challenges in realizing the perimeter defense system with real robots, and the major challenges lie in coping with discrepancy between first-order assumptions of point particles and dynamics of real robots, as shown in Fig. 1. This work simulates the perimeter defense based on Unmanned Aerial Vehicles (UAVs). UAVs are deployed in various space such as power plant [6], penstock [11], forest [3], or disaster sites[8], which are good perimeter defense applications.

This paper extends the previous work [7] on hemisphere defense to apply the derived theory from point particles to real robots. We observe any performance discrepancy in relaxing first-order assumptions and discuss how to reduce such discrepancy. The contributions of the paper are (i) realizing perimeter defense between aerial defender and ground intruder from theory to practice; and (ii) performing parametric analysis of system scales and comparative study of strategies to evaluate the performance discrepancy in relaxing first-order assumptions.

Section II formulates the problem. Section III addresses the transition from point particle to real robot. To evaluate the discrepancy, two metrics are proposed in Section IV. Section V provides experimental results, and Section VI concludes the paper.

Ii Problem Formulation and Preliminaries

Consider a hemisphere perimeter with radius of as shown in Fig. 2. The defender is constrained to move on the hemisphere while the intruder is constrained to move on the ground plane. The positions of the players in spherical coordinates are: and , where and are the azimuth and elevation angles, which gives the relative position as: , where and (see Fig. 2). Without loss of generality, we assume the defender’s maximum speed is 1. The intruder is assumed to have a maximum speed (otherwise, intruder always has a strategy to win the game). We denote that the velocities of defender and intruder are and , respectively. The game ends at time with intruder’s win if and , whereas it ends with defender’s win if and . We call as the terminal time.

Ii-a Optimal breaching point for point particle

Given , , we call breaching point as a point on the perimeter at which the intruder tries to reach the target, as shown in Fig. 2. We call the azimuth angle that forms the breaching point as breaching angle, denoted by , and call the angle between and the tangent line at as approach angle, denoted by .

It is proved in [7] that given the current positions of defender and intruder as point particles, there exists a unique breaching point such that the optimal strategy for both defender and intruder is to move towards it, known as optimal breaching point. The breaching angle and approach angle corresponding to the optimal breaching point are known as optimal breaching angle, denoted by , and optimal approach angle, denoted by .

As stated in [7], although there exists no closed form solution for and , they can be computed at any time by solving two governing equations:

(1)

and

(2)

Ii-B Target time and payoff function

We call the target time as the time to reach and define as the defender target time, as the intruder target time, and the following as payoff function:

(3)

The defender reaches faster if and the intruder reaches faster if . Thus, the defender aims to minimize while the intruder aims to maximize it.

Fig. 2: The coordinate system and relevant variables.

Ii-C Optimal strategies and Nash equilibrium

It is proven in [7] that the optimal strategies for both defender and intruder are to move towards the optimal breaching point at their maximum speed at any time.

Let and be the continuous and that lead to so that and , and let and be the optimal strategies that minimize and , respectively, then the optimality in the game is given as a Nash equilibrium:

(4)

Iii From Theory to Practice

This section discusses the transition from theory to practice in executing the optimal strategies for hemisphere perimeter defense game.

Iii-a From point particle to three-dimensional robot

The major challenge in bringing the theory closer to practice lies in representing the agents as three-dimensional robot. Previous works in perimeter defense focused on the point particle [9, 5, 12, 7, 13] to represent the defender and intruder, and the following are the assumptions made for the point particle, which may not hold true in working with the three-dimensional robot:

  • It has no volume and thus it is scale invariant

  • It moves with desired velocity instantly and precisely

  • If it moves at its maximum speed, the speed is consistent along the trajectory

  • It can accurately detect other agent’s positions and react to it simultaneously

  • Optimal trajectory obeys first-order assumptions

This work aims to simulate robots using the optimal strategies derived from theory, observe any discrepancy between the performances from theory and practice, and discuss how relaxing aforementioned assumptions would lead to the discrepancy. The employed robot is an UAV in Fig. 3. It has a dimension of 735mm 735mm 200mm and the mass of 1.8kg. This UAV well represents a three-dimensional robot relaxing the point particle assumptions.

Fig. 3: An UAV carrying stereo cameras and Ouster lidar
Fig. 4: Overall controller for cross feedback system

Iii-B Optimal breaching point in practice

In practice, we relax the assumption of scale invariance. The radius of hemisphere becomes a relevant variable , as denoted in Fig. 2. Accordingly, (2) in practice becomes

(5)

although (1) stays the same in practice since all the parameters are scale invariant. We can solve for and by solving two governing equations (1) and (5), and obtain the corresponding optimal breaching point in practice.

Iii-C Controller

As mentioned in the previous assumptions [7], the controller used for theoretical approach was simply perfect. Both defender and intruder reacted to each other without any delay, and desired dynamics were enforced precisely and instantly to result in continuous optimal trajectories as shown in Fig. 1

. The agents also identify each other’s states without any error at any time (e.g. perfect state estimation).

The overall controller for practical approach is illustrated in Fig. 4. All modules represent the nodes in ROS and are connected by rostopics for real-time implementation. For each agent, the strategy module subscribes to the ground truth odometry that are published by both UAVs to calculate position command. In this way, state estimation is based on ground truth and becomes highly accurate. Then, the position command is fed into the SO3-based controller based on [10]

to generate SO3 command. The SO3 commands consist of thrust and moment that control the robot in low level. Finally, the simulated robot publishes the ground truth odometry to be fed into the SO3-based controller for closed loop control. Note that the defender and intruder strategy modules take in both agents’ odometry. For this cross feedback system, the two odometry data are stored into queues and used for synchronized computation of the optimal strategy.

Iii-D Executing optimal strategies

As stated in Sec.II-C, both defender and intruder move at their maximum speeds in executing optimal strategies in theory. It is also worth noting that the optimal trajectories are continuously changing based on the configuration of agents at given time because the optimal breaching point is newly calculated at any time to reflect the change in positions of the defender and intruder.

For this reactive and adversarial environment, we enforce the players to follow an infinitesimal segment of computed optimal trajectory at any time to maintain consistent maximum speeds. In Fig. 2, given the optimal breaching point and current configurations of defender and intruder and , their current optimal trajectories are and , respectively. Based on the movements of defender and intruder, the optimal trajectories as well as the optimal breaching point may change over time, so we set the goal trajectory and as segments of original trajectories by taking the length of and for defender and intruder, respectively. We give position command towards the end point of the infinitesimal segment, which can be viewed as controlling an instantaneous velocity of agents.

First, we aim to compute the position of after infinitesimal defender movement . For simplicity, assume is on the zx-plane (i.e. ) by rotating the hemisphere by about the z-axis, and we will rotate the coordinates by back to the original orientation at the end. Given the radius , azimuth angle , and , we know the following: (i) is on the hemisphere with a center on the origin and a radius of ; (ii) The arc distance between and is ; (iii) is on the plane . The conditions (i) and (ii) give

(6)
(7)

We know the equation of a plane is given by

(8)

Therefore, the plane is uniquely determined by three points , , and , 0, . With the condition (iii), (8) becomes

(9)

Together with (6), (7) and (9), we get

(10)

where

The final computation of the position is summarized in Algorithm 1. Notice that line 4 and 5 rotate the system back to the original configuration. Similarly, the position of after infinitesimal intruder movement is computed in Algorithm 2, and overall agents’ strategies are summarized in Algorithm 3.

0:  
0:  Updated defender positions
1:  Set defender’s small movement
2:  
3:  Evaluate using (10)
4:  
5:  
Algorithm 1 [] = MoveDefender()
0:  
0:  Updated intruder positions
1:  Set intruder’s small movement
2:  
3:  
4:  
5:  
6:  
Algorithm 2 [] = MoveIntruder()
0:  Agent odometry
0:  Position command (updated for defender and updated for intruder)
1:  Initialize using odometry
2:  if  or  then
3:     Compute optimal breaching angle
4:     if Defender Strategy then
5:        [] = MoveDefender()
6:     else if Intruder Strategy then
7:        [] = MoveIntruder()
8:     end if
9:  end if
Algorithm 3 Defender/Intruder Strategy

Iv Discrepancy Analysis

In this section, we propose two metrics to evaluate the discrepancy between the performances from theory and practice. One metric measures the discrepancy from relaxing the scale invariant assumption made for point particle. This metric is crucial since the discrepancy between small and large scales would not allow scalability, which is essential for real-world applications. The other metric investigates if the traversed trajectory is the same as the optimal trajectory derived from theory by comparing its performances with the performance of other baseline strategy. We chose these metrics to evaluate the discrepancy because other point particle assumptions (e.g. instant velocity change, simultaneous detection and reaction, and no fluctuation in maximum speed) highly depend on the hardware. A robot with better computing power will be closer to meet these assumptions, so we focus on the discrepancy in system scales and strategies.

Iv-a Discrepancy in system scales

Fig. 5

shows different scales of environment (e.g. hemisphere) for perimeter defense game. Since the robot size is fixed, this variance may impact the dynamics of the UAV. For instance, if the robot traverses along the surface of small hemisphere, it may enforce the controller to execute abrupt turns and the resulting trajectory may not exactly follow the hemisphere surface.

Fig. 5: Different system scales of environment and UAVs with a fixed scale. Left: Radius of 3. Middle: Radius of 10. Right: Radius of 100.
Fig. 6: Game setting for computing . Left: Initial configuration when the game begins. Right: Final configuration at terminal time .

To account for such performance discrepancy, we propose a metric as shown in Fig. 6. This metric computes the geodesic between and , which represent the positions of defender and intruder at terminal time , respectively. We purposely set the game for the intruder to win (e.g. intruder reaches to perimeter before defender does) because more interesting dynamics are happening for defender so we fix the behavior of intruder and evaluate that is normalized by radius for different set of system scales. We also compare with the outcome derived from first-order assumptions and observe how the relaxed point particle assumptions impact , which would be closer to the theoretical outcome in ideal conditions.

Iv-B Discrepancy in strategies

We define a metric , shown as in Fig. 7, to understand the discrepancy in different defender strategies. This metric measures the distance between and at terminal time , which represents the distance between the agents at the end of the game. To obtain this metric, we intentionally set the game for the defender to win so that lies on the base of the hemisphere. In this manner, any plausible defender strategy can be evaluated because the game ends only if the defender finishes executing its strategy. If is to be used as the metric, the game would end before defender fully executes the strategy, which makes it challenging to compare the performance.

In comparing strategies, we run two defender strategies: (1) optimal strategy that moves defender towards the breaching point at its maximum speed at any time; and (2) baseline strategy that moves defender at its maximum speed at any time towards the azimuth angle of intruder, which is towards that is an intersection of and the base of hemisphere as shown in Fig. 2. A strategy with larger outperforms the other since it secures more time to defend the intruder.

Fig. 7: Game setting for computing . Left: Initial configuration when the game begins. Right: Final configuration at terminal time .

V Experiments

This section evaluates the performance discrepancy between theory and practice based on system scales and strategies. We run all experiments using Gazebo with ROS.

V-a Parametric analysis of system scales

In the parametric analysis, we vary the system scales by changing the radius of the hemisphere. A total of 22 different radii from 3 to 300 are used, and multiple experiments are run for each radius to compute . Given radius , the experiments are run with an initial configuration , infinitesimal movement (lead to ), and .

Fig. 8: Parametric analysis. Magnified view shows small radius regime.
Fig. 9: The xyz positions of the defender and intruder with three set of radii. UAV trajectories converge to first-order assumptions as increases.
Fig. 10: Comparative study of baseline and optimal strategy.

Fig. 8 shows the change in as a function of radius. is normalized by radius so it should be consistent in theory regardless of system scales. It can be seen that stays relatively small in the regime of small radius (e.g. ) and converges to the value derived from first-order assumptions as

increases. The error bar indicates that the standard deviation decreases as

increases. Fig. 9 confirms that small radius regime may result in non-smooth trajectories and the UAV trajectories converge to first-order assumptions as increases. We infer that in the small radius regime, UAV dynamics is sensitive to control inputs since the scale of robot is comparable to the scale of environment. The UAV may accelerate well to get up to the maximum speed so is lower and inconsistent with the outcome from the first-order assumptions.

In this way, the point particle assumption that agents move with desired velocity instantly is much relaxed. Accordingly, to reduce the discrepancy between theory and practice, a larger scale of environment relative to that of robot is preferred, and sensitivity analysis for the system would help the robot better obey the scale invariant assumptions.

V-B Comparative study of strategies

For this study, we compute for optimal and baseline strategies. Seven trials with different configuration are conducted to observe the performance discrepancy. An infinitesimal defender movement (lead to ), an infinitesimal intruder movement (lead to ), and are used to guarantee that the defender reaches to the base of hemisphere earlier than the intruder does.

Fig. 10 shows the variations of normalized by radius for the two strategies in different trials. Although the discrepancy is very small, it is worth noting that the baseline strategy sometimes outperforms the optimal strategy. This tendency supports the claim that there exists discrepancy between theory and practice and that the optimal strategy derived from first-order assumptions could perform worse than other strategies in high-order dynamics. This discrepancy is due to the delay in executing the optimal strategy because computing the optimal breaching point takes some time. As a reference, one cycle of the cross feedback system in Fig. 4 takes about 70 msec, and the optimal breaching point calculation takes the major portion of it, which is about 10 msec. To compensate for the delay, robots with higher computing power can be employed.

Vi Conclusion

This paper aims to apply the theory with first-order assumptions derived from the hemisphere perimeter defense game to robots with realistic models, and observe performance discrepancy in relaxing point particle assumptions. We study the transition from theoretical point particles to practical three-dimensional robots using UAV models in Gazebo. To evaluate the performance discrepancy, two analyses are conducted. In the parametric analysis, it is found that UAV trajectories converge to first-order assumptions as the radius of the hemisphere increases. The comparative study shows that the optimal strategy derived from first-order assumptions can perform worse than other strategies in high-order dynamics due to the delay in executing the strategy. To cope with this discrepancy, the future work will focus on changing the first-order model to better capture the high dynamics of the UAVs and design pursuit strategies for the new model.

References

  • [1] A. Alexopoulos, T. Schmidt, and E. Badreddin Cooperative pursue in pursuit-evasion games with unmanned aerial vehicles. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4538–4543. Cited by: §I.
  • [2] S. Arola and M. A. Akhloufi (2019) UAV pursuit-evasion using deep learning and search area proposal. In 2019 IEEE international conference on robotics and automation, Cited by: §I.
  • [3] S. W. Chen, G. V. Nardari, E. S. Lee, C. Qu, X. Liu, R. A. F. Romero, and V. Kumar (2020) SLOAM: semantic lidar odometry and mapping for forest inventory. IEEE Robotics and Automation Letters 5 (2), pp. 612–619. Cited by: §I.
  • [4] T. H. Chung, G. A. Hollinger, and V. Isler (2011) Search and pursuit-evasion in mobile robotics. Autonomous robots 31 (4), pp. 299. Cited by: §I.
  • [5] R. Isaacs (1999) Differential games: a mathematical theory with applications to warfare and pursuit, control and optimization. Courier Corporation. Cited by: §I, §III-A.
  • [6] E. S. Lee, G. Loianno, D. Thakur, and V. Kumar (2020) Experimental evaluation and characterization of radioactive source effects on robot visual localization and mapping. IEEE Robotics and Automation Letters 5 (2), pp. 3259–3266. Cited by: §I.
  • [7] E. S. Lee, D. Shishika, and V. Kumar (2020) Perimeter-defense game between aerial defender and ground intruder. In 2020 59th IEEE Conference on Decision and Control (CDC), pp. 1530–1536. Cited by: §I, §I, §II-A, §II-A, §II-C, §III-A, §III-C.
  • [8] S. Lee, D. Har, and D. Kum (2016) Drone-assisted disaster management: finding victims via infrared camera and lidar sensor fusion. In 2016 3rd Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE), pp. 84–89. Cited by: §I.
  • [9] L. Liang, F. Deng, Z. Peng, X. Li, and W. Zha (2019) A differential game for cooperative target defense. Automatica 102, pp. 58–71. Cited by: §I, §I, §III-A.
  • [10] D. Mellinger and V. Kumar (2011) Minimum snap trajectory generation and control for quadrotors. In 2011 IEEE international conference on robotics and automation, pp. 2520–2525. Cited by: §III-C.
  • [11] T. Nguyen, S. S. Shivakumar, I. D. Miller, J. Keller, E. S. Lee, A. Zhou, T. Özaslan, G. Loianno, J. H. Harwood, J. Wozencraft, et al. (2019) Mavnet: an effective semantic segmentation micro-network for mav-based tasks. IEEE Robotics and Automation Letters 4 (4), pp. 3908–3915. Cited by: §I.
  • [12] D. Shishika and V. Kumar (2018) Local-game decomposition for multiplayer perimeter-defense problem. In 2018 IEEE Conference on Decision and Control (CDC), pp. 2093–2100. Cited by: §I, §III-A.
  • [13] D. Shishika and V. Kumar (2020) A review of multi agent perimeter defense games. In

    International Conference on Decision and Game Theory for Security

    ,
    pp. 472–485. Cited by: §I, §III-A.
  • [14] R. Vidal, O. Shakernia, H. J. Kim, D. H. Shim, and S. Sastry (2002) Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation. IEEE transactions on robotics and automation 18 (5), pp. 662–669. Cited by: §I.
  • [15] R. Yan, Z. Shi, and Y. Zhong (2019) Construction of the barrier for reach-avoid differential games in three-dimensional space with four equal-speed players. In 2019 IEEE 58th Conference on Decision and Control (CDC), pp. 4067–4072. Cited by: §I.