A Virtual Motion Camouflage Approach for Cooperative Trajectory Planning of Multiple UCAVs
Abstract
This paper investigates cooperative trajectory planning of multiple unmanned combat aerial vehicles (multi-UCAV) in performing autonomous cooperative air-to-ground target attack missions. By integrating an approximate allowable attack region model, several constraint models, and a multicriterion objective function, the problem is formulated as a cooperative trajectory optimal control problem (CTOCP). Then, a virtual motion camouflage (VMC) for cooperative trajectory planning of multi-UCAV, combining with the differential flatness theory, Gauss pseudospectral method (GPM), and nonlinear programming, is designed to solve the CTOCP. In particular, the notion of the virtual time is introduced to the VMC problem formulation to handle the temporal cooperative constraints. The simulation experiments validate that the CTOCP can be effectively solved by the cooperative trajectory planning algorithm based on VMC which integrates the spatial and temporal constraints on the trajectory level, and the comparative experiments illustrate that VMC based algorithm is more efficient than GPM based direct collocation method in tackling the CTOCP.
1. Introduction
Nowadays, it is an active research area to perform autonomous cooperative air-to-ground target attack (CA/GTA) missions using multiple unmanned combat aerial vehicles (multi-UCAV) [1]. However, compared with single UCAV planning and coordinated formation control problems [2], new technical challenges in the CA/GTA missions are emerging. The cooperative trajectory planning is one of the key challenging technologies, due to its high dimensionality, severe equality and inequality constraints involved, and the requirement of spatial-temporal cooperation of multi-UCAV.
Recently, various algorithms have been developed to solve this cooperative trajectory planning problem [3, 4], including artificial neural network methods [5], sample-based planning methods [6], maneuver automation (MA) [7], and optimal control methods. There is no doubt that the optimal control theory is the most natural framework for this type of problem with dynamic constraints [8]. However, the rapid solution to optimal control problems (OCPs) for complicated nonlinear systems, such as UCAVs, is a challenging task [9]. Analytical solutions are seldom available or even possible. As a result, one usually resorts to numerical techniques [3]. The techniques can be classified into two general types, namely, indirect and direct methods. Indirect methods [10] solve the OCPs by formulating the first-order optimality conditions, applying Pontryagin’s Maximum Principle and nonlinear programming (NLP) to tackle the resulting two-point boundary value problem numerically. Direct methods [11], on the other hand, are devoted to reduce the OCPs to finite-dimensional NLP problems by discretization and parameterization of subsets of the state and control vectors and then are solved by developed optimizers. As one of direct methods, Gauss pseudospectral method (GPM) is applied widely and efficiently in the trajectory optimization field [12–14], due to its advantages of fewer parameters and higher precision in the calculation procedure. However, the method to achieve the trajectory planning for a single vehicle or cooperative multivehicle required a high computational load. To reduce the computational complexity, the inverse dynamics methods [15, 16] and differential flatness theory (DFT) based methods [17–20] are introduced. Compared to pseudospectral methods, these methods can use any models and any performance indexes and transform the OCP into a low dimensional NLP problem. However, they are still time-consuming. Particularly, when the number of the vehicles is too large, the computational cost will be unacceptable. For this purpose, the virtual motion camouflage (VMC) algorithm [21–23] is introduced into this work. Inspired by the biological motion known as the motion camouflage, the VMC algorithm can dramatically reduce the problem dimension and the computational cost. Some numerical simulations [22–24] suggest that its computational speed could be much faster than the pseudospectral methods. Motivated by these advantages, this paper employs virtual motion camouflage approach to develop cooperative trajectory planning algorithms.
For the cooperative trajectory planning of multi-UCAV, the objective is to generate dynamically feasible trajectories for UCAVs, which can guide them to the goals in the shortest time or distance, without collisions with obstacles or each other. To date, a number of theories and techniques have been developed to accomplish the cooperative tasks, including several collision avoidance techniques and time adjustment strategies [17, 25–30]. Bollino and Lewis [25] addressed the optimal path planning of UAVs in obstacle-rich environments and proposed a collision-free path planning for multi-UAV using optimal control techniques and pseudospectral methods. Kuwata and How [26] presented a cooperative distributed robust trajectory optimization approach, using RH-MILP with independent dynamics but coupled objectives and hard constraints. To improve the convergence rate, the virtual motion camouflage method was applied to the cooperative electronic combat air vehicles (ECAVs) [27] and unmanned aerial vehicle (UAV) formation control [28]. Reference [27] described how an interesting bioinspired motion strategy can be used to design real-time trajectories for cooperative electronic combat air vehicles. The constant speed motion camouflage law was developed to derive the feasible condition of a constant speed ECAVs and PT coherent mission, and its feasibility conditions were found. Reference [28] proposed a divide and conquer hierarchical approach in three levels to solve the UAV formation flight trajectory plan problem considering dynamics, state, and control variable inequality and equality constraints. These approaches mentioned above could generate collision-free trajectories, but the temporal cooperation was ignored. In contrast, Lian [17] introduced a differential flatness theory (DFT) based method to optimally formulate the cooperative path planning for multiagent dynamical systems considering spatial and temporal constraints. McLain and Beard [29] proposed the coordination variables (CV) and coordination function (CF) based strategy to achieve cooperative timing among teams of vehicles by coordinating the velocity and path length of each vehicle. But these approaches failed to deal with the dynamics constraints of vehicles, and the generated paths were not always flyable and smooth. Kaminer et al. [30] presented a general framework for coordinated control problem of multiple autonomous agents. On the basis of decoupling space and time in the problem formulation, it reduced the number of optimization parameters and made it easy to implement optimization in real time. However, the time coordination lays in the path following and the design of control laws in path following algorithms was much more difficult. Although the previous investigations have described several valuable strategies in the cooperative path planning, these methods cannot tackle the point-to-region cooperative trajectory planning for CA/GTA missions directly, which needs to integrate both the spatial and temporal constraints on the trajectory level.
To address the problems mentioned previously, a novel cooperative trajectory planning method for multi-UCAV in performing the CA/GTA missions is presented in this paper. Firstly, some constraints including individual and cooperative constraints are modeled. Particularly, an approximate allowable attack region is built for the critical terminal constraints. Then, after the multicriterion objective function is constructed, the cooperative trajectory planning problem is formulated as a cooperative trajectory optimal control problem (CTOCP). Owing to the temporal constraints, a notion of the virtual time is introduced and the VMC problem is reformulated in the virtual time domain. Inspired by VMC, DFT, and GPM algorithms, a new cooperative trajectory planning algorithm in an optimal control framework is proposed. The proposed approach is demonstrated by two typical CA/GTA examples, and the comparative experiments between GPM and VMC are carried out. The results show that the proposed approach is feasible, effective, and efficient.
The rest of the paper is organized as follows. In Section 2, the integrated model is set up. In Section 3, the problem is formulated as a CTOCP followed by a multicriteria objective function. Section 4 develops a novel cooperative trajectory planning algorithm based on virtual motion camouflage in virtual time domain, which can solve the problem efficiently. Section 5 presents several numerical examples, and finally the conclusions and future works are outlined in Section 6.
2. Modeling
2.1. Aircraft Model
2.2. Allowable Attack Region Model
2.3. No-Fly Zone Model
In the battlefield environment, the no-fly zone (NFZ) is an area where vehicles are not permitted to fly over, due to the presence of military restrictions (e.g., armed enemies and the missile killing range), the physical obstacles (e.g., mountains and buildings within the natural and urban environment, resp.), and civil restrictions mainly due to safety reasons (e.g., densely populated areas and severe weather condition zones).

2.4. Cooperative Constraint Model
In order to make the UCAVs arrive in the AARs of targets safely in an expected time sequence, cooperative trajectories should satisfy the following constraints.
2.4.1. Spatial Constraints
2.4.2. Temporal Constraints
3. Problem Formulation
3.1. Objective Function
As can be seen, the individual objective function Jj is defined by the weighted sum of the three separate running cost terms with appropriate weighting factors , , and . The three criteria (13)–(15) represent different physical meanings, and they are difficult to be compared directly. Hence the selection of these weighting factors is a skilled technique. In the experimentations, the multi-objective fuzzy optimization method proposed by Wang et al. [37] is employed. The method includes two main steps: (a) normalizing each single objective function and (b) solving the membership function about each criterion by using fuzzy distribution function.
3.2. Cooperative Trajectory Planning Problem Formulation
As stated above, to obtain optimal or suboptimal cooperative trajectories, the cooperative trajectory planning for the CA/GTA missions can be formulated as a cooperative trajectory optimal control problem (CTOCP).
Problem 1 (CTOCP). Find the trajectories, which drive the system from given initial conditions to desired final conditions over time horizons [t0, tf], while the cooperative objective function is minimized as
4. Virtual Motion Camouflage Based Cooperative Trajectory Planning
To tackle the cooperative trajectory planning for CA/GTA missions, an efficient virtual motion camouflage based planning method is proposed for numerically solving the CTOCP, which combines with the benefits of several other classical trajectory generation techniques, including DFT, GPM, and NLP. Correspondingly, this task contains three primary steps. The first is to determine outputs by VMC and DFT in the virtual time domain, such that the dynamics system can be mapped to a lower-dimensional output space. Then, GPM is used to discretize the outputs. Finally, the system is transformed into a NLP problem and the sequential quadratic programming (SQP) is used to solve the NLP to minimize the cost and subject to constraints in output space.
4.1. Virtual Motion Camouflage and Problem Formulation in the Output Space
4.1.1. Virtual Motion Camouflage
From [23], it can be known that when the proposed VMC suboptimal trajectory design method is used, the complexity of the problem can be reduced by (nsv + mcv) times from the collocation method or nsv times from the differential inclusion, where nsv and mcv are the numbers of the state variables and the control variables. In VMC framework, if the dimension of the position vector xa(t) is one (i.e., na = 1), the dimension of the problem is the same as that of the differential inclusion method. Otherwise, when na > 1, the reference point can be fixed or regarded as a parameter to be optimized, whereas VPT can be predefined or predetermined by the initial and terminal conditions of the aggressor trajectory. Thus, the dimension of the problem is lowered from na to one, such that the solving speed can increase.
4.1.2. VMC Problem Formulation in Virtual Time Domain
Hence, the Problem 1 can be reformulated in virtual time domain (VTD) as follows.
Problem 2 (CTOCP-VTD). Minimize the cooperative objective function (19) of all UCAVs represented with respect to the new independent variable as
In the VMC framework, the trajectory position ρa can be defined as the aggressor position vector. As described above, the state and the control variables of the dynamics system can be recovered from the flat outputs. Hence, according to (22), these variables are also functions of the PCP, the predefined virtual prey motion, the reference point, and their corresponding derivatives. Once the predefined virtual prey motion and the reference point are selected, the system of cooperative trajectory planning of UCAVs, including the objective function and the constraints, is mapped to a lower-dimensional output space (here, the dimension is one). Obviously, the dynamic constraints of this system (27) can be automatically satisfied. Therefore, the Problem 2 can be redefined as follows.
Problem 3 (CTOCP-VTD in the output space). Consider
4.2. Collocation Using Gauss Pseudospectral Method
In order to solve the Problem 3 through a NLP algorithm, the PCP history should be discretized into c = 0,1, …, Nc, Nc + 1 nodes, with and . In this paper, GPM is selected as the discretization method [12], which is an orthogonal collection method where the collocation points are the Legendre-Gauss (LG) points. Similar to other pseudospectral methods, a finite basis of global interpolating polynomials is used to approximate the state and control space at a set of discretization nodes in the GPM and then the optimal control equations can be transformed into nonlinear algebra equations, such that the OCP can be solved by a NLP algorithm.
As described above, the continuous OCP can be transformed to the discretized NLP problem. The parameters to be optimized are the PCP nodes . When the prey motion ρp,c is equal to the aggressor motion ρa,c for c = 0, Nc + 1, one can assume that , so the boundary conditions are no longer considered. And the discretized NLP problem can be denoted as follows.
Problem 4 (CTP-NLP). Its standard form is denoted as
Then the resulting Problem 4 can be solved through well-developed algorithms, such as the SQP algorithm. In this paper, TOMLAB/SNOPT software toolbox is chosen due to its advantages on solution effectiveness for the large-scale NLP problems [40].
4.3. Convergence and Initial Guess
As analyzed in [41], the GPM exhibits global convergence properties in many applications. In addition, the convergence of the virtual motion camouflage has been proved by Xu et al. [21, 23], in which [23] showed that if the problem was only one-dimensional, the VMC based trajectory design method could converge to the optimal solution, but when the problem was multi-dimensional and highly nonlinear, the result might be suboptimal in the full solution space, and [21] showed that the VMC framework together with the pseudospectral discretization method could converge to the optimal solution.
By Introducing elastic programming concept and some advances in both mathematical programming techniques and pseudospectral methods, Ross and Gong [42] designed a guess-free trajectory optimization algorithm. These advances have the effect of eliminating the guessing problem in the trajectory optimization, which has been used in the current GPM algorithm. In the proposed VMC method, the motion of the virtual prey and a reference point are guessed. However, both of them have direct physical relation to the problem. Typically the virtual prey motion is selected according to the boundary conditions and a proper guess is not a difficult task [23]. In this paper, the trivial guess for the reference point can be the initial point (IP), and an initial guess of vc = 1 is chosen for c = 1, …, Nc. The virtual prey motion is defined as a 2-order curve, determined by the initial and terminal conditions.
5. Numerical Examples
The basic ideas presented in this paper are illustrated in the following three examples. The specific vehicle platform used in simulations is the Storm shadow UCAV. Its relevant parameters are all taken from [43], summarized in Table 1.
Item | Minimum value | Maximum value |
---|---|---|
Flight altitude (m) | zmin = 200 | zmax = 8 000 |
Airspeed (m/s) | Vmin = 60 | Vmax = 300 |
Flight-path angle (deg) | γmin = −89 | γmax = 89 |
Roll angle (deg) | μmin = −80 | μmax = 80 |
Tangential load factor (g) | nx,min = −0.725 | nx,max = 0.91 |
Normal load factor (g) | nz,min = −3.2 | nz,max = 8 |
Roll angle rate (deg/s) | ||
Rate of change of tangential load factor (g/s) | ||
Rate of change of normal load factor (g/s) |
The experimental test environment is a rectangle area of 30 × 40 km2, as shown in Figure 2, where the NFZ1 caused by the severe weather condition is modeled as a cuboid with infinite altitude, and the NFZs caused by enemy threats are modeled as two hemispheres, denoted as THT1 and THT2. According to Figure 1, the shape parameters can be set as follows: for the NFZ1, k1 = k2 = k3 = k4 = 1, p = 2, and the length of a side is set as 8 km; for THT1 and THT2, k1 = k2 = k4 = 1, k3 = 2, p = 6, and the radius of the threats can be set as 7 km and 9 km, respectively. In order to avoid contact, a safe buffer bs (bs = 200 m) is added around the outside edge of each obstacle/threat boundary. All the results presented below are generated using TOMLAB/SNOPT software toolbox on a 2.4 GHz CPU and 2 G RAM computer running Windows XP and MATLAB R2009b. Table 2 summarizes the parameters used in the algorithm. To simplify the problem, the same weight coefficients for the objective function of each UCAV are set, and it is assumed that the target assignment is already completed before.
Item | Parameter | Value |
---|---|---|
Minimum safety radius (m) | dsafe | 500 |
Weighting factors of the objective function | wt | 0.3 |
wp | 0.4 | |
wr | 0.3 | |
Weighting factors of the robust subobjective | 0.5 | |
0.3 | ||
0.2 | ||
Safe buffer of NFZs (m) | bs | 200 |

5.1. Example 1: Cooperative Trajectories of Two UCAVs Arriving Simultaneously
In this example, two UCAVs cooperatively attack two stationary ground targets while avoiding a series of static obstacles/threats detected and collision with each other en route and meeting aircraft dynamical constraints, especially simultaneous arrival constraint. UCAV1 and UCAV2 start at each initial point, that is, IP1 (10, 2, 2) km and IP2 (17, 2, 2) km. Then they fly into the AARs of two targets, TAR1 (4, 34, 0) km and TAR2 (13, 40, 0) km, respectively. The initial remaining three states and control inputs of each UCAV are preset as , , and , and the terminal remaining three states of each UCAV are predetermined as and . In VMC framework, the reference point ρref for each UCAV is presumed as its IP, and the virtual prey path is predefined as a 2-order curve, determined by the initial and terminal conditions.
Figure 2 shows the overall collision-free attack trajectories of two UCAVs, whose total flight time is equal, equaling to 124 s. It means that two UCAVs arrive simultaneously. As can be seen from Figure 2, along the resulting smooth trajectories, the UCAVs can avoid collision with all obstacles, threats, and the other en route and then successfully fly into the AARs (fuchsin areas) to perform weapon delivery missions. In addition, the approximate weapon trajectories are drawn to illustrate the attack process. The PCP histories of two UCAVs are shown in Figure 3, which are smooth and lie in the range from 0 to 1. And the time histories of the UCAVs’ states (V, γ, ψ) and control inputs (μ, nx, nz) are shown in Figure 4. It is clear that the constraints on these variables, especially the cooperative constraints, are all satisfied (see Tables 1 and 2), which means that the resulting trajectories are feasible and safe.


5.2. Example 2: Cooperative Trajectories of Multi-UCAV Arriving in Sequence
In this example, three UCAVs attack two stationary ground targets cooperatively. The only additional requirement is that the UCAVs arrive at their AARs in sequence rather than simultaneously, and the intervals between UCAVs are set to 20 s and 30 s, respectively, denoted as Δ12 = 20 s and Δ23 = 30 s. The coordinates of the initial points IP1, IP2 and two targets TAR1, TAR2 are the same with those in Example 1, and the third initial point is IP3 (4, 2, 2) km. The result of the previously finished target assignment is that the UCAV1 and UCAV2 attack TAR1 and UCAV3 attacks the other one. The initial remaining three states and control inputs of the UCAV3 are preset as and , and the terminal remaining three states of the UCAV3 are predetermined as . In VMC framework, the reference point and the virtual prey path are predetermined, using the same strategy as the one in Example 1.
The overall collision-free attack trajectories of multi-UCAV are shown in Figure 5, and the total flight time of three UCAVs is 100 s, 120 s, and 150 s, respectively. It can be clearly demonstrated that UCAVs can avoid all obstacles or threats and successfully fly into the AARs in sequence to perform the weapon delivery. The PCP histories of three UCAVs are shown in Figure 6. And Figure 7 shows the distance between each pair of UCAVs. From it, one can find that the minimum distance between each UCAV is more than the minimal safety radius of dsafe (dsafe = 500 m); that is, the UCAVs can avoid collision with each other en route. Figure 8 shows the time histories of the UCAVs’ states (V, γ, ψ) and control inputs (μ, nx, nz). Obviously, the resulting trajectories are feasible and safe, since all the constraints listed in Tables 1 and 2 are satisfied.




In order to validate the convergence rate of VMC based algorithm in cooperative trajectory planning for multi-UCAV further, the comparative experiments are carried out with different numbers of UCAVs (Nv is set as 1, 2, 3, 5, 8, and 10, resp.) and the same number of nodes (Nc = 20). The performance comparison of multi-UCAV is summarized in Table 3. As can be seen clearly, along with the increase of the number of UCAVs, the average planning time is rising at a slow rate, which is acceptable for the preplanning applications. Particularly, for one UCAV, the average planning time is 2.21 s. When the number of UCAVs increases to 10, the average planning time increases by only more than quadruple of that for one UCAV, which is 9.62 s.
Number of UCAVs (Nv) | Number of optimization variables | Average planning time (s) | Ratio of the planning time (%) |
---|---|---|---|
1 | (20 × 1 + 1) × 1 = 21 | 2.21 | — |
2 | (20 × 1 + 1) × 2 = 42 | 2.83 | 128.05 |
3 | (20 × 1 + 1) × 3 = 63 | 3.32 | 150.23 |
5 | (20 × 1 + 1) × 5 = 105 | 4.93 | 223.08 |
8 | (20 × 1 + 1) × 8 = 168 | 7.70 | 348.42 |
10 | (20 × 1 + 1) × 10 = 210 | 9.62 | 435.29 |
5.3. Comparison between VMC Based Method and GPM Based Direct Collocation Method
To further evaluate the performance of the proposed algorithm, a recently developed optimal trajectory generation method, GPM based direct collocation method, is used for comparison, which is basically converting the original optimal control problem to a NLP problem via the GPM directly. In the following comparative experiments, Example 1 is simulated again using the two algorithms, respectively, performed on the same desktop with different numbers of nodes (Nc is set as 10, 15, 20, and 30, resp.). All the results are presented below.
Figures 9 and 10 show the comparative solutions attained by using the GPM and VMC based methods, respectively. It can be seen that the trajectories and the state and control time histories of these two algorithms are similar. The detailed performance comparison is summarized in Table 4. With comparison, when the number of nodes increases, the number of optimization variables in GPM is much more than VMC. As a result, the computational speed of VMC is more than an order of magnitude faster than GPM, while the optimization performance has only small loss.
Method | Nc | Number of optimization variables | Objective function value | Average planning time (s) | Ratio of the planning time (%) |
---|---|---|---|---|---|
GPM | 10 | (10 × 9 + 1) × 2 = 182 | 1.776 | 12.31 | — |
15 | (15 × 9 + 1) × 2 = 272 | 1.731 | 16.28 | — | |
20 | (20 × 9 + 1) × 2 = 362 | 1.698 | 20.31 | — | |
30 | (30 × 9 + 1) × 2 = 542 | 1.705 | 28.67 | — | |
VMC | 10 | (10 × 1 + 1) × 2 = 22 | 1.851 | 2.23 | 18.12 |
15 | (15 × 1 + 1) × 2 = 32 | 1.840 | 2.46 | 15.11 | |
20 | (20 × 1 + 1) × 2 = 42 | 1.769 | 2.83 | 13.93 | |
30 | (30 × 1 + 1) × 2 = 62 | 1.731 | 3.18 | 11.09 |


In order to compare the convergence rate and analyze the impacts of the number of UCAVs on the computation time between VMC and GPM further, the comparative experiment is run multiple times with different numbers of UCAVs (Nv is set as an integer from 1 to 10, resp.) and the same number of nodes (Nc = 20). Figure 11 shows the change of the average planning time required to solve the optimization problem as the number of UCAVs increases. It can be seen that the increase of the number of UCAVs results in an exponential increase of the required computation time for GPM. In contrast, the computation time for VMC based algorithm increases slowly. When the number of UCAVs increases to 10, the number of the optimization variables for GPM is up to 1810. However, for VMC, the number is only 210. Since the required computation time strongly depends on the number of optimization variables, the computational time required by VMC is more than an order of magnitude less than that required by GPM. Maybe it can be concluded that the larger the number of UCAVs is, the more obvious the advantage of VMC to GPM is.

6. Conclusions
This paper is devoted to explore the cooperative collision-free trajectory planning for multiple UCAVs in performing the CA/GTA missions. The main contributions of the paper are as follows. Firstly, the cooperative trajectory planning problem under consideration is mathematically formulated as a cooperative trajectory optimal control problem (CTOCP). In order to consider the weapon delivery constraints, an approximate allowable attack region model is proposed and integrated into the problem formulation. Secondly, a particular virtual motion camouflage approach combining with differential flatness theory, Gauss pseudospectral method, and nonlinear programming is used to develop the trajectory planner for solving CTOCP. The advantages of this method are that it can automatically satisfy all boundary conditions and use any models and any performance indexes, does not require numerical integration of differentiation of the state equations, and transforms the OCP into a NLP problem of very low dimension. Finally, in order to handle the temporal constraints, the notion of the virtual time is introduced into the virtual motion camouflage approach to realize the spatial-temporal cooperation.
The proposed approach is validated by numerical examples. The results show that the approach is able to generate both feasible and near-optimal attack trajectories with meeting the spatial and temporal constraints very efficiently. Moreover, the convergence rate and average planning time of the method and optimality of the generated trajectories are evaluated via a detailed comparison with GPM based direct collocation method. The results show that the computational speed of virtual motion camouflage approach is more than an order of magnitude faster than GPM, at small loss of optimality.
For the future work, we will analyze some uncertain factors in the true battlefield environment and carry out the research on the real-time cooperative trajectory planning. For the presence of a larger number of UCAVs, we will make further efforts to exploit more efficient trajectory planning algorithm and improve the time cooperative strategy. Moreover, we will try to study another important aspect about how to plan in the opposability battlefield environment.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
Great thanks are due to the editor for helpful comments and the reviewers for their valuable suggestions and comments on the revision of the paper. This work was supported by the Science and Technology on Electro-Optic Control Laboratory of China (no. 20125188006).