International Journal of Aerospace Engineering

Volume 2024, Issue 1 6643424

Research Article

Open Access

Heterogeneous UAV Swarm Collaborative Search Mission Path Optimization Scheme for Dynamic Targets

Kaixin Cheng,

Kaixin Cheng

orcid.org/0000-0001-8398-4655

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Tao Hu,

Corresponding Author

Tao Hu

[email protected]

orcid.org/0000-0002-7712-5851

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Di Wu,

Di Wu

orcid.org/0000-0001-8473-1553

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Tingli Li,

Tingli Li

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Shu Wang,

Shu Wang

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Kaiyue Liu,

Kaiyue Liu

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Zhifu Tian,

Zhifu Tian

orcid.org/0000-0002-6592-9844

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Dong Yi,

Dong Yi

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Kaixin Cheng,

Kaixin Cheng

orcid.org/0000-0001-8398-4655

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Tao Hu,

Corresponding Author

Tao Hu

[email protected]

orcid.org/0000-0002-7712-5851

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Di Wu,

Di Wu

orcid.org/0000-0001-8473-1553

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Tingli Li,

Tingli Li

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Shu Wang,

Shu Wang

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Kaiyue Liu,

Kaiyue Liu

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Zhifu Tian,

Zhifu Tian

orcid.org/0000-0002-6592-9844

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

Dong Yi,

Dong Yi

College of Data Target Engineering Strategic Support Force Information Engineering University Zhengzhou 450001 China

Search for more papers by this author

First published: 05 June 2024

https://doi.org/10.1155/2024/6643424

Academic Editor: Guillermo Valencia-Palomo

Share a link

Email
Wechat
Bluesky

Abstract

To solve the problem of collaborative search mission planning (CSMP) for a heterogeneous unmanned aerial vehicle (UAV) swarm for dynamic targets, this paper proposes a new CSMP scheme for a heterogeneous UAV swarm. First, a new polar coordinate system motion method is used to establish the UAV motion model and decision input solution model, which effectively solves the path-unified coding problem of a heterogeneous UAV swarm. Then, in the mission area, a dynamically updated multisearch situation map model is established. Finally, to improve the global searching capability of a UAV, an improved path optimization algorithm PSO-Rolling Horizon Control (PSO-RHC) is designed. Multiple Monte Carlo simulations are performed for three time-sensitive moving target types and three constraint types. The simulation results show that the task execution efficiency indexes of the proposed scheme for the decision input solution model, pheromone update mechanism, and optimization algorithm are improved by 188%, 72%, and 102%, respectively, and the overall performance is improved by 227%.

1. Introduction

Unmanned aerial vehicles are increasingly used in modern air combat to enhance the combat capabilities of manned aircraft [1]. Due to limited capacity and computing power, drones cannot obtain enough resources to complete tasks, and drones themselves do not have enough power to complete complex tasks. Efficient collaboration among multiple UAVs can effectively improve the task execution efficiency of a UAV swarm. Decentralization, autonomy, and robustness of UAV swarm cooperative control similar to the distributed and self-organizing properties of biological groups such as ant colonies, bee colonies, bird swarms, and fish swarms are required [2]. The UAV swarm is an autonomous system inspired by the self-organization mechanism of biological groups, with complementary functions and cooperative operations, which greatly improves execution efficiency. Therefore, cooperative control techniques of UAV swarms have been presented and widely investigated.

The multi-UAV cooperative mission planning problem (MCMPP) is a somewhat complicated and challenging NP-H optimization problem. The mission planning optimization algorithm optimizes a UAV swarm flight path with the constraints of a mission environment to improve the coordination efficiency and task execution efficiency of the UAV swarm. Due to the large scale of the UAV population, its computing and communication volumes are increasing exponentially, making it difficult to centralize decision-making within a useful time frame. Therefore, distributed decision-making schemes have been proposed for a variety of applications, such as swarm flocking [3], reconnaissance [4], surveillance [5], cargo transportation [6], search [7], assignment of tasks, and planning of routes [8–10].

At present, the MCMPP for a homogeneous UAV swarm has been extensively studied. However, in practical real-time search, rescue, and security systems, there is a clear trend that many different types of autonomous drones perform different functions, and each type is specialized and efficient to perform specific tasks. Autonomous UAVs with various architectures and characteristics provide strong teamwork capabilities while controlling system complexity and cost and are widely used in military and civilian fields. Heterogeneous UAVs can adapt to changing mission requirements during joint mission execution. Compared with similar drones, it can significantly improve system performance and reduce integration and maintenance costs [11]. In contrast to a homogeneous UAV swarm, UAV performances in a heterogeneous UAV swarm, including maneuverability, sensor performance, and task execution performance, are usually heterogeneous. The multifunctional heterogeneous UAV cooperative mission planning problem (MFHCMPP) has NP-hard complexity in path optimization [12], and its heterogeneity makes it difficult to find the optimal solution. To solve the MFHCMPP, in reference [13], the authors considered a sensor heterogeneous UAV swarm and the task scenarios with timing constraints and functional constraints. A multigroup fruit fly optimization algorithm (MFOA) was proposed to solve the MFHCMPP. In the simulation experiment, the 3D path planning results of the UAV swarm were generated based on the criterion of minimizing the completion time and the total task time at the same time, which verified the effectiveness of the method. In reference [14], the authors considered a heterogeneous UAV swarm with interchangeable recording sensors and established a mathematical model that integrated the sensor allocation and path planning problems. A comprehensive mission planning framework based on a two-level adaptive variable neighborhood search algorithm was proposed. The framework had high flexibility to solve complex NP-H optimization problems. In reference [15], a heterogeneous unmanned vehicle system, including a UAV, unmanned ground vehicle (UGV), unmanned surface vessel (USV), and other unmanned vehicles, was established to solve the problem of the search and track cooperative path planning for underwater targets. Then, in order to reduce the computational complexity of the combined search space between unmanned vehicles, a heterogeneous system framework, cooperative strategy, and path planning algorithm were developed, and the communication mechanism and information flow between unmanned vehicles were defined.

In the research field of the MCMPP, from the perspective of the task type, the research on path planning for static tasks is relatively mature. The main classical models include the multitrip salesman model (MTSP), mixed linear integer programming model (MILP), and vehicle scheduling and path planning model (VRP). These models are solved with heuristic algorithms [16] such as genetic algorithms (GAs) [17, 18], evolutionary algorithms (EAs) [19, 20], ant colony optimization (ACO) [21–24], and particle swarm optimization (PSO) [4, 25, 26].

Path planning for dynamic tasks [27, 28] is another research field of the task type in the MCMPP relative to static tasks. The cooperative search task of a UAV swarm for dynamic targets is a type of dynamic task. Due to the unknown location of the mission target, the path-planning method based on the static model cannot be directly applied. Therefore, in order to solve the cooperative search problem of a UAV swarm for dynamic targets, target probability map (TPM) [29–34], digital pheromone map (DPM) [35–38], and environment map [39] methods are widely used to help a UAV swarm perceive the surrounding environment. A TPM is a probability distribution model that is used to describe the probability distribution of unknown search targets in the given mission area. In reference [33], an improved multi-TPM model for multitype targets was proposed by establishing a TPM model that distinguished target types. At the same time, the TPM model was able to update the target probability distribution by adding time parameters in the model over time. To a certain extent, the algorithm solved the problem of the cooperative mission planning of UAV swarms for dynamic targets. However, the problem with this algorithm was that the model relied heavily on the prior information of the target, and the target type could not be modified during task execution. In reference [34], the two-dimensional normal random distribution was used to describe the initial TPM, and the authors designed a TPM update method during the task. However, the definition of the target’s prior information and the target’s motion type in the literature was not sufficient, and the definition of heterogeneous sensors in the UAV swarm was not considered. In this research, inspired by the ant colony pheromone indirect communication collaboration method, the DPM, which has physical characteristics such as secretion, propagation, and volatilization of bio pheromones, is established and updated in the established mission area to describe the search state of the UAV cluster for the mission area. In reference [33], the return visit of the UAV swarm to the searched area was implemented by using three operations of pheromone concentration volatilization, residue, and secretion to update the pheromone map. However, the problem with this method is that the oversimplified method of constant increment and threshold limitation was used to secrete pheromones. This problem leads to the low efficiency of a UAV’s return search. In reference [35], two types of pheromones, attraction and repulsion, were used to mark search areas that had not been visited for a long time and had been recently visited, respectively. Through this pheromone guidance mechanism, the selective search of the search area and the nonsearch area was realized, and the phenomenon of a UAV flying out of the search area could be avoided. However, the problem with this method was that the above two types of pheromone coordination and update algorithms needed to be optimized. Under special numerical conditions, the pheromone guidance mechanism would be wrong.

To improve the optimization efficiency of the MCMPP, a mathematical model is usually established to describe the flight path of a UAV swarm. The mathematical model can encode the UAV flight path to facilitate the heuristic optimization algorithm. This type of mathematical model is called a decision input solution model [33]. In the study of the MCMPP for dynamic targets, the existing literature [33–35, 39–43] often uses the method of a geometric model. The method based on the geometric model focuses on describing the necessary constraints and seeks feasible and safe paths that meet the preknown flight environment through the establishment of mathematical models to solve the problem. Almost all graph-based exploration methods, such as the Grid method [33–35, 39], A ^∗ [40], Phi ^∗ [41], Dijkstra search method [42], and Voronoi graph method [41], have been classified as model-based methods. The algorithm based on the geometric model is easy to implement and can efficiently provide the correct path for each drone through the static roadmap so that all obstacles and threats can be detected early. For example, in reference [41], the incremental Phi ^∗ method is proposed to solve the path planning problem of UAVs at any angle on the grid, which can obtain the shortest path close to the optimal. In references [33–35, 39], the Grid method, a UAV motion model that discretizes the direction of UAV motion, was used as a decision input solution model. To simplify the calculation, the decision input solution was defined as three grid points at different distances near the current motion direction of the UAV. Geometric model-based methods have shown significant advantages in reducing computational complexity and increasing practical efficiency, but given the complete availability of global environmental information, the scope of these methods is limited. Furthermore, the geometric models of these methods may yield suboptimal results in terms of accuracy and practicality due to the performance limitations of the geometric models. First, the high maneuverability of a UAV is limited by this method. Since the flight path of a UAV is connected by discrete grid points, the UAV can only move in the given direction, which greatly limits the flight freedom of the UAV. This problem causes the UAV path generated by the optimization algorithm to meander, which is inconsistent with the actual flight trajectory of a UAV. Second, this method is not suitable for a uniform UAV swarm. The research on UAV flight speed and energy consumption modeling in reference [44] shows that a UAV has higher energy utilization efficiency and a longer flight distance when it moves at a constant speed. Therefore, in the cooperative mission planning problem of a UAV swarm, it is usually assumed that the UAV is moving at a constant speed. However, in the Grid method, the motion distance of the UAV depends on the distance between the changing grids. Third, this method is not suitable for heterogeneous UAV swarm. The maneuverability factors of different types of UAVs, such as speed, step size, minimum turning radius, and maximum heading angle, are usually different. This makes it impossible to use the same size grid to describe the path of different types of drones. Therefore, the current Grid method has some problems and cannot be used as an appropriate input solution model to solve the collaborative search mission of a heterogeneous UAV swarm for dynamic targets.

In summary, heterogeneous UAV swarm cooperative search mission planning for dynamic targets is a very complicated and challenging NP-hard optimization problem. This problem is an assembly of three complicated problems: dynamic target search path planning, a heterogeneous UAV swarm, and mission planning optimization, which need to be solved by a new system architecture and each module algorithm. The goal of this research is to design a heterogeneous UAV swarm cooperative search mission planning scheme for dynamic targets. Distinct from the results in the literature, the main contributions of this research are as follows:

1.
Based on the polar coordinate system motion method, an improved UAV motion model and a decision input solution model are established. Therefore, the high maneuverability of each heterogeneous UAV can be fully released in path optimization.
2.
A dynamic updating multisearch situation map model is established, including an environmental search map, an environmental pheromone map, and a TPM. A numerical update algorithm based on a normal distribution integral is used to update the pheromone concentration in the environmental pheromone map. The multisearch situation graph model improves the search performance of UAVs and the task information sharing performance among UAVs in the UAV swarm.
3.
The improved path optimization algorithm PSO-Rolling Horizon Control (PSO-RHC) is designed. Based on the traditional PSO algorithm, the RHC method is used to improve the global optimization performance. In addition, the optimal solution rolling mechanism is designed to improve the convergence speed of the PSO-RHC algorithm. Compared with the traditional heuristic algorithm, the UAV swarm path obtained with the PSO-RHC algorithm is smoother and the optimization performance index is higher.
4.
Three types of motion and time-sensitive targets that distinguish known partial motion state information are considered, including unknown static targets, unknown dynamic targets, and dynamic targets with known partial prior information. In the task planning problem, heterogeneous UAV maneuverability constraints, sensor constraints, and flight energy constraints are considered.

The rest of this article is organized as follows. The second section describes the cooperative mission planning problem of a heterogeneous UAV swarm and establishes an improved UAV motion model, a dynamically updated multisearch situation graph model, and a collaborative search mission optimization model. In the third section, the PSO-RHC algorithm is proposed to achieve the cooperative search path optimization of a UAV swarm. In the fourth section, a Monte Carlo simulation is carried out to verify the effectiveness of the proposed method. Finally, the fifth section offers the conclusions.

2. Modeling of the Cooperative Mission Planning Problem for a Heterogeneous UAV Swarm

In this section, the heterogeneous UAV model, multisearch situation map model, and collaborative search mission optimization model are established.

2.1. Heterogeneous UAV Model

The UAV swarm considered in this research is composed of heterogeneous fixed-wing UAVs; that is, each UAV has different functional and performance constraints, including different maximum turning angles, different motion speeds, and different target detection performance. The mission area S ∈ R² is assigned, in which there are n ∈ N multitype UAVs and i ∈ I unknown targets. In the task, three types of motion and time-sensitive targets with known partial motion state information are considered, including unknown static targets, unknown dynamic targets, and dynamic targets with known partial prior information. Among these targets, the dynamic target with known partial prior information refers to the fact that the trajectory of the target has been observed for a long time before the start of the mission, and some position data for the target is obtained by sampling as the prior information of the mission. The mission of the UAV group is to search for moving targets in an unknown environment through an effective decision-making method based on limited prior information. The total mission time is k_max.

Taking two-dimensional space as an example, as shown in Figure 1, the mission area is discretized into a grid map with the size of Lx × Ly, and the length and width of each grid are recorded as grid_x and grid_y, respectively. As shown in Figure 1, the gray sector represents the optional path of n at the next moment with the maneuver constraint and the possible position of n at the next moment is any point on the arc l_n(k). The orange dotted line represents the motion path of the UAV n. In the two-dimensional space, the UAV n is considered as a particle in uniform motion, and its motion state at k is determined by three variables: heading angle θ_n(k), turning angle φ_n(k), and motion step size path_n. Among them, the heading angle θ_n(k) refers to the included angle between the motion direction vector

and the positive direction of the horizontal coordinate axis. The turning angle φ_n(k) refers to the included angle between

and

. The motion step size refers to the fixed displacement distance with the fixed motion speed

and motion time T. To simplify the UAV motion model, the specific turning process of the UAV is not considered. The motion model of a UAV based on the polar coordinate system is established, as shown in Equation (1).

()

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Mission area model and UAV motion model.

In Figure 1, the red star represents the target. The radius of the projection of the detection range of n on the plane of the mission area is assumed to be R_n. It is assumed that the N swarm is equipped with a specific radar detection system that can image the detection range as the detection interval T and judge whether there is a target in the field of view according to the image. Usually, the closer the UAV is to the target, the more shots are taken, and the greater the probability of finding the target is. It is assumed that the probability model for the target i detected by the detection image of n is as shown in Equation (2).

()

where d_ni represents the distance of n and i and α_n represents the radar image detection index of n.

It is assumed that during the search process, N images the target H times, and the imaging distances are d₁, d₂, ⋯d_H, respectively. If the target is detected for any image within the H images, it indicates that the target has been found. According to the Bayesian criterion, the probability that N finds the target based on the H images is shown in Equation (3).

()

2.2. Multisearch Situation Map Model

A multisearch situation map is established, and each grid in the task area is assigned, including the environment search map, environment pheromone map, and TPM.

2.2.1. Environment Search Map

In the environment search map, the environment search values of different grids reflect the search coverage of the grid by N. The definition is as follows:

()

where E(k) represents the environment search graph matrix at k. Once any UAV detects the grid at k_x,y, e_x,y(k) is assigned a value of 1 and maintains the value until the end of the task.

2.2.2. Environment Pheromone Map

In the environment pheromone map, the pheromone concentration of different grids reflects the degree of attraction of the grid to N. The higher the pheromone concentration is, the more the UAV tends to transfer to the grid. Therefore, N can reasonably update the local pheromone structure according to the search situation and it can realize the task coordination of the swarm. The environmental pheromone map is composed of all grid pheromone concentration values in the task area, reflecting the search of the mission area by N. The environment pheromone map matrix A(k) of the mission area is defined in Equation (5).

()

A coordination mechanism between drones should be established to prevent swarms from scanning certain areas. To this end, rules have been developed for updating environmental pheromone maps to ensure coordinated searches of areas among drones. After n completes the one-step detection of the surrounding area on grid (x_n, y_n), it volatilizes a certain amount of pheromones around (x_n, y_n) to reduce the pheromone concentration of the searched grid and prevent the swarm from repeatedly searching for a certain area. The degree of volatilization of the pheromones by n should depend on the radar detection system of n. Therefore, based on the radar detection model of n, shown in Equation (2), the amount of pheromone volatilization at grid (x, y) can be calculated, as shown in Equation (6).

()

where d_n represents the distance from n to grid (x, y) and Δa_x,y(d_n)(k) represents the amount of pheromone volatilization on grid (x, y) at k.

A new target may appear in the area that has been searched, which will lead to increased uncertainty and the degree of attraction to the UAV. Therefore, it is necessary to correspondingly enhance the pheromone concentration of each grid. Moreover, the longer the time interval between UAV swarms searching the grid is, the faster the pheromone concentration on the grid should increase. The definite integral value of the normal distribution function is used to describe the growth process of the global pheromone. The following equations, Equations (7)–(10), represent the formula derivation of the global pheromone update mechanism. First, the updated model of the global pheromone is defined as follows in Equation (7):

()

where k_last represents the time of the last volatile pheromone of the grid (x, y). a₀(x, y) represents the concentration of the residual pheromone after the last volatilization. ϕ(τ) represents the normal distribution function and ϕ(τ) ~ N(μ, σ). τ_x,y represents the time gap after the last volatilization.

In the global pheromone update mechanism shown in Equation (7), using a₀(x, y) as the reference value and τ_x,y as the upper limit of the integral of the normal distribution function, the growth from reference value to a_x,y(k) is .

The three-sigma principle is shown in Equation (8).

()

Letting σ = μ/3 and then substituting Equation (8) into Equation (7), Equation (9) is obtained.

()

Letting μ = λ × k_max, the growth rate of the pheromone concentration after grid pheromone volatilization can be adjusted using the parameter λ.

Based on Equations (7)–(9), the global pheromone update mechanism is derived, as shown in Equation (10).

()

2.2.3. TPM

It can be seen from reference [33] that the TPM matrix describes the distribution of targets in the mission area, with the goal of improving the possibility of finding targets. In the discretization mission area, tpm_x,y is the normalized target existence probability of the grid (x, y), tpm_x,y ∈ [0, 1]. The target existence probability of all grids in the mission area S constitutes a probability distribution matrix.

()

For the three types of targets in the task problem description, the method in reference [33] is used to generate a TPM matrix based on prior information before the task starts. It is assumed that the numbers of the three types of goals are I_a, I_b, and I_c and the total number I = I_a + I_b + I_c.

a.
For unknown static targets and unknown dynamic targets, the probability of the targets being in any grid in the task area is the same, so the target probability distribution density function can be described by a uniform distribution.

()

b.
For the dynamic target with known partial prior information, the prior information data is defined as a set of grid coordinates, including Ε × I_c grid coordinates.

()

From the calculation method of the distribution probability of these types of targets in reference [33], Equation (14) can be obtained as follows:

()

where δ₀ ≥ 0 is the normal distribution variance, which reflects the accuracy of the prior information.

From Equation (14), the TPM matrix can be computed as shown in Equation (15).

()

Although the TPM matrix objectively describes the target’s existence probability based on prior information, in the actual search process, considering the different detection capabilities of different UAVs, there should be corresponding different target existence probability maps. Therefore, based on the radar detection model of n, as shown in Equation (2), the following Equation (16) is further derived, which is called the heterogeneous target detection probability map P_n. The matrix is a normalized matrix, which is defined as follows:

()

where

represents the distance between (x, y) and (x^∗, y^∗) and p_n(x, y) represents the probability that n detects the target at grid (x, y) based on prior information.

2.3. Collaborative Search Mission Optimization Model

The purpose of UAV swarm cooperative search is to cover the task area under the given constraints and detect more targets. According to the multisearch situation map model proposed in Section 2.2, the corresponding revenue evaluation subfunctions are established, namely, environmental search revenue J_S(k), environmental pheromone revenue J_A(k), and target existence probability revenue J_P(k).

J_S(k) is calculated using the total number of searched grids in the task area, as shown in Equation (17).

()

where ΔJ_S(k)_n and ΔJ_S(k), respectively, are the number of grids that are first searched in the mission area at k by n and N. J_S(k) represents the total environmental search revenue of N at k.

J_A(k) is calculated using the amount of volatilization of the environmental pheromones, as shown in Equation (18).

()

where ΔJ_A(k)_n and ΔJ_A(k) represent the sum of the pheromone concentration volatilization on each grid in the environmental pheromone map at k using n and N. J_A(k) represents the total environmental pheromone income of N at k.

The goal of the UAV swarm cooperative search attack task is to obtain decision input to maximize the overall performance index under certain constraints. Therefore, a collaborative search attack task optimization model P1 is constructed, which is expressed as follows:

()

where C_m represents the maneuverability constraint of n. C_k represents the total task duration constraint. U(k) represents the decision input solution space that N satisfies C_m and C_k at k. U^∗(k) represents the optimal decision input solution of N at k. J(k) represents the overall performance index of N at k.

At k, N selects the optimal solution U^∗(k) from U(k) as the optimal decision input that can maximize J(k + 1) and generates the search path for each UAV according to U^∗(k).

3. Design of PSO-RHC Algorithm

In this section, the PSO-RHC algorithm for collaborative search mission planning is proposed, including the coding method of the decision input solution, the solution process of the PSO algorithm, and the design of the rolling horizon control method.

3.1. Coding Method of Decision Input Solution

As shown in Figure 1 and Equation (1), the search path U_n of n during the mission can be represented as shown in Equation (20).

()

where u(k)_n represents the decision input solution space of n at k under the constraints.

Because the radar detection interval of N is Τ and the motion speed v_n of n is constant, therefore, the only variable in the decision input of n is φ_n(k). Then, the input decision solution space of N at k is defined as follows:

()

3.2. Solution Process of PSO Algorithm

The optimal decision input problem shown in Equation (21) can be defined as an optimization problem of N dimensional solution space, which can be solved with the PSO algorithm.

First, the particle swarm PS consists of M initialized particles ps_m(m = 1, 2, ⋯, M) that are randomly generated. Secondly, PS performs multiple iterations to optimize the objective function J, and p_bestm and g_best are recorded as the historical optimal solutions of ps_m(κ) and g_best, respectively. Then, the moving speed psv_m(κ) of ps_m(κ) is calculated and generated. Finally, after PS iterates to the maximum number Κ, g_best is the output.

According to the above calculation process and the decision input solution space and overall performance index of N at k, as defined in Equations (19) and (21), problem P1 can be transformed into problem P2, and the particle swarm with the following structure can be established, as shown in Equation (22) and Figure 2.

()

where ω is the inertia factor, c₁ and c₂ are the learning factors, and r₁, r₂ ∈ [0, 1] are random numbers.

Each ps_m is independently taken as the decision input solution in Equation (19) to obtain p_bestm and g_best to complete an iterative calculation of the particle swarm. After PS iterates to Κ, g_best is the optimal decision input solution U^∗(k) of N at k.

3.3. Rolling Horizon Control Method

Referring to the rolling horizon control method in model predictive control theory,

is taken, which is the sum of the overall performance benefits of the next Q times UAV swarm motion as the optimization objective function to generate U^∗(k). Therefore, the problem P3 can be established and defined as follows:

()

Therefore, it is necessary to expand the dimension of the solution from N to D = N × Q in the particle swarm structure, as shown in Figure 2. The particle swarm structure controlled by the rolling horizon is shown in Figure 3.

With the above method, the search path and overall performance gain of N in [k, k + Q] can be calculated at k. The optimal decision input U^∗(k), which can make , can be generated at k. In this way, the UAV swarm can consider further path optimization and jump out of the local optimum to improve the overall search mission performance.

However, the cost of doing this is more computational resource consumption and a slower particle swarm convergence speed. In the PSO algorithm, the factor of whether the initial solution is good enough has a great influence on the convergence speed of the particle swarm. Therefore, the following method, called the optimal solution rolling mechanism, is used to provide a suboptimal solution for the PSO algorithm as an initial value.

At k, as shown in Figure 3, the global optimal solution of the particle swarm obtained with the PSO-RHC algorithm is defined as follows:

()

The dimensions of g_best(k) are D = N × Q, in which the pre-N dimension represents U^∗(k), and the [N + 1, Q × N] dimension represents {U(k + q)_gbest|q = 2, ⋯, Q}. g_best(k) is a suboptimal solution for [k + 1, k + Q]. The structure of l_best(k + 1) is shown in Equation (25) and Figure 4.

()

As the inheritance of the optimal solution at k, l_best(k + 1) can effectively improve the convergence speed of the PSO-RHC algorithm.

The algorithm flow of the PSO-RHC algorithm is shown in Figure 5.

4. Simulation Experiments

To verify the superiority of this method in the optimization of the UAV swarm search, a simulation based on MATLAB is carried out. This section describes how the performance of the proposed method is studied based on four aspects, including the decision input solution model, the pheromone update mechanism, the optimization algorithm, and the overall improvement. Compared with the corresponding four algorithms, the performance of the proposed method is studied and its superiority in the search optimization of a UAV swarm is proven.

The mission execution flowchart of the UAV swarm N to perform the collaborative search mission is shown in Figure 6. After the mission starts, first, as shown in the blue process of Figure 6, the relevant parameters and matrix of the mission area algorithm are initialized according to Equations (4) and (5), and the target detection probability graph matrix P_n is initialized according to Equations (11)–(16). Then, as shown in the green process of Figure 6, according to the collaborative search task optimization model and the solution process of the PSO-RHC algorithm, shown in Equations (17)–(25), the optimal decision input solution g_best(k) is solved. Finally, as shown in the orange process in Figure 6, the UAV swarm N motion state is updated according to the UAV motion model shown in Equation (1), and the environment-search and pheromone structure is updated according to Equations (4) and (6)–(10), respectively. If k > k_max, the mission ends, and all the algorithm pops out. Otherwise, k = k + 1 and proceed to the next iteration until k > k_max.

4.1. Mission Scenarios and Parameter Settings

The mission area is 200 km × 200 km discretized into 200 × 200 grids. There are I = 8targets in the area, including I_a = 2 unknown static targets, I_b = 2 unknown dynamic targets, and I_c = 4 dynamic targets with known partial prior information. It is assumed that the dynamic target is a ground patrol vehicle and moves randomly around point C_i in the mission area as radius d_i. Based on the artificial potential field (APF) method and the Wiener random process, as shown in Equation (26), a dynamic target random motion model is established to describe the motion trajectory of the dynamic target. All the motion state parameters of the targets are shown in Table 1. Targets 1 and 2 are static targets, targets 3 and 4 are unknown dynamic targets, and targets 5–8 are dynamic targets with partial prior information. All the motion states and the initial coordinates of the dynamic targets are randomly generated with the Monte Carlo method.

()

where (x_C, y_C) represents the coordinate of C_i, (x_i(k), y_i(k)) represents the coordinate of target i at k, and d_Ci represents the Euclidean distance between them.

represents the APF force vector from C_i to target i, and λ_i is a parameter related to d_Ci and d_i.

, and path_i(k) represents the heading vector, random turning vector, and motion step size of target i at k, respectively. T represents the system iteration step.

represents the displacement vector of target i at k.

Table 1. Target movement information.

Target label i	Target type	Coordinate C_i	Speed v_i(km/h)	Radius d_i(km)
1	I_a	(70.59, 80.51)	0	0
2	I_a	(164.48, 161.02)	0	0
3	I_b	(105.1, 110.23)	[50.2, 100.4]	20.3
4	I_b	(70.34, 92.56)	[50.2, 100.4]	23.6
5	I_c	(72.26, 126.38)	[50.2, 100.4]	29.66
6	I_c	(124.4, 63.76)	[50.2, 100.4]	32.51
7	I_c	(119.68, 130.13)	[50.2, 100.4]	34.34
8	I_c	(80.32, 59.06)	[50.2, 100.4]	30.9

It is assumed that the UAV swarm N enters the mission area at k = 1 and the system iteration step is 20 s, that is T = 20. In addition, it is assumed that the UAV swarm flies at different altitudes, so the collision between UAVs is not considered in the research and simulation of this paper. During each iteration of the system, the UAV swarm has a displacement and performs detection for the surrounding environment. The UAV swarm N contains four UAVs with different performances, as shown in Table 2. In Table 2, the UAV swarm is divided into the three levels of A, B, and C based on the aspects of maneuver performance and detection performance. The take-off position coordinates of each UAV are given. The specific performance parameters of a heterogeneous UAV are shown in Table 3. The other relevant parameters in this method are shown in Table 4.

Table 2. Heterogeneous UAV swarm performance rating and take-off positions.

UAV label	Whole performance	Maneuver performance	Detection performance	Take-off position
1	A	A	A	(10, 10)
2	B	A	B	(10, 190)
3	B	B	A	(190, 190)
4	C	B	B	(190, 10)

The performance rating order is A > B > C.

Table 3. Performance parameters of heterogeneous UAV swarm (different φ_n max).

UAV label	Maximum turning angle φ_n max	Speed v_n(m/s)	Maximum detection radius R_n(km)	Target detection parameter α_n(km)
1	75°	216	10	4.35
2	75°	216	6	2.61
3	45°	180	10	4.35
4	45°	180	6	2.61

Table 4. Description and value of the proposed method algorithm parameters.

Parameter name	Parameter value	Parameter description
T	20	Step of system iteration (Equation (1))
λ	0.2	Update constant of global pheromone (Equation (10))
E	200	Number of I_c type target prior data (Equation (13))
δ₀	50	Normal distribution variance (Equation (14))
M	10	Number of particles in Equation (22)
Q	6	Rolling times in Equation (23)
K	24	Maximum iterations in Equation (22)
D	24	Dimension of decision input solution in Equation (23)

4.2. Simulation and Analysis

The task execution efficiency of the heterogeneous UAV swarm search mission is affected by multiple modules in the mission planning scheme. This research improves the mission planning scheme based on three aspects: decision input solution model, pheromone update mechanism, and optimization solution algorithm. Therefore, in the simulation, the proposed algorithm is compared with four other different algorithms, which have different variables mentioned above for the three aspects. The algorithm list and the differences between the algorithms in the simulation are shown in Table 5.

Table 5. List of algorithms and the difference of each algorithm in simulation.

Algorithm label	Decision input solution model	Pheromone update mechanism	Algorithm of optimization
Algo. 1	Polar coordinate	Integral method	PSO-RHC
Algo. 2	Grid method [33]	Integral method	PSO-RHC
Algo. 3	Polar coordinate	Constant increment [33]	PSO-RHC
Algo. 4	Polar coordinate	Integral method	PSO [4]
Algo. 5	Grid method [33]	Constant increment [33]	PSO [4]

4.2.1. Path Optimization Performance Analysis

As shown in Figure 7, the Grid method [33–35, 39] shown in Table 5 is a UAV motion model that discretizes the direction of UAV motion. In the Grid method, the decision input solution is defined as three grid points at different distances near the current motion direction of the UAV, which leads to the change of the motion step of the UAV. However, many references often consider the UAV as a uniform motion when using the Grid method for simulation and ignore the problem of the change of the UAV motion step size, which causes the simulation results to have large errors. In the simulation experiment of this research, as shown in Table 3, the UAV is moving at a constant speed. To prove and eliminate this error, the simulation method is as follows:

First, the Grid method with different motion distances is used for simulation. Then, the decision input solution U_Grid, as shown in Equation (26), as well as the path optimization results with errors, are obtained. In Equation (26), path_n[θ_n(k)] refers to the corresponding motion distance of the UAV in the direction. Equation (26) is shown as follows:

()

Then, θ_n(k) and the uniform motion step path_n are combined into

, as shown in Equation (27), which refers to the decision input solution of the uniform motion UAV without errors. Therefore, the path optimization results of the UAV swarm based on the Grid method after eliminating errors can be obtained. Equation (27) is shown as follows:

()

In addition, due to the discrete and fixed motion direction of the UAV in the Grid method, the Grid method cannot solve the path optimization problem of the UAV swarm with different motion performances. Therefore, in the simulation of the Grid method, the maximum deflection angle of the UAV is set to φ_n max = 45^∘.

In summary, for the two problems of the Grid method in the path optimization problem of uniform heterogeneous UAV swarm, in this research, the above solutions are used to control variables for more objective algorithm performance analysis and comparison. The simulation analysis is given below.

Before the path optimization simulation of the UAV swarm collaborative search mission, the random motion trajectories of targets 5–8 are generated according to Equation (26) and Table 1. Then, the motion trajectory of each target is randomly sampled E times to obtain the prior information of the I_c type target. The 2000 times random motion tracks of targets 5–8 are shown in Figure 8(a), and the E times sampling prior coordinates of targets 5–8 are shown in Figure 8(b).

As shown in Figure 8(a), the star represents the position of C_i, which is the center of the random motion trajectories of the dynamic target. With C_i as center and d_i as radius, the bold dark circle represents the reference range of the random movement of the dynamic target. The light-colored lines represent the random trajectory of the dynamic targets. It can be seen from Figure 8(a) that the dynamic target moves randomly around a circle with C_i as the center and d_i as the radius. As shown in Figure 8(b), the star represents the position of C_i, and the points represent the target positions sampled from the random motion trajectories of the dynamic targets.

Using Equations (11)–(15) to process the target data in Table 1, as shown in Figure 8(b), the generated TPM matrix image is shown in Figure 9, where Figure 9(a) is a three-dimensional image and Figure 9(b) is a two-dimensional image. Figure 9 reflects the target distribution probability in the mission area obtained from prior information. The TPM matrix initialization information is substituted into each algorithm for the UAV swarm path optimization simulation.

The paths of the UAV swarm after 500 iterations obtained in the five experiments of Algo. 1–Algo. 5 are shown in Figures 10–14. The black triangle in the figure represents the target, the red hexagon star represents the detected target, and the blue dot represents the take-off position of the UAV. Figure 15 shows the comparison of the number of detected targets between five algorithms. Figure 16 shows the comparison of the coverage efficiency of area searching between the five algorithms.

As shown in Figure 15, the number of detected targets is increased with iterations and the number of Algo. 1 first reaches eight. As shown in Equation (2), the UAV has a certain probability of detecting a target during searching the mission area. Use the Russian roulette method to determine whether a target has been detected. With iterations, if a target has been detected, then the detected target stops moving and the number of detected targets increases 1.

As shown in Figure 16, the coverage efficiency of area searching increases with iterations. To improve the searching and detecting efficiency for dynamic targets, three area searching map models including environment search map, environment pheromone map, and TPM have been established in the research of this paper. Therefore, the coverage efficiency of the area searching, in Figure 16, is calculated by the collaborative search mission optimization model as shown in Equation (19). In addition, Figure 16 shows that the coverage efficiency of Algo. 3 is higher than that of Algo. 1 due to the constant increment used in Algo. 3, which would increase the value of the environment pheromone fast. This problem is analyzed in detail in the following sections, as shown in Figure 17. Therefore, Algo. 3 should not participate in coverage efficiency comparison. Similarly, Algo. 2 (with errors) and Algo. 5 (with errors) should not participate in comparison, too. In conclusion, as shown in Figure 16, the coverage efficiency of Algo. 1 is higher than the other three algorithms and this conclusion is the same as that shown in Figure 15.

It can be seen from the UAV swarm path shown in Figures 11 and 14 that the Grid method has path errors in Algo. 2 and Algo. 5, which increases with the iteration of the algorithm. Taking Algo. 2 as an example, the variation curve of the error of Algo. 2 with the iteration of the algorithm is described in Figure 18. Figure 18(a) shows the variation curve of the motion step sizes of the UAVs in Algo. 2 (with errors) with the iteration of the algorithm, and Figure 18(b) shows the variation curve of the coordinate error of each UAV of Algo. 2 with the iteration of the algorithm. The UAV coordinate error refers to the Euclidean distance between the UAV coordinate (with errors) shown in Figure 11(a) and the UAV coordinate (without errors) shown in Figure 11(b). The variable step size of the UAV, which is contrary to the assumed conditions, leads to an error between the actual coordinates and the expected coordinates, which can reach up to 61.69 km.

It can be seen from Figure 15 that the number of targets detected by Algo. 2 (with errors) and Algo. 5 (with errors) is eight, but those of Algo. 2 (without errors) and Algo. 5 (without errors) are five and four, respectively. The number of errors for the detected target is three and four. The 30 simulation results displayed in Figure 19 show that the use of the Grid method leads to large errors in path generation and target detection. Therefore, in the next simulation and analysis, Algo. 2 (with errors) and Algo. 5 (with errors) are only used as reference data rather than as a basis for algorithm performance evaluation. Henceforward, when referring to Algo. 2 or Algo. 5, the terms refer to Algo. 2 (without errors) and Algo. 5 (without errors).

According to the target distribution probability diagram shown in Figure 9 and the path optimization results of each algorithm shown in Figures 10–14, the differences and advantages between Algo. 1 and Algo. 2–Algo. 5 are compared and analyzed as follows. In the following, the generated path by Algo. x is abbreviated as Path x.

It can be seen from Figure 10 that Path 1 has the characteristics of cluster linkage, regular cruising, and a smooth trajectory. In addition, the UAV swarm focuses on searching areas with a high probability of target existence. In line with expectations, Path 1 detects all the targets.

It can be seen from Figure 11(b) that Path 2 is more convoluted and not smooth enough. It does not give full play to the high mobility of UAVs and cannot give full play to the search performance of the heterogeneous UAV swarm. Taking UAV 1 as an example, the motion states of the Algo. 1 and Algo. 2 UAVs are compared in Figure 20. Figure 20(a) shows the change curve of the UAV turning angle with the algorithm iteration, and Figure 20(b) shows the change curve of the UAV heading angle with the algorithm iteration. Taking the period k ∈ [200, 300] as an example, two subgraphs are added to Figures 20(a) and 20(b) to observe the change in the UAV motion state. It can be seen from Figure 20 that due to the limitations of the solution space in the Grid method, the motion state of the UAV in Path 2 is discrete, and the frequency of the oblique motion of the UAV is higher. The oblique motion refers to the motion with a 45° angle between the motion direction and the x-axis or y-axis. The motion step of the oblique motion is longer and the search income is higher. It can be seen from Figure 15 that Path 2 only detects five targets due to the error caused by the Grid method and the limitations of the discrete motion state.

It can be seen from Figures 12 and 15 that the cooperative search efficiency of each UAV in Path 3 is lower than in Path 1, and seven targets are detected in Path 3. The pheromone update mechanisms of Algo. 1 and Algo. 3 are compared in Figure 17, where Figure 17(a) represents the pheromone concentration growth curve with different initial values and Figure 17(b) represents the pheromone concentration growth slope curve with different initial values of the pheromone concentration. It can be seen from Figure 17 that the pheromone update mechanism of Algo. 1 based on Equation (10) can adjust the slope of pheromone concentration according to different initial values of the pheromone concentration. However, when the grid in Path 3 is searched, the growth rate of the pheromone concentration based on the constant increment method is faster and independent of the initial value of the pheromone concentration. Therefore, in Path 3, each UAV conducts a patrol search in its adjacent area, which causes Path 3 to not fully cover the mission area with a high probability of target existence.

It can be seen from Figures 13 and 15 that the path continuity of Path 4 is lower than that in Path 1, and only six targets are detected in Path 4. The optimization function performance of Algo. 1 and Algo. 4 is compared in Figure 21. Figure 21(a) shows the curve of the overall performance index of the UAV swarm with the iteration of the algorithm, and Figure 21(b) shows the average convergence curve of the optimization function performance index during the search mission. It can be seen from Figure 21 that the mean value of the optimization performance index of the PSO-RHC algorithm converges to 81.87, while that of the traditional PSO algorithm is 72.55, which is an increase of 12.8%. The comparison of the simulation results shows that the PSO-RHC algorithm can jump out of the local optimum and consider the path optimization of longer distances, thereby generating a better cooperative search path for the UAV swarm.

As shown in Table 5, Algo. 5 is composed of three other algorithm modules. It can be seen from Figure 14 that the path of Path 5 is convoluted, the cooperative search efficiency of each UAV is lower than that in Path 1, and the UAVs are more inclined to oblique motion, which combines the shortcomings of Paths 2–4. The performances of the Algo. 1 and Algo. 5 algorithms are compared in Figure 22. Taking UAV 1 as an example, Figure 22(a) shows the change curve of the UAV heading angle with the algorithm iteration, and Figure 22(a) shows the detailed change of period k ∈ [200, 300]. Figure 22(b) shows the change curve of the overall performance index of the UAV swarm with the algorithm iteration. Figure 22(c) shows the average convergence curve during the search mission. Figure 22(d) shows the average convergence rate curve of the optimization function during the search mission. It can be seen from Figures 22(b) and 22(c) that the overall performance index of Path 1 is better than that of Path 5, and the index increases by 77.5%. It can be seen from Figure 22(d) that the convergence rate of the optimization function of Algo. 5 reaches 100% at the 6th iteration, while the convergence rate of the optimization function of Algo. 1 reaches 92% at the 10th iteration and 97.3% at the 14th iteration. It can be seen from Figure 15 that the worst path in the five paths optimized by five algorithms is Path 5, in which only four targets have been detected. The above simulation results show that Algo. 1 has fully improved the mission planning scheme based on the three aspects of the decision input solution model, pheromone update mechanism, and optimization algorithm. Moreover, the additional computational complexity caused by the increase of the solution space is within the acceptable range.

4.2.2. Task Execution Efficiency Analysis

The five algorithms are simulated with Monte Carlo simulation 30 times, and the number of iterations of each algorithm is 500. The comparison of the five algorithms based on three metrics is shown in Figure 19.

Figure 19(a) is the line graph of Metric 1, the number of target detections. It can be seen from the graph that the simulation results of Algo. 1 are the best, and all the target detection tasks are completed in 30 simulations. However, the simulation results of the other algorithms are relatively poor. Most of the simulation results cannot complete the detection task of all targets. The worst simulation results only detect three targets by eliminating the error of Algo. 5.

Figure 19(b) is a line chart of Metric 2, the number of task execution steps, which refers to the number of iterations of the algorithm when completing the detection task of all eight targets, but if the task is not completed when the algorithm is iterated 500 times, the corresponding value of Metric 2 is 500. As can be seen from Figure 21(b), Algo. 1 has the best simulation results. All 30 simulations complete the task before the algorithm iterates 500 times. The best simulation results only use 189 steps; that is, when the algorithm has iterated 189 times, all eight targets have been detected. However, most of the simulation results of the other algorithms fail to complete the task before the algorithm iterates 500 times. The best simulation result uses 429 steps with Algo. 3, that is, the algorithm completes the task with 429 iterations.

Figure 19(c) is the line chart of Metric 3, task execution efficiency, which refers to the number of targets that can be detected theoretically when the algorithm iterates 500 times. The calculation of the equation is shown in Equation (28). If the index value of the simulation result is less than eight, it means that the task cannot be completed before the algorithm iterates 500 times. Otherwise, it means that the task is completed in advance. As can be seen from Figure 19(c), the simulation results of Algo. 1 are the best, 30 simulations have completed the task in advance, and the task execution efficiency can reach 21.07. However, most of the values of Metric 3 corresponding to the simulation results of other algorithms are less than eight, and the lowest value is three with Algo. 5 (without errors).

()

The three metrics of the five algorithms in 30 simulations, as shown in Figure 19, are averaged to compare the performance of each algorithm. The results are shown in Table 6.

Table 6. Three average task execution efficiency metrics of five algorithms in 30 simulations.

Algorithm label	Average value of Metric 1	Average value of Metric 2	Average value of Metric 3
Algo. 1	8	313.50	12.76
Algo. 2 (with errors)	8	382.23	10.46
Algo. 2 (without errors)	4.43	500	4.43
Algo. 3	7.26	489.98	7.42
Algo. 4	6.33	500	6.33
Algo. 5 (with errors)	7.90	409.97	9.63
Algo. 5 (without errors)	3.90	500	3.90

It can be seen from Table 6 that the performance of Algo. 1 is the best, followed by the performances of Algo. 3 and Algo. 4, and the performances of Algo. 2 and Algo. 5 are poor. By comparing the average value of Metric 3 of Algo. 1 and the other algorithms, the improvement of this research can be evaluated based on four aspects: the decision input solution model, pheromone update mechanism, optimization solution algorithm, and overall optimization, as shown in Table 7:

Table 7. The comparisons between the proposed method and the other methods based on four aspects.

Mission planning scheme modules	Performance improvement degree	Algorithm comparison
Decision input solution model	188%	Algo. 1 and Algo. 2
Pheromone update mechanism	72%	Algo. 1 and Algo. 3
Optimization algorithm	102%	Algo. 1 and Algo. 4
Entire scheme	227%	Algo. 1 and Algo. 5

As shown in Tables 5–7, the conclusions can be drawn as follows. The simulation results show that under the assumed task conditions in this research, the method proposed in this paper improves the mission planning scheme based on the three aspects of the decision input solution model, pheromone update mechanism, and optimization solution algorithm, which are improved by 188%, 72%, and 102%, respectively. The overall performance is improved by 227% compared with other methods.

5. Conclusion

To solve the problem of cooperative search task planning for a heterogeneous UAV swarm for dynamic targets, this paper proposes an improved cooperative search mission planning algorithm for a heterogeneous UAV swarm. Heterogeneous UAV maneuverability constraints and detection performance constraints, as well as three target types that distinguish the motion and time sensitivity of known partial motion state information, are considered in this problem.

First, based on the polar coordinate motion method, an improved UAV motion model and decision input solution model are established. Therefore, the high mobility performance of each UAV can be fully released in the path optimization under the condition of heterogeneous performance in the UAV swarm. Then, a dynamically updated multisearch situation map model is established, including the environment search map, environment pheromone map, and TPM. The numerical update algorithm based on normal distribution integral is used to update the pheromone concentration in the environmental pheromone map. The multisearch situation graph model improves the search performance of UAVs and the task information sharing performance among UAVs in the UAV swarm. Finally, an improved path optimization algorithm PSO-RHC is designed. Based on the traditional PSO algorithm, the RHC method is used to improve the global optimization performance. Compared with the traditional heuristic algorithm, the UAV swarm path obtained with the PSO-RHC algorithm is smoother and the optimization performance index is higher. In addition, by using the optimal solution rolling mechanism, the convergence speed of the PSO-RHC algorithm is effectively improved in order to solve the problem of slow convergence speed caused by the increase of the solution space. Therefore, the heterogeneous UAV swarm can achieve efficient collaborative task execution for the assumed task conditions in this study.

To study the efficiency of task execution and optimize the performance of the algorithm, several simulations are carried out. The simulation results show that under the assumed task conditions in this study, the method proposed in this paper improves the mission planning scheme based on three aspects: the decision input solution model, pheromone update mechanism, and optimization solution algorithm for the cooperative search task planning problem of a heterogeneous UAV swarm with dynamic targets, with improvements of 188%, 72%, and 102%, respectively.

However, in the collaborative search path optimization problem in this study, the obstacle avoidance and collision avoidance of UAVs and the dynamics of the actual mission area are not fully considered. Therefore, the environmental modeling of the dynamic mission area (such as the dynamic wind field and obstacles) is more in line with the actual task scenario, which still needs to be studied in future work.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant No. 42201472 and No. 62101599.

Open Research

Data Availability Statement

The data used in this study have been mainly obtained in the simulation experiments.

References

1 Danoy G., Brust M. R., and Bouvry P., Connectivity stability in autonomous multi-level UAV swarms for wide area monitoring, Proceedings of the Fifth ACM International Symposium on Development and Analysis of Intelligent Vehicular Networks and Applications, 2015, Shanghai, China, 1–8.
Google Scholar
2 Mavrovouniotis M., Li C., and Yang S., A survey of swarm intelligence for dynamic optimization: algorithms and applications, Swarm and Evolutionary Computation. (2017) 33, 1–17, https://doi.org/10.1016/j.swevo.2016.12.005, 2-s2.0-85011876502.
10.1016/j.swevo.2016.12.005
Web of Science® Google Scholar
3 Chen M., Dai F., Wang H., and Lei L., DFM: a distributed flocking model for UAV swarm networks, IEEE Access. (2018) 6, 69141–69150, https://doi.org/10.1109/ACCESS.2018.2880485, 2-s2.0-85056606422.
10.1109/ACCESS.2018.2880485
Web of Science® Google Scholar
4 Wang Y. B., Bai P., Liang X. L., Wang W., Zhang J., and Fu Q., Reconnaissance mission conducted by UAV swarms based on distributed PSO path planning algorithms, IEEE Access. (2019) 7, 105086–105099, https://doi.org/10.1109/ACCESS.2019.2932008.
10.1109/ACCESS.2019.2932008
Web of Science® Google Scholar
5 Hu C. F., Zhang Z. L., Yang N., Shin H. S., and Tsourdos A., Fuzzy multi-objective cooperative surveillance of multiple UAVs based on distributed predictive control for unknown ground moving target in urban environment, Aerospace Science and Technology. (2019) 84, 329–338, https://doi.org/10.1016/j.ast.2018.10.017, 2-s2.0-85055987690.
10.1016/j.ast.2018.10.017
Web of Science® Google Scholar
6 Shirani B., Najafi M., and Izadi I., Cooperative load transportation using multiple UAVs, Aerospace Science and Technology. (2019) 84, 158–169, https://doi.org/10.1016/j.ast.2018.10.027, 2-s2.0-85055874988.
10.1016/j.ast.2018.10.027
Web of Science® Google Scholar
7 Li P. and Duan H. B., A potential game approach to multiple UAV cooperative search and surveillance, Aerospace Science and Technology. (2017) 68, 403–415, https://doi.org/10.1016/j.ast.2017.05.031, 2-s2.0-85020450557.
10.1016/j.ast.2017.05.031
Web of Science® Google Scholar
8 Zhao X. Y., Zong Q., Tian B. L., Zhang B., and You M., Fast task allocation for heterogeneous unmanned aerial vehicles through reinforcement learning, Aerospace Science and Technology. (2019) 92, 588–594, https://doi.org/10.1016/j.ast.2019.06.024, 2-s2.0-85068186841.
10.1016/j.ast.2019.06.024
Web of Science® Google Scholar
9 Yao W. R., Qi N. M., Wan N., and Liu Y. B., An iterative strategy for task assignment and path planning of distributed multiple unmanned aerial vehicles, Aerospace Science and Technology. (2019) 86, 455–464, https://doi.org/10.1016/j.ast.2019.01.061, 2-s2.0-85061043005.
10.1016/j.ast.2019.01.061
Web of Science® Google Scholar
10 Wu W. N., Wang X. G., and Cui N. G., Fast and coupled solution for cooperative mission planning of multiple heterogeneous unmanned aerial vehicles, Aerospace Science and Technology. (2018) 79, 131–144, https://doi.org/10.1016/j.ast.2018.05.039, 2-s2.0-85047819262.
10.1016/j.ast.2018.05.039
Web of Science® Google Scholar
11 Chen J., Zhang Y., Wu L., You T., and Ning X., An adaptive clustering-based algorithm for automatic path planning of heterogeneous UAVs, IEEE Transactions on Intelligent Transportation Systems. (2022) 23, no. 9, 16842–16853, https://doi.org/10.1109/TITS.2021.3131473.
10.1109/TITS.2021.3131473
Web of Science® Google Scholar
12 Szczerba R. J., Galkowski P., Glicktein I. S., and Ternullo N., Robust algorithm for real-time route planning, IEEE Transactions on Aerospace and Electronic Systems. (2000) 36, no. 3, 869–878, https://doi.org/10.1109/7.869506, 2-s2.0-0034226296.
10.1109/7.869506
Web of Science® Google Scholar
13 Luo R., Zheng H., and Guo J., Solving the multi-functional heterogeneous UAV cooperative mission planning problem using multi-swarm fruit fly optimization algorithm, Sensors. (2020) 20, no. 18, https://doi.org/10.3390/s20185026, 32899674.
10.3390/s20185026
PubMed Web of Science® Google Scholar
14 Zheng H. and Yuan J., An integrated mission planning framework for sensor allocation and path planning of heterogeneous multi-UAV systems, Sensors. (2021) 21, no. 10, https://doi.org/10.3390/s21103557, 34065325.
10.3390/s21103557
PubMed Web of Science® Google Scholar
15 Wu Y., Low K. H., and Lv C., Cooperative path planning for heterogeneous unmanned vehicles in a search-and-track mission aiming at an underwater target, IEEE Transactions on Vehicular Technology. (2020) 69, no. 6, 6782–6787, https://doi.org/10.1109/TVT.2020.2991983.
10.1109/TVT.2020.2991983
Web of Science® Google Scholar
16 Tang J., Liu G., and Pan Q., A review on representative swarm intelligence algorithms for solving optimization problems: applications and trends, IEEE/CAA Journal of Automatica Sinica. (2021) 8, no. 10, 1627–1643, https://doi.org/10.1109/JAS.2021.1004129.
10.1109/JAS.2021.1004129
Google Scholar
17 Tian J., Shen L., and Zheng Y., Genetic algorithm based approach for multi-UAV cooperative recon-naissance mission planning problem, Lecture Notes in Computer Science, 2006, Springer, Berlin, Heidelberg, https://doi.org/10.1007/11875604_13.
Google Scholar
18 Price I. C. and Lamont G. B., GA directed self-organized search and attack UAV swarms, Proceedings of the 2006 Winter Simulation Conference, 2006, Monterey, CA, USA, 1307–1315, https://doi.org/10.1109/WSC.2006.323229, 2-s2.0-46149118252.
10.1109/WSC.2006.323229
Google Scholar
19 Shi K., Zhang X., and Xia S., Multiple swarm fruit fly optimization algorithm based path planning method for multi-UAVs, Applied Sciences. (2020) 10, no. 8, https://doi.org/10.3390/app10082822.
10.3390/app10082822
Google Scholar
20 Chai X., Zheng Z., Xiao J., Yan L., Qu B., Wen P., Wang H., Zhou Y., and Sun H., Multi-strategy fusion differential evolution algorithm for UAV path planning in complex environment, Aerospace Science and Technology. (2022) 121, article 107287, https://doi.org/10.1016/j.ast.2021.107287.
10.1016/j.ast.2021.107287
Web of Science® Google Scholar
21 Qi D., Zhang Z., and Zhang Q., Path planning of multirotor UAV based on the improved ant colony algorithm, Journal of Robotics. (2022) 2022, 9, 2168964, https://doi.org/10.1155/2022/2168964.
10.1155/2022/2168964
Web of Science® Google Scholar
22 Liu W. and Zheng X., Three-dimensional multi-mission planning of UAV using improved ant colony optimization algorithm based on the finite-time constraints, International Journal of Computational Intelligence Systems. (2021) 14, no. 1, 79–87, https://doi.org/10.2991/ijcis.d.201021.001.
10.2991/ijcis.d.201021.001
Web of Science® Google Scholar
23 Gao C., Zhen Z. Y., and Gong H. J., A self-organized search and attack algorithm for multiple unmanned aerial vehicles, Aerospace Science and Technology. (2016) 54, 229–240, https://doi.org/10.1016/j.ast.2016.03.022.
10.1016/j.ast.2016.03.022
Web of Science® Google Scholar
24 Zhen Z., Xing D., and Gao C., Cooperative search-attack mission planning for multi-UAV based on intelligent self-organized algorithm, Aerospace Science and Technology. (2018) 76, 402–411, https://doi.org/10.1016/j.ast.2018.01.035, 2-s2.0-85042369612.
10.1016/j.ast.2018.01.035
Web of Science® Google Scholar
25 Shin J.-J. and Bang H., UAV path planning under dynamic threats using an improved PSO algorithm, International Journal of Aerospace Engineering. (2020) 2020, 17, 8820284, https://doi.org/10.1155/2020/8820284.
10.1155/2020/8820284
Web of Science® Google Scholar
26 Huang C., A novel three-dimensional path planning method for fixed-wing UAV using improved particle swarm optimization algorithm, International Journal of Aerospace Engineering. (2021) 2021, 19, 7667173, https://doi.org/10.1155/2021/7667173.
10.1155/2021/7667173
Web of Science® Google Scholar
27 Zhao C., Liu Y., Chen L., Li F., and Man Y., Research and development trend of multi-UAV path planning based on metaheuristic algorithm, Control and Decision. (2022) 37, no. 5, 1102–1115.
Google Scholar
28 Tang J., Chen X., Zhu X., and Zhu F., Dynamic reallocation model of multiple unmanned aerial vehicle tasks in emergent adjustment scenarios, IEEE Transactions on Aerospace and Electronic Systems. (2023) 59, no. 2, 1139–1155, https://doi.org/10.1109/TAES.2022.3195478.
10.1109/TAES.2022.3195478
Web of Science® Google Scholar
29 Polycarpou M. M., Yang Y., and Passino K. M., A cooperative search framework for distributed agents, Proceeding of the 2001 IEEE International Symposium on Intelligent Control (ISIC ′01) (Cat. No.01CH37206), 2001, Mexico City, Mexico, 1–6, https://doi.org/10.1109/ISIC.2001.971475.
10.1109/ISIC.2001.971475
Google Scholar
30 Pan L.-G., Lu Q., Xie X., Wang J., and Wang J., A probability distribution based cooperative search approach for stochastic source localization, 2018 IEEE 27th International Symposium on Industrial Electronics (ISIE), 2018, Cairns, QLD, Australia, 585–590, https://doi.org/10.1109/ISIE.2018.8433772, 2-s2.0-85052377282.
10.1109/ISIE.2018.8433772
Google Scholar
31 Bertuccelli L. F. and How J. P., Search for dynamic targets with uncertain probability maps, 2006 American Control Conference, 2006, Minneapolis, MN, USA, https://doi.org/10.1109/ACC.2006.1655444.
10.1109/ACC.2006.1655444
Google Scholar
32 Millet T., Casbeer D., Mercker T., and Bishop J., Multi-agent decentralized search of a probability map with communication constraints, AIAA Guidance, Navigation and Control Conference, 2013, Toronto, Ontario, Canada, https://doi.org/10.2514/6.2010-8424, 2-s2.0-84867792498.
10.2514/6.2010-8424
Google Scholar
33 Zhen Z., Chen Y., Wen L., and Han B., An intelligent cooperative mission planning scheme of UAV swarm in uncertain dynamic environment, Aerospace Science and Technology. (2020) 100, article 105826, https://doi.org/10.1016/j.ast.2020.105826.
10.1016/j.ast.2020.105826
Web of Science® Google Scholar
34 Wang N., Li Z., Liang X., Li Y., and Zhao F., Cooperative target search of UAV swarm with communication distance constraint, Mathematical Problems in Engineering. (2021) 2021, 14, 3794329, https://doi.org/10.1155/2021/3794329.
10.1155/2021/3794329
Web of Science® Google Scholar
35 Ao W. U., Rennong Y. A., Xiaolong L. I., and Yueqi H. O., Cooperative search algorithm based on pheromone decision for UAV swarm, Journal of Beijing University of Aeronautics and Astronautics. (2021) 47, no. 4, 814–827.
Google Scholar
36 Paradzik M. and İnce G., Multi-agent search strategy based on digital pheromones for UAVs, 2016 24th Signal Processing and Communication Application Conference (SIU), 2016, Zonguldak, Turkey, 233–236, https://doi.org/10.1109/SIU.2016.7495720, 2-s2.0-84982839403.
10.1109/SIU.2016.7495720
Google Scholar
37 Yang F., Ji X., Yang C., Li J., and Li B., Cooperative search of UAV swarm based on improved ant colony algorithm in uncertain environment, 2017 IEEE International Conference on Unmanned Systems (ICUS), 2017, Beijing, China, 231–236, https://doi.org/10.1109/ICUS.2017.8278346, 2-s2.0-85046680533.
10.1109/ICUS.2017.8278346
Google Scholar
38 Erignac C., An exhaustive swarming search strategy based on distributed pheromone maps, Proceedings of AIAA Infotech @Aerospace 2007 Conference and Exhibit, AIAA, 2007, California, 1–16.
Google Scholar
39 Zhang Z. X., Long T., and Xu G. T., Revisit mechanism driven multi-UAV cooperative search planning method for moving targets, Acta Aeronautica Astronautica Sinica. (2020) 41, no. 5, article 323314.
Google Scholar
40 Li J., Deng G., Luo C., Lin Q., Yan Q., and Ming Z., A hybrid path planning method in unmanned air/ground vehicle (UAV/UGV) cooperative systems, IEEE Transactions on Vehicular Technology. (2016) 65, no. 12, 9585–9596, https://doi.org/10.1109/TVT.2016.2623666, 2-s2.0-85012949959.
10.1109/TVT.2016.2623666
Web of Science® Google Scholar
41 Nash A., Koenig S., and Likhachev M., Incremental Phi∗: incremental any-angle path planning on grids, Proc. 21st Int. Jont Conf. Artif. Intell, 2009, San Francisco, CA, USA: Morgan Kaufmann, 1824–1830.
Google Scholar
42 Soltani A. R., Tawfik H., Goulermas J. Y., and Fernando T., Path planning in construction sites: performance evaluation of the Dijkstra, A∗, and GA search algorithms, Advanced Engineering Informatics. (2002) 16, no. 4, 291–303, https://doi.org/10.1016/S1474-0346(03)00018-1, 2-s2.0-0041427756.
10.1016/S1474-0346(03)00018-1
Web of Science® Google Scholar
43 Pehlivanoglu Y. V., A new vibrational genetic algorithm enhanced with a Voronoi diagram for path planning of autonomous UAV, Aerospace Science and Technology. (2012) 16, no. 1, 47–55, https://doi.org/10.1016/j.ast.2011.02.006, 2-s2.0-84855910036.
10.1016/j.ast.2011.02.006
Web of Science® Google Scholar
44 Zeng Y. and Zhang R., Energy-efficient UAV communication with trajectory optimization, IEEE Transactions on Wireless Communications. (2017) 16, no. 6, 3747–3760, https://doi.org/10.1109/TWC.2017.2688328, 2-s2.0-85020943053.
10.1109/TWC.2017.2688328
Web of Science® Google Scholar

All articles

Heterogeneous UAV Swarm Collaborative Search Mission Path Optimization Scheme for Dynamic Targets

Abstract

1. Introduction

2. Modeling of the Cooperative Mission Planning Problem for a Heterogeneous UAV Swarm

2.1. Heterogeneous UAV Model

2.2. Multisearch Situation Map Model

2.2.1. Environment Search Map

2.2.2. Environment Pheromone Map

2.2.3. TPM

2.3. Collaborative Search Mission Optimization Model

3. Design of PSO-RHC Algorithm

3.1. Coding Method of Decision Input Solution

3.2. Solution Process of PSO Algorithm

3.3. Rolling Horizon Control Method

4. Simulation Experiments

4.1. Mission Scenarios and Parameter Settings

4.2. Simulation and Analysis

4.2.1. Path Optimization Performance Analysis

4.2.2. Task Execution Efficiency Analysis

5. Conclusion

Conflicts of Interest

Funding

Open Research

Data Availability Statement

References

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Heterogeneous UAV Swarm Collaborative Search Mission Path Optimization Scheme for Dynamic Targets

Abstract

1. Introduction

2. Modeling of the Cooperative Mission Planning Problem for a Heterogeneous UAV Swarm

2.1. Heterogeneous UAV Model

2.2. Multisearch Situation Map Model

2.2.1. Environment Search Map

2.2.2. Environment Pheromone Map

2.2.3. TPM

2.3. Collaborative Search Mission Optimization Model

3. Design of PSO-RHC Algorithm

3.1. Coding Method of Decision Input Solution

3.2. Solution Process of PSO Algorithm

3.3. Rolling Horizon Control Method

4. Simulation Experiments

4.1. Mission Scenarios and Parameter Settings

4.2. Simulation and Analysis

4.2.1. Path Optimization Performance Analysis

4.2.2. Task Execution Efficiency Analysis

5. Conclusion

Conflicts of Interest

Funding

Open Research

Data Availability Statement

References

Figures

References

Related

Information