International Journal of Aerospace Engineering

Volume 2024, Issue 1 5876393

Research Article

Open Access

Multi-UAV Cooperative Search for Moving Targets With Impaired Communication Using Improved Gray Wolf Optimizer

Jiaqi Zhan,

Jiaqi Zhan

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Chaoyang Niu,

Corresponding Author

Chaoyang Niu

[email protected]

orcid.org/0000-0001-5426-7833

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Wei Liu,

Wei Liu

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Shiyu Wang,

Shiyu Wang

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Xuanshen Wan,

Xuanshen Wan

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Yanyun Wang,

Yanyun Wang

orcid.org/0009-0009-9473-5713

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Jiaqi Zhan,

Jiaqi Zhan

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Chaoyang Niu,

Corresponding Author

Chaoyang Niu

[email protected]

orcid.org/0000-0001-5426-7833

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Wei Liu,

Wei Liu

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Shiyu Wang,

Shiyu Wang

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Xuanshen Wan,

Xuanshen Wan

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

Yanyun Wang,

Yanyun Wang

orcid.org/0009-0009-9473-5713

Data and Target Engineering College , Information Engineering University , Zhengzhou , 450000 , China , plaieu.edu.cn

Search for more papers by this author

First published: 16 September 2024

https://doi.org/10.1155/2024/5876393

Academic Editor: Pengyun Chen

Share a link

Email
Wechat
Bluesky

Abstract

The study investigates the problem of collaborative target search by multiple unmanned aerial vehicles (UAVs) under the condition of communication information corruption. The problem involves the influence of target movement and environmental obstacles on the update of the UAV’s local target probability map (TPM), as well as communication information corruption among UAVs due to packet loss and resulting local information inconsistency. The objective is to efficiently plan paths for the UAV swarm to discover more targets. To address the above issues, this paper proposes the following methods. Firstly, an information diffusion model is introduced to simulate the TPM changes caused by target movement and the discovery of obstacles based on a two-dimensional Gaussian distribution, thereby correcting the real-time target information held by UAVs. Secondly, a radial basis function neural network (RBFNN) is employed to recover the corrupted information, and a weighted averaging method is used to fuse the information from multiple UAVs, achieving consensus among the UAVs. Lastly, the original gray wolf optimizer (GWO) is enhanced by incorporating strategies such as Levy flight and the nonlinear convergence factor. Subsequently, the improved GWO is integrated with the model predictive control (MPC) for UAV trajectory planning, enabling a more efficient search for optimal paths. Experimental results demonstrate that the proposed information recovery method outperforms the compared methods by 24.7% in terms of performance. The proposed comprehensive search approach shows higher efficiency in searching for more targets within a specified time, and it also exhibits advantages in terms of area coverage and other aspects.

1. Introduction

With the continuous advancement of technology, unmanned aerial vehicles (UAVs), as emerging aircraft, are playing an increasingly important role in tasks such as target search, tracking, interference, firepower, and effectiveness evaluation [1–3]. The technology of UAVs searching for multiple moving targets has wide applications in the military, security, rescue, environmental protection, and other fields. In the military domain, UAVs can provide real-time monitoring and tracking of enemy moving targets, offering precise information support for combat command. In the security field, UAVs can be used to monitor public areas, promptly detect suspicious moving targets, and enhance public safety. In the rescue field, UAVs can swiftly locate disaster areas, search for missing persons, and improve rescue efficiency. In the environmental protection sector, UAVs can monitor wildlife, contributing to biodiversity conservation. In target search missions in high-risk areas, using reconnaissance means such as satellites and radar may lead to resource wastage, while using vehicles may not be efficient for reconnaissance. In such scenarios, UAV target search holds significant value. Therefore, the main focus of this paper is on the collaborative search of moving targets by multiple UAVs.

For multi-UAV cooperative target search tasks, path planning based on environmental information held by the UAV is an important research topic. Path planning algorithms for UAV cooperative search are presented in detail in [4]. These algorithms are classified into three categories based on (1) global probability maps (GPMs), (2) model predictive control (MPC), and (3) intelligent optimization algorithms (IOAs). Among them, GPM algorithms have higher requirements on the a priori information of the environment, and MPC algorithms have higher computational costs and higher requirements on the UAV hardware. In contrast, IOA algorithms need modest prior data, effectively reducing the computational complexity through information updating rules. However, IOAs might lag in convergence and precision due to update limitations, risking local optimality. To address this issue, relevant enhancements have been made as follows:

Sun et al. [5] proposed an improved intelligent water drop optimization algorithm by introducing heterogeneous water drop populations and a coevolutionary mechanism with a stronger ability to jump out of local optima. Shao et al. [6] optimized the initial distribution of particles and acceleration coefficients and proposed a mutation strategy that improved the performance of the original algorithm. The immune genetic algorithm proposed by Zhou et al. [7] utilized an immune strategy to enable UAVs to search for targets more efficiently in uncertain environments. The above methods perform well when searching for unknown stationary targets; however, in real search tasks, the targets faced by UAVs are often moving targets in unknown environments. Therefore, path planning algorithms also need to have strong real-time performance. Pérez-Carabaza et al. [8] used an ant colony algorithm to plan feasible paths for UAVs in real time to minimize the search time, which considered constraints in terms of communication and obstacles. However, the method required more a priori information. Zhen et al. [9] considered four types of time-sensitive targets and proposed a hybrid artificial potential field and ant colony optimization method. To search for dynamic targets more efficiently, the target probability map (TPM) update was related to the motion of the target, but the obstacle information was known a priori. Reference [10] integrates the gray wolf algorithm with the differential evolution algorithm, resulting in a better balance between the exploration and exploitation phases. Compared to the original algorithm, the improved algorithm plans trajectories with shorter and smoother total lengths. Fei et al. [4] combined MPC with an improved sparrow search algorithm to establish a multi-UAV cooperative search algorithm whose planned implementation paths have high search accuracy and stability. In most previous research, the initial state probability distribution of the target in the updating process of the TPM has been used without considering the influence of obstacles. However, actual TPM updates should take into account the target motion and positions of obstacles. Additionally, the UAV swarm lacks real-time collaboration during the target search process, resulting in low search efficiency. To address these issues, a real-time path planning algorithm needs to be designed. This algorithm should be able to detect obstacle information in the environment in real time and consider the probability changes caused by target motion. Furthermore, it should improve the environmental adaptability and overall search efficiency of the multi-UAV target search. The issues laid out above are the primary motivation for the research work described in this paper.

During collaborative UAV missions, information inconsistency arises due to differing UAV detections. Factors such as enemy interference, adverse weather, and ambient noise can degrade communication channels, leading to packet loss in transmitting environmental data among UAVs. Therefore, information inconsistency is more common among multiple UAVs. Consistency among UAVs is also known as reaching group consensus, which can avoid many problems such as repeated searches of the same area due to information inconsistency among multiple UAVs. A group consensus can generally be reached through information fusion, which integrates the environmental information of each UAV to make the information consistent [11–13]. Khan, Yanmaz, and Rinner [14] considered a variety of constraints, analyzed the characteristics of transmitted information, and investigated the effect of multisource information fusion on search time in path planning. Basso et al. [15] used information filtering to merge environmental information between multiple UAVs and further processed the fused information to obtain coherent information regarding the region of interest. Saadaoui et al. [16, 17] employed the LPSO algorithm to perform collaborative search tasks with multiple UAVs. In the global communication method, each UAV sends local information and fuses the information received at each time step. Zhou et al. [18] identified UAVs by their range and fused feasible ones. Zhou et al. [19] considered constraints such as energy consumption and investigated the reasonable allocation of UAV computational resources in information transmission and fusion processes. However, real environments introduce challenges such as adverse weather or other factors not addressed in the previously mentioned studies. Li et al. [20] considered the issue of lost information in transmission and used an improved least squares method to compensate for the lost information and fuse the information from each UAV. However, the influence of obstacles was not accounted for in the recovery of lost data [20]. Additionally, previous researchers have rarely considered inconsistent environmental information among UAVs due to communication degradation in the transmission of environmental information in obstacle environments. Therefore, a method that takes into account the obstacle information during the process of recovering damaged data, which is then fused with the recovered information, must be designed. This topic was the second motivation for the research work reported in this paper.

Inspired by the above two motivations, this study proposes a real-time path planning algorithm for the cooperative search of dynamic targets by multiple UAVs, which considers the effect of target motion on the local TPM of UAVs, revisiting the searched area by UAVs, and recovering the lost transmission information in the environment with the avoidance of obstacles by UAVs. The contributions of this study are as follows:

1.
The problem of inconsistent UAV information owing to poor communication and other factors is addressed. This study proposes a data recovery and information fusion method using a radial basis function neural network (RBFNN) to unify the environmental information held by the UAVs and enable multiple UAV systems to reach a group consensus. The details of the RBFNN are covered in Section 4.1.
2.
The problem of target motion and obstacle detection affecting the local TPM distribution of UAVs is addressed. In this study, we utilized an information diffusion mechanism to model the changes in the TPM distribution at each time step for the aforementioned reasons.
3.
An improved gray wolf optimizer (IGWO) combined with MPC is proposed, which can solve the current optimal path for UAVs at each time step and solve the problem of real-time path planning optimization for multiple UAVs.

The remainder of this paper is organized as follows: Section 2 describes the problem and path-planning model. In Section 3, the mechanism for updating the UAV environmental information is described; we propose a new consistency strategy for environmental information and describe the real-time path planning algorithm of the UAV. Simulation results are presented in Section 4. Finally, Section 5 summarizes this study.

2. Problem Formulation and Models

The main focus of this study is the problem of cooperative target search by multiple UAVs in adverse communication environments. The problem can be described as follows. Each UAV has knowledge of the initial position of the target, but during the search process, the target is moving and the UAVs encounter obstacles in the environment. Both of these situations cause changes in the UAVs’ local TPMs, necessitating the design of a method to compensate for these changes. Additionally, the adverse communication environment can result in damaged information transmission between the UAVs. Therefore, it is necessary to employ certain methods to recover the damaged information and perform information fusion to achieve consensus within the UAV cluster. Once consensus is reached, the UAVs update their local environmental information through their own sensors and communication, make decisions, and execute actions based on the currently available environmental information. This ensures a low-cost search while maintaining efficient path planning.

2.1. Environmental Model

The rasterized method [21] is used to divide the search area into L_x × L_y cells; that is, the search area has L_x cells in each row and L_y cells in each column. The coordinates of the g^th cell are denoted by g = (x_g, y_g), where x_g ∈ {1, 2, ⋯L_x} and y_g ∈ {1, 2, ⋯L_y}. It is assumed that multiple UAVs take off from the airport with the same flight attitude into the search environment for target search, that each UAV flies at a uniform speed at each time step, and that at each time step, the UAVs can only move one cell to an adjacent cell.

2.2. Environment-Aware Map Model

The perception of the environment by a UAV is described using the environment perception mapping model from [22], where the environment perception map consists of a TPM and an uncertainty map, which describes the probability of the presence of a target in all partitioned grids. The uncertainty map describes the degree of certainty that the UAV has about the information of the partitioned grids. The k-moment UAV_i in grid g, using the sensors to acquire the target probability, can be denoted as p_ig(k) ∈ [0, 1] and the uncertainty as χ_ig(k) ∈ [0, 1]. Together, the two forms of environmental perception information of the UAV can be represented as follows:

()

The updating of the local sensor information during the search for a target by a UAV is called detection updating, and according to the Bayesian information criterion, the TPM updating formula of UAV_i can be expressed as follows:

()

where z_ig(k) = 1 denotes that at k-moments of UAV_i, the presence of a target is observed in grid g, and vice versa; z_ig(k) = 0 denotes that no target is observed; p_d is the probability that a target is present in grid g and is correctly detected by the UAV; and p_f is the probability that a target is not present in grid g but is detected by the UAV.

To update the uncertainty, we use the following update method from [20]:

()

where η ∈ (0, 1) is the coefficient factor, d_ig(k) = 1 indicates that grid g was searched at moment k by UAV_i; d_ig(k) = 0 indicates that grid g has not been searched; χ_ig(k) = 1 indicates that the target probability for grid g is completely unknown at moment k to UAV_i; χ_ig(k) = 0 indicates that the target probability for grid g is completely known. This χ updating method generally gives a small value of η, which results in a rapid decrease in the uncertainty of the grid visited by the UAV and a slow decrease in the uncertainty of the grid not visited by the UAV. This kind of uncertainty update method is in line with the real-life pattern of sensor detection of unknown regions and simultaneously facilitates the efficiency of updating environment-aware maps.

2.3. UAV Model

The multi-UAV swarm used in this study comprises N isomorphic UAVs. The UAV model is appropriately simplified and can be regarded as a two-dimensional prime. UAVs within the swarm maintain constant flight heights, ensuring no interaircraft collision problem. The state S_i(k) of UAV_i at moment k can be described as (x_i(k), y_i(k)). The UAV heading and turn constraints can be described as follows. Each UAV has eight possible heading angles at moment t = 1, φ_i ∈ {0^∘, 45^∘, 90^∘, 135^∘, 180^∘, 225^∘, 270^∘, 315^∘}, and these headings are numbered as {1, 2, 3, 4, 5, 6, 7, 8}, whereas at subsequent moments, to enhance the range of the UAV and to reduce the computational burden, the UAV turn radius at moments t = 2, 3, 4⋯ is defined as Δφ_i(k) ∈ {−45^∘, 0^∘, 45^∘}. To standardize the UAV flight path, the UAV can only move from the current grid to its neighboring grids in each time step so that it can be located in the center of the grid in each time step when the heading angle of the UAV is φ_i ∈ {45^∘, 135^∘, 225^∘, 315^∘} and the velocity v_i is √2 times φ_i ∈ {0^∘, 90^∘, 180^∘, 270^∘}. The turn selectable direction of the UAV is shown in Figure 1; its control input (action) is u_i(k) = [v_i(k), Δφ_i(k)], and its motion model can be represented as follows:

()

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Turning directions of UAV.

In this study, the sensor detection radius of the UAV is D_r, the TPM of the grids within the detection range, and χ is updated locally according to the detection update rule in Section 2.2. The TPM of the undetected grids is updated using information diffusion, which is described in detail in Section 3.1. As shown in Figure 2, the detection area of the UAV at each moment is indicated by the surrounding green square.

2.4. Target Model

The targets considered in this paper are moving targets with known initial positions and unknown directions of motion. There are multiple targets in the search area, and their positions and motions are independent of each other. The probability of the existence of the above targets in the search area grid can be described by a two-dimensional normal distribution model as follows:

()

where (x_t(0), y_t(0)) is the initial known position of the target and

and

are the variances of the target. In addition, the motion of the target must satisfy the following rules.

1.
The maximum velocity of the target motion is less than or equal to the maximum velocity of the UAV motion;
2.
The motion of the target is independent of the UAV motion;
3.
The heading angle of the target at each moment can be selected from any of the eight directions described in Section 2.2.

2.5. Obstacle Avoidance Model

In the search area, impenetrable higher obstacles or enemy-controlled areas exist, and these unreachable grids must be avoided by the UAV during trajectory planning to prevent collisions. In this study, the UAV does not have any a priori information about the search area; thus, it needs to sense nearby obstacles through sensors during flight, store the locations of these discovered obstacles, and use the prediction of its own flight path for collision avoidance during flight. That is, the UAV must predict the m-step path after the current moment (within the detection range of UAVs); if the predicted path does not intersect with an obstacle, then this path is a feasible flight path. A feasible flight path is described as follows:

()

where (x_g(k + q), y_g(k + q)) is the predicted position of the UAV at moment k + q, Obs_detection is the obstacle region found by the UAV, feasible path = true denotes a feasible flight path, and feasible path = false denotes an infeasible path. Figure 3 shows the obstacle avoidance model with a predicted step size of m = 3, where the dark area represents the obstacle and the black path represents the feasible path.

3. Method

3.1. Collaborative Search Framework in Poor Communication Environments

In the aforementioned problem, the search area is divided into multiple grids using the grid-based approach. Assuming the initial position of the target is known but its movement direction is unknown, a normal distribution is used to describe the initial distribution of the target’s position. This distribution affects the update of the TPMs. During the actual search process, the target is moving and obstacles may be encountered. This is reflected in the TPM by increasing the target existence probability in the grid where the target is located and reducing the target existence probability to zero in the grids where obstacles are detected. Therefore, the initial target distribution cannot be used continuously to update the TPM. This paper adopts an information diffusion approach to compensate for the impact of the aforementioned issues and dynamically adjust the target information acquired by the UAVs.

The effectiveness of UAV decisions can be determined by a reward function, where the reward function J_i(S_i(k), u_i(k)) represents the reward obtained by UAV_i when it is in the state S_i(k) and takes control input u_i(k). A higher value of J_i(S_i(k), u_i(k)) indicates better performance of u_i(k). However, in adverse communication environments, when UAVs package and transmit their locally sensed environmental information to other UAVs, some data may be lost, leading to significant errors in the information fusion results between UAVs. As a result, consensus cannot be reached among the UAVs, and the decisions made may lead to mutual repetition or conflicts, thereby reducing the overall efficiency of cooperative search. Therefore, it is necessary to address the issue of inconsistent information among UAVs in a degraded communication environment. In this paper, we employ RBFNN to recover the damaged information and utilize a weighted averaging method to perform a multisource fusion of information between UAVs, ultimately achieving consensus among all UAVs in the group.

After addressing the aforementioned issues, UAVs make decisions based on the acquired environmental information, specifically in the context of path planning. In this paper, we propose the use of the MPC-based IGWO algorithm for path planning of the UAV cluster. MPC considers future states during the search process, enabling UAVs to adapt better to the environment and tasks. Additionally, compared to other algorithms, IGWO incorporates strategies such as Levy flight and nonlinear convergence factors, which enhance its ability to escape local optima. Figure 4 illustrates the framework of the multi-UAV cooperative search under adverse communication conditions.

3.2. Update of Environment-Aware Map

As described above, the process of updating the TPM is influenced by the distribution of the target’s position and the presence of obstacles detected in the environment. In this section, an information diffusion model is used to simulate the effects of target movement, and the discovered obstacle grid cells are subjected to specific operations to reduce the probability of the target’s existence to zero. Additionally, a revisiting mechanism is incorporated into the update process to address the issue of the target potentially moving into areas that have already been searched due to the extended search duration.

3.2.1. Update of TPM

In the process of the UAV searching for a target, the probability distribution of the target UAV will change owing to the target movement, so we cannot always use the initial distribution of the target to update the local TPM of the UAV. Thus, we must make a reasonable prediction of the target movement to improve accuracy when the TPM is updated.

For the initial distribution of information on the target, we adopted the information diffusion mechanism [23] to predict the change in the probability distribution of the target. This approach is similar to the gas diffusion mechanism, which mainly describes the phenomenon in which information will diffuse from the current grid to the surrounding neighboring grids. The information of the neighboring grids will also diffuse into the current grid, the information will diffuse from the high-probability grids to the low-probability grids, and the probability distribution of the target will converge to a uniform distribution after a long period of time.

For example, the basic diffusion model of the TPM for grid g can be expressed as follows:

()

where λ_d is the diffusivity and

is the set of neighboring grids of grid g. In general, the number of neighboring grids is an integer; it was set to eight to ensure the local characterization of the information.

To ensure that the information diffusion process is stable, the diffusion rate must meet the following requirements:

()

Using the initial distribution information of the target and the above probability diffusion model, any grid target probability can be obtained at any time, and all UAVs have the same initial information and probability diffusion rules, which avoids the problem of inconsistent information among UAVs. In addition, using the predicted target probability distribution to update the target probability of the corresponding grids results in a higher level of confidence.

The above diffusion process corresponds to the case in which there are no obstacles in the search area. However, after detecting an obstacle through the sensors, the UAV needs to change the way in which the information diffuses in the obstacle area and its neighboring areas.

No possible target is assumed to exist in the obstacle region. Prior to identifying this zone, information is diffused in the same manner as in a normal grid. Once the obstacle is discovered, the normal (nonobstacle) grid around the obstacle does not diffuse the probability information to the obstacle grid, and the target probability that already exists in the obstacle grid needs to be diffused to the normal grid around it. This update method is used instead of directly dropping to zero. This approach ensures that the TPM of the overall region will not change abruptly and that the local characteristics of the region near the obstacle can be retained. The probability decrease using the diffusion mechanism is very fast; thus, the target probability of the obstacle region is reduced quickly and eventually approaches zero. The diffusion method can be expressed as follows:

()

where

is the normal grid adjacent to the obstacle.

The direction of the diffusion of grid probabilities is shown in Figure 5, where g₁ is the obstacle grid, g₂ is the normal grid adjacent to the obstacle, and g₃ is the normal grid. The red and green arrows show the direction of the probability of spreading outward and inward, respectively.

3.2.2. Uncertainty-Related Revisit Mechanisms

Because of the random movement of the target in the search area, it may travel back to the area that was previously detected by the UAV, and the UAV does not have an effective reobservation method for these already detected grids. Thus, a grid-revisit method is needed to solve the problem.

The revisit mechanism used in this study is related to the uncertainty χ:

()

where

is the uncertainty of grid g that has already been detected, χ_s is a fixed increasing uncertainty constant, τ_g is the last visited moment of grid g, and T_h is the time threshold. The above equation states that if the time since grid g was last detected has exceeded the set threshold, that is, k − τ_g ≥ T_h, then the uncertainty of grid g will continuously increase subsequently, thus increasing the interest of the UAV in the grid and achieving the purpose of detecting it again.

3.3. Group Consensus Among UAVs

The transmission of environmentally aware information in poor communication environments leads to partial data loss. As described in this section, we first compensate for the lost data and then fuse the UAV local information with the processed received information to make the environment-aware information consistent among all UAVs.

3.3.1. Information Recovery

Suppose that all information about grid g in the transmission message E_j(k) from UAV_j is lost. The following methods were adopted to recover lost information from other UAVs. Regarding the target probability at grid g, firstly, owing to the local correlation of the transmission information, the target probability of the grids around grid g can be collected using the k-nearest neighbors algorithm [24] (KNNA), and then the collected data are fitted to the lost probability of grid g using the RBFNN [25]. Regarding the uncertainty of the loss χ of grid g, χ_ig(k) can be set to one because the transmission information has been lost and the UAV knows nothing about the grid.

When using the KNNA to collect the neighbor grid information of grid g, the number of neighboring grids collected is limited to ensure that the collected information is relevant to the missing information. The distance threshold, which is the distance between grid g and its neighboring grids, is defined as follows:

()

If L ≤ L_d, then the position information

and target probability of g₋neighbor are recorded. In addition, because the target probability between the obstacle and nonobstacle regions has obvious differences, the acquisition information of the two must be distinguished to improve the accuracy of the subsequent operation, selected as follows:

()

Figure 6 illustrates the neighborhood grid information collection at L_d = 1, with the loss probability grid in red and the neighboring grids within the distance threshold in green.

After collecting sufficient valid information using the KNNA, this information must be used to fit the missing values. Assuming the target probability in grid g from UAV_j’s TPM is lost, because the information contained in the TPM consists of only the coordinates of the grid and the target probability, the fitting function can be designed as p_jg(k) = f_recover(x_g, y_g), where p_jg(k) represents the target presence probability of grid g recovered at time k and f_recover is a recovery function fitted using the coordinates and target presence probabilities of neighboring grids g₋neighbor in UAV_j. Because the RBFNN has the characteristics of a simple structure, fast learning speed, excellent approximation performance, and generalization ability, it was employed in this study to solve the above-fitting function.

The RBFNN is a commonly used three-layer feedforward network, the three layers are the input, implicit, and output layers. The implicit layer has multiple radial basis activation functions, which are used to map the input vectors from low-dimensional m to high-dimensional n; thus, an indistinguishable low-dimensional linear space is converted into a distinguishable high-dimensional linear space. The RBFNN is a local approximation network in which only a few connection weights in the local region of its input space affect the output. Compared with backpropagation and other global approximation networks, an RBFNN has a faster learning speed, which is conducive to meeting the real-time requirements of searching for targets in UAVs. The input of the RBFNN consists of the coordinates of neighboring grids and the target probability, whereas the output consists of the target probability of the damaged grid. Gaussian functions are used as the radial basis functions. The structure of the RBFNN is illustrated in Figure 7.

The parameter that must be determined in the RBFNN training process is the spread speed d, which mainly serves to adjust the sensitivity of the hidden layer to the data. The lower the spread speed d, the higher the sensitivity of the implicit layer to the input, and the more accurate the function approximation; however, the approximation process will be poorly smoothed, resulting in average network performance. The higher the d is, the smoother the function fitting will be, but the approximation error will be correspondingly larger.

3.3.2. Information Fusion

To ensure that the UAV reaches a group consensus in terms of environment-aware information, we fused the UAV local information with the recovered received information using a weighted average. The fusion probability of grid g can be expressed in terms of the TPM as follows:

()

where

is the fusion target probability of grid g at moment k and ω_jg(k) denotes the fusion weights of UAV_j at moment k with respect to grid g. To improve the accuracy of the fusion process and utilize all the information efficiently, the weights ω_jg(k) are updated using the uncertainty χ_jg(k):

()

The pseudocode for information recovery and fusion is shown in Algorithm 1.

Algorithm 1: Pseudocode for information recovery and merge.

1: Input Distance threshold for KNAA L_d, Spread speed of RBFNN d, Number of UAVs N
2: Restore the lost information of grid g
3: for n =1:N do
4: for x =1: L_x do
5: for y =1: L_y do
6: if L <L_dthen
7: Collect, and p_{jg_neighbor}(k)
8: end if
9: end for
10: end for
11: Input , , p_{jg_neighbor}(k) and (x_g, y_g) to RBFNN
12: Obtain the fitted value p_jg(k)
13: end for
14: for each grid do
15: calculate the weighted matrix ω_jg(k)
16: end for
17: calculate the final recovered target probability

3.4. Collaborative Target Search Algorithm

Section 3.3 presents UAV environment perception information with collective consensus. This section describes the combination of the MPC strategy with IGWO to find the optimal path for each UAV. Firstly, the UAVs exchange their respective environmental information such as position and target probability. Then, MPC is used to generate a set of feasible predicted paths for each UAV. Subsequently, IGWO evaluates the quality of each path using a path evaluation function and selects a path with a higher evaluation in each iteration. This newly generated path can be one of the original paths or a path generated by the algorithm. Through multiple iterations, the UAVs can achieve collaboration and obtain more efficient search paths.

3.4.1. Description of MPC

The basic idea of MPC is to predict the future dynamics of the UAV at each sampling moment, solve a finite-time open-loop optimization problem online, and consider the first element of the obtained control sequence as the decision of the UAV. This method of predicting the future state is helpful for enhancing the perception performance and adaptability to the environment of UAVs and for ensuring the optimality of the planned path to a certain extent; therefore, it is widely used in path planning problems.

The basic process of MPC can be described as follows: At moment k, UAV_i predicts that the control input sequence for the next m-steps as u[k : k + m − 1] = [u(k), ⋯, u(k + m − 1)], and J_i(S_i(k), u_i[k : k + m − 1]) can be used to evaluate the effect of the control input sequence. If J_i(S_i(k), u_i[k : k + m − 1]) is the highest, it means that the effect is the best among all the control input sequences, and the optimal control sequence can be expressed as follows:

()

Considering the accuracy of the prediction, only the first term, that is, , is executed. When the moment steps to k + 1, the above process is repeated, and the optimal control input, , is obtained at moment k + 1. Figure 8 shows the three-step MPC prediction process for m = 6, with the dashed line indicating the predicted trajectory, and the bold solid line indicating the actual action taken.

3.4.2. IGWO

3.4.2.1. Gray Wolf Optimizer (GWO) Algorithm

The GWO [26] is a classical and effective IOA that simulates the hunting behavior of gray wolves in nature, with the advantages of a simple structure and high solution accuracy. Gray wolves hunt in a hierarchy, as shown in Figure 9, where the lower ranked wolf ω is led by higher ranked wolves α, β, and δ to surround, approach, and capture prey.

Assuming that the number of gray wolves in the pack is Pop_size and the position of the gray wolf i at moment k is S_i(k) = (x_i(k), y_i(k)), the gray wolf with the highest fitness value is denoted as α (i.e., the one closest to the prey). The gray wolves with the second- and third-highest fitness values are denoted as β and δ, respectively, and the remaining gray wolves are denoted as ω. The mathematical description of the GWO is as follows:

()

In Equation (16), D denotes the bearing and unit distance moved by a gray wolf toward the target, k is the number of iterations, S_p(k) is the position of the prey at iteration to k generations, and C is a coefficient, expressed as:

()

where r₁ is a random value in the interval [0, 1]. Equation (17) indicates that the distance between the gray wolves and the target decreases and, eventually, the gray wolf pack forms an encircling posture on the target. The convergence factor A can be expressed as follows:

()

where r₂ in A is a random value in the interval [0, 1], a is a control parameter for the convergence factor A, and MAX is the maximum number of iterations.

3.4.2.2. Improvement Strategies

1.
Levy flight strategy: Levy flight is stochastic wandering with a heavy-tailed probability distribution of the step length, which has a high probability of jumping out of the current search area during the wandering process. It then explores a larger search space and thus is helpful for solving the problem of falling into the local optimal solution, which can be mathematically described as follows:

()

where S(k) is the position of an individual gray wolf at the k^th iteration.

()

where p is the step size of the Levy flight; η is a random number belonging to [0,2], usually 1.5; S_b is the current optimal individual; and μ and v follow a normal distribution.

2.
Nonlinear convergence factor strategy: The convergence factor A determines the search strategy of the gray wolf, and the control parameter a in the convergence factor A is optimally set to regulate the relationship between global and local searches more effectively throughout the iteration process.

()

As can be seen from Equation (23), we propose that the convergence factor decreases more gently in the preiteration period, which is conducive to the algorithm conducting a global search in a larger search space. The latter period decreases more sharply, which is conducive to the local search of the algorithm and can find the optimal solution in the candidate space more quickly.

3.
Optimal individual information fusion strategy: To improve the ability of the community to find the optimal value in the global range further, the following strategy is designed such that the optimal individual in the k^th iteration is α_k, β_k, and δ_k; α, β, and δ are the individuals with the optimal fitness values up to this iteration, which is briefly referred to as the global optimal individual. Because the risk of falling into local optimality increases in the late iterations of the algorithm, it is considered that the optimal individuals α, β, and δ (e.g., in the k^th generation) and other stochastic individuals of this generation are integrated into the global optimal individuals in the form of differential evolution during the iteration process, which not only enhances the stochasticity and diversity of the community but also improves the solving efficiency of the algorithm. The mathematical description of this strategy is as follows

()

where S_α,β,δ denotes the positions of the global optimal individuals α, β, and δ; S_α(k), S_β(k), and S_δ(k) are the positions of the k^th iteration optimal individuals α_k, β_k, and δ_k, respectively; and S_rand i(k), i = 1, 2, 3 denotes the three randomly selected individuals in the k^th iteration. In Equation (25), K₁ is the positional update weight.

The pseudocode for IGWO is given in Algorithm 2.

Algorithm 2: Pseudocode for IGWO.

1: Input Population size Pop_size, Number of iterations MAX, Search area
2: Calculate the fitness of all gray wolves in the k^th iteration, and denote the top three individuals in fitness as α_k, β_k and δ_k
3: Update the global optimal individual position by using (24) and (25)
4: Update a and A by applying (23) and (19), update C according to Ref. [27]
5: Update all the individual positions by utilizing (16) and (17), apply Levy flight strategy to the individuals, and retain the optimal solution

3.4.3. Path Evaluation Function

The path evaluation function in this study primarily considers three aspects: energy cost, search reward, and uncertainty. It defines the evaluation function J_i(S_i(k), u_i(k)) for the m-step prediction path of the UAV generated by MPC as follows:

()

where ω₁, ω₂, and ω₃ are the weights of the energy cost, search reward, and uncertainty, respectively, and ω₁ + ω₂ + ω₃ = 1. The energy cost L_i(k) is defined as follows:

()

where (x_ir(k), y_ir(k)) is the next time-step coordinate of (x_is(k), y_is(k)) in UAV_i on the m-step prediction path, that is, r = s + 1. The equation indicates the desire for the UAV to take shorter paths during the process of searching for the target. The search reward R_i(k) is defined as

()

where N_D denotes the set of grids detected by UAV_i in the MPC prediction path. The uncertainty evaluation function U_i(k) is defined as

()

The higher the uncertainty, the more likely a potential target exists in that grid.

4. Experiments

The simulation experiments were conducted in the MATLAB 2020a environment. Table 1 lists the symbols used in this study and their corresponding values. The number of grids that lost information at each moment when communication was performed between the UAVs was random, the initial placement position of the UAVs was on the boundary of the search area, and the number of UAVs was three. In Figure 10, the white, magenta, and purple lines are used to denote the motion path of the UAVs in their searching process, and the black line denotes the target movement path. The higher the target probability in the simulation results, the brighter the color of the grid, where blue and green represent the lower target probability and yellow represents the higher target probability. The obstacle region settings used in this study are shown in Figure 11.

Table 1. Descriptions of parameters.

Notation	Description	Value
χ_ig(0)	Initial uncertainty of the grid g	1
Pop_size	Population size	30
MAX	Maximum number of IGWO iterations	500
N	Number of UAVs	3
P_is	Initial position of UAV_i	/
λ_d	Diffusivity	0.1
V_i	Velocity interval of UAV_i	1 grid
d	Spread speed of RBFNN	10
L_d	Distance threshold for KNAA	1 grid
T_h	Time threshold for revisit	50 s
m	MPC predicted steps	6
S_MAX	Maximum number of exercise steps for UAV_i	/
p_d	Sensor detection probability for UAV_i	0.9
p_f	Sensor false alarm probability for UAV_i	0.1
D_r	Detection radius of UAV_i	/

4.1. Information Recovery and Fusion Performance Testing

The search area was a rectangular area of 9.5 km × 9.5 km, which was divided into 19 × 19 grids, each with a length and width of 0.5 km, in which five targets existed with initial positions of 3.25 km and 2.25 km, 4.75 km and 4.75 km, 6.75 km and 4.75 km, 3.75 km and 8.25 km, and 6.25 km and 6.75 km; in this area, the speed was 50 m/s for horizontal or vertical flight, the speed was √2 times the horizontal for oblique flight, the decision time interval was 10 s (i.e., the UAV was at the center of any grid each time it made a decision), and the maximum number of movement steps was S_MAX = 30; the information was lost during the transmission process, and the number of lost data grids was 10% of the total number of grids in the area.

Figure 12 shows a vertical view of TPM fusion for each UAV. Comparing Figures 12(a) and 12(b) demonstrates that numerous grids have significant missing data in the received information of UAV1, and the loss of these data hinders the subsequent decision-making process of UAV1. Figures 12(c) and 12(d) show the fused TPMs after recovering the target probabilities of the missing grids based on the strategies of RBFNN and [20] for UAV1, respectively. The RBFNN method of this study evidently has a better information recovery effect because [20] uses the least squares method for regression fitting, and the form of the fitting function used is generally adapted to the distribution of the target probability, whereas the RBFNN does not need to consider the form of the distribution of the fitting function. In addition, Li et al. [20] have a minimum restriction on the number of grids collected by the KNNA, but some of the grids that are at the boundaries of the obstacles are unable to collect enough information about neighboring grids. Figures 12(c) and 12(e) show the respective fused TPMs of the two UAVs, which are essentially the same as those of the fused TPM of the unlost data in Figure 12(a). The maximum value of the information error for the same grids in Figures 12(c) and 12(e) is 6.06 × 10⁻⁴, so that the information fusion strategy proposed in this study can enhance the consistency of information among UAVs and thus reach a better group consensus state.

To compare the two methods further, the recovery error of each lost data grid is shown in Figure 13, where the total number of lost data grids is 40. The maximum recovery error of the strategy in [20] is 0.00223, and the maximum recovery error of the proposed strategy is 0.00168. Moreover, among the 40 grids with lost data, the recovery results of the strategy used in this study are better than those of [20] in 28 out of 40 grids. These recovery results are better than those of [20], and in most of the grids with better recovery results, the error is less than 3 × 10⁻⁴; thus, the recovered fusion TPM can be assumed to be equal to the state of the unlost data. Therefore, the information recovery method adopted in this study can be assumed to reduce the information error caused by the data loss significantly.

4.2. Effect of UAV Parameters on Search Performance

To test whether the algorithm in this study has high requirements for the model parameters, this section explores whether IGWO is sensitive to the parameters of the UAV detection radius D_r, number of UAVs N, and sensor performance (p_d and p_f).

•
Experiment 1: The UAV detection radius D_r was set to {0.5, 1, 1.5, and 2 km}. Figure 14(a) shows the number of search targets and the detection grid coverage for different UAV detection radii.
•
Experiment 2: The number of UAVs was set to {2, 3, 4, 5}. Figure 14(b) shows the effect of the number of UAVs on the number of search targets and detection grid coverage.
•
Experiment 3: The UAV sensor detection parameters were set to {p_d = 0.6, p_f = 0.4}, {p_d = 0.7, p_f = 0.3}, {p_d = 0.8, p_f = 0.2}, and {p_d = 0.9, p_f = 0.1}. Figure 14(c) demonstrates the sensitivity of the IGWO to the sensor performance.

For all three experiments, each set of experiments was run independently 20 times, and the results were averaged.

From Figure 14(a), when D_r = 0.5 or 1 km, all targets cannot be found at the end of the search, and the UAV cluster finds all targets when D_r > 1.5 km. Only one target is found in the whole search process when D_r = 0.5 km. In addition, the grid coverage increases owing to the increase in the detection radius of the UAV. When D_r is small, the cluster search efficiency is poor; therefore, the algorithm in this study is more sensitive to the detection radius of the sensors used, and sensors with smaller detection radii cannot be used.

From Figure 14(b), when N = 2, the UAV cluster finally searches for four targets; when N > 2, the UAV cluster can find all targets, and the grid coverage also improves with an increase in the number of UAVs.

From Figure 14(c), the UAV cluster, only the sensor with {p_d = 0.9, p_f = 0.1}, finds all the targets, and the search efficiency is higher. The sensors with smaller p_d and larger p_f all increase the grid coverage in the middle and late iterations to jump out of the local optimum to find more targets. However, they are not successful. The {p_d = 0.9, p_f = 0.1} sensors can find more targets by covering fewer grids, indicating that the sensor performance significantly affects the search results, and UAVs can achieve good search results only when the sensor performance is excellent.

4.3. Performance Comparison of Different Search Algorithms

To test the performance of the search algorithm proposed in this study, simulation experiments were conducted using six search algorithms, including the algorithm proposed in this study: GWO, distributed gray wolf optimizer (DGWO) [28], motion-encode particle swarm optimization (MPSO) [29], greedy [19], distributed MPC (DMPC) [30], and IGWO. GWO, DGWO, and MPSO are popular IOAs that apply their respective rules for reasoning and solving to predict the optimal target grids for the next moment. DGWO divides all individuals in the wolf pack into several islands, with each island containing multiple candidate solutions. The islands update their positions according to the updating rules of GWO, and they exchange their candidate solutions with each other at a certain frequency; MPSO encodes the search trajectory into motion paths of the UAV, which vary with the particle positions in the PSO algorithm, preserving group cognition and global consistency. The greedy method favors grids with the largest target probability in the next moment. DMPC uses MPC for the next m-steps of the UAV, seeking optimal, high-efficiency paths, which is computationally costly.

The simulation scenario is the multitarget scenario described in Section 4.1, with three UAVs with initial departure positions at 0.25 km and 1.25 km, 0.25 km and 4.25 km, and 0.25 km and 7.25 km, respectively. The search radius D_r was 1.5 km. The number of targets searched for, time used for searching, and coverage of detection grids were used to analyze the strengths and weaknesses of the method. In addition, the overlap of the UAV search path with the grid of high-probability regions can aid in analyzing the performance of the algorithm.

The comparison of the number of searched targets and the coverage of detected grids during the search process of the six methods is shown in Figure 15, which demonstrates that the IGWO algorithm outperforms the other algorithms in terms of the number of targets found, area coverage, and search efficiency. The performance of the IGWO is average in the presearch period, whereas the DMPC and DGWO have advantages in both the number of searched targets and coverage of detected grids because in the presearch period, UAVs have less information about the environment. In the DMPC method, the local information can be updated quickly by traversing all the predicted paths, whereas IOAs use their own updating rules for inference. Owing to the small difference in information between the grids, the degree of certainty of the grid information is low, which leads to a large error in the inference results. DGWO provides an additional exploration mechanism where the island model increases the probability of transforming low-quality candidate solutions on each island into better solutions. Compared to GWO, the computational complexity is significantly reduced, which is the reason for its efficient early-stage search. In the early stages of the search, IGWO directs UAVs toward high-probability areas. As the search progresses, each UAV begins to consider factors such as probability distribution and uncertainty, and based on their respective positions, they disperse their actions. The speed and quantity of target discoveries steadily increase, with all five moving targets found by 220 s. In contrast, other methods only found four targets by their maximum search time, demonstrating that the approach proposed in this paper has a stronger ability to escape local optima.

For the detection grid coverage, the proposed method maintains a low level in the first and middle periods. This occurs because the next moment of UAV travel to the grid predicted by this method has a higher reward value, which can accurately search for the target in a smaller detection area. This finding proves the feasibility of the strategy proposed in this study. At the late stage of the search, as the other methods get stuck in local optima, they fly around the local optimum for a long time, failing to find the last target, which slows the growth of coverage. However, the method proposed in this study promptly escapes local optima. After all the targets were found in 220 s, the UAV was unable to find the target each time it updated its position because the target disappeared; therefore, it continued to expand the search area to find the target, and the coverage rate increased rapidly in the process.

To demonstrate the superiority of the proposed method more intuitively, the comparison data of the six methods at different time periods are listed in Table 2. At t = 100 s, the number of targets searched by DGWO was two, while the other methods were all one, and the coverage of DMPC was higher than that of the remaining algorithms; at t = 150 s, the number of targets searched by DGWO was one, while the other methods were all one; at t = 200 s, IGWO and GWO can search four targets with lower coverage, whereas the greedy algorithm and DMPC only searched three targets, DGWO still only found four targets; at t = 250 s, only the IGWO searched all targets and the growth rate of its coverage was accelerated; and at t = 300 s, only the IGWO found all targets and achieved a coverage of 0.7875, which is higher than those of the other algorithms.

Table 2. Comparison of the performance of six algorithms in different phases.

Search time t (s)	100	150	200	250	300
GWO
Number of targets searched	1	3	4	4	4
Coverage rate	0.3375	0.4525	0.5450	0.6400	0.7625
MPSO
Number of targets searched	1	3	4	4	4
Coverage rate	0.3825	0.5025	0.6075	0.6225	0.7125
Greedy
Number of targets searched	1	3	3	3	4
Coverage rate	0.3350	0.4425	0.4975	0.5700	0.7225
DMPC
Number of targets searched	1	3	3	4	4
Coverage rate	0.4100	0.5325	0.6425	0.7275	0.7475
DGWO
Number of targets searched	2	4	4	4	4
Coverage rate	0.3900	0.5400	0.5775	0.5800	0.7600
IGWO
Number of targets searched	1	3	4	5	5
Coverage rate	0.3375	0.4525	0.5100	0.6175	0.7825

The overall search process is shown in Figure 10, with the final TPM of UAV1 in the bottom panel (Section 4.1 demonstrates that the TPMs of all UAVs are consistent), where the darker areas are the obstacles detected by the UAVs and the red and violet dots are the starting points of the UAV and targets, respectively. Notably, the trajectories of IGWO and GWO were the same in the early stage; however, the advantages of the strategy developed in this study became effective after the UAV gathered a certain amount of environmental information and avoided the local optima to locate all the targets. DGWO shows high efficiency in the early stage of search but tends to get trapped in local optima in the mid to late stages. The MPSO and greedy methods focused on high-probability regions, leading to local optima entrapment and hampering search efficiency. DMPC can escape local optimum; however, it has limitations with updating rules, impacting the overall information utilization and resulting in inferior search efficiency compared to that of IGWO.

5. Conclusions and Future Works

In this study, we investigated collaborative target search methods for multiple UAVs in adverse communication environments. To recover damaged transmission information, a strategy based on KNNA, RBFNN, and multisource information fusion was developed. The simulation results indicated that this strategy effectively maintained consistency among all the information of all UAVs. Additionally, to mitigate the impacts of target motion and obstacles on the local TPMs of the UAVs, we employed an information diffusion mechanism for corresponding corrections. For UAV path planning, an algorithm combining the MPC and IGWO was designed. The simulation results demonstrate that IGWO outperforms the popular IOA and other algorithms in terms of optimization performance, significantly improving the search success rate and efficiency. The use of neural networks in recovering corrupted data in this study imposes a heavier computational burden on UAVs compared to classical methods. Additionally, IGWO requires UAVs to possess strong computational capabilities. Therefore, in scenarios with high real-time demands, corresponding improvements may be necessary. We did not consider factors such as flight altitude and adjustable speed and acceleration of the UAV, which will be the focus of our future research.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding

The authors received no specific funding for this work.

Acknowledgments

The authors have nothing to report.

Open Research

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

1 Duan H., Zhao J., Deng Y., Shi Y., and Ding X., Dynamic discrete pigeon-inspired optimization for multi-UAV cooperative search-attack mission planning, IEEE Transactions on Aerospace and Electronic Systems. (2021) 57, no. 1, 706–720, https://doi.org/10.1109/TAES.2020.3029624.
10.1109/TAES.2020.3029624
Web of Science® Google Scholar
2 De Moraes R. S. and De Freitas E. P., Multi-UAV based crowd monitoring system, IEEE Transactions on Aerospace and Electronic Systems. (2019) 56, no. 2, 1332–1345.
10.1109/TAES.2019.2952420
Web of Science® Google Scholar
3 Xia Z., Du J., Wang J., Jiang C., Ren Y., Li G., and Han Z., Multi-agent reinforcement learning aided intelligent UAV swarm for target tracking, IEEE Transactions on Vehicular Technology. (2022) 71, no. 1, 931–945, https://doi.org/10.1109/TVT.2021.3129504.
10.1109/TVT.2021.3129504
Web of Science® Google Scholar
4 Fei B., Bao W., Zhu X., Liu D., Men T., and Xiao Z., Autonomous cooperative search model for multi-UAV with limited communication network, IEEE Internet of Things Journal. (2022) 9, no. 19, 19346–19361, https://doi.org/10.1109/JIOT.2022.3165278.
10.1109/JIOT.2022.3165278
Google Scholar
5 Sun X., Cai C., Pan S., Zhang Z., and Li Q., A cooperative target search method based on intelligent water drops algorithm, Computers & Electrical Engineering. (2019) 80, article 106494, https://doi.org/10.1016/j.compeleceng.2019.106494, 2-s2.0-85073555822.
10.1016/j.compeleceng.2019.106494
Google Scholar
6 Shao S., Peng Y., He C., and Du Y., Efficient path planning for UAV formation via comprehensively improved particle swarm optimization, ISA Transactions. (2020) 97, 415–430, https://doi.org/10.1016/j.isatra.2019.08.018, 2-s2.0-85070395947, 31416619.
10.1016/j.isatra.2019.08.018
PubMed Web of Science® Google Scholar
7 Zhou Z., Luo D., Shao J., Xu Y., and You Y., Immune genetic algorithm based multi-UAV cooperative target search with event-triggered mechanism, Physical Communication. (2020) 41, article 101103, https://doi.org/10.1016/j.phycom.2020.101103.
10.1016/j.phycom.2020.101103
Google Scholar
8 Pérez-Carabaza S., Schere J., Rinner B., López-Orozco J., and Besada-Portas E., UAV trajectory optimization for minimum time search with communication constraints and collision avoidance, Engineering Applications of Artificial Intelligence. (2019) 85, 357–371, https://doi.org/10.1016/j.engappai.2019.06.002, 2-s2.0-85068265397.
10.1016/j.engappai.2019.06.002
Web of Science® Google Scholar
9 Zhen Z., Che Y., Wen L., and Han B., An intelligent cooperative mission planning scheme of UAV swarm in uncertain dynamic environment, Aerospace Science and Technology. (2020) 100, article 105826, https://doi.org/10.1016/j.ast.2020.105826.
10.1016/j.ast.2020.105826
Web of Science® Google Scholar
10 Yu X. B., Jiang N. J., Wang X. M., and Li M., A hybrid algorithm based on grey wolf optimizer and differential evolution for UAV path planning, Expert Systems With Applications. (2023) 215, article 119327, https://doi.org/10.1016/j.eswa.2022.119327.
10.1016/j.eswa.2022.119327
Google Scholar
11 Cabreira T. M., Brisolara L. B., and Ferreira P. R., Survey on coverage path planning with unmanned aerial vehicles, Drones. (2019) 3, no. 1, https://doi.org/10.3390/drones3010004.
10.3390/drones3010004
Web of Science® Google Scholar
12 Kappel K. S., Cabreira T. M., Marins J. L., Brisolara L. B., and Ferreira P. R., Strategies for patrolling missions with multiple UAVs, Journal of Intelligent & Robotic Systems. (2020) 99, no. 3-4, 499–515, https://doi.org/10.1007/s10846-019-01090-2, 2-s2.0-85073820579.
10.1007/s10846-019-01090-2
Google Scholar
13 Saadaoui H. and Bouanani F. E., Information sharing in UAVs cooperative search based on calculating the minimum time, Proceedings of the ACM International Conference Processing Series, 2017, Rabat, Morocco, 168–173.
Google Scholar
14 Khan A., Yanmaz E., and Rinner B., Information merging in multi-UAV cooperative search, Proceedings of the IEEE International Conference on Robotics and Automation, 2014, HK, China, 3122–3129.
Google Scholar
15 Maik B., Diego S., Pedro S., and Edison F., Merging three-dimensional occupancy grid maps to support multi-UAVs cooperative navigation systems, Journal of Field Robotics. (2023) 40, no. 3, 483–504, https://doi.org/10.1002/rob.22143.
10.1002/rob.22143
Google Scholar
16 Saadaoui H. and El Bouanani F., Information sharing based on local PSO for UAVs cooperative search of unmoved targets, Proceedings of the 2018 international Conference on Advanced Communication Technologies and Networking, 2018, Marrakech, Morocco.
Google Scholar
17 Saadaoui H., Bouanani F. E., and Llli E., Information sharing based on local PSO for UAVs cooperative search of moved targets, IEEE Access. (2021) 9, 134998–135011, https://doi.org/10.1109/ACCESS.2021.3116919.
10.1109/ACCESS.2021.3116919
Google Scholar
18 Zhou W., Li J., Liu Z., and Shen L., Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning, Chinese Journal of Aeronautics. (2022) 35, no. 7, 100–112, https://doi.org/10.1016/j.cja.2021.09.008.
10.1016/j.cja.2021.09.008
Web of Science® Google Scholar
19 Zhou L., Leng S., Liu Q., and Wang Q., Intelligent UAV swarm cooperation for multiple targets tracking, IEEE Internet of Things Journal. (2022) 9, no. 1, 743–754, https://doi.org/10.1109/JIOT.2021.3085673.
10.1109/JIOT.2021.3085673
Web of Science® Google Scholar
20 Li L., Zhang X., Yue W., and Liu Z., Cooperative search for dynamic targets by multiple UAVs with communication data losses, ISA Transactions. (2021) 114, 230–241, https://doi.org/10.1016/j.isatra.2020.12.055, 33423767.
10.1016/j.isatra.2020.12.055
PubMed Web of Science® Google Scholar
21 Zhang Y., Zhou D., and Xia H., Cooperative search algorithm for multi-UAV in uncertain environment, Electronics Optics & Control. (2012) 19, no. 2, 5–25.
Google Scholar
22 Hu X., Liu Y., and Wan G., Optimal search for moving targets with sensing capabilities using multiple UAVs, Journal of Systems Engineering and Electronics. (2017) 28, no. 3, 526–535, https://doi.org/10.21629/JSEE.2017.03.12, 2-s2.0-85024500985.
10.21629/JSEE.2017.03.12
Google Scholar
23 Peng H., Huo M., Liu Z., and Xu W., Simulation analysis of cooperative target search strategies for multiple UAVs, Proceedings of the 2015 27th Chinese Control and Decision Conference, 2015, Qingdao, China, 4855–4859.
Google Scholar
24 Kashvi T., Sanjukta D., Srishti V., and Aleena S., A brief review of nearest neighbor algorithm for learning and classification, Processing of the 2019 International Conference on Intelligent Computing and Control Systems, 2019, Madurai, India, 1255–1260.
Google Scholar
25 Adnan E., Ahmed E., Zaher Y., Haitham A. A., and Mohammed A. F., RBFNN-based model for heavy metal prediction for different climatic and pollution condition, Neural Computing and Applications. (2017) 28, 1991–2003.
10.1007/s00521-015-2174-7
Google Scholar
26 Seyedali M., Seyed M., and Andrew L., Grey wolf optimizer, Advances in Engineering Software. (2014) 69, 46–61, https://doi.org/10.1016/j.advengsoft.2013.12.007, 2-s2.0-84893010002.
10.1016/j.advengsoft.2013.12.007
Web of Science® Google Scholar
27 Mittal N., Singh U., and Sohi B. S., Modified grey wolf optimizer for global engineering optimization, Applied Computational Intelligence and Soft Computing. (2016) 2016, no. 1, 7950348, https://doi.org/10.1155/2016/7950348.
10.1155/2016/7950348
Google Scholar
28 Abed-alguni B. H. and Alawad N. A., Distributed grey wolf optimizer for scheduling of workflow applications in cloud environments, Applied Soft Computing Journal. (2021) 102, article 107113, https://doi.org/10.1016/j.asoc.2021.107113.
10.1016/j.asoc.2021.107113
Google Scholar
29 Manh D. P. and Quang P. H., Motion-encoded particle swarm optimization for moving target search using UAVs, Applied Soft Computing. (2020) 97, article 106705.
Google Scholar
30 Xiao D., Jiang J., Zhou J., and Yu C., Multi-UAV cooperation search for moving targets under limited communication, Journal of Harbin Engineering University. (2018) 39, no. 11, 1823–1829.
Google Scholar

All articles