Volume 2025, Issue 1 9959245
Review Article
Open Access

Advancing Capabilities With UAVs: A Comprehensive Review of Sensor Fusion, Path Planning, Energy Management, and Deep Learning Algorithms

Suresh Kumar P

Suresh Kumar P

Automotive Research Centre , Vellore Institute of Technology , Vellore , 632014 , India , vit.ac.in

Search for more papers by this author
Aryan Shyam S

Aryan Shyam S

School of Mechanical Engineering , Vellore Institute of Technology , Vellore , 632014 , India , vit.ac.in

Search for more papers by this author
Rammohan A.

Rammohan A.

Automotive Research Centre , Vellore Institute of Technology , Vellore , 632014 , India , vit.ac.in

Search for more papers by this author
Ramesh Kumar C

Ramesh Kumar C

Automotive Research Centre , Vellore Institute of Technology , Vellore , 632014 , India , vit.ac.in

Search for more papers by this author
Saboor Shaik

Saboor Shaik

School of Mechanical Engineering , Vellore Institute of Technology , Vellore , 632014 , India , vit.ac.in

Search for more papers by this author
Qasem M. Al-Mdallal

Corresponding Author

Qasem M. Al-Mdallal

Department of Mathematical Sciences , United Arab Emirates University , P.O. Box 15551 , Al Ain , Abu Dhabi, UAE , uaeu.ac.ae

Search for more papers by this author
Mohammad Mukhtar Alam

Mohammad Mukhtar Alam

Department of Industrial Engineering , College of Engineering , King Khalid University , Abha , 61421 , Saudi Arabia , kku.edu.sa

Center for Engineering and Technology Innovations , King Khalid University , Abha , 61421 , Saudi Arabia , kku.edu.sa

Search for more papers by this author
Fayaz Hussain

Fayaz Hussain

Department of Biological and Agricultural Engineering , Faculty of Engineering , Universiti Putra Malaysia , Selangor , Malaysia , upm.edu.my

Search for more papers by this author
First published: 23 June 2025
Academic Editor: Mohamed Louzazni

Abstract

The current engineering and construction of buildings has led to an increase in building height, which has created difficulties for search and rescue teams working with fire departments. These challenges are exacerbated by the limited reach of the fire ladder and water hose restrictions. Furthermore, traffic jams might make it difficult for the fire brigade to get to the site quickly, which could lead to the loss of life and property. To combat that issue, unmanned aerial vehicles (UAVs) can be employed for environment mapping in buildings during fire emergencies which will help detect fire and identify trapped humans. That is possible by equipping UAVs with path-planning algorithms, control strategies, and image-processing algorithms coupled with a robust sensor fusion algorithm. Well-designed UAVs can help overcome the limitations of traditional firefighting methods. This paper presents an elaborate review of the development of UAVs in terms of their data handling, control management, and energy storage. For the data handling and control management systems, the paper discusses effective sensor fusion algorithms, optimal path-planning strategies, control techniques, and fire detection algorithms. The use of hybrid energy storage for UAVs along with its energy management is elaborated. Through this review, researchers will acquire knowledge about the existing development in the application of UAVs in firefighting. Finally, the paper highlights advancements and limitations of various sensor fusion, path planning and energy management strategies to increase the flight time.

1. Introduction

Unmanned aerial vehicles (UAVs) have gained immense popularity because of the advancement of artificial intelligence (AI) [1]. It is an airborne robot that does not require a human pilot on board. The most common forms of UAVs are drones whose operation is controlled remotely by pilots situated at a ground station [2]. These UAVs can have different levels of automation ranging from total manual operation to complete autonomy with the help of AI. UAVs were first created in the twentieth century for military purposes that were too dangerous for humans. By the twenty-first century, however, they had become indispensable assets for many industries [38]. Control techniques gradually developed over the years. Drones became more affordable, making them cost-effective in a variety of applications such as surveillance of fires in forests, firefighting, aerial photography, package deliveries, and agriculture. Today, drones are deployed to inspect infrastructure, monitor defense activities, and assist in construction. Interestingly, their popularity has even led to its penetration in the entertainment industry, with drone racing entering mainstream media [913]. Scientific studies on drones, both technical and nontechnical, have been the focus of the last 10 years. Depending on how they were used, the public’s perception of drones varied, including support for their usage in battling fires. Other factors to consider were logic and economic viability. To ensure fire safety, technical aspects included fire suppressants, remote sensing, simulation studies, and the application of fire detection algorithms with unmanned aerial systems (UASs) [14]. The use of drones has been increasing in firefighting [1520]. The sensor fusion algorithm methodologies for accurate localization are given in [2125]. Human injuries are frequently the consequence of inadequate information regarding the fire front. Here’s where UAVs come into play, saving money, and taking over dangerous fire tracking duties [2630]. The algorithms can be implemented on a drone along with multiple algorithms designed to accomplish the functions of take-off and landing autonomy, planning of trajectory, and fire tracking [3134]. The UAVs equipped with infrared and RGB cameras, communication modules, and temperature sensors can provide complete details on the fire as well as the UAV position and velocity. The UAV then delivers the collected data to the ground station in real-time [35, 36].

Research assessing the employment of a path-planning algorithm by a group of UAVs to prevent collisions with obstacles discovered that the UAVs were able to reach their mission region successfully [37]. This demonstrates the use of path-planning algorithms in the deployment of autonomous drones, which can be used to provide virtual reality support to first responders and aid in the fight against spreading forest fires. The advancement of technology that allows autonomous drone operation is not only applicable to drones, but also to marine, space, and service robots, as well as unmanned vehicles [38]. Experimental research with the informed RRT algorithm revealed that it outperformed the classic RRT algorithm under diverse scenarios [39]. The informed RRT algorithm shows benefits such as less computation time, the capacity to generate paths through difficult situations and less dependency on the planning problem’s state dimension and spectrum. Furthermore, research has been conducted on the use of light detection and ranging (LiDAR)-mounted UAVs for remote sensing, which is important in forest management [40].

Precision agriculture (PA) has progressed because of the advent of UAV–based remote sensing systems [41]. When compared to older methods, using UAVs to monitor crops provides a gateway to gather data from agricultural fields in a simple, quick, and cost-effective manner. PA using IoT technology based on UAVs is the future of remote sensing [42]. Another aspect of UAVs is their capability to have low-altitude flights which are advantageous for taking very-detailed photographs of crops. The efficiency of the monitoring system and its ability to collect usable and accurate data has been greatly improved [4347]. Several sectors and businesses, such as agriculture, construction, mining, and infrastructure inspection, benefit from drone mapping. The maps are very useful in catastrophe management through monitoring, charting, and assessing postdisaster damage [48]. The use of drones for search and rescue missions has increased recently, particularly for missions in inhospitable areas. A case study highlighted the deployment of drones to locate a person in a slot canyon that was rugged and hard to access by foot [49]. The study mentioned that the drones sent the search and rescue team images of difficult-to-access places. That shows that drones add incredible advantages to human efforts. Drones substantially speed up rescue operations while keeping the rescuers out of significant danger. UAVs detected an at-risk human across a larger area in less time than first responders and rescuers without assistance, according to a prospective randomized simulation investigation of the effectiveness of drone technology [50]. Drones using UAV technology in combination with the global navigation satellite system (GNSS), deep learning, and real-time computer vision could offer error-free data to rescuers and personnel in the ground station in a short duration of time [48].

The paper provides a comprehensive review of the advancements in UAVs, focusing on their data handling, control management systems, and hybrid energy storage for self-powering. The data handling and control management systems encompass an efficient sensor fusion algorithm, an optimized path planning strategy, and various control and fire detection algorithms for detecting and extinguishing a fire. The concept of utilizing hybrid energy storage for UAVs along with its energy management is also elaborated. The paper highlights the numerous possible solutions to make UAVs smart and reliable with intelligent algorithms developed to date. The rest of this paper is structured as follows: Section 2 explains the qualitative comparison of sensor fusion algorithms of the firefighting UAV and the advantages and limitations of different path planning algorithms. Section 3 compares various control algorithms and discusses the pros and cons of image processing deep learning algorithms. Section 4 evaluates energy storage to power the UAVs along with the various hybrid energy storage combinations. Sections 5 and 6 summarize the insights after reviewing the literature and conclude with major outcomes and areas for future research.

2. Firefighting UAV Architecture, Sensor Fusion, and Path Planning

2.1. General Architecture of Firefighting UAVs

The general architecture of a smart UAV is illustrated in Figure 1, which shows the on-board hybrid energy storage, the on-board computer that runs the various algorithms, flight controller, global positioning system (GPS), a variety of sensors (thermal imaging camera (TIC), LiDAR, temperature, inertial measurement unit (IMU), and ultrasonic) for fire detection and tracking and the motors. The data collected by the TIC, depth camera, temperature sensor, IMU sensor, and ultrasonic distance sensor is processed in the onboard computer. It houses the SLAM, path-planning, and proprietary fusion algorithms, which calculate and transfer the results to the flight controller. Additionally, data from the GPS sensor is sent directly to the flight controller, which processes data and gives signals to the drone about motor speed. Stabilization and tracking algorithms are used to keep drones on their predicted pathways. With the limited power supplied by the energy source on board, power distribution coupled with energy management is utilized to predict flying time for the given battery state of charge.

Details are in the caption following the image
General architecture of firefighting UAVs.

Four pillars make up the brain of a firefighting UAV—sensor fusion, path planning, flight control, and fire detection.

2.2. UAV Sensor Fusion

For the fire-tracking and path-finding algorithms to work, it is necessary to have sensor fusion algorithms [51]. Figure 2 illustrates the types of sensor fusion algorithms. The algorithms fuse data from the sensors and other sources to provide crucial information to the processor [52, 53]. If data from those sources were processed separately instead of being fused, the communication of information would not be as effective, increasing computational time and reducing accuracy. In the UAV’s sensor fusion, GPS, IMU, the visual camera, LiDAR, and radar sensors play very crucial roles. For example, the visual camera can provide information about the kind of object it sees. The radar and LiDAR sensors will provide the distance of the object to the drone. With sensor fusion, their information is combined, allowing the computer to make the appropriate calculation for the speed and angle of the rotor blades [54, 55]. Apart from that, other types of data are fused to give meaningful input to the controller.

Details are in the caption following the image
Different types of sensor fusion algorithms.

The Brovey transform (BT) is a simple technique that improves the visual contrast at both ends of the histogram of satellite data. Each moderate-resolution band is multiplied by the ratio of the higher-resolution panchromatic band divided by the sum of all three multispectral bands in this method [5658]. Filtering algorithms find their use in applications that require some degree of segmentation from the preferred and nonpreferred types of data. This separation can be done by setting a certain threshold. This can be applied to various parameters such as the voltage output of the sensor, temperature reading from analog sensors, and sound frequency. There are different types of filtering algorithms such as high pass filter (HPF), low pass filter (LPF), Kalman filter, and extended Kalman filter [59], developed a 2D median filtering algorithm for image processing and it has the limitation of in preserving edges in highly textured areas [60], introduced a constraint satisfaction problem, which can handle difference constraints. The collaborative filtering algorithm with an evaluation metric that closely models user experience, enhancing recommendation accuracy was proposed in [61], adaptive filtering algorithm, and a model-based filtering algorithm was presented for to provide better performance in noisy environments and gives better accuracy [62, 63]. A filtering algorithm was introduced for enhancing the detection of variable stars in [64], multidirectional ground filtering for LiDAR to improve the mapping accuracy was proposed in [65]. A median filtering algorithm of better noise reduction [66] and UAVs safety authentication, behavior modeling, and safety supervision was developed in [67]. The practice of integrating two or more photos into a single image while preserving certain important features from each is known as wavelet image fusion for effective multiresolution analysis [6870], which is a widely used technology in a variety of domains, including remote sensing, robotics and medical applications [71, 72], introduced image fusion technique using neutrosophic-based wavelet transform, improving image clarity and focus. Principal component analysis (PCA) is an unsupervised learning method used for reducing complexity in machine learning. Correlated attributes are converted into a set of linearly uncorrelated properties through the statistical method of orthogonal transformation [73, 74]. By integrating standard PCA with rotated wavelet transform to fuse multispectral images to improve the image resolution is developed in [7577].

The values of parameters between data points are predicted using geostatistical analysis techniques. Geostatistical methods are only applicable to data that is spatially dependent, that is, not random [7880]. The fusion technique is one of the most widely used procedures for sharpening the HIS approach. Color enhancement, feature enhancement, spatial resolution improvement, and the merging of different data sets have become common procedures in image analysis [8183]. A hybrid algorithm mixes two or more different algorithms to solve the same problem and is commonly used in programming languages such as C++, picking one (depending on the input) or switching between them throughout the procedure [84]. The elaborative qualitative comparison of the above types of sensor fusion algorithms is illustrated in Table 1.

Table 1. Qualitative analysis of sensor fusion algorithms.
Algorithm Algorithm types Advantages Disadvantages
Brovey transform [5658]
  • • The spatial resolution can be increased
  • • Development is simple and clean
  • • Preserves spectral information
  • • It improves visual appearances
  • • Produces minimal color distortion
  • • It is compatible with various sensors
  • • The Brovey transform can be sensitive to noise present in the original images
  • • It preserves multispectral band spectra but ignores panchromatic spectral variability
  • • It can cause contrast issues in fused images
  • • Brovey transform prioritizes panchromatic details, overshadowing multispectral information
  • • It cannot be used on a broad scale
  
Filtering algorithm [5967]
  • HPF
  • LPF
  • Kalman filter
  • • Filtering algorithms reduce sensor noise, improve estimates via multisensor fusion, enhance data quality, and boost system performance
  • • These enhance accuracy and precision via multisensor fusion, compensating for limitations and biases for robust estimates
  • • It improves system robustness and fault tolerance by using redundant information from multiple sensors, ensuring reliable estimates even in the presence of sensor failures
  • • These offer a comprehensive understanding of the system or environment by integrating data from multiple sensors
  • • Filtering algorithms are adaptable and flexible, allowing customization to match sensor characteristics, system dynamics, and application needs. This optimization improves fusion performance
  • • Filtering algorithms can be computationally demanding, limiting real-time processing and necessitating significant computing resources for large or high-dimensional datasets
  • • Model assumptions and uncertainties in filtering algorithms can result in inaccuracies due to mismatches with real-world conditions or errors in model parameters and sensor measurements
  • • These are sensitive to initial conditions, impacting their convergence and performance if poorly determined or with errors in initial estimates
  • • Filtering algorithms assuming linearity and Gaussian noise may struggle with nonlinear and complex relationships, resulting in suboptimal estimates and incomplete system representation
  
Wavelet-based fusion [6872] Multivariate wavelet denoising (MWD)
  • • Wavelet-based fusion uses a multiresolution representation to capture details at different scales, preserving global and local information. It combines low-frequency components for global structures and high-frequency components for local details
  • • It excels at representing localized features, capturing both frequency and spatial information simultaneously. This enhances spatial localization in wavelet-based fusion, preserving the spatial details of the input images
  • • Wavelet-based fusion adapts to image characteristics by selecting suitable wavelet functions and scales, making it versatile for various image types and applications
  • • It is robust to noise and artifacts by selectively fusing less noisy regions while suppressing noisy or corrupted regions. It is suitable for scenarios with noisy or distorted input images
  • • Wavelet-based fusion can be computationally intensive, especially for large or high-resolution images, potentially leading to increased processing time and resource requirements. Real-time or near-real-time processing may not always be possible
  • • It requires careful selection of wavelet functions and scales, which can be challenging. Different images may require specific choices to achieve optimal fusion. Incorrect selection may result in suboptimal outcomes, including information loss or unwanted artifacts
  • • It primarily emphasizes local details and frequency information, potentially neglecting higher-level contextual information and semantics in the input images. This limitation can lead to a fused image that lacks certain global or contextual features, which may be crucial for certain applications or analysis tasks
  • • Wavelet-based fusion may struggle with complex scenes containing intricate structures, textures, or overlapping objects, potentially resulting in a loss of information or compromised visual quality in the fused image
  
Statistical fusion [73, 74]
  • STARFM/ESTARFM
  • STAARCH
  • STDFM/ESTDFM
  • • Statistical fusion uses mathematical models to estimate optimal fusion weights or parameters. By exploiting the statistical properties of the data, it achieves an optimal combination of information, improving fusion results
  • • It is versatile, adaptable to various data types and domains like remote sensing, medical imaging, finance, etc. Customizable statistical models ensure efficient fusion by accommodating specific data characteristics and requirements
  • • Statistical fusion is robust to noise and outliers, filtering them out to focus on reliable components and improving the quality and reliability of the fused information
  • • It incorporates uncertainty and confidence measures, enabling decision-makers to assess the quality of fused data and make informed judgments. This is particularly valuable when dealing with data from diverse sources with varying levels of reliability or uncertainty
  • • Statistical fusion methods assume specific data distributions or models. If data does not adhere to these assumptions, fusion results may be suboptimal, introducing errors or biases
  • • These are sensitive to data quality. Noisy, outlier-ridden, or artifact-laden data can significantly impact fusion outcomes, potentially leading to erroneous or biased results
  • • Statistical-based fusion methods primarily focus on statistical properties and may not fully exploit contextual or higher-level information present in input sources. This limitation can result in fused outputs lacking comprehensive contextual understanding
  • • These methods may struggle to adapt to dynamic or rapidly changing data. Fixed statistical models may not be suitable if data properties vary over time or in different scenarios, leading to suboptimal fusion results and an inability to capture data evolution
  
PCA [7577]
  • EOF
  • PCA
  • • Dimensionality reduction: PCA reduces the dimensionality of data while retaining important information, improving computational efficiency, and mitigating the curse of dimensionality
  • • Feature extraction: PCA identifies the most significant features in the data, aiding in pattern recognition, data interpretation, and identifying key variables
  • • Noise reduction: PCA can filter out noise and highlight the underlying signal in the data, enhancing the signal-to-noise ratio and improving data quality
  • • Visualization: PCA facilitates data visualization by projecting high-dimensional data onto lower-dimensional space, allowing for easier interpretation and understanding of data patterns
  • • Linearity assumption: PCA assumes a linear relationship between variables, limiting its effectiveness for nonlinear data where important relationships may be missed
  • • Sensitivity to outliers: Outliers can strongly influence PCA results and distort the principal components, leading to a biased representation of the data
  • • Interpretability challenges: Interpreting principal components may be difficult due to the linear combination of original variables, making it challenging to assign meaningful interpretations to each component
  • • Loss of information: PCA discards some variance in the data during dimensionality reduction, potentially losing important details or subtle patterns present in the original high-dimensional space
  
Geostatistical based fusion [7880] Kringing BME
  • • Spatial coherence: Geostatistical methods consider the spatial relationships between neighboring data points, preserving spatial coherence and ensuring smooth transitions in the fused outputs
  • • Uncertainty quantification: Geostatistics provides measures of uncertainty, allowing for the quantification and propagation of uncertainty in the fusion process, leading to more reliable and informative results
  • • Incorporation of auxiliary data: Geostatistical fusion can effectively integrate auxiliary data sources, such as geophysical measurements or expert knowledge, enhancing the accuracy and completeness of the fused outputs
  • • Spatial interpolation: Geostatistical techniques can perform spatial interpolation to estimate values at unobserved locations, enabling the creation of continuous and spatially comprehensive fused maps
  • • Data stationarity assumption: Geostatistical methods assume stationarity of the data, meaning that the statistical properties do not change across the entire study area. Deviations from this assumption can affect fusion accuracy
  • • Sensitivity to input data quality: Geostatistical fusion is sensitive to the quality and accuracy of input data. Biased or erroneous data can propagate into the fused outputs and compromise the reliability and representativeness of the results
  • • Computational complexity: Geostatistical algorithms can be computationally demanding, especially for large datasets or high-resolution data, requiring substantial computational resources and time for processing
  • • Interpretation challenges: Geostatistical fusion may produce complex models or output representations, making the interpretation of the fused results challenging for non-experts and hindering the understanding and usability of the outputs
  
HIS [8183]
  • HIS
  • FIHS
  • AIHS
  • • Enhanced color representation: HIS fusion preserves the color information while improving the intensity and contrast, resulting in visually appealing fused images with enhanced color representation
  • • Perceptual quality: HIS fusion can improve the perceptual quality of the fused image by effectively combining the hue, intensity, and saturation components, resulting in images that are visually pleasing and easy to interpret
  • • Flexible fusion control: HIS fusion provides separate control over the fusion process for each component (hue, intensity, and saturation), allowing for fine-tuning and adjustment based on specific application requirements
  • • Compatibility with human perception: HIS fusion considers the human visual system’s response to different color components, aligning with how humans perceive and interpret color information
  • • Information loss: HIS fusion may lead to the loss of certain image details due to the nature of the fusion process, which prioritizes color enhancement over preserving all fine-grained information present in the input images
  • • Sensitivity to input quality: HIS fusion can be sensitive to the quality of the input images, particularly in terms of color balance and saturation. Poor-quality input images may result in fused outputs that exhibit color artifacts or inconsistencies
  • • Lack of spatial detail preservation: HIS fusion primarily focuses on color enhancement and may not adequately preserve the spatial details present in the input images. This can result in fused images that lack fine spatial information, which could be important in certain applications
  • • Limited adaptability: HIS fusion techniques often have fixed fusion rules and may not adapt well to different data characteristics or application scenarios. This lack of adaptability may result in suboptimal fusion results for specific datasets or requirements
  
Hybrid [84] MQQA-BME
  • • Improved fusion quality: Modified quantile–quantile adjustment-Bayesian maximum entropy (MQQA-BME) utilizes multiquality and multiresolution analysis to achieve high-quality fusion results, preserving important details from multiple sensors and enhancing the overall image quality
  • • Adaptability to diverse datasets: MQQA-BME can handle different types of data and adapt to varying sensor characteristics, making it suitable for fusion in diverse application domains
  • • Objective quality assessment: MQQA-BME incorporates objective quality assessment measures to evaluate the fused images, allowing for quantitative analysis and comparison of fusion outcomes
  • • Efficient computational complexity: MQQA-BME optimizes the fusion process to achieve a balance between fusion quality and computational efficiency, making it feasible for real-time or near-real-time applications
  • • The complexity of implementation: Implementing MQQA-BME requires expertise in multiquality and multiresolution analysis techniques, which can be challenging and time-consuming
  • • Sensitivity to input data quality: The performance of MQQA-BME is dependent on the quality of the input data. Poor-quality or noisy data may adversely affect the fusion results and lead to suboptimal outcomes
  • • Limited handling of complex scenes: MQQA-BME may struggle to handle complex scenes with intricate structures, highly textured regions, or overlapping objects, as the fusion process may not fully capture the complexity and richness of such scenes
  • • Lack of interpretability: While MQQA-BME produces high-quality fused images, the fusion process itself may not provide explicit interpretability or insights into the fusion decisions, making it difficult to understand the underlying fusion mechanisms

The fusion algorithms are compared based on statistical parameters. Prediction accuracy measures such as correlation (R), standard deviation (STD), mean absolute difference (MAD), and root mean square error (RMSE) are among them. These terms can be used to evaluate the overall accuracy of the anticipated data by comparing the predicted results with real outcomes at each specified pixel at specific times. In addition to the R, STD, MAD, and RMSE, the error relative global dim synthesis (ERGAS) and the spectral angle mapper (SAM) are considered. The ERGAS can be used to determine the spectral similarity of merged data to actual values, whereas the SAM may be utilized to extract the merged or fused image’s amount of distortion [85].

The equations of R, MAD, STD, RMSE, ERGAS, and SAM are as follows:
(1)
(2)
(3)
(4)
(5)
(6)
where Xfused,i represents the value of fused data, Xfine,i represents the value of the fine image pixel data, represents the mean value of fused data, i is the pixel number corresponding to the fine image and j corresponds to the band number of course image, n represents the number of data points, Npixel represents the number of pixels, NP represents the number of bands, Xcourse,j represents the value of course image pixel data, h represents the high spatial resolution image, l represents the low spatial resolution image, and represents the mean value of course data.

Based on the comparison, the accuracy of the fusion algorithms was evaluated and the algorithms were sorted on the bases of the RMSE accuracy. The lower the RMSE score, the better the fused data produced, that is, the accuracy of the fused data is inversely proportional to the value of the RMSE value.

Figure 3 illustrates the application of sensor fusion algorithms for each sensor and how it is interlinked in a firefighting UAV (drone).

Details are in the caption following the image
Application of the various sensors and fusion algorithms in firefighting UAVs.

2.3. Path Planning

The process of constructing a path from a starting point to a destination is known as path planning. It allows an autonomous robot or vehicle to discover the best path to its goal [8688]. The map needed for this process can be in the form of a grid map, state space, or topological roadmap. Grid-based search and sampling-based search algorithms are the two main types of algorithms that can obtain an optimum planned path [89].

Grid-based search algorithms seek and produce a path in a grid map based on the lowest travel cost which is the amount of energy required to travel from point A to point B. In a 2D environment, they can be used for path planning for mobile robots. However, the amount of memory required to perform grid-based algorithms rises as the dimension number increases, such as in the case of a six-degree-of-freedom robot [9092]. With the aid of randomly selecting new nodes or robot configurations from a state space, sampling-based search algorithms develop a tree structure to produce a searchable tree. They can be used in low- and high-dimensional search spaces [9395].

An effective path-planning algorithm should:
  • be capable of always finding an optimal or near-optimal path in real-time static environments;

  • be able to adapt and be implemented in dynamic environments;

  • remain compatible with and enhance the chosen self-referencing approach; and

  • minimize computation time, complexity, and data storage.

The survey suggests that rapidly exploring random tree (RRT), RTT, and informed RTT are the most effective. In RRT, points are produced at random and linked to the nearest accessible node. When a vertex is created, it must be checked to ensure that it is not in the path of an obstacle. In addition, the vertex must be chained to its nearest neighbor while avoiding impediments. The method terminates when a node inside the desired region is created or a limit is reached. A design decision is made while choosing a random position [96]. If there is an obstruction between the random node and the nearest neighbor node, it will not link to the random node. Otherwise, if the random node is closer to the intended target, it will connect to the random node. In unexplored state spaces, the likelihood of random node selection rises. Simple methods, including using internal random number generators, can be used. We can determine the path from the starting node to the end objective using this fast-exploring random trees approach. This method, however, will not produce an ideal route. Such techniques are adequate only for simple applications [97].

RRT is a streamlined type of RRT [98]. The RRT method identifies the shortest path to the destination when the number of nodes approaches infinity. RRT has the same fundamental premise as RRT, but two major changes to the method result in drastically different outcomes. RRT begins by measuring the separation between each vertex and its parent vertex. That is referred to as the vertex’s cost function. A neighborhood of vertices within a certain radius of the new node is then investigated after the nearest node in the graph has been identified. If a node is found that is less expensive than the proximal node, the proximal node is replaced with the cheaper node. That feature’s effect is illustrated by the addition of fan-shaped twigs to the tree structure. The second distinction made by RRT is the rewiring of the tree. The neighbors are reverified when a vertex is connected to the least expensive neighbor. RRT then examines whether rewiring to the newly added vertex will reduce the cost of neighboring nodes. The neighbor is connected to the newly added vertex if the price drops. This feature smoothens the route.

Informed RRT is an enhancement to the RRT technique that accelerates the convergence of the discovered solution to the optimum [99]. That approach, developed expressly for path-length issues, directly samples a subset of a planning problem that contains all feasible improvements to a given solution. To discover a better solution, one should confine the search to the entire state space, which can increase the pace of convergence and locate the best path in a finite time. Informed RRT samples the subset directly (inside the ellipse) and concentrates on the path between start and target. As a result, with each sample, the answer will improve. Even in a huge state–space, it is computationally cheap. Table 2 illustrates the advantages and limitations of informed RRT compared with RRT and RRT in terms of their path-finding factors. It was found that for a given number of nodes, informed RRT was 88% faster than RRT [99].

Table 2. Advantages and limitations of different path planning algorithms [100103].
Advantages of informed RRT
Factors Informed RRT RRT RRT
Convergence rate [96, 97] Excellent convergence rate with the improved and feasible initial solution Better convergence rate than RRT but still suffers from a costly initial solution Satisfactory, but often takes time to reach convergence. Plus requires a costly initial solution
Computational cost [96, 98] Consumes the least amount of computational resource as it samples the subset directly unlike RRT and RRT Consumes a lesser amount of computational resources compared to RRT as path finding is more confined toward the finish Consumes a lot of computational resources as it calculates all the paths from the starting nodes regardless of their direction toward the finish
Ability to find difficult paths [97, 98] It can find the feasible path in the least amount of time as it samples the subset directly As it samples paths based on the nearest nodes to the start, the path finding is quicker than RRT As it does its sampling randomly, it takes a long time to determine a feasible path
Optimization speed [97, 98] Optimization speed is the fastest provided it calculates an optimum initial solution Optimization is faster than RRT but slower than informed RRT Slowest compared to RRT and informed RRT as it finds paths randomly
  
Limitations of informed RRT
Factors Informed RRT RRT RRT
  
Dependency on the initial solution [99] Because the search is intrinsically relying on the present solution, it cannot focus when the related prolate hyper-spheroid is bigger than the planning problem It is dependent on the initial solution thus if it deviates, the whole solution could bring on increased lapse time It is not dependent on the initial solution as it goes about calculating paths randomly

3. UAV Control and Deep Learning Algorithms

3.1. UAV Control Algorithms

A control algorithm is a strategy that receives inputs from various sensors and feedback (if the system is closed loop) to calculate an optimal control signal and send it to the actuator. It is important to employ the right type of control algorithm for the UAVs. There are various types of control algorithms, some of the popular ones are illustrated in Figure 4.

Details are in the caption following the image
Types of control algorithms in a flight controller.

A proportional integral derivative (PID) controller comprises proportional, integral, and derivative terms that work in harmony to calculate a control signal. The controller calculates an error value e (t) in real-time as the difference between a desired setpoint (SP) and a measured process variable (PV), then, corrects the error using proportional, integral, and derivative terms. The proportional term amplifies or reduces the control value by multiplying a gain to the error value. The gain can either be an integer or a rational number. With every iteration, the error will get closer to zero as the measured value will reach closer to the desired value. However, with only the proportional term, the controller will never get the system to the appropriate set point. Thus, an integral term is added to keep the memory of the previous iteration to add it to the control value. The PI controller may overshoot the desired set point for certain conditions of the system. To monitor the rate of change, a derivative term is added that reduces the control value after a certain threshold of the rate of change. PID can be further classified into normal PID and intelligent PID [104, 105].

Model predictive controller (MPC) is a feedback control algorithm that uses a mathematical model to predict future outputs of the process. It can accept multiple inputs and produce multiple outputs (MIMO) to accommodate interactions among different control loops. This prevents a complex design for a multi-PID controller. The principle of MPC is to predict the output for a set of time steps. To predict, the controller receives an input (e.g., the position of the drone) as a reference value. Based on this reference and the various input and output constraints, it calculates various output predictions. The prediction that provides the least cost function, that is, the error value associated with the shortest path between the set point and the PV, would be chosen as the final output. After having the prediction output chosen, the set of inputs that were used to acquire the output is supplied in real-time [106, 107]. MPC can be further classified into deterministic MPC, adaptive MPC, telematics MPC, and stochastic MPC.

Sliding mode control (SMC) is a nonlinear control strategy that changes the dynamics of a nonlinear system by providing a discontinuous control signal. It is a set-valued control signal that compels the system to “slide” along a cross section of its usual behavior. The feedback control law is not a time-reliant function. Instead, depending on where it is in the state space, it can move from one continuous structure to another. As a result, SMC is a way of variable structure control. As a legion of control structures is built in such a manner that trajectories always advance toward an adjacent region with a different control structure, the end trajectory will not reside wholly inside one control structure. Rather, it will glide along the control structures’ limits. The system’s movement as it slides along those limits is referred to as a sliding mode and the boundaries formed by the geometrical locus are referred to as the sliding surface. SMC can be classified into integral SMC, terminal SMC, fractional order SMC, higher order SMC, SMC with feed-forward disturbance compensation, boundary layer SMC, SMC with reaching law modification, and SMC with AI [108, 109].

Fuzzy logic (FL) control is a popular type of control strategy. The term “fuzzy” alludes to how the logic involved may cope with things that cannot be represented as “true” or “false,” but rather as “partially true” or “partially false.” With traditional controls, the inputs and outputs are binary, that is, they are communicated in 0 and 1. FL, however, considers decimal values between 0 and 1, allowing judgments to be made based on estimations of how partially true or partially false an event is. FL has the benefit of allowing the solution to the issue to be expressed in words that humans can understand, allowing for the use of their expertise in the controller’s design. That simplifies the automation of tasks humans do well. The tasks consist of input, processing, and output stages. The input stage transforms inputs from sensors or other sources, including switches and thumbwheels, into the appropriate membership functions and truth values. Each applicable rule is put through its paces in the processing stage, which produces a result for each one before combining them. The combined result is then reconverted into the desired control output value in the output stage. FL control can be classified into adaptive FL and predictive FL [100, 101, 110]. Table 3 tabulates the pros and cons of each algorithm.

Table 3. Comparison of various control algorithms.
Control algorithm Advantages Disadvantages
PID [104, 105]
  • • Versatility: PID control is widely applicable and can be used in various systems and processes, making it a versatile control technique
  • • Fast response: PID control can provide a fast response to changes in system conditions, allowing for quick adjustments and maintaining desired setpoints
  • • Stability: PID controllers can achieve system stability by continuously adjusting control outputs based on error feedback and maintaining a balanced control response
  • • Easy implementation: PID controllers are relatively simple to implement and require minimal computational resources, making them suitable for real-time control applications
  • • Sensitivity to parameter tuning: PID controllers rely on the appropriate tuning of their proportional, integral, and derivative parameters. Poorly tuned PID controllers may lead to unstable control, oscillations, or slow response times
  • • Limited adaptability: PID control is not well-suited for systems with time-varying or nonlinear dynamics, as the fixed parameters may not adequately adapt to changing conditions
  • • Lack of robustness: PID control may be sensitive to disturbances or uncertainties in the system and it may struggle to maintain stable control in the presence of significant disturbances or changes in operating conditions
  • • Inability to handle complex systems: PID control is less effective for complex systems with multiple interacting variables or intricate dynamics, where more advanced control strategies or techniques may be required for optimal performance
  
Intelligent PID [104, 105]
  • • Adaptive tuning: Intelligent PID controllers can automatically adjust their parameters based on system conditions, allowing for adaptive tuning and improved control performance in varying operating conditions
  • • Enhanced robustness: Intelligent PID control techniques, such as fuzzy logic or neural network-based approaches, can improve the robustness of control by handling nonlinearities, uncertainties, and disturbances more effectively than traditional PID controllers
  • • Complexity: Implementing intelligent PID control techniques may require additional computational resources and expertise compared to traditional PID control, making it more complex to design, implement, and maintain
  • • Parameter identification: Intelligent PID controllers often rely on accurate parameter identification, which can be challenging and time-consuming. Obtaining accurate system models and tuning the intelligent control parameters can be a demanding task, especially in complex systems
  
MPC [106, 107]
  • • Handling constraints: MPC can explicitly handle input and state constraints, ensuring that the system operates within safe limits and satisfies operational constraints
  • • Multivariable control: MPC can handle multivariable systems, considering the interactions between different variables and optimizing the overall system performance
  • • Future prediction: MPC utilizes a model of the system to predict its future behavior, allowing for proactive control actions and improved tracking of setpoints and reference trajectories
  • • Robustness to disturbances: MPC incorporates disturbance models and actively accounts for them in the control strategy, making it more robust against disturbances and uncertainties
  • • Computational complexity: MPC involves solving optimization problems repeatedly in real-time, which can be computationally demanding and require powerful hardware or efficient algorithms
  • • Model uncertainty: Accurate system models are crucial for effective MPC and uncertainties in the model can lead to suboptimal control performance or even instability
  • • Sensitivity to model errors: Small errors or mismatches between the actual system behavior and the model used in MPC can have significant impacts on control performance
  • • Implementation challenges: Implementing MPC in real-world applications may require extensive modeling effort, system identification, and calibration to achieve desired control performance. Maintenance and retuning can also be challenging over time
  
Adaptive MPC [106, 107]
  • • Adaptability to changing dynamics: Adaptive MPC can adjust the control strategy and model parameters in real-time to account for variations in the system dynamics, ensuring optimal control performance even in the presence of changing operating conditions
  • • Improved robustness: Adaptive MPC can handle uncertainties and disturbances by continuously updating the model and control parameters. It can adapt to unforeseen changes in the system, enhancing robustness and maintaining control performance in dynamic environments
  • • Increased complexity: Adaptive MPC involves more complex algorithms and parameter adaptation schemes compared to traditional MPC, which can result in increased computational burden and implementation complexity
  • • Sensitivity to modeling errors: Errors or inaccuracies in the adaptive model identification process can affect the effectiveness of the control strategy. Robust estimation and identification techniques are needed to minimize the impact of modeling errors on control performance
  
Stochastic MPC (SMPC) [106, 107]
  • • Handling uncertainties: Stochastic MPC explicitly considers probabilistic uncertainties in the system dynamics and disturbance models. It enables robust control by incorporating uncertainty information, resulting in improved performance and stability in the presence of uncertainties
  • • Optimal decision-making under uncertainty: Stochastic MPC optimizes control actions over a finite time horizon while considering the probabilistic nature of uncertainties. This allows for more informed and optimal decision-making, considering the uncertainty distribution and associated risks
  • • Increased computational complexity: Stochastic MPC involves solving optimization problems over a range of possible scenarios, which requires additional computational resources and may increase the computation time compared to deterministic MPC
  • • Uncertainty modeling challenges: Accurate representation of uncertainties in the system dynamics and disturbances can be challenging and the choice of appropriate probabilistic models and parameter estimation techniques can significantly impact the performance and reliability of the stochastic MPC controller
  
SMC [108, 109]
  • • Robustness: SMC is inherently robust to parameter variations and external disturbances, ensuring reliable performance even in the presence of uncertainties
  • • Fast response: SMC exhibits fast response times due to its discontinuous control action, allowing for rapid tracking of reference signals and rejection of disturbances
  • • Chattering phenomenon: SMC often exhibits chattering, which is a high-frequency oscillation around the sliding surface. This can introduce mechanical wear, increase control effort, and degrade system performance
  • • Sensitivity to model mismatches: SMC performance heavily relies on accurate knowledge of system dynamics. Model mismatches between the actual system and the control design can lead to suboptimal control performance
  • • Difficulty in choosing sliding surface parameters: Designing an appropriate sliding surface is crucial for achieving desired control objectives. Selecting suitable sliding surface parameters can be challenging, requiring expert knowledge and extensive tuning
  • • Lack of continuous control signal: SMC relies on discontinuous control actions, which may not be desirable for systems with physical constraints or limitations. The abrupt switching between control modes can introduce mechanical stress or instability in certain applications
  
Integral SMC [108, 109]
  • • Improved steady-state accuracy: ISMC includes an integral action that eliminates steady-state errors, ensuring accurate tracking of desired setpoints even in the presence of disturbances or parameter uncertainties
  • • Robustness to system uncertainties: ISMC provides robustness against model uncertainties and disturbances, making it suitable for applications where accurate system modeling is challenging or where disturbances are prevalent
  • • Complexity in parameter tuning: ISMC requires careful tuning of multiple parameters, including sliding surface gains and integral action parameters, which can be time-consuming and challenging. Poor tuning can lead to performance degradation or even instability
  • • Sensitivity to noise and measurement errors: ISMC can be sensitive to measurement noise and errors due to the integral action. Inaccurate measurements can affect the control signal, leading to suboptimal control performance or even instability. Proper filtering and noise reduction techniques are necessary
  
Fractional order SMC [108, 109]
  • • Enhanced control performance: FOSMC allows for more flexibility and improved control accuracy by incorporating fractional-order operators, which capture the system dynamics more accurately than traditional integer-order controllers
  • • Robustness to uncertainties: FOSMC exhibits robustness to parameter uncertainties, disturbances, and nonlinearity, making it suitable for systems with complex dynamics or uncertain models
  • • Increased complexity in design and implementation: FOSMC requires a deeper understanding of fractional calculus and additional mathematical tools for controller design. Implementation may involve complex numerical algorithms, potentially increasing computational burden
  • • Limited theoretical analysis and guidelines: Compared to traditional integer-order control approaches, FOSMC lacks well-established theoretical frameworks and comprehensive guidelines for system analysis and controller design, making it challenging to determine optimal tuning strategies and performance bounds
  
Terminal SMC [108, 109]
  • • Finite-time convergence: TSMC ensures finite-time convergence to the sliding surface, providing fast and robust tracking or stabilization performance in a finite time interval
  • • Chattering reduction: TSMC reduces chattering, a phenomenon commonly associated with traditional sliding mode control, by incorporating a terminal sliding surface that allows for smoother control actions
  • • Terminal conditions sensitivity: TSMC’s performance heavily depends on the accuracy of the terminal conditions. Deviations or errors in the terminal conditions can lead to suboptimal or even unstable control behavior
  • • Complexity in design: The design of TSMC often requires the solution of complex optimization problems to determine the terminal conditions and control gains, adding computational complexity and potentially increasing implementation challenges
  
Higher order SMC [108, 109]
  • • Enhanced robustness: HOSMC can provide improved robustness against uncertainties, disturbances, and parameter variations compared to traditional first-order sliding mode control, enabling better tracking or regulation performance in the presence of these factors
  • • Reduced chattering: Higher-order sliding mode control techniques can effectively reduce chattering, which is a common drawback of first-order sliding mode control. This leads to smoother control actions and improved system behavior
  • • Increased complexity: HOSMC methods often involve more complex mathematical formulations and control algorithms compared to first-order sliding mode control, making their design and implementation more challenging
  • • Controller tuning difficulty: Tuning the parameters of a higher-order sliding mode controller can be more intricate and time-consuming due to the increased number of control gains and additional design considerations, requiring a deeper understanding of the control system and its dynamics
  
Boundary layer SMC [108, 109]
  • • Reduced chattering: BL-SMC introduces a boundary layer around the sliding surface, which helps to smooth out control actions and reduce chattering, resulting in improved control performance and system stability
  • • Enhanced robustness: BL-SMC provides increased robustness against uncertainties and disturbances by confining the sliding motion within the boundary layer, allowing for better tracking or regulation performance even in the presence of external disturbances
  • • Increased complexity: BL-SMC involves additional design considerations and parameter tuning compared to traditional sliding mode control, making it more complex and challenging to implement, especially for complex systems with nonlinear dynamics
  • • Potential performance trade-off: While BL-SMC reduces chattering, it introduces a trade-off between chattering reduction and control performance. In some cases, the boundary layer may smooth out control actions too much, leading to slower system response or degraded tracking accuracy
  
SMC with reaching law modification [108, 109]
  • • Improved control performance: The reaching law modification enhances the convergence speed and accuracy of the sliding mode control, leading to improved tracking or regulation performance of the controlled system
  • • Reduced chattering: By modifying the reaching law, SMC with reaching law modification can effectively reduce chattering, which is a common issue in traditional sliding mode control, resulting in smoother control actions and improved system stability
  • • Increased complexity: The modification of the reaching law adds complexity to the control design process, requiring additional analysis and tuning of the control parameters. This complexity can make the implementation and fine-tuning of the control system more challenging
  • • Sensitivity to parameter selection: The performance of SMC with reaching law modification is highly sensitive to the selection of control parameters, such as the reaching law gains. Improper parameter selection can lead to degraded control performance or instability, requiring careful parameter tuning for optimal system operation
  
SMC with feed-forward disturbance compensation [108, 109]
  • • Improved disturbance rejection: The feed-forward disturbance compensation in SMC allows for the effective rejection of external disturbances or uncertainties, enhancing the system’s ability to maintain desired performance even in the presence of disturbances
  • • Reduced tracking error: By incorporating feed-forward disturbance compensation, SMC can minimize the tracking error by actively compensating for disturbances, resulting in more accurate tracking of reference signals
  • • Sensitivity to disturbance model accuracy: The performance of SMC with feed-forward disturbance compensation relies heavily on the accuracy of the disturbance model. If the model does not accurately capture the disturbance characteristics, the compensation may not be effective, leading to suboptimal control performance
  • • Complexity in disturbance estimation: Estimating the disturbances accurately for feed-forward compensation can be challenging, particularly in practical applications with complex and uncertain disturbance sources. This complexity can introduce additional design and implementation difficulties, requiring robust estimation techniques to ensure accurate compensation
  
SMC with AI [108, 109]
  • • Improved adaptability: AI algorithms in SMC can adapt to varying system dynamics, uncertainties, and disturbances, enhancing control performance and robustness in real-time applications
  • • Enhanced control accuracy: AI techniques can learn complex nonlinear relationships and optimize control actions, leading to improved tracking accuracy and performance compared to traditional SMC approaches
  • • Increased complexity: Incorporating AI algorithms into SMC adds complexity to the control system design, implementation, and tuning process. This complexity may require specialized knowledge and resources, making the control system more challenging to develop and maintain
  • • Data dependency and generalization: AI-based SMC relies on training data to learn and generalize control strategies. The performance of AI models can be affected by the availability, quality, and representativeness of training data, limiting their effectiveness in scenarios with limited or biased data
  
FLC [100, 110]
  • • Linguistic representation: FLC allows for intuitive control design using linguistic variables, rules, and fuzzy membership functions, making it suitable for systems with complex or vague dynamics
  • • Robustness to uncertainties: FLC can handle uncertainties, noise, and imprecise measurements, providing robust control performance even in the presence of system variations and disturbances
  • • Nonlinear control capability: FLC can effectively handle nonlinear systems and nonlinear control objectives, allowing for more flexible and adaptive control strategies
  • • Human-like decision-making: FLC mimics human decision-making by incorporating expert knowledge, making it suitable for systems where expert intuition and qualitative reasoning are important
  • • Rule base complexity: Designing an optimal rule base can be challenging, especially for complex systems, as it requires expert knowledge and manual rule tuning, which can be time-consuming and subjective
  • • Lack of formal mathematical analysis: FLC lacks a rigorous mathematical framework for analysis and optimization, making it difficult to guarantee stability and performance guarantees compared to other control methods
  • • Difficulty in system identification: FLC may require substantial data or expert input to identify accurate fuzzy membership functions, making the process more complex and potentially limiting its applicability in data-scarce scenarios
  • • Computational complexity: Implementing FLC in real-time systems can be computationally demanding, especially for large rule bases or high-dimensional systems, which can impact the control system’s response time and real-time performance
  
Adaptive FLC [100, 101]
  • • Enhanced robustness and adaptability: Adaptive FLC can adjust its control parameters based on system dynamics and varying operating conditions, providing improved performance and robustness in the face of uncertainties, changes, and disturbances
  • • Automatic parameter tuning: Adaptive FLC can automatically tune its fuzzy membership functions, rules, and other control parameters, reducing the reliance on manual tuning and expert knowledge, and enabling more efficient control design
  • • Increased complexity: Adaptive FLC introduces additional complexity in terms of parameter adaptation algorithms and online learning mechanisms, which can make the control system design and implementation more challenging
  • • Sensitivity to modeling errors: Adaptive FLC relies on accurate system modeling and identification, and errors or inaccuracies in the model can lead to suboptimal adaptation and reduced control performance. Proper modeling and identification techniques are crucial for the effectiveness of adaptive FLC

3.2. Deep Learning Algorithms

To effectively determine the location of a fire, detection devices have been combined with AI, which is beneficial for UAV pilots. A system was designed with one IR camera that would work with the Raspberry Pi platform and detect images using various image detection algorithms. Some of the algorithms are discussed in detail below.

The Harris Corner and Canny detector can be defined as a metric that defines corner quality N as the difference between a matrix’s determinant and trace that is expressed in terms of an image’s gradients. If N is greater than zero, there is a corner present. A canny detector works on a similar principle but instead of finding corners, it finds edges [102, 103]. Scale-invariant feature transform (SIFT) uses the difference of Gaussians (DoGs), which is determined by first convolving a picture with a Gaussian kernel. Using a series of reference pictures, the SIFT key points of objects are extracted and kept in a database. An object is detected by comparing each feature in the new image to this database and discovering matching features [111113].

SIFT, when coupled with the speeded up robust feature (SURF) algorithm, becomes closely related descriptors. SIFT and SURF are founded on the same concepts, but each stage of each algorithm has its own set of specifics. Square-shaped filters are employed for SURF, which is several times faster than SIFT, as well as being more durable [114116]. Oriented features from accelerated segment test (FAST) and rotated binary robust independent elementary features (BRIEF; ORB) creates a binary string for image description using two algorithms—FAST and BRIEF. ORB compares pixels to differentiate between an object and its surroundings. FAST declares a corner if it finds a pixel with a higher or lower position than its center pixel. The BRIEF algorithm uses that information as well, to perform a series of threshold tests on the intensities of pixels in a given patch image [117]. The binary robust invariant scalable key (BRISK) points algorithm uses an optimized variant of FAST called adaptive and generic accelerated segment test (AGAST). To ensure invariance against scale variation, BRISK uses a pyramid-based technique [118]. Histograms of oriented gradient (HOG) is a gradient-based descriptor made up of local histograms of picture grid gradient orientations. It divides the image into many grids and then checks local gradient directions. The HOG descriptor is created by normalizing the collected histograms from all grids [119121]. And last, a convolutional neural network (CNN) is a composition of convolutional layers, pooling, and rectified linear unit (ReLU) which are fully connected layers between an input and output layer [122, 123].

A qualitative comparison of the advantages and disadvantages of the types of object detection algorithms appears in Table 4.

Table 4. Qualitative analysis of object detection algorithms.
Algorithm Advantages Disadvantages
Harris Corner [102, 103]
  • • Rotation and scale invariance: Harris Corner can detect corners regardless of their orientation or size, making it suitable for applications where objects may undergo different transformations
  • • Robustness to image noise: Harris Corner is resistant to noise in the image, allowing for accurate corner detection even in noisy environments
  • • Sensitivity to parameter selection: The performance of Harris Corner relies on appropriately selecting parameters, such as the threshold and window size, which can be challenging and impact the accuracy of corner detection
  • • Limited to corner detection: Harris Corner is primarily designed for detecting corners and may not be effective in identifying other types of image features, such as edges or texture patterns
  • • Computationally intensive: The computational complexity of Harris Corner can be high, especially for large-scale images, which can limit its real-time applicability or require optimization techniques
  • • Susceptible to variations in lighting and contrast: Harris Corner may produce inconsistent results when there are variations in lighting conditions or contrast, affecting the reliability of corner detection in challenging imaging scenarios
  
SIFT and SURF [111116]
  • • Robustness to scale and rotation changes: SIFT and SURF can detect and match image features across different scales and orientations, making them suitable for applications with varying image transformations
  • • Distinctiveness and repeatability: SIFT and SURF produce highly distinctive feature descriptors that enable accurate matching even under changes in viewpoint, illumination, and partial occlusion
  • • Computational complexity: Both SIFT and SURF involve computationally intensive processes, which can limit their real-time performance, especially in applications with large-scale image datasets
  • • Sensitivity to image transformations: While SIFT and SURF are designed to be robust to scale and rotation changes, they may struggle with other image transformations, such as affine transformations or severe viewpoint changes
  • • Patent and licensing issues: SIFT was patented, and its usage may require licensing for commercial applications, which can limit its accessibility. SURF is an alternative that offers similar capabilities without patent restrictions
  • • Limited to 2D image data: SIFT and SURF are primarily designed for 2D image analysis and may not be as effective in scenarios that involve 3D data or video sequences where temporal information is crucial
  
ORB [117]
  • • Fast feature extraction: ORB combines the speed of the FAST keypoint detector with the efficiency of the BRIEF descriptor, making it computationally efficient for real-time applications
  • • Rotation and scale invariance: ORB is designed to be robust to scale and rotation changes, allowing for reliable feature matching across different viewpoints and orientations
  • • Limited invariance to affine transformations: ORB is less effective in handling affine transformations compared to scale and rotation changes
  • • Less distinctive descriptors: ORB’s binary feature descriptors are less distinctive compared to more complex descriptors, such as SIFT or SURF, which may lead to lower matching accuracy in certain scenarios
  • • Limited application to textured scenes: ORB performs better in scenes with sufficient texture, but its performance may degrade in textureless or low-texture environments
  • • Limited support for 3D data: ORB is primarily designed for 2D image analysis and may not be as effective in scenarios involving 3D data or video sequences where temporal information is crucial
  
BRISK [118]
  • • Efficiency and speed: BRISK features are designed to be computationally efficient, making it suitable for real-time applications
  • • Robustness to scale and rotation: BRISK can handle scale and rotational changes, providing reliable feature matching across different viewpoints
  • • Low memory usage: BRISK utilizes compact binary descriptors, resulting in lower memory requirements compared to methods using floating-point descriptors
  • • Robustness to image noise: BRISK features are designed to be robust against image noise, improving the accuracy of keypoint detection and matching
  • • Limited invariance to affine transformations: BRISK is less effective in handling affine transformations compared to scale and rotational changes
  • • Limited descriptor distinctiveness: The binary descriptors used in BRISK may be less distinctive compared to more complex descriptors, leading to lower matching accuracy in challenging scenarios
  • • Limited support for 3D data: BRISK is primarily designed for 2D image analysis and may not be as effective in scenarios involving 3D data or video sequences where temporal information is crucial
  • • Sensitivity to lighting changes: BRISK features can be sensitive to lighting variations, which can affect their reliability and matching performance in different lighting conditions
  
HOG [119121]
  • • Robustness to geometric and photometric transformations: HOG features are invariant to changes in object position, scale, and orientation, making them suitable for object detection tasks in various conditions
  • • Efficient computation: HOG features can be computed quickly, enabling real-time or near-real-time applications
  • • Interpretable features: HOG descriptors capture local image gradient information, providing meaningful representations that are interpretable by humans
  • • Generalizability: HOG features have been widely used and proven effective in various computer vision tasks, such as pedestrian detection and object recognition
  • • Limited spatial information: HOG features do not explicitly encode detailed spatial information, which can limit their performance in tasks where precise object localization is required
  • • Sensitivity to occlusion and clutter: HOG features may struggle to handle occluded or cluttered scenes where local gradient patterns are disrupted, leading to decreased detection accuracy
  • • Lack of scale invariance: HOG features are not inherently scale-invariant, which can make them less effective when dealing with objects at different scales
  • • Difficulty in handling complex object deformations: HOG features may not capture complex deformations or variations in object shape, limiting their ability to handle highly deformable objects or non-rigid motions
  
CNN [122, 123]
  • • Hierarchical feature learning: CNNs automatically learn hierarchical representations of data, enabling them to extract complex features at different levels, leading to superior performance in tasks like image classification and object recognition
  • • Spatial invariance: CNNs are inherently spatially invariant, making them robust to object translations and allowing for effective detection and recognition across different regions of an image
  • • Parameter sharing: CNNs use weight sharing, which reduces the number of parameters, enabling efficient training and inference on large-scale datasets
  • • Transfer learning: Pre-trained CNN models can be used as a starting point for new tasks, allowing for the effective transfer of knowledge and reducing the need for extensive training on limited datasets
  • • High computational requirements: Training and running CNN models can be computationally expensive, especially for large networks and high-resolution images, requiring significant computational resources
  • • Need for large labeled datasets: CNNs typically require large labeled datasets for effective training, making them less suitable for domains with limited annotated data or specialized tasks
  • • Lack of interpretability: The inner workings of CNNs can be challenging to interpret, making it difficult to understand the specific features or patterns that drive their predictions
  • • Sensitivity to adversarial attacks: CNNs are vulnerable to adversarial attacks, where carefully crafted input perturbations can lead to misclassification, raising concerns about their robustness in security-critical applications

4. Energy Storage Solutions for UAVs

The preceding sections primarily discussed the data handling and control management systems of a UAV. Along with understanding the methods to control a UAV, it is equally important to address the source that provides the necessary amount of energy to the UAV. Energy storage coupled with the other systems in the UAV determines the device’s overall weight and distribution. Moreover, finding the right type of energy storage is necessary as it dictates flight endurance. Factors to keep in mind while selecting energy storage for the UAVs are: (a) provide a long duration of charge, (b) have a good energy to weight ratio, (c) be environmentally friendly, (d) be readily available, and (e) produce little to no noise and vibration.

4.1. Single-Source Energy Storage

Some popular single-source energy storage solutions are explained below.

Batteries—Batteries are a very popular form of energy storage. Their applications can be seen in various consumer electronics, electric vehicles, lawnmowers, et cetera. They work on the principle of reduction oxidation (REDOX) reaction where chemical energy is converted into electric energy. Using batteries to store energy improves a propulsion system’s functionality and capabilities. Battery-powered platforms can meet a variety of hobbyist needs in terms of flight duration and cost-effectiveness. Many small UAVs, particularly quadcopters, are powered by batteries. Typical tiny UAVs powered by batteries have restricted endurance because of battery pack weight constraints. Using lithium polymer (LiPo) batteries, they can have a flight time of up to 90 min [124].

Hydrogen fuel cells (HFCs)—HFCs generate electricity by combining hydrogen (H2) with oxygen (O2) through a spontaneous chemical reaction where water is eliminated as a byproduct. Constant supplies of hydrogen are needed to run the fuel cell. HFCs have several advantages. They do not emit direct pollution, and they do not make noise. They are fueled by an energy-dense fuel that is the most prevalent molecule on the planet. HFCs are more energy efficient than other UAV power sources. They can power the UAVs for hours in the air. HFCs perform well in cold temperatures, unlike batteries whose chemical activity weakens at low temperatures. The refueling time of a hydrogen-fueled UAV is quick (can be refueled in under a minute). While HFCs may look promising, it is important to address their disadvantages as well. The cost of using HFCs is quite high. In addition, the generation, storage, and transportation of hydrogen is complicated and expensive [125]. Another problem with HFCs is that they generate a lot of heat. Given the widespread usage of plastic in UAVs such as drones, the generation of heat could melt some of the drone’s parts.

Super capacitors (SCs)—SCs have a high specific power ranging from 10 W/g which is much higher than a lithium-ion battery 1–3 W/g, thus, allowing for fast discharge and charge [126]. This is especially useful when a rapid recharge is necessary to recover boost capability from the secondary power source. In addition, supercapacitors have the advantage of providing power over the entire voltage range from 0 V to the maximum nominal voltage, unlike batteries that must operate within a specific range. Unlike an electrochemical battery, which has a fixed cycle life, cycling a supercapacitor causes little wear and tear which makes it age better. Under typical conditions, a supercapacitor’s capacity degrades from 100% to 80% after 10 years. Another advantage is that the supercapacitor is forgiving in both hot and cold climates, which batteries cannot match. Though they are advantageous in some respects, supercapacitors have their cons. First, a supercapacitor has low specific energy which means the flight time of the UAV is significantly low. Second, a full energy spectrum cannot be achieved because of its linear discharge. A supercapacitor also suffers from high self-discharge. And last, it is expensive to manufacture, thus, will raise the cost of the UAV [127].

4.2. Hybrid Energy Storage

Because of its dynamic load response capabilities and energy efficiency, a fuel cell and battery hybrid energy solution (Figure 5) for UAVs has lately gained interest. Backed by multiple studies, a fuel cell and battery hybrid provide an average energy efficiency of 45%, which is extremely effective for operating UAVs such as drones [128130]. A disadvantage, however, concerns safety-related issues with storing hydrogen in UAVs used for firefighting applications as hydrogen is very flammable and can catch fire easily in the hot environment it will be subjected to. In addition, the UAVs will have to be equipped with multiple accessories to keep hydrogen in its optimal state, which will add weight to the UAV and downgrade its maneuverability and flight time along with increasing its price.

Details are in the caption following the image
Fuel cell and battery energy storage.

Another disadvantage fuel cells have is that they have low instantaneous current supply capacity. Coupling a supercapacitor with the fuel cell (Figure 6) can overcome this disadvantage. A couple of tests and analyses were performed on fuel cells and supercapacitor hybrid, which report that the performance of the UAVs significantly improves. By not depleting the fuel cell, its longevity can be extended [131, 132].

Details are in the caption following the image
Fuel cell and supercapacitor energy storage.

Batteries can store huge amounts of energy, but they require a significant amount of time to charge. Whereas capacitors recharge quickly but only store a little energy. Combining the properties of a lithium battery with the fast charging of a capacitor, supercapacitor hybrid UAVs greatly boost the efficiency levels and power of Li batteries. They guarantee to save a huge amount of weight while doubling the distance of flight and flight time of the UAVs [133] (Figure 7).

Details are in the caption following the image
Battery and supercapacitor energy storage.

To make the energy delivery operation more efficient and reliable, the battery, fuel cell, and supercapacitor can be coupled in the order of two or three. The HFC excels at low power and long endurance, whereas the supercapacitor excels at brief bursts of high power and rapid recharging. Because the supercapacitor has limited energy storage, a battery is necessary to meet high power demand. The battery supplies the additional power essential for flight phases like takeoff and ascent and it is sized to last for the desired length of boost power [134]. This combination is reported to be extremely efficient and reliable [135] (Figure 8).

Details are in the caption following the image
Fuel cell, battery, and supercapacitor hybrid energy storage.

A solar-powered UAV requires it to be equipped with a vast span of solar cells along its surface. Such UAVs can fly without issue during sunny conditions. However, at night or on a cloudy day, a solar-powered UAV becomes inoperable. A solar battery hybrid drone is equipped with a battery backup along with solar panels. Thus, during sunny hours, the solar panels would harness energy to power the drone and charge the battery. This stored energy can then be used during nonsunny hours [136] (Figure 9).

Details are in the caption following the image
Battery and solar cell hybrid energy storage.

The attributes of different available energy sources are discussed in Table 5.

Table 5. Comparison of different energy storage systems.
Energy storage Advantages Disadvantages
Battery-powered UAVs [124]
  • • Versatility: Battery-powered UAVs offer versatility in terms of maneuverability, payload capacity, and flight duration, making them suitable for various applications such as aerial photography, surveillance, mapping, and package delivery
  • • Environmental friendliness: Battery-powered UAVs produce zero emissions, making them environmentally friendly compared to their fossil fuel-powered counterparts
  • • Quiet operation: Battery-powered UAVs operate with lower noise levels, making them suitable for applications where noise pollution needs to be minimized, such as in urban areas or wildlife monitoring
  • • Cost-effectiveness: Operating and maintaining battery-powered UAVs can be more cost-effective compared to traditional aircraft, as they require less fuel and have fewer mechanical components
  • • Limited flight time: Battery-powered UAVs have limited flight durations due to the energy constraints of batteries, which can restrict their operational range and require frequent battery replacements or recharging
  • • Payload capacity: Battery weight limits the payload capacity of UAVs, restricting the types of equipment or sensors that can be carried, which may limit their applications in certain industries or research fields
  • • Charging infrastructure: The need for charging infrastructure and access to power sources can be a logistical challenge, especially in remote or inaccessible areas where power outlets may be scarce
  • • Battery degradation: Over time, batteries used in UAVs may experience degradation, leading to reduced flight times and overall performance. Battery maintenance and replacement can add to the operational costs of battery-powered UAVs
  
Fuel cell-powered UAVs [125]
  • • Extended flight time: Fuel cell-powered UAVs offer longer flight durations compared to battery-powered UAVs, allowing for extended missions and increased operational range
  • • Quick refueling: Fuel cell systems can be refueled more quickly than recharging batteries, reducing downtime between missions and enabling rapid turnaround
  • • High energy density: Fuel cells have a high energy density, providing a greater amount of energy storage in a smaller space, allowing for larger payloads or increased flight efficiency
  • • Lower environmental impact: Fuel cells produce lower emissions compared to internal combustion engines, making fuel cell-powered UAVs environmentally friendly and suitable for sensitive areas or emission-restricted zones
  • • Limited infrastructure: Fuel cell refueling infrastructure is not as widespread as battery charging infrastructure, limiting the availability of refueling stations and potentially restricting the operational areas for fuel cell-powered UAVs
  • • Complex design and integration: Fuel cell systems require careful integration and additional components such as hydrogen storage and delivery systems, increasing the complexity and weight of the UAV, which can affect maneuverability and payload capacity
  • • Hydrogen storage challenges: Storing and handling hydrogen, the fuel for fuel cells, can be challenging due to its low density and flammability. Adequate safety measures and proper storage facilities are necessary
  • • Higher cost: Fuel cell systems are currently more expensive than battery systems, resulting in higher initial costs for fuel cell-powered UAVs. However, as technology advances and production scales up, costs are expected to decrease
  
Supercapacitor as an auxiliary power source [126, 127]
  • • Rapid charging: Supercapacitors can be charged quickly, allowing for shorter turnaround times between flights and reducing downtime during operations
  • • High power density: Supercapacitors offer high power density, enabling them to deliver bursts of power quickly, which is beneficial for applications that require sudden increases in power, such as takeoff and maneuvering
  • • Extended battery life: By serving as an auxiliary power source, supercapacitors can alleviate the demand on the main battery, extending its lifespan and improving overall battery management
  • • Enhanced performance in extreme temperatures: Supercapacitors can perform reliably in a wide range of temperatures, making them suitable for UAV operations in both hot and cold environments
  • • Limited energy density: Supercapacitors have lower energy density compared to batteries, resulting in limited energy storage capacity. This can restrict the flight time and payload capacity of UAVs using supercapacitors as an auxiliary power source
  • • Voltage drop: As supercapacitors discharge, their voltage drops linearly, which can affect the performance of the UAV and potentially lead to power interruptions if not properly managed.
  • • Cost: Supercapacitors are generally more expensive than traditional batteries, which can increase the overall cost of UAV systems utilizing supercapacitors as an auxiliary power source
  • • Limited availability: Supercapacitors with high capacity and performance may have limited availability in the market, making it challenging to source suitable supercapacitors for UAV applications
  
Fuel cell and battery [128130]
  • • Extended flight time: Fuel cells provide longer flight durations compared to batteries alone, enabling UAVs to operate for extended periods without frequent recharging or battery replacement
  • • High power density: Batteries offer high power density, allowing for quick bursts of power during maneuvers, while fuel cells provide continuous and steady power output for sustained operations
  • • Reduced environmental impact: Fuel cells produce electricity through the chemical reaction of hydrogen and oxygen, resulting in clean and emission-free power generation. Batteries also offer a greener alternative to internal combustion engines
  • • Versatile power options: The combination of fuel cell and battery systems provides versatility in power management, allowing for optimal utilization of each power source based on specific mission requirements
  • • Weight and size: Both fuel cell and battery systems add weight and occupy space in UAVs, potentially reducing payload capacity and affecting maneuverability
  • • Infrastructure Requirements: Fuel cells require a supply of hydrogen fuel and appropriate storage and refueling infrastructure, which may be limited or not readily available in some areas
  • • Cost: Fuel cell systems are generally more expensive compared to battery systems, contributing to the overall cost of UAVs. Additionally, fuel cells may require maintenance and periodic replacement of components
  • • Limited energy storage: While fuel cells provide longer flight times, the energy density of the fuel is generally lower than that of batteries, resulting in limited energy storage capacity for longer missions or heavy payload requirements
  
Fuel cell and supercapacitor [131, 132]
  • • Extended flight time: Fuel cells offer longer flight durations, while supercapacitors provide quick energy discharge for high-power maneuvers, allowing UAVs to balance between sustained operations and bursts of power
  • • Lightweight: Supercapacitors have a high power-to-weight ratio, contributing to weight reduction in UAVs. Fuel cells also offer a relatively lightweight power source compared to traditional combustion engines
  • • Fast charging: Supercapacitors can be rapidly charged, reducing downtime between flights. Fuel cells can be refueled relatively quickly compared to battery charging times
  • • Environmental friendliness: Fuel cells generate electricity through a clean chemical reaction, producing no harmful emissions. Supercapacitors are also considered environmentally friendly as they do not contain hazardous chemicals or heavy metals
  • • Limited energy storage: Fuel cells have limited energy density compared to traditional fuel sources, which can restrict flight duration. Supercapacitors also have lower energy density compared to batteries, limiting their ability to store large amounts of energy
  • • Cost: Fuel cell systems are generally more expensive compared to traditional combustion engines or battery systems. Supercapacitors can also be costlier than batteries, adding to the overall cost of UAVs
  • • Infrastructure requirements: Fuel cells require a supply of hydrogen and appropriate storage and refueling infrastructure, which may not be widely available. Supercapacitors require specialized charging infrastructure that may not be easily accessible in certain locations
  • • Complexity: Both fuel cell and supercapacitor systems require additional components and control systems, adding complexity to the UAV design and increasing maintenance and operational considerations
  
Battery and supercapacitor [133]
  • • Energy storage capacity: Batteries offer high energy density, allowing for longer flight durations. Supercapacitors provide quick energy discharge for high-power maneuvers and can handle a large number of charge–discharge cycles
  • • Lightweight: Supercapacitors have a high power-to-weight ratio, contributing to weight reduction in UAVs. Batteries also offer a relatively lightweight power source compared to traditional combustion engines
  • • Cost-effectiveness: Batteries are generally more cost-effective compared to fuel cells, making them a popular choice for UAV applications. Supercapacitors also offer a good balance between cost and performance
  • • Accessibility and Infrastructure: Batteries are readily available and widely used, with established charging infrastructure. Supercapacitors can be charged quickly and do not require complex infrastructure, making them more accessible and convenient for UAV operations
  • • Limited energy storage: Both batteries and supercapacitors have limitations in energy storage capacity compared to traditional fuel sources. This can restrict the flight duration and operational range of UAVs, requiring frequent recharging or swapping of power sources
  • • Charging time: Batteries require longer charging times compared to refueling or swapping fuel cells. Supercapacitors, although fast-charging, may not have sufficient energy density for extended flight durations
  • • Lifespan and durability: Batteries degrade over time and have limited charge–discharge cycles, requiring periodic replacement. Supercapacitors can handle more cycles but may have shorter overall lifespans compared to batteries
  • • Environmental impact: Both batteries and supercapacitors contain materials that can have environmental impacts if not disposed of properly. The production, recycling, and disposal processes of these energy storage systems need to be managed carefully to minimize environmental consequences
  
Battery, fuel cell, and supercapacitor [134, 135]
  • • Enhanced energy management and flexibility in power delivery
  • • Increased overall energy storage capacity for extended flight durations
  • • Redundancy and fault tolerance for improved reliability
  • • Optimal utilization of different power sources based on their strengths and characteristics
  • • Increased complexity and weight due to the integration of multiple power sources
  • • Higher cost compared to using a single power source
  • • Challenges in managing power distribution and coordination between different sources
  • • Additional maintenance and monitoring requirements for each power source increase operational complexity
  
Solar cell and battery [136]
  • • The use of PVCs may help in up to 59% of fuel savings in addition to reducing the weight of the UAV
  • • Renewable energy: Solar cells provide a sustainable and renewable power source, reducing dependency on fossil fuels
  • • Extended flight time: Continuous charging from sunlight extends the UAV’s flight endurance
  • • Environmental-friendly: Solar power reduces carbon emissions and minimizes environmental impact
  • • Silent operation: Solar-powered UAVs operate quietly, making them suitable for noise-sensitive applications
  • • Needs a large wingspan to maximize the amount of light energy captured (which increases the weight of UAVs)
  • • Limited power generation: Solar cells are dependent on sunlight availability, limiting power generation in low-light or cloudy conditions
  • • Weight and space constraints: Solar panels add weight and occupy space, potentially affecting the UAV’s payload capacity
  • • Charging time: Charging from solar energy is slower compared to traditional battery charging methods
  • • Reliance on battery: UAVs still require a battery for power storage, adding complexity and weight to the system

4.3. Power Management Strategies for UAVs

Hybridization is the ideal design for powering a UAV propulsion system. It combines the benefits and capabilities of several power sources while balancing their drawbacks. Power must be distributed as required from a variety of sources to produce an optimal result of energy efficiency and good performance while increasing lifespan as much as possible. As a result, a power management strategy (PMS) must be implemented. The essentials to consider while implementing PMS in real-time are good efficiency, quick response time, and fuel usage along with power requirements and flight circumstances.

Table 6 presents a short comparison between two PMSs—rule-based strategies and FL strategies.

Table 6. Comparison of PMSs.
PMSs Advantages Limitations
Rule-based strategies [137, 138]
  • • Simplicity: Rule-based strategies are relatively simple to implement and understand, requiring minimal computational resources
  • • Real-time responsiveness: Rule-based systems can react quickly to changing power demands and conditions, providing immediate power management decisions
  • • Transparency and interpretability: The decision-making process in rule-based strategies is transparent and interpretable, allowing easy validation and adjustment of rules
  • • Low computational overhead: Rule-based approaches have low computational requirements, making them suitable for resource-constrained UAVs
  • • Lack of adaptability: Rule-based strategies may struggle to adapt to dynamic and uncertain operational conditions, potentially leading to suboptimal power management decisions
  • • Limited complexity: Rule-based approaches may not capture the complexity of power management problems, resulting in suboptimal performance compared to more sophisticated algorithms
  • • Manual rule design: Designing effective rules requires expert knowledge and manual rule tuning, which can be time-consuming and challenging
  • • Limited optimization: Rule-based strategies often rely on heuristics and predefined rules, limiting their ability to optimize power management based on specific performance criteria
  
Fuzzy logic strategies [139, 140]
  • • Flexibility: Fuzzy logic allows for flexible and adaptive decision-making, accommodating uncertainties and imprecise inputs in power management systems
  • • Multivariable control: Fuzzy logic can handle multiple input variables simultaneously, enabling comprehensive power management considering various factors
  • • Expert knowledge Incorporation: Fuzzy logic allows the incorporation of expert knowledge and linguistic rules, capturing domain-specific insights for improved decision-making
  • • Real-time adaptation: Fuzzy logic systems can adapt and adjust power management decisions in real time, responding to changing operational conditions
  • • Complexity in rule design: Designing an optimal fuzzy logic system requires expert knowledge and careful selection of linguistic variables and membership functions
  • • Computational overhead: Fuzzy logic computations can be computationally intensive, requiring more processing power and memory compared to simpler algorithms
  • • Interpretability challenges: Fuzzy logic systems can be difficult to interpret and validate due to the complex relationships among linguistic variables and rules
  • • Limited optimization: Fuzzy logic strategies may not provide the same level of optimization as more advanced optimization techniques, potentially leading to suboptimal power management outcomes

5. Applications of Deep Learning Algorithms in UAVs

Future directions of promising research in UAVs hybrid energy management systems and deep learning applications: AI driven reinforcement leaning algorithms can be used to establish energy management strategies to improve flight time for different applications like agriculture, disaster management, surveillance by optimizing the required energy sources like batteries, fuel cells, and solar panels, and SCs. The deep learning techniques like long short-term memory (LSTM) and other ML models can be used for state-of-charge (SOC) and state-of-health (SOH) estimation and prediction of system failures, UAVs endurance. CNN algorithms can be used for network optimization in swarm UAVs applications like logistics and surveillance and disaster management. The recent technology in advanced deep learning algorithms in communication of swarm robotics and key findings are given in Table 7. Many industries are focusing on the development of digital twin technology to improve the prediction accuracy of deep learning algorithms about the system faults, lifetime by analyzing the sensors data in simulations.

Table 7. Deep learning algorithms applications in UAVs.
Research focus Methodology Key findings and contributions Year of publication Applications
Optimization using multi access edge computing (MEC) [141] Convex and nonconvex optimization algorithms The iterative algorithm improves 5G network experience by 19%–34% compared to benchmark 2024 5G communication, and latency sensitive applications
Wireless coverage optimization with constraints with multiple UAVs [142] Multi and improved multiobjective gray wolf optimizer for wireless coverage optimization Optimized energy and balanced coverage with energy optimization is developed 2024 Disaster management applications and wireless communication multi-UAV based applications
Multi access edge computing in IOT networks [143] Deep reinforcement learning and Lyapunov optimization in IOT networks An online learning-based algorithm to reduce service delay 2023 Future transportation, smart cities, and IOT networks
Assisted data collection using multi-UAVs for optimization [144] Three step ant colony optimization algorithm is used based on density Proposed a strategy to improve the network performance by data accuracy 2023 Real time monitoring with UAV, sensor, and IOT networks
Reduce mission time and cost using multi-UAVs for real-time data collection [145] Heuristic approximations and successive convex optimizations algorithms Optimized UAV trajectory for optimal performance of the allocated missions 2023 UAV based surveillance and monitoring applications and IOT networks

6. Discussion

For an effective operation and functioning of a UAV in fire disaster management, a suitable sensor fusion algorithm and path planning algorithm are necessary along with control strategies and a precise fire detection algorithm. Those can be considered the four prongs of the control management of the UAV. Numerous types of each prong are compared based on the merits and demerits they offer in the tables. A UAV requires an efficient energy storage and management system of which there are multifarious types. This section summarizes the various aspects of each type of sensor fusion algorithms, path planning algorithms, control strategies, fire detection algorithms, and hybrid energy storage solutions.

6.1. Sensor Fusion Algorithms

Optimal data fusion and its computational time are the primary aspects one must study when analyzing sensor fusion algorithms. BT offers advantages such as better spatial resolution and simple development. Though it suffers from distortion of color and cannot be used on a broad scale. It also reduces the amount of spectral data while combining RGB images. Unlike BT, filtering algorithms such as HPF, LPF, and Kalman decrease spectral asymmetry. They eliminate the accelerometer spike and frequency smoothing during the fusion of sensor data. Those algorithms can be relied upon for good calculation because of their iterative nature. But the iterative method produces a huge amount of useless data. This may lead to the formation of blurred images as output. Wavelet-based fusion can be considered for the fusion of images if the spectrum must be changed before the combining process. One major advantage it holds is that it retains the initial color and statistical parameters. However, it suffers from insufficient spatial resolution and is limited to only the linear environment. If the application demands a high spatial resolution fusion, one must use PCA. It can be used in a nonlinear environment, unlike wavelet-based fusion. It, however, suffers from spectral distortion and low accuracy if the number of sensors is low. Another type of fusion is statistical-based. Spatial and temporal adaptive reflectance fusion model (STARFM), spatialtemporal adaptive algorithm for mapping reflectance change (STAARCH), and spatiotemporal data fusion method (STDFM) are adaptable to nature. The issue with those algorithms, however, is that they avoid BRDF, which compromises accuracy in heterogeneous scenarios. If robustness and simplicity are demanded, geostatistical-based fusion such as Kringing and BME must be used. But they are limited to hard data. BME can be combined with MQQA to form a hybrid that will provide an integrated fusion algorithm. However, that requires in-depth knowledge about using hybrid or it provides erroneous results. HIS algorithm is proven to provide satisfying spatial augmentation, but at the cost of spectral distortion.

6.2. Path Planning Algorithms

The path planning algorithm must be fast, efficient, and accurate to provide the shortest path in the least amount of time without being excessively computationally expensive. The three important path-planning algorithms are RRT, RRT, and informed RRT. Factors used to compare the three are convergence rate, computational cost, ability to locate difficult paths, and optimization speed. In terms of convergence rate, RRT is slow compared to RRT and informed RRT. Along with that, it requires a high-cost initial solution. RRT offers a better convergence rate compared to RRT, but suffers from a costly initial solution, an issue found in RRT. However, that is solved in informed RRT, which acquires a feasible solution and enjoys an excellent convergence rate. Regarding computational cost, the RRT algorithm consumes a huge amount of computational resources as it calculates all the paths from the starting nodes regardless of their direction toward the finish point. Unlike RRT, RRT’s path-finding method is more confined toward the finish point because it needs to evaluate less data. Informed RRT, however, consumes the least amount of computational resources as it samples the subset directly, unlike RRT and RRT. RRT takes a long time to determine a feasible path as it does its sampling randomly. Because of that, its optimization speed is slow. RRT samples paths based on the nearest nodes to the start point, making the pathfinding and time to optimization quicker than RRT. Informed RRT samples the subset directly and finds the feasible path in the least amount of time with the fastest optimization speed. However, informed RRT’s search intrinsically relies on the present solution, hence, it cannot focus when the related prolate hyper spheroid is bigger than the planning problem. Figure 10 shows the comparison of cost and run time of different path planning algorithms, and in Table 8, the comparison of these path planning algorithms is given.

Details are in the caption following the image
Path cost and run time of different path planning algorithms.
Table 8. Comparison of path planning algorithms [146, 147].
Path planning algorithms Path cost (m) Run time (s) Number of nodes (tree density)
RRT 56 150 2954
RRT 50 621 2972
Informed RRT 41 210 2000
Bidirectional Informed RRT 40.95 125.54 1195

6.3. Control Algorithms

Algorithms that control UAVs should be fast, accurate, and reliable along with being robust to uncertainties in the system and environment. PID controllers are advantageous if the application cannot handle a high computational effort. It is a simple structured strategy codified based on the difference between a desired SP and a measured PV. Prior knowledge of the plant is not required and it does not depend on complex mathematical models. However, setting up the PID controller is time-consuming as the tuning parameter can only be determined by trial and error. It does not hold any robustness against uncertainties and external disturbances. It does not support the MIMO system. To combat some of the issues with PID, intelligent PID is developed which has better robustness for control systems. But its adaptability and parameter handling capacity are low. Unlike PID, MPC controllers can support MIMO systems along with having good parameter handling capacity. They provide zero state error and decrease peak overshoot. However, operating an MPC demands intensive computational resources along with being tedious to integrate as it requires a nonlinear optimization problem to be solved online. Also, its ability to handle uncertainties is poor. Compared to the offset-free MPC’s static disturbance correction components, adaptive MPC may be able to handle time-varying disturbances and address plant-model mismatch across larger prediction horizons. Another type of control algorithm is SMC which is high-speed and robust. It offers a simple architecture with immense accuracy as it does not depend on a complex mathematical model and is easy to implement. It is suitable for plants that work in the presence of parametric uncertainties and MIMO systems. Although SMC is robust, it is vulnerable to plant parametric instability and sliding mode disturbances. It also suffers from the chattering phenomenon along with having zero constraint handling capacity. There are several variations of SMCs developed such as integral SMC, terminal SMC, fractional order SMC, higher order SMC, SMC with feed-forward disturbance compensation, boundary layer SMC, SMC with reaching law modification, and SMC with AI. Additionally, a FL controller (FLC) can improve transient and steady-state performance over a PI controller. It is robust toward speed control applications along with being compatible with MIMO systems. But to use FLC, prior knowledge of the plant is necessary. Like PID, it does not have good constraint-handling capacity. It requires a huge amount of computational power.

6.4. Fire Detection Image Processing Algorithms

Fire detection algorithms are expected to detect the location of fire effectively. The Harris Corner algorithm is known for not being influenced by disturbances during rough flights because of its simple and independent rotation and motion. The algorithm is limited to a single scale and the results of detection overly depend on the threshold. For an under-the-scale scenario and rotation changes, SIFT, SURF, and BRISK algorithms are proven to be effective. However, their performance is not up to the mark when subjected to an illumination change. BRISK has many outliers as it uses a binary feature and matching approach. The ORB algorithm offers advantages such as high computational efficiency and stability. It is, however, ineffective in detecting scale invariance in real-time feature points. Another algorithm that is popular amongst developers is CNN. It holds the ability to recognize relevant features without requiring human intervention. It also learns unique characteristics for each class on its own. But to implement CNN, a huge amount of training data is required. Additionally, it cannot position and orient an object. The HOG algorithm is subjected to low susceptibility to external influences as environmental changes. However, the gears used to operate HOG are expensive and disturbances on the surface erode the method’s statistical dependability.

6.5. Energy Storage Systems and Management

Hybrid energy storage systems can increase the flight time of a UAV. There are several types of hybrid energy systems. Solar cell and battery is a much-discussed combination that can be used. Photovoltaic cells (PVCs) can help reduce the weight of the UAV and achieve up to 59% of fuel savings. However, the UAVs should have a large wingspan to maximize the amount of light energy captured which will increase weight. Coupling fuel cells with batteries can provide excellent dynamic load response. The combination is energy efficient and offers a longer flight time. Because the battery is coupled with fuel cells, the amount of hydrogen required is reduced which improves space efficiency. The remaining hydrogen is still very flammable. Also, the onboard H2 generation system makes the system complex. If the UAV requires frequent impulses of power, the fuel cell can be coupled with a supercapacitor. It improves the performance of the UAV while increasing the longevity of the fuel cell. But the whole system is costly and an increased amount of complexity is involved in its implementation. To eliminate the risks associated with hydrogen as fuel, the fuel cell can be replaced by a battery-supercapacitor hybrid. With this combination, the efficiency and power of UAVs can be increased significantly. Along with that, the supercapacitor hybrid is lighter compared to other hybrid energy solutions. However, complex EMS is required. To acquire the best of everything, a tri-energy storage system can be used which comprises the battery, fuel cell, and supercapacitor. It is highly efficient, fulfilling the energy requirement for all operating conditions effectively. Though it is expensive because of its increased complexity. In Table 9, the endurance time with different energy sources is discussed with various energy source combinations.

Table 9. Comparison of endurance time with hybrid energy systems [148, 149].
Type of energy source Fuel cell (kW) Battery (Ah) Super capacitor (F) Endurance time (min)
Fuell cell + battery 0.55 5.3 140
Fuel cell + super capacitor 0.5 3500 100
Battery + super capacitor 4.5 120 60
Battery + solar cell 4.3 150 80
Battery + fuel cell + super capacitor 12.5 40 88 1800

Figure 11 shows the endurance times of some UAVs with hybrid energy system. Power must be distributed as required to produce an optimal result of energy efficiency and good performance with a lifespan of as long as possible. Rule-based strategies offer advantages such as low computational cost because of their simplistic nature, prolonged performance, and ease of accessibility. However, they are not the best performers in terms of prediction quality. One can also consider FL strategies as they use less memory, and thus, offer high energy efficiency. Though the rules of FL require constant monitoring and need to be updated frequently.

Details are in the caption following the image
Endurance time of UAV with hybrid energy storage systems.

7. Conclusions and Future Work

An elaborate review was conducted on the advancement in the technology of UAVs for firefighting applications. The paper initially dealt with the current scenario of firefighting techniques where various issues like delay in fire extinguishment, risk of injury for victims and firefighters, et cetera, were exposed. The review focused on the following aspects of the development of UAVs: control management systems involved the discussion of an effective sensor fusing algorithm, optimal path planning strategy, and various control and hybrid energy storage solutions, fire detection algorithms to detect and extinguish a fire. Successively, various path planning algorithms were reviewed and RRT, RRT, and informed RRT were extensively discussed. Informed RRT required the least amount of time for the convergence of the discovered solution to the optimum which made it 88% faster than RRT and RRT. Regarding the control algorithms, several of them were discussed based on their working methodology and the advantages and disadvantages they concurred. It was found that simple algorithms such as PID required less computational energy compared to the more sophisticated control strategies though at a cost to the robustness of the system. Complex algorithms such as SMC with AI offered enhanced performance and robustness, but required a large amount of computational resources. Concerning the fire detection algorithms, the review of various methods was discussed. The HOG algorithm was proven to be the most effective in discerning the fire in the most optimal and precise manner. Following this, the hybrid energy storages for UAVs were discussed and compared to understand which type of energy storage combination is most optimal for increasing flight time and to distribute the energy effectively across all the components, efficient energy management was discussed.

In the future, one can expect to see even more widespread use of these techniques, particularly in applications where the system dynamics are complex or hard to model. Adaptive control algorithms can adjust their parameters in real time based on the system’s current behavior. This makes them well-suited for use in systems where the operating conditions change frequently or are hard to predict. In the future, there is a potential to see more research on developing control algorithms that can handle distributed systems, including techniques for coordinating the actions of the different components. Hybrid energy storage systems, which combine multiple types of storage technologies (such as batteries, supercapacitors, and fuel cells), have the potential to greatly improve the performance and endurance of UAVs. It would be a crucial step towards the development of long-duration UAVs that can be deployed for a variety of tasks such as surveillance, disaster management, transportation, and scientific research.

Disclosure

We certify that the submission is original work and is not under review at any other publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Author Contributions

All coauthors have seen and agree with the contents of the manuscript.

Funding

This work was funded by Vellore Institute of Technology (VIT), Vellore. The Seed Grant Financial Committee (SGEC) has given the project under process/product development File No: SG 20210132. The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Large Research Project under Grant RGP2/239/46. All authors would like to express their gratitude to the United Arab Emirates University, Al Ain, UAE, for providing financial support with Grant 12R283.

Acknowledgments

This work was funded by Vellore Institute of Technology (VIT), Vellore. The Seed Grant Financial Committee (SGEC) has given the project under process/product development File No: SG 20210132. The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Large Research Project under Grant RGP2/239/46. All authors would like to express their gratitude to the United Arab Emirates University, Al Ain, UAE, for providing financial support with Grant 12R283.

    Data Availability Statement

    Data sharing is not applicable to this article as no new data were created or analyzed in this study.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.