International Journal of Intelligent Systems

Volume 2025, Issue 1 4067323

Review Article

Open Access

Recent Advances in Automatic Modulation Classification Technology: Methods, Results, and Prospects

Qinghe Zheng,

Corresponding Author

Qinghe Zheng

orcid.org/0000-0003-1466-2542

Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Shandong Intelligent Manufacturing and Data Application Engineering Laboratory , Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Search for more papers by this author

Xinyu Tian,

Xinyu Tian

orcid.org/0000-0003-1247-6076

Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Shandong Intelligent Manufacturing and Data Application Engineering Laboratory , Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Search for more papers by this author

Lisu Yu,

Lisu Yu

orcid.org/0000-0001-8637-852X

Department of Information Engineering , Nanchang University , Nanchang , 330031 , China , ncu.edu.cn

Search for more papers by this author

Abdussalam Elhanashi,

Abdussalam Elhanashi

orcid.org/0000-0002-2514-1585

Department of Information Engineering , University of Pisa , Pisa , 56122 , Italy , unipi.it

Search for more papers by this author

Sergio Saponara,

Sergio Saponara

orcid.org/0000-0001-6724-4219

Department of Information Engineering , University of Pisa , Pisa , 56122 , Italy , unipi.it

Search for more papers by this author

Qinghe Zheng,

Corresponding Author

Qinghe Zheng

orcid.org/0000-0003-1466-2542

Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Shandong Intelligent Manufacturing and Data Application Engineering Laboratory , Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Search for more papers by this author

Xinyu Tian,

Xinyu Tian

orcid.org/0000-0003-1247-6076

Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Shandong Intelligent Manufacturing and Data Application Engineering Laboratory , Department of Intelligent Engineering , Shandong Management University , Jinan , 250357 , China , sdu.edu.cn

Search for more papers by this author

Lisu Yu,

Lisu Yu

orcid.org/0000-0001-8637-852X

Department of Information Engineering , Nanchang University , Nanchang , 330031 , China , ncu.edu.cn

Search for more papers by this author

Abdussalam Elhanashi,

Abdussalam Elhanashi

orcid.org/0000-0002-2514-1585

Department of Information Engineering , University of Pisa , Pisa , 56122 , Italy , unipi.it

Search for more papers by this author

Sergio Saponara,

Sergio Saponara

orcid.org/0000-0001-6724-4219

Department of Information Engineering , University of Pisa , Pisa , 56122 , Italy , unipi.it

Search for more papers by this author

First published: 18 February 2025

https://doi.org/10.1155/int/4067323

Citations: 7

Academic Editor: Eugenio Vocaturo

Share a link

Email
Wechat
Bluesky

Abstract

As an essential technology for spectrum sensing and dynamic spectrum access, automatic modulation classification (AMC) is a critical step in intelligent wireless communication systems, aiming at automatically recognizing the modulation schemes of received signals. In practice, AMC is challenging due to the influence of communication environment and signal parameters, such as unknown channels, noise, symbol rate, signal length, and sampling frequency. In this survey, we investigated a series of typical AMC methods, including key technology, performance comparisons, advantages, challenges, and future key development directions. According to the methodology and processing flow, AMC methods are divided into three categories: likelihood-based (Lb) methods, feature-based (Fb) methods, and deep learning methods. The technical details of various types of methods are introduced and discussed, such as likelihood distributions, artificial features, classifiers, and network structures. Then, extensive experimental results of state-of-the-art AMC methods on public or simulated datasets are compared and analyzed. Despite the achievements that have been made, there are still limitations of the individual methods, including generalization capability, reasoning efficiency, model complexity, and robustness. In the end, we summarized the severe challenges faced by AMC and key future research directions.

1. Introduction

With the development of the sixth-generation (6G) wireless communication system, more diversified and complex application scenarios are gradually being considered [1, 2], in which the drone [3] or vehicle communication system [4] can provide real-time and reliable network access for highly mobile users. Drones or vehicles can provide the hotspot coverage for mobile users, make up for the blind areas covered by ground communication base stations, and improve the overall coverage of the network. By deploying communication nodes in the air and realizing the enhancement and supplementation of the ground communication network [5–7], the wireless communication systems can effectively improve the network capacity, reduce the communication congestion, and ensure the smooth operation of the communication service under special scenarios, such as dense urban areas and natural disaster sites. In addition, the high-speed wireless communication system can be combined with ground communication system [8], satellite communication system [9], and other communication modes [10, 11] to realize a highly integrated network architecture, which is able to flexibly meet the requirements of various application scenarios and provide users with efficient and reliable communication services. For example, the combination of self-driving vehicles and intelligent traffic management systems helps to realize real-time and efficient information transmission and exchange, improving the safety, efficiency, and sustainability of road traffic. In the fields of virtual reality (VR) [12], augmented reality (AR) [13], and Internet of Things (IoTs) [14], wireless communication systems can provide flexible and efficient network access to meet the demanding requirements for communication performance in new applications.

In the wireless communication system, the modulation schemes of transmitted signals may need to be adjusted in real-time according to the channel condition due to the signal transmission distance, terrain, buildings, and other factors. As one of the key technologies of wireless communication system, the AMC technique [15] aims to monitor and recognize the signal modulation scheme in real-time, so as to help the wireless communication system adaptively adjust the reception parameters and improve the signal reception quality. This is crucial to ensure stable communication between users and ground control centers in complex environments [16]. On the other hand, wireless communication system faces various non-cooperative interferences in the complex environments, e.g., cities and mountains. By suppressing and filtering the non-cooperative interference signals, the AMC technique is of great significance to improve the anti-interference capability of wireless communication systems in the strong interference environments. Through integrating AMC technology, the wireless communication system can monitor the channel state and modulation scheme, which helps to complete the dynamic spectrum allocation (DSA) and resource management; further improve the spectrum utilization; and optimize the communication quality. Specifically, in the case of poor channel quality, a configuration with lower order modulation and higher coding rate can be used to reduce bit error rates (BERs). In the case of better channel quality, higher order modulation and lower coding rate can be used to improve spectrum utilization and communication rate. Besides, the AMC technique can simplify the design of wireless communication systems by eliminating the need to design separate receivers for different modulation schemes [17], which helps to reduce the system costs and design complexity, and improve the practicability and feasibility of wireless communication systems.

In practical wireless communication environments, signals can be easily affected by adverse factors such as multipath propagation [18], interference [19], and noise [20], resulting in complex signal waveforms and posing formidable challenges to AMC. In a multipath propagation environment, channel estimation becomes more complex and difficult [21]. The received signal copies may have different amplitudes, phases, and time delays due to multipath propagation, resulting in signal fading and distortion at the receiving end [22]. The time delay causes the signal to spread out in the time domain, leading to misaligned symbol boundaries between received signals and transmitted signal. At the low signal-to-noise ratios (SNRs), the signal waveform may be severely disturbed by noise and interference. Circuit elements in wireless communication systems (e.g., oscillators, mixers, and amplifiers) generate phase noise during operation, including thermal noise, flicker noise, and random noise. The Doppler effect [23] caused by the high-speed movement of drones or cars causes the frequency of the received signal to differ from the frequency of the transmitted signal, thus causing frequency offset. Antennas may be affected by environmental factors like wind and ice loads, and changes in orientation and shape can also affect the propagation characteristics of signals [24]. Therefore, the quality of the dataset used for signal modeling determines the actual performance of AMC algorithms. On the other hand, the design and application of more and more complex modulation schemes, such as SQPSK, Π/4 QPSK, and CPFSK, further increase the difficulty of the recognition, since the differences between higher order intraclass modulation schemes are usually difficult to capture. Even some modulation schemes have multilevel modulation or modulation mixing situations [25]. In wireless communication systems, the AMC technique requires high adaptability and the ability to track and recognize changes in the modulation schemes of signals in real time, as the modulations of communication signals may change dynamically over time [26], which places requirements on the complexity and inference speed of AMC methods.

At present, AMC methods are mainly divided into the following three categories: likelihood-based (Lb) methods, feature-based (Fb) methods, and deep learning (DL) methods. The foundational work is shown in Figure 1. The Lb methods achieve the automatic classification of modulation schemes through Bayesian estimation, including average likelihood ratio test (ALRT) [27], generalized likelihood ratio test (GLRT) [28], hybrid likelihood ratio test (HLRT) [29], and some other variants [30, 31]. Although Lb methods can adapt to various types of modulated signals, they are usually difficult to compute analytical solutions and require accurate estimation of priori probabilities and likelihood functions. The Fb methods first extract features from the received signals and then design a machine learning model to fit the features and complete the AMC. Common features include zero-crossing rate [32], peak [33], peak interval [34], high-order cumulants [35], cyclic spectrum [36], etc. These features are usually easy to implement and possess a low computational complexity, but feature extractions are highly susceptible to the noise and interference. Machine learning models, such as support vector machine (SVM) [37], random forest (RF) [38], and decision tree (DT) [39], can learn effective feature representations in large amounts of signals, but there are requirements for the quality and quantity of the training dataset. DL methods utilize overparameterized deep neural networks (e.g., convolutional neural networks (CNNs) [40–44], recurrent neural networks (RNNs) [45–49], graph neural networks (GNNs) [50–52], transformers [53–55], and hybrid neural networks [56–59]) to perform the end-to-end feature extraction and classification of signals without feature designing and engineering, which exhibit strong nonlinear fitting capabilities and can achieve excellent AMC accuracy in the complex wireless communication environments. However, DL models usually have a high structural complexity and thus require large computational resources and storage, which limits their practical applications.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

The related foundational work of AMC.

Based on the stacking of nonlinear connections with large-scale parameters, DL models have demonstrated excellent feature learning capabilities and achieved remarkable success in many research fields, including computer vision [60], intelligent manufacturing [61], and signal analysis [62]. In recent years, DL has been gradually developed for AMC and has shown great potential. It is an important research direction in the future. Unlike traditional methods that rely on manually designed features and domain expertise, DL models are able to automatically learn feature representations of signals and thus can be more easily generalized on a variety of application scenarios without the requirements for extensive feature engineering for specific situations. DL methods utilize large amounts of data and complex network structures to achieve the strong generalization capabilities and adapt to multiple modulation schemes and channel conditions. Moreover, DL models can adjust the model structure and parameters to achieve better performance according to actual needs. Although the internal structures of DL models are more complex, they can explain the learned features through visualization techniques, such as feature maps [63], attention weights [64], and heatmaps [65]. This makes DL methods interpretable, helps analyze the working principle of the model, and further improves AMC performance. It can be said that the development of DL has brought new opportunities and breakthroughs to the AMC technique.

In this paper, we provide a comprehensive survey on the recent advancements in AMC techniques from the perspective of traditional methods and deep learning methods, which emphasize the important step between the signal modeling and AMC. We first introduce the specific mainstream AMC methods and discuss of both their advantages and disadvantages. Then, we compare and analyze their AMC performance in terms of classification accuracy and inference speed. Finally, we summarize the critical challenges faced by AMC and future development trends. Although a number of reviews have been investigated, there is currently a lack of joint reviews of methods and results in the AMC field. In this survey, we focus on inspiring and innovative work from the following four aspects: signal representations, model structures, performance evaluation, and robustness to channel conditions.

The rest of the survey paper is organized as follows. In Section 2, we formulate the problem of AMC. Section 3 and Section 4 introduce the Lb and Fb methods, respectively. In Section 5, we describe the popular deep learning methods for AMC. The experimental results and analysis are reported in Section 6. Current challenges and future trends are discussed in Section 7. Finally, the conclusions are summarized in Section 8.

2. Problem Formulation

2.1. Communication Model

The typical wireless communication system with AMC is shown in Figure 2. First, the encoded information is modulated onto the wireless carrier signals, which are then transmitted through an antenna to other users or ground stations. During the channel transmission, the signal may be affected by multipath decline [66], unknown noises [67], frequency offset [68], phase offset [69], timing error [70], and other interferences. In the receiver, the modulated signals are demodulated from the wireless carrier signals, and then, the original information can be recovered to execute subsequent commands, such as real-time control of devices and video/image transmission. During the communication process, AMC can avoid manual judgment of modulation schemes and reduce labor cost, especially in large-scale telecommunication networks.

Consider a received complex baseband signal r, which can be expressed as follows:

()

where ρ represents the channel amplitude gain following a specific distribution, such as Rayleigh distribution [71] and Rician distribution [72]. f₀, φ₀, and ε₀ denote the frequency offset, carrier phase offset, and timing error, respectively. K is the total number of signal symbols. a(·) represents the noises, which includes the additive white Gaussian noise (AWGN) and Gaussian mixture noise (GMN) during the modeling process. g_m[s(l)] stands for the kth transmitted symbol drawn from constellations of mth modulation scheme, in which m = 1, 2, …, M and M is the total number of candidate modulation schemes.

The existing communication models only consider single-channel scenarios, such as Rayleigh fading channels and Rician channels, while ignoring the influence of multipath channels. However, there may exist multiple channel scenarios and interference factors simultaneously in practical communication environments [73], so the assumption of a single scenario may limit the generalization capability of AMC methods. In addition, signal transmissions can be subject to various hardware limitations, such as nonlinear effects [74] and filter characteristics [75]. At present, the simplified channels in most studies are hard to accurately reflect the complexity of the actual communication environments, which affect the performance evaluation of various methods [76, 77]. To overcome these limitations, future communication models need to comprehensively consider various factors in the practical communication environments to more accurately explore and evaluate AMC methods.

Besides, the multiple-input multiple-output (MIMO) system makes it challenging to accurately model the wireless channels. The diversity of transmission signals increases the number of candidate modulation schemes and increases the difficulty of AMC. Multiple users transmitting data simultaneously can cause complex interference, resulting in the received signal containing information from multiple users. Similarly, the orthogonal frequency division multiplexing (OFDM) technique divides the frequency band into multiple subcarriers, each of which can independently modulate and transmit different data, thereby achieving higher bandwidth utilization. However, the occurrence of multichannel interference in the frequency band gradually increases the difficulty of signal reception, making automatic modulation recognition more complex. In OFDM systems, both time-domain and frequency-domain jitter may occur. Time-domain jitter can cause subcarrier insertion and deletion, while frequency-domain jitter can cause the frequency of subcarriers to deviate from the center frequency, resulting in instantaneous frequency offset and phase rotation of received signals, increasing the difficulty of AMC.

2.2. Signal Representation

In AMC tasks, the representation of signals is directly related to the method design and AMC performance. The most popular signal representations include temporal signals, spectrum, and constellation. In most work, a single signal representation is used as input for driving the model. Even if some signal representations are attempted to be fused, they remain in a single time or frequency domain [78], resulting in fragmented time-frequency analysis and a lack of effective means to fuse information from multiple domains. Some typical signal representation examples are shown in Figure 3.

2.2.1. Temporal Representation

The received modulated signals are usually stored as in-phase and quadrature (I/Q) representations in the time domain, including in-phase component r_I and quadrature component r_Q. In this way, the received signal r can be rewritten as follows:

()

where

()

In fact, r_I and r_Q components can be regarded as real and imaginary parts of the received signal. In many works [66–68], I/Q signals are directly fed into the model in the form of matrix to learn features and drive the model to complete AMC.

To improve the signal characterization capability, the amplitude (A) and phase (P) of received temporal signals can be calculated and used as signal representation

[69–71], as given by the following equation:

()

To fully utilize the information hidden in the received signals, I/Q components and A/P information are combined as the signal representation for model training in some works [45, 79].

The quality of temporal signal representations determines subsequent feature learning and establishment of mapping relationship between received signals and modulation schemes. Appropriate signal representation can provide sufficient information to improve the accuracy and robustness of AMC. The fusion of multiple signal representations is an important direction to enhance model learning capability. The contemporary signal representation fusion methods include serial fusion [80] and parallel fusion [81]. The serial fusion refers to the sequential input of multiple representation vectors into the classifier, while the parallel fusion is the combination of multiple representation vectors into a new high-dimensional matrix, which is then fed into the classifier.

2.2.2. Spectrum

To compensate for the difficulty of parametric models to analyze the frequency information of temporal signals, some studies have proposed the use of spectrum obtained from the short-time Fourier transform (STFT) of received signals as the representation to reflect the modulation information, such as amplitude spectrum (AS) and phase spectrum (PS) [82], which can be computed by the following equation:

()

where κ and ψ are AS and PS, respectively. ℜ and ℵ represent the real and imaginary parts, respectively. R is the STFT results and can be calculated by the following equation:

()

where p and q denote varying time and frequency, respectively. τ(·) represents the window function such as Hamming and Blackman, in which the window size and hop size are denoted as W and H, respectively. Different window functions have different temporal and frequency resolution characteristics. For example, the temporal resolution of the Hanning window is higher, while the frequency resolution of the Blackman window is higher. The sampling period T is usually determined to be 0.001, and sampling point P is the same as window size W.

It is necessary to pay attention to the trade-off between the temporal and frequency resolution in specific application scenarios. On the other hand, the size of the spectrum directly determines the amount of information contained, which indirectly affecting the computational complexity and inference speed of AMC methods. In addition, a larger spectrum size may lead to model overfitting, especially when the number of training data is limited. A smaller spectrum size helps reduce the risk of overfitting, but may lead to information loss and degradation of AMC accuracy. Some studies have proposed alternative frequency-domain analysis methods, such as wavelet transform (WT) with local analysis and multiscale analysis capabilities [83]. The WT can adapt to the frequency and time characteristics of various signals, thereby improving the robustness to varying SNRs. For certain specific wavelet bases, such as Daubechies wavelet and Haar wavelet, specific fast methods [84, 85] can significantly reduce the computational complexity and improve analysis efficiency.

Compared to temporal signals, spectrums are more prone to data augmentation to help improve the generalization of AMC models, such as spectrum interference [86] and sample generation based on generate adversarial networks (GANs) [87]. A certain level of interference to frequency-domain information can help expand the sample space and improve decision boundaries, but the interference degree needs to be empirically set based on the complexity of the task. GANs mine sample features to make new samples and help models learn robust modulation knowledge, but the stability and reliability are still being explored. Although GANs have been successfully applied to natural and facial images [88], there is a lack of theoretical basis for whether the modulation properties of spectral images can be truly learned.

2.2.3. Constellation

Compared to temporal signals, constellation diagrams can intuitively represent the modulation characteristics of signals, such as phase and amplitude. Through visualization techniques, constellation diagrams can effectively present the variability of modulation schemes, making the model easier to interpret. Due to the fact that symbol transmission is independent of time sequence, constellation diagrams are able to eliminate impact. For a received signal r, each symbol can be estimated and plotted in the I/Q plane as a constellation diagram C, which can be expressed as a mapping function m(·), i.e.,

()

However, researchers [89] have found that symbol synchronization methods are difficult to deal with received signals corrupted by complex channels (e.g., Rayleigh channels) and it is difficult to obtain suitable constellation diagrams, which leads to degradation of AMC performance. To avoid the signal-level symbol synchronization, a group of constellation diagrams called MCDs is designed as input for DL models and denoted as [C₀, C₁, …, C_J−1], where C_j is the constellation diagram of r_j and can be computed by the following equation:

()

where

()

In the above equations, the signal r is J times decimated with offset equal to all the integer symbol timings, and r_j(k) denotes the kth symbol of the decimated signal of offset j.

Actually, some other constellation variants have also been studied to improve AMC performance, including the Cauchy-score constellation [90], constellation density [91], and slotted constellation [92]. However, most of the current studies on constellation representations focus on how to cover the modulation information comprehensively, neglecting the design and training process of corresponding classifiers. On the other hand, classifiers like AlexNet [93] and ResNet [81] transferred from image processing tasks are challenging to adapt to the constellation diagrams, as the distribution or structure of symbols rather than pixel values is more noteworthy. In addition, constellation diagrams usually contain a large number of blank areas due to their sparsity, which has uncertain adverse effects on classifiers. How to reduce the dimensions of constellation diagrams and refine modulation knowledge is also an important future research direction.

2.3. Problem Description

For Lb AMC methods, the classification results are obtained by identifying the hypothesis with the greatest likelihood as follows:

()

where X_i is the ith signal representation and Y_i is its one-hot encoding ground-truth label, in which y_m is 1 and the others are 0. The hypothesis H_gm for every modulation type g_m is calculated through the estimation of channel parameters consisting of channel gain, noise variance, and phase offset and an appropriate likelihood function F.

The likelihood function is represented by the probability density function (PDF) of the communication model and observation data, representing the probability of received data under a given modulation scheme. Different communication models require the use of corresponding likelihood functions. The design of likelihood functions directly determines the AMC accuracy and the complexity of the system. To reduce the computational complexity of Lb methods in practical AMC applications, statistical features (e.g., mean, variance, and cumulants) have been proposed as substitutes for likelihood functions [31].

For the Fb and DL-based AMC methods, our goal is to maximize the probability of correctly classifying received signals’ corresponding modulation scheme by constructing a classifier, i.e.,

()

where M(·) represents the classifier and θ is learnable parameters. By iteratively updating the parameters θ, the convergent classifier M^∗(·) can be obtained to perform AMC. In this case, the AMC task is transformed into an objective function optimization problem for a nonlinear model in a high-dimensional space.

In the Fb and DL-based AMC methods, designing appropriate objective functions to measure model performance is a key issue. The optimization of the objective function is prone to falling into local minimum, especially when the number of training samples is insufficient. The optimal trade-off between overfitting and underfitting also needs to be considered to support the generalization of the classifier in different application scenarios and communication environments. Although a large number of stochastic gradient descent (SGD) optimization algorithms [94, 95] and their variants [78, 96] have been developed, the theoretical basis associated with ensuring a lower bound on the modulation information is inadequate.

3. Lb Methods

In Lb methods, the likelihood function is computed and updated for the selected communication model to meet the complexity requirements or to be usable in the noncollaborative environment. Afterward, a threshold is used to determine the fit between the likelihood ratio test results of signals and the modulation candidates. The overall process of Lb methods is shown in Figure 4. In this section, we focus on introducing and analyzing typical Lb methods, including ALRT [27], GLRT [28], HLRT [29], and some variants [30, 31].

3.1. ALRT

The ALRT [27] is a suitable solution to overcome the limitations in maximum-likelihood classifiers. The ALRT likelihood function varies the uncertain parameters by integrating over all possible values of uncertain parameters and corresponding probabilities. Considering that the classifier is unaware of the channel parameter set Θ containing amplitude gain ρ, noise variance σ², and carrier phase offset φ₀, the ALRT likelihood function can be expressed as follows:

()

where f (·) is the probability under modulation hypothesis H of the channel parameter set Θ. The F is the likelihood provided by the channel parameter set Θ. The challenge faced by ALRT is that its likelihood function becomes very complex with the introduction of integral operations.

3.2. GLRT

Unlike the ALRT, the likelihood function of GLRT [28] is replaced within a certain range for the unknown parameters and performs the integration of the unknown parameters while maximizing the likelihood. In essence, GLRT is an integration of the maximum-likelihood classifier and estimator. The likelihood function of GLRT can be expressed as follows:

()

It can be seen that the GLRT-based classifier can handle high SNR, but is sensitive to low SNR. Once there are too many candidate modulation schemes, especially high-order modulation schemes like 64QAM, the AMC accuracy loss of GLRT is obvious. Moreover, its real-time performance is still difficult to meet the requirements in large-scale MIMO systems.

3.3. HLRT

The HLRT [29] is designed to estimate the modulation schemes of received signals by taking the average of the transmitted symbols and the maximum of the generated likelihood function with respect to the carrier phase. The likelihood function of HLRT can be defined as follows:

()

Compared to ALRT and GLRT, HLRT is adept at distinguishing different types of modulation (e.g., AM and FM) and is robust to noise. However, HLRT requires the selection of appropriate statistics and mixing strategies, which is challenging in practical applications. The AMC accuracy of the hybrid likelihood ratio degrades if the strong correlation is exhibited between the statistics. For complex signals, HLRT faces the same problem as ALRT and GLRT, i.e., it requires significant computational costs and long processing time.

3.4. Variants and Analysis

For specific cases, some variants of the likelihood function are used to improve AMC performance, such as randomized likelihood ratio test (RLRT) [30] by randomizing the training samples and features to generate statistics, and decision-directed likelihood ratio test (DLRT) [31] incorporating search and LRT.

The Lb AMC methods rely on the estimation of modulation parameters and usually require manual setting of multiple hyperparameters, such as search range and iteration number, which exhibit a significant impact on AMC accuracy. In practical applications, selecting appropriate hyperparameters according to communication scenarios is a challenging problem. RLRT has strong robustness by simulating noise distribution, providing an important research direction. By modeling the noise distribution and introducing prior knowledge to guide the design of likelihood functions, the generalization capability of the algorithm can be improved. Therefore, how to further utilize unstructured expert knowledge is crucial. Search guidance in DLRT can reduce the space for candidate modulation schemes and improve the accuracy in classifying M-PSK. In fact, the most popular aspect in the Lb methods is to design appropriate likelihood functions to capture complex features and potential relationships hidden in the signals, such as nonlinear likelihood functions that are more suitable for nonlinear modulation recognition.

4. Fb Methods

In the Fb methods, the signals are first preprocessed through denoising, filtering, and downsampling, and then, the corresponding features are extracted. The key features are selected and used to train machine learning models, i.e., to complete feature learning and modulation classification by minimizing the objective function. The processing flow is shown in Figure 5.

4.1. Feature Extraction

4.1.1. Spectrum-Based Features

Essentially, the characteristics of modulation schemes are reflected in the amplitude, frequency, and phase, and thus, a series of spectrum-based features have been developed to characterize various modulation methods. The spectrum-based features are designed according to specific modulation characteristics and therefore have certain limitations. The definition of spectrum-based features is given as follows:

(1)
The maximum spectral power density η_sp of the normalized and centered instantaneous amplitude of the received signal is calculated by the following equation:
()
where DFT (·) represents a discrete Fourier transform. The variable r_AK is the normalized and centered instantaneous amplitude of the received signal r and μ_A is the mean of the instantaneous amplitude of the signal in which the amplitude normalization is proposed to enhance the attenuation caused by unknown channel effects, and they can be computed by the following equation:
()
(2)
The standard deviation η_nc of the normalized and centered instantaneous amplitude is calculated by the following equation:
()
where K_c represents the number of symbols that satisfy the constraint of r_K(k) > r_t, and the r_t is a threshold that filters out the symbols with low amplitudes.
(3)
The standard deviation η_as of the absolute value of segmented instantaneous amplitude of the normalized and centered signal is calculated by the following equation:
()
(4)
The kurtosis η_ia of the normalized and centered instantaneous amplitude is calculated by the following equation:
()
(5)
The standard deviation f_as of the absolute value of the normalized and centered instantaneous frequency is calculated by the following equation:
()
where f_AK is normalized and centered instantaneous frequency.
(6)
The symmetry measurement f_ss of the spectrum around the carrier frequency is calculated by the following equation:
()
where
()
In the above equations, f_ak + 1 denotes the number of symbols corresponding to the carrier frequency f_c, and f_s is the sampling rate.
(7)
The kurtosis f_ia of the normalized and centered instantaneous frequency is calculated by the following equation:
()
(8)
The standard deviation φ_id of the instantaneous direct phase of the nonlinear component is calculated by the following equation:
()
(9)
The standard deviation φ_as of the absolute value of the instantaneous phase of the nonlinear component is calculated by the following equation:
()

The above features are applicable to single-carrier modulation methods, such BPSK, QPSK, 8PSK, AM, and FM. These modulation schemes mainly use phase information for information transmission, so the amplitude and frequency characteristics are relatively fixed. In OFDM, different subcarrier frequencies and phases are usually used for modulation. Therefore, the characteristics of amplitude, frequency, and phase are all of great significance. It should be noted that different communication scenarios and modulation methods may have certain differences. For specific communication scenarios and modulation methods, more or other types of features may be involved, or more complex feature engineering methods may be required. Therefore, it is very important to select and extract features targeted at specific tasks and scenarios.

The manually designed spectral features have good interpretability, which clearly reflects the key features of the signal and helps the researcher to understand the relationship between features and modulation schemes and thus improve the model performance. Artificial features can be easily and flexibly implemented and deployed in toolkits without the need for complex optimization. On the other hand, artificial features are difficult to adapt to changes in different modulated signals, resulting in poor generalization performance. The process of manual design and feature engineering is easily influenced by subjective factors. It is worth noting that artificial features cannot automatically learn implicit relationships in the data, which limits the performance improvement of the model. Besides, as the signal dimension increases, artificial features may experience dimensional disasters.

4.1.2. Cumulants

Cumulant [35] is a measure that describes the characteristics of data distribution and can be used to measure the skewness and kurtosis of data distribution. By studying the high-order statistical characteristics of data, cumulants can be used to classify different categories of data. First, the cumulative analysis is performed on the original data to obtain a series of features, capturing the characteristics of the data distribution. Then, the original data are projected onto a low-dimensional space by selecting important high-order cumulants, thereby reducing computational complexity and storage requirements.

The typical second-order, fourth-order, and sixth-order cumulants are, respectively, defined as follows:

()

The high-order cumulants still have some limitations in practical applications, e.g., the computational complexity of cumulant features is high, especially when dealing with high-dimensional data. In addition, the cumulants are sensitive to noise and instability, so some antinoise and enhancement techniques need to be adopted. The choice of appropriate orders of cumulants is also a concern. Correlation algorithms based on information entropy [97], mutual information [98], or other specific metrics can be introduced to pick the most discriminative high-order cumulant features.

4.1.3. Cyclostationarity

The cyclostationarity features describe the modulation features of received signals by analyzing its statistical characteristics on a periodic basis, which can effectively capture the periodicity, spectral characteristics, and phase information. At present, the commonly used cyclostationarity features include periodic graph [36], cyclic autocorrelation function (CAF) [99], cyclic cross-correlation function (CCCF) [100], cyclic spectral correlation coefficient (CSCF) [101], and cyclic spectral correlation graph (CSCG) [102]. The periodic graph describes the frequency distribution of signals for AMC by calculating its power spectral density (PSD) in the frequency domain. The CAF and CCCF calculate the autocorrelation and cross-correlation of signals under different time lags to capture the phase information. The CSCF further calculates the correlation between CAF and CCCF to characterize the modulation relationship for AMC. The CSCG is an extension of CSCF in the frequency domain, describing the periodic similarity of signals in frequency.

In terms of computational complexity, fast time-frequency transformation methods such as fast Fourier transform (FFT) can effectively improve the computational efficiency. WT and other types of methods can be used to improve its sensitivity to noise. In addition, cyclostationarity features typically exhibit high dimensions, leading to the problems of dimension curse and overfitting. Traditional dimensionality reduction methods such as principal component analysis (PCA) [103] and linear discriminant analysis (LDA) [104] can reduce feature dimensions, but affect the AMC accuracy. Overall, it is necessary to select appropriate cyclostationarity features according to specific communication scenarios and even combine them with other features such as high-order cumulants to improve AMC accuracy.

4.1.4. Feature Fusion

Considering the limitations of a single feature, the feature fusion strategy can integrate feature information from different sources, levels, and representations to form a more comprehensive and discriminative feature representation. Various typical feature fusion strategies have been widely studied, including serial fusion [105], parallel fusion [106], cascade fusion [107], decision fusion [108], weighted fusion [109], and deep learning–based nonlinear fusion [110], but few are applied to the AMC.

Serial fusion connects different feature vectors one by one to construct a higher dimensional feature, which is easy to implement but increases computational complexity. This type of direct fusion is usually hard to bring significant performance improvement and may also have adverse effects in situations where there are too many feature modes. The ability to contextualize the full text of the model is challenging, thus affecting serial fusion efficiency, especially when features are complex.

Parallel fusion concatenates various types of feature vectors in the multichannel manner to form a feature matrix. Different features achieve knowledge aggregation and joint reasoning through information exchange between channels. Compared to serial fusion, parallel fusion has higher efficiency in feature fusion. For example, Wu et al. [106] convert the modulated signal into two image representations of cyclic spectrum and constellation diagram, and fuse them into dual channel image inputs.

Decision fusion inputs various feature vectors into independent classifiers and then fuses each decision result through voting, averaging, or other ensemble methods. It can fully utilize the advantages of multiple feature extractors and classifiers, avoid the failure of a single feature or classifier, and improve the robustness of the entire AMC system. Obviously, the decision fusion method has the highest computational cost as it requires the design of corresponding classifier for each feature.

In practical communication systems, signal transmission can be affected by many interference factors such as channel noise and multipath propagation, resulting in a decrease in signal quality. Therefore, the weighted fusion mechanism can be introduced to assign different weights to different features in the process of feature fusion. It can adaptively adjust feature weights based on the importance of features, reducing the sensitivity of a single feature to signal quality fluctuations. However, both feature weights and classifier parameters need to be optimized simultaneously, which may make the optimization process more complex.

Researchers have started to turn to the study of end-to-end deep learning models to automatically learn the nonlinear relationship between different features, and realize the automatic weighted fusion and analysis of features. Considering the flexibility of the deep learning model structure, feature fusion is strongly scalable and can be adapted to application scenarios with different levels of complexity. Deep learning–based feature fusion technique improves the problem of relative fragmentation of time-frequency-domain analysis of traditional methods and has achieved significant performance improvement in many modulation classification tasks.

4.2. Machine Learning Models

4.2.1. DT

The DT [39] composed of a series of decision nodes is the most suitable classifier for processing spectral features, as shown in Figure 6. In the tree structure, the input node is used to import all types of features. After the input node, a series of conditions or threshold-based judgments with specific individual features are used to identify the modulation scheme of the signal.

The structure of the DT is easy to understand and implement, and therefore, it can clearly display the signal features represented by each branch and leaf node. The high interpretability is helpful to analyze the working principle of the model and classification results. The DT-based methods can automatically select important features during the individual judgment process, which is beneficial for reducing the workload of feature designing and engineering. In addition, DTs possess strong robustness to outliers and noise, but are sensitive to signal parameters and typically require dynamic threshold adjustments.

In practical applications, the construction of DTs involves recursively partitioning features, resulting in high computational complexity, especially when dealing with large datasets, where model efficiency is difficult to meet requirements. The output of DTs may be affected by the order of data input, leading to model instability. In fact, improving the splitting criteria and pruning strategy of DT-based methods is an effective way to improve the generalization ability and computational efficiency of the model [111]. Besides, the most important thing is the efficient feature extraction, such as combining the powerful feature extraction ability of deep learning models and the interpretability of DT [112].

4.2.2. RF

On the basis of DTs, RFs [38] have been developed for AMC. By integrating multiple trees, the RF-based methods correct and prevent overfitting problems. By setting the number and maximum depth of trees, RFs train each tree using different feature subsets to reduce the prediction variance. In practical applications, the training and testing of RFs supports the parallelization operation, which can fully utilize modern hardware resources to accelerate the computational process.

Although the AMC accuracy of RFs is usually better than a single DT, their visualization and interpretability are poor. RFs may be affected by noise and redundant information during feature selection, leading to the prediction bias. Due to the voting mechanism of RFs, errors in individual DTs sometimes have a significantly detrimental impact on the final classification results. Therefore, the adaptability of RFs in feature selection is a research focus [113]. Moreover, how to make RFs more robust in dealing with changes in data distribution is also a key research direction [114].

4.2.3. SVM

By mapping the original data to a higher dimensional feature space through a mapping function, SVM [37] makes the nonlinear classification problem in the original space linearly separable in the feature space. Using optimization algorithms like the Lagrange multiplier method and sequential minimal method, SVM searches for a hyperplane in feature space to distinguish samples of different categories, while maximizing the distance between each sample to the hyperplane. Finally, the new test sample is mapped to the feature space and its distance from the hyperplane is calculated to determine its category.

In the mapping process, it is necessary to use kernel functions KF to represent the mapping relationship from a low-dimensional space to a high-dimensional feature space, such as linear kernels KF₁, polynomial kernels KF₂, and Gaussian kernels KF₃, which can be defined as follows:

()

where x and w represent the feature vector and weight vector, respectively. p is a constant greater than 0, q is an offset, and d is the power of the polynomial.

In the feature space, the hyperplane can be expressed by the following equation:

()

where

is the sample representation in the feature space and w₀ is the bias.

In the practical application of SVM for the AMC task, its classification performance highly depends on the selection of kernel functions, so identifying different modulation methods usually requires extensive experiments. When the sample distribution between categories is uneven, SVM may lean toward the majority class, resulting in a decrease in classification performance for the minority class. In addition, SVM is a binary classifier, requiring specialized methods (such as the “one vs all”) to handle the multiclassification problem, which increases both the computational complexity and difficulty in parameter adjustment. Once there are too many categories of candidate modulation schemes, the inference efficiency of the model can be significantly affected.

4.2.4. Model Ensemble

Many types of traditional machine learning classifiers have been developed, such as k-nearest neighbor (KNN) [115] and hidden Markov model (HMM) [116]. To address the limitations of each classifier, the model ensemble strategy can be utilized to leverage the strengths of each model.

Model ensemble [117] is able to combine the prediction results of various models to improve the overall discriminative performance of the AMC system. Especially in some complex communication conditions, the model ensemble can make the discrimination system more stable. Due to the diversity of ensembled models and the possibility of converging to different local optima, the robustness of the AMC system can be improved and the risk of overfitting to specific types of modulated signals can be reduced. In addition, the model ensemble can better cope with outliers, as different models have varying degrees of sensitivity to various parameters.

However, compared with a single model, model ensemble requires higher computational resources and time to complete training and reasoning, especially on large-scale datasets. The selection of features has also become more complex, as different models require corresponding features to drive them. During the model ensemble process, multiple parameters need to be adjusted simultaneously, such as the importance of each model, which makes the joint optimization problem difficult. Therefore, it is necessary to choose appropriate base models and ensemble strategies (such as bagging [118], boosting [119], and stacking [79]) to improve the AMC accuracy.

5. End-to-End Deep Learning Methods

Benefiting the layer-by-layer stacking of dense nonlinear transformations, heavily overparameterized deep learning models have achieved superior feature learning and classification performance in AMC tasks, especially the ability to generalize across various communication scenarios. A wide variety of deep learning model structures have been developed to deal with different signal representations to extract robust features, such as coding methods, connectivity methods, and activation functions. The entire processing flow of deep learning methods is shown in Figure 7. First, the deep learning model is initialized, and then, the objective function of the model is trained to converge. Finally, the converged model was used for testing to evaluate its AMC performance.

5.1. CNNs

CNNs, as the representative models of deep learning, use convolutional layers to extract features from inputs and use pooling layers to reduce the feature dimensionality. Finally, modulation schemes are classified through fully connected layers with softmax. The mechanisms of weight sharing, local connection, and pooling in CNNs make them highly efficient in extracting spatial features. CNNs can extract and fuse features from both the original time-domain signal and spectrum, forming higher level representations, thereby improving the AMC accuracy. Some typical CNN models specifically designed for AMC are shown in Figure 8.

In [40], the MCNet is capable of analyzing the multiscale spatiotemporal correlations exhaustively to promisingly improve the AMC accuracy under poor conditions with the cheapest computational cost. The first convolutional layer is used to extract generic features and reduce spatial dimension. Then, two layers containing asymmetric convolutional kernels organized in parallel are deployed to reduce the number of weights. Finally, the features of stacked convolutional encoding are output as the modulation scheme in the form of probability distribution through the fully connected layer with softmax. It is worth noting that skip connections are adopted for block-wise association to prevent gradient vanishing and accordingly improve the AMC accuracy. In [41], the convolutional layer, dropout layer, and Gaussian noise layer are applied for regularization and reducing the overfitting in the system. The noise layer improves the model’s robustness to noise by simulating the noise distribution. In [42], additional dense layers and Gaussian noise layers are added to the traditional CNN structure. Hou et al. [44] designed a complex CNN, which is composed of residual units, max-pooling, flatten, and fully connected layer with softmax. Both frequency and phase characteristics of radio signals can be utilized to recognize the modulation schemes in each frequency band. Zhang et al. [110] proposed adding a feature fusion layer to CNN to achieve end-to-end full-stage fusion of features. Using cuboidal convolution kernels, the three-dimensional CNN [120] allows to capture underlying features as intra- and interantenna correlations at multiscale signal representations.

To reduce the training costs of CNN and improve inference efficiency, some lightweight structures have been designed. In [90], the bottleneck and shuffle unit are used to avoid the potential overfitting risk and lower the computational complexity. The model only has a total of 19.35M floating-point operations per second (FLOPs), which is one-tenth of a three-layer ResNet. Some special convolutional operations have also been developed, such as point-wise convolution (PWC) and depth-wise separable convolution (DWSC) [61]. The PWC adopts the convolution kernels of 1 × 1 size, which maintains the size of the output feature maps. PWC only involves operations on a single pixel, so it has high computational efficiency and is suitable for large-scale convolution operations. Due to the independent operation of each channel by PWC, it can highlight the feature differences between channels and better express information between different channels. The DWSC divides the traditional convolution operation into PWC and DWC, where each convolution kernel is responsible for one channel. DSWC is able to greatly reduce the number of parameters, thereby reducing the risk of overfitting. DSWC can explore more different convolutional combinations to better capture features in signals and improve the model’s expressive power.

CNNs can extract a variety of features through multilayer stacking of convolution kernels, while also possessing the excellent contextual correlation capabilities. However, due to its limitations, CNN is hard to perform the precise time-frequency transform operations (e.g., FFT), resulting in insufficient frequency-domain analysis [121]. In some researches, CNNs driven by multimodal data containing I/Q sequence and spectrum are plagued by parameter uncertainty issues, such as Fourier series [122] and window function size [123]. In addition, most of the CNN structures are transferred from visual tasks [93], and there is little consideration given to the modulation characteristics of wireless communication signals by designing specialized modules to extract robust features and improve the AMC performance.

5.2. RNNs

In the AMC problem, the input layer of RNN usually accepts time-domain slices of one-dimensional signals as the input, the hidden layer uses RNN units (such as LSTM or GRU) to learn temporal information in sequence data, and the output layer provides predictions of the modulated signal. The cell state in the RNN unit is used to store the historical information of each sequence, which plays a crucial role in feature learning. The gating mechanism composed of input gate, forgetting gate, and output gate controls the flow of information in the cell state. Some typical RNN models for AMC are shown in Figure 9.

In [45], a single-layer LSTM based on the attention mechanism is proposed for processing long-sequence signal data. The attention mechanism is added to enable LSTM model to capture temporal features of long-sequence data in a way faster than the convergence speed of traditional LSTM. Moreover, signal embedding is introduced in the model to cover the modulation information more comprehensively and accurately. In [47], a novel one-shot neural network pruning method based on the weight magnitude and gradient momentum is proposed to produce sparse RNN structures. Experimental results demonstrate that it is crucial to retain nonrecurrent connections while pruning RNNs. Zang et al. [49] proposed a modified hierarchical RNN with grouped auxiliary memory (GAMHRNN), in which the hierarchical structure is stacked from the group-assisted memory to each layer with fast connections. Li et al. [124] pointed out that RNNs can be used to learn the intrinsic regularity of temporal series, and the attention mechanism is helpful for RNN to focus on the correct pulses and ignore the noise. Bhatti et al. [125] designed a bidirectional LSTM (BiLSTM) structure containing two parallel layers of LSTM for forward propagation and backward propagation, respectively. In the BiLSTM, the minibatch processing operation allows for anticausal behavior and effectively increases the amount of data accessible to the network at every time step, providing the model with a richer context. Liu et al. [126] have verified that BiLSTM layers can extract the contextual information of signals well and address the long-term dependence problems. By adding the self-attention block [127], the BiLSTM is able to obtain both the historical and future knowledge from feature sequences of each domain and thus automatically mine crucial mapping to improve AMC accuracy.

The survey indicates that current research not only focuses on the structure of LSTM but also focuses more on signal characterization or feature analysis methods. Compared to CNNs, although LSTM has powerful temporal analysis capabilities, it also brings high computational complexity and difficult convergence. Due to the inclusion of multiple gating mechanisms and cell states in LSTMs, the optimization of objective functions becomes more complex, especially for communication scenarios containing a large number of candidate modulation schemes. Distributed computing and model compression techniques are effective ways to reduce the computational resources and memory requirements of LSTM in AMC tasks. In addition, one of the challenges faced by LSTM is the difficulty in leveraging a priori knowledge to help improve AMC performance in the same way that CNNs are able to simultaneously process temporal signals and spectrograms by means of 1D convolution and 2D convolution. How to simultaneously incorporate the knowledge from multiple domains into the learning process of LSTM is a key research direction.

5.3. GNNs

In recent years, the mapping of temporal series into graphs has been investigated through the use of techniques such as visibility graphs (VGs), which can simply capture relevant aspects of local and global dynamics simultaneously, thus developing specialized GNN to mine knowledge in time series and obtain special potential graphical features. A graph is composed of nodes and edges, where nodes represent objects and edges represent relationships between objects. During the learning process, the features of nodes and edges are both extracted, and the feature vector of the central node is updated by aggregating the information of neighboring nodes. Furthermore, the transfer function is designed to calculate the information transfer between nodes in order to better capture the structural information of the graph.

In [50], I/Q signal data are mapped into two graphs, respectively, and then, a typical GNN model is designed to process graphs. After concatenating feature vectors extracted from two graphs, the classification results can be obtained through a fully connected layer. In [51], each node in GNN sends message to its adjacent nodes and will merge the messages received from its adjacent nodes. Then, an activation function is followed to enhance the ability of GNN to fit the data distribution. Moreover, the softmax function is added on each row of adjacency matrixes to normalize the input data. To capture the relationships between multidomain features and deep features into account, Yao et al. [52] built a graph of multidomain features and deep features and classified the modulation types using the output of GCN by way of a fully connected layer.

Under the same scale of parameters, the computational complexity of GNNs is higher than that of CNN and LSTM, especially when dealing with large-scale graphs. This is because GNNs need to extract and update features for each node and edge, and the computational effort grows squarely with the size of the graph. GNN also faces the oversmoothing problem, which means that the node representations are gradually approximated as the number of network layers increases. The oversmoothing leads to a decrease in the performance of GNNs when processing graphs with complex topological structures. The main advantage of GNNs lies in capturing the local structures and features of the graph and thus lacks the perception of global structural information. In practice, many graph structures are dynamic, i.e., the topology of the graph and the edges between nodes are constantly changing. However, most GNNs are designed for static graphs and are difficult to adapt to changes in dynamic graphs. Since GNNs are highly dependent on the structure and features of the graph, the generalization performance of GNN may drop dramatically when the graph data change. How to design suitable GNNs that can cope with dynamic graphs is an open problem.

5.4. Transformers

The transformer model consists of an input embedding layer, positional encoding layer, self-attention, multihead attention, point-wise feedforward layer, normalization layer, and output layer. The input embedding layer converts the input modulation signal into a word vector representation. The positional encoding adds the position information to each word vector, so that the model can understand the order relationship between words. The self-attention blocks calculate the association degree between each word and other words in a sequence to associate contextual information. By parallel stacking of self-attention layers, the multihead attention is used to capture different levels of contextual information and improves the model’s expressive ability. The feedforward layer added after each attention layer is adopted to integrate attention information. The normalization helps to stabilize the training process of transformer. The transformer model can more effectively capture long-term dependencies while avoiding the order dependency issue that exists in RNNs. Compared to the fixed convolutional kernels used in CNNs, the self-attention mechanism in the transformer can adaptively learn the weights from different positions, thereby obtaining more accurate feature representations. Moreover, the multihead attention mechanism helps to simultaneously learn contextual information at different levels, facilitating more efficient fusion of different features.

Cai et al. [128] proposed a transformer network consisted of linear projection layer, encoder, and multilayer perception. The I/Q signals are embedded into linear sequences in the linear projection layer. Then, an additional learnable classification token with position embedding is added before being fed into the encoder. The output of the encoder is then served as the input of the multilayer perception head, which is consisted of fully connected layers and dropout layers. The MCformer structure [53] leverages convolution layers along with self-attention-based encoders to efficiently exploit temporal correlation between the embeddings produced by the convolution layer. A frame-wise embedding-aided transformer (FEA-T) [129] is designed to extract the global correlation feature of the signal to obtain higher classification accuracy as well as lower time cost. To enhance the global modeling ability of the transformer, the frame-wise embedding module is used to aggregate more samples into a token in the embedding stage to generate a more efficient token sequence. A signal spatial transformer structure based on the attention mechanism [130] is developed to eliminate communication interference factors by a priori learning of signal structure, such as time offset, symbol rate, and clock recovery. In [131], the self-supervised contrastive pretraining of the transformer is performed with unlabeled signals, and time warping–based data augmentation is introduced to improve generalization ability. Then, the pretrained transformer model is fine-tuned with labeled signals, in which the hierarchical learning rates are employed to ensure convergence.

Despite the introduction of positional encoding, the self-attention mechanism in the transformer model does not explicitly consider the positional information, which may be crucial for learning modulation knowledge. The transformer needs to adapt to complex parameters such as signal bandwidth and SNRs in the AMC task. The current studies have demonstrated the disadvantage of the transformer in terms of computational complexity, especially dealing with long-sequence data, which limits the application of transformer model on edge devices with limited computational resources.

5.5. Hybrid Models

To break through the limitations of a single deep learning model and fully utilize the advantages of various structures, hybrid model structures [56–59, 132–136] have been designed for AMC. Actually, deep learning can flexibly select and combine different types of layers to improve AMC accuracy. For example, the number and configuration of layers can be adjusted according to the complexity of the actual AMC tasks. Compared to the single deep learning model, hybrid models can extract features from multiple perspectives to better cope with various disturbances and noises.

Zhang et al. [56] proposed to solve the AMC task using a dual-stream structure by combining the advantages of CNN and LSTM, which efficiently explore the feature interaction and the spatial–temporal property of raw I/Q signals. Ke et al. [57] presented a deep learning framework based on a LSTM denoising autoencoder (AE) to automatically extract stable and robust features from noisy radio signals and infer modulation schemes using the learned features. Wang et al. [132] proposed a hybrid CNN–transformer–GNN (CTGNet) for AMC to uncover complex representations in radio signals. In [134], a hybrid feature extraction CNN combined with the channel attention mechanism is introduced for AMC. In [135], the ResNeXt model is utilized to extract the distinctive semantic feature of the signal, and the GRU is employed to extract the time-series features.

Although the hybrid models based on specific rules perform better than the single model, they usually need to carefully adjust the configuration of each network layer and parameter. The complex network structure implies high computational complexity, and the convergence of the model’s objective function is difficult to guarantee. In addition, the internal working mechanism of hybrid models is usually more complex than a single deep learning model, resulting in poor interpretability, which affects the application in practical communication scenarios. At present, a series of methods such as feature visualization [58], module decomposition [59], knowledge distillation [136], and attribution analysis [137] have been gradually developed to dissect the working mechanism of deep learning models similar to “black boxes.”

5.6. Optimization and Generalization Methods

In the AMC task, the optimization aims to find the most suitable parameters so that the model can accurately classify the modulation schemes. The selection of optimization algorithms and hyperparameter settings are key aspects of training deep learning models. The SGD based on error backpropagation is one of the most popular algorithms. A series of SGD variants such as momentum optimization [78] and federated learning [96] have also been developed for global optimization problems with highly nonconvex objective functions. The momentum optimization method accelerates the gradient descent process by introducing the accumulation of historical gradient information, which helps to jump out of local minima. The momentum optimization method has the advantage of increasing the optimization speed, but may lead to oscillating results. Federated learning is more demanding in terms of information transfer between agents.

Generalization refers to the ability of the model to apply knowledge learned from a training set to unseen test samples. The ability to generalize well is critical to the success of the model in real-world applications. In the AMC task, generalization is reflected in the model’s ability to accurately identify the modulation schemes of a signal across various communication conditions, such as frequency offset, phase offset, and noise. Currently, the model structural complexity constraints based on the sparsity principle are commonly used to curb model overfitting [138], such as L2 regularization and dropout. In addition, as one of the representative regularization algorithms, data augmentation aims to improve the generalization ability of the model by increasing the number and diversity of training samples [139, 140]. Based on the characteristics of modulated signals, three types of data augmentation methods are considered in [141], i.e., rotation, flip, and Gaussian noise interference, which are applied in both training and testing stages of deep learning–based classifiers. GANs [142] have also been used to learn the features and distribution of radio signals to help expand the input space and improve generalization ability. For example, Tang et al. [143] proposed a programmatic data augmentation method by using the auxiliary classifier generative adversarial networks (ACGANs).

6. AMC Performance Evaluation

6.1. Experimental Datasets

At present, two popular public datasets, i.e., RadioML 2018.01A [144] and RadioML 2016.10A [145], are mainly used for evaluating AMC performance of various methods. The RadioML 2018.01A [144] contains a total of 2,000,000 I/Q signals covering 24 candidate modulation schemes {OOK, 4ASK, 8ASK, BPSK, QPSK, 8PSK, 16PSK, 32PSK, 16APSK, 32APSK, 64APSK, 128APSK, 16QAM, 32QAM, 64QAM, 128QAM, 256QAM, AM-SSB-WC, AMSSB-SC, AM-DSB-WC, AM-DSB-SC, FM, GMSK, OQPSK} with a signal size of 2 × 1024. RadioML 2016.10A [144] contains 220,000 signals with 11 candidate modulation schemes of size 2 × 128, including BPSK, QPSK, 8PSK, 16QAM, 64QAM, GFSK, CPFSK, 4PAM, AM-DSB, AM-SSB, and WBFM. All the modulated signals are set to a 1-MHz bandwidth and SNRs ranging from −20 dB to +18 dB with a step size 2 dB and have been uniformly distributed in each category.

There are also some publicly available datasets with more complex communication conditions that deserve further research, such as RDL 2021.12 [39], RadioML 2016.10B [41], and CSPB.ML.2023G1 [146]. Specifically, CSPB.ML.2023G1a is a challenge dataset with cochannel and frequency offset, and it contains 11 candidate modulation schemes {BPSK, QPSK, 8PSK, 4QAM, 16QAM, 64QAM, SQPSK, MSK, GMSK}. In studies tailored to specific wireless communication considerations, some other candidate modulation schemes have been simulated to verify the AMC effectiveness of the method. For example, {4FSK, 8FSK, 16FSK} transmitted through the Rayleigh channel is considered in [82]. The classification of high-order intraclass modulation schemes with fine granularity like 128QAM and 256QAM is studied in [54].

6.2. Performance Evaluation

In Table 1, we summarize the state-of-the-art AMC performance of various methods on different datasets. It can be seen that most of the work is based on public datasets, considering the interference of a single AWGN. Part of the work studied the classification of commonly used modulation schemes under specific communication conditions. In general, most methods are able to achieve the AMC accuracy above 85% when SNR > 10 dB. When SNR is below −16 dB, most methods exhibit results similar to random classification. In wireless communication systems, the position of users may change at any time, leading to fluctuations in SNR and greatly increasing the difficulty of AMC. Few works consider the effects caused by the actual communication channel, such as Doppler drift [42]. Therefore, the influence of complex noise or demanding wireless communication conditions on AMC performance needs to be further investigated, such as mixed noise [39, 99] and composite channels [67].

Table 1. Comparison of AMC performance of state-of-the-art methods.

Reference	Method	Channel	Candidate modulation	SNR (dB)	Accuracy (%)	Complexity (ms)
Chen et al. 2019 [30]	Maximum-likelihood	Non-AWGN	BPSK, QPSK, 8PSK, and 16QAM	−3:2:25	> 95 at 11 dB	—

Li et al. 2021 [35]	4th cumulant + LB	AWGN	BPSK, QPSK, 8PSK, and 16QAM	−10:2:20	> 95 at 10 dB	—

Yan et al. 2020 [99]	Cyclic spectrum	Non-AWGN	BPSK, QPSK, OQPSK, 2FSK, 4FSK, and MSK	−10:1:20	80 at 0 dB 100 at > 4 dB	—

Luan et al. 2022 [39]	DT	Non-AWGN	RDL 2021.12	−20:2:20	> 85 at 10 dB	—

Lin et al. 2018 [37]	SVM	AWGN	4QAM, 16QAM, and 64QAM	5:1:30	100 at all SNR	—

Huynh et al. 2020 [40]	MCNet	AWGN	RadioML 2018.01A	−20:2:18	> 93 at 10 dB	0.131

Hermawan et al. 2020 [41]	IC-AMCNet	AWGN	RML 2016.10B	−20:2:18	91.7 at 10 dB	0.29

Zhang et al. 2020 [42]	CNN	Doppler shift: 5 Hz, 50 Hz, 100 Hz	MD 2020	−6:4:30	60.06 58.24 58.04	0.402

Zheng et al. 2021 [118]	CNN	AWGN	RadioML 2016.10A	−20:2:18	58.8	0.3

Wang et al. 2020 [43]	DRCN	AWGN	BPSK, QPSK, 8PSK, and 16QAM	−10:2:10	100 at > 0 dB	—

Zheng et al. 2022 [82]	Ms-RaT (Layer 4) Ms-RaT (Layer 5) Ms-RaT (Layer 6)	Rayleigh + AWGN	OOK, 4ASK, 8ASK, 16ASK, BFSK, 4FSK, 8FSK, 16FSK, BPSK, 4PSK, 8PSK, 16PSK, 8QAM, 16QAM, 32QAM, and 64QAM	−20:2:18	61.4 62.4 62.7	12.5 14.1 17.3

Hou et al. 2022 [44]	Complex ResNet	AWGN	ASK, QPSK, 2FSK, OFDM, SSB, DSB, AM, and FM	−6:2:10	98.6 at 0 dB	0.328

Zheng et al. 2023 [109]	CNN LSTM CNN-LSTM	AWGN	RadioML 2016.10A	−20:2:18	62.6 61.8 64.2	0.82 0.87 0.94

Chen et al. 2020 [45]	LSTM	AWGN	RadioML 2016.10A	−20:2:18	94.1 at 18 dB	1.08

Shi et al. 2022 [46]	LSTM-AE	AWGN	RadioML 2016.10A	−20:2:18	94.5 at 18 dB	0.004

Zang et al. 2020 [49]	2-layer LSTM	AWGN	RadioML 2016.10A	0:2:18	90.5	—

Xuan et al. 2022 [50]	2D CNN 1D CNN GRU LSTM AvgNet	AWGN	RadioML 2016.10A	−20:2:18	53.03 60.02 58.20 60.26 62.93	—

Liu et al. 2020 [51]	GCN	—	2ASK, 4ASK, 2FSK, 4FSK, BPSK, QPSK, 16QAM, and 64QAM	−14:2:10	< 80 at 10 dB	0.78

Cai et al. 2022 [128]	Transformer	AWGN	RadioML 2016.04C RadioML 2016.10A RadioML 2018.01A	−20:2:18 −20:2:18 −20:2:30	> 90 at 10 dB > 70 at 10 dB > 80 at 10 dB	—

Zhang et al. 2022 [54]	Spatial transformer	AWGN	BPSK, QPSK, 8PSK, 16QAM, 32QAM, 64QAM, 128QAM, and 256QAM	0:1:23	96.6	5.729

Zheng et al. 2023 [7]	MobileRaT-A MobileRaT-B	AWGN	RadioML 2018.01A RadioML 2016.10A	−20:2:18	61.8 63.2	0.9 4.4

Tang et al. 2018 [143]	GCN + CNN	—	BPSK,4ASK, QPSK, OQPSK, 8PSK, 16QAM, 32QAM, and 64QAM	−6:2:14	95 at −2 dB	—

Li et al. 2023 [136]	ResNeXt-GRU	AWGN	RadioML 2018.01A	−20:2:30	> 90 at > 10 dB	0.21

Zhou et al. 2022 [123]	Few-shot CNN	AWGN	RadioML 2016.10A	−20:2:18	> 80 at > 0 dB	—

Khan et al. 2021 [138]	3D CNN	AWGN	BPSK, QPSK, 16QAM, and 64QAM	—	96.97	—

Wang et al. 2021 [67]	Multiple CNN	Non-AWGN	FSK, PSK, and QAM	−5:1:5	47.3 at < −4 dB 98.2 at > 5 dB	—

In addition to modulation classification accuracy, the inference efficiency of the model determines whether it can be successfully applied in real-time wireless communication processes. In terms of model complexity and inference speed, most AMC methods can meet the requirements of real-time inference, as they are deployed and tested using workstations with efficient computing power. Most models can complete inference within 1 ms, as the length of the signal is usually within 2 × 2048. The work based on spectral maps leads to a decrease in inference efficiency [82], as the number of pixel points in the image is much larger than that of the temporal signal. Only a few studies like [7, 147] have tested the computational efficiency of the AMC method on edge devices with limited power and memory, such as NVIDIA Jetson Nano.

6.3. Robustness Analysis

In practical wireless communication applications, the rapid mobility of communication terminals places high demands on the robustness of AMC methods. The model is required to exhibit robustness against signal characterization, channel types, and communication scenarios; otherwise, a cliff drop in performance occurs during the application. Li et al. [35] considered the impact of power allocation ratios of the far and the near users on AMC accuracy under different SNRs. Luo et al. [148] verified the effectiveness of the AMC method under different channel models. Li et al. [149] and Thameur et al. [150] observed the effect of signal length (from 128 to 4096) on the application of the sparse filtering CNN and fully connected neural network to the AMC task, respectively. Lin et al. [151] discussed the role of denoising through moving-averaging and Gaussian filters to enhance the signal expressiveness.

The setting of hyperparameters during the training process is another aspect that needs to be considered, such as learning rate, batch size, weight decay, and momentum. Currently, most studies set hyperparameters based on empirical basis. Optimal hyperparameter solving methods such as the grid search method are difficult to apply to large-scale models due to the expensive training cost, especially to deep learning models. In addition, the optimization of hyperparameters in training and testing stages lacks qualitative analysis of the impacts on AMC, and only some work [109, 118] has observed the impact on AMC accuracy through experiments.

7. Challenges and Future Directions

Although AMC has achieved a series of progress, it still faces many theoretical and practical challenges. Some key research directions are summarized in Figure 10.

In terms of the signal modality or representation, it directly determines the upper bound of the model’s AMC accuracy, as its implicit expert knowledge is the key to establishing input–output mapping. A large number of signal representations are currently being studied for learning the expert knowledge behind each modulation scheme, with the main research direction being in the way of efficient fusion between representations [152, 153]. How to establish the correlation between expert knowledge and model learning process and create a knowledge base by means of knowledge graph [154] can help to further solve AMC problems.

The current research focuses on the blind AMC, which assumes that the parameters of received signals are unknown, such as SNR, symbol rate, and channel gain, increasing the challenge of AMC. If the prior information like SNR estimation, carrier frequency compensation, and clock synchronization can be introduced to guide the model learning, it is beneficial for extracting discriminative and robust features. Frequency-domain analysis–based methods and wavelet ridge–based techniques can be utilized for carrier frequency estimation [155]. However, further explorations are needed to introduce prior information into the learning process of the model. Adding prior information as a penalty term to the objective function [156] for joint optimization is a feasible approach, but the corresponding impact mechanism is still unclear and needs further exploration.

In addition, the generalization ability of AMC models across various communication scenarios directly determines their actual availability, so regularization methods such as data augmentation [137, 157] are key research directions. However, current data augmentation algorithms focus on expanding the number of samples and overlook the theoretical guidance. The impact of regularization technique on the optimization process of model’s objective function lacks intuitive explanation, resulting in hyperparameters mainly being empirically set. When training deep learning models, techniques such as regularization and pruning can be used to reduce the risk of overfitting and improve their generalization ability. In addition, reasonable model initialization and learning rate adjustment can also make the model more adaptable to new data, thereby improving its generalization performance. By transferring knowledge from one domain to another, the generalization performance of the model can be improved in the target domain. Transfer learning can be achieved through fine-tuning, pretraining, and other methods.

Another factor that affects the actual usability of AMC models is model complexity, which affects the deployment and inference of the model at the edge devices. In addition to some lightweight model structures being developed, some structural search algorithms [158] and redundant weight pruning strategies [147, 159] have also been studied. More studies are attempting to apply machine learning models to complete the AMC task in the field programmable gate array (FPGA) and embedded devices, especially in the wireless communication process with variable communication environments.

In the future, AMC will gradually be applied to more complex communication scenarios, such as high-speed mobile communication. AMC can also be applied to the security monitoring and protection of mobile communication networks. By identifying and recognizing various modulation schemes in mobile communication, malicious intrusions, interference signals, and other security threats can be detected and prevented, improving the security and stability of the network. In the IoT, a large number of smart devices require stable wireless communication. AMC can be used to manage and monitor these devices. By identifying the modulation scheme of the signal sent by the device, it is possible to perform device recognition, tracking, and management, improving the intelligence level of the IoT.

8. Conclusions

A comprehensive survey of AMC based on Lb, Fb, and deep learning methods has been presented in this study, including key technologies, performance comparisons, advantages, and future key development directions. The Lb AMC methods strictly rely on the estimation of modulation parameters and usually require manual setting of multiple hyperparameters, such as search range and iteration number, which exhibit a significant impact on AMC accuracy. The Fb methods require designing corresponding features and classifiers for specific types of modulation schemes and feature engineering to improve AMC accuracy. End-to-end deep learning models have shown significant advantages in AMC tasks, achieving superior AMC performance without the need for artificial feature design. In numerous studies, many key factors have been studied, including appropriate signal representations, model structures, optimization methods, etc. The current research also considers various communication environments, including frequency offset, phase offset, and timing error, as well as signal parameters such as signal length and symbol rate. In terms of AMC performance, most methods have been verified on public or simulated datasets and achieved an accuracy of over 80% when SNR > 10 dB.

Although a series of progress has been made, more attention will be paid to achieving fine-grained recognition of higher order intraclass modulation schemes on edge devices in the future. Deep learning has shown great potential for use in AMC, so in the future, attention will be paid to more suitable deep learning models and corresponding optimization and generalization algorithms. Considering the adaptability and robustness of deep learning, AMC under various communication conditions will also be further studied, such as various channel parameters. In addition, lightweight deep learning is also a key research direction, as it is easy to deploy at the edge.

Conflicts of Interest

The authors declare no conflicts of interest.

Author Contributions

Conceptualization: X.T. and Q.Z.; methodology: Q.Z., S.S., and A.E.; validation: Q.Z. and X.T.; data curation: X.T.; writing–original draft preparation: Q.Z.; writing–review and editing: L.Y. and X.T.; supervision: Q.Z.

Funding

This research was supported by Shandong Provincial Natural Science Foundation, Grant no. ZR2023QF125 and Program for Young Innovative Research Team in Higher Education of Shandong Province, Grant no. 2024KJH005.

Open Research

Data Availability Statement

No new data were created or analyzed in this study.

References

1 You X., Wang C. X., Huang J. et al., Towards 6G Wireless Communication Networks: Vision, Enabling Technologies, and New Paradigm Shifts, Science China Information Sciences. (2021) 64, 110301–110374, https://doi.org/10.1007/s11432-020-2955-6.
10.1007/s11432-020-2955-6
Web of Science® Google Scholar
2 Zhang H., Shlezinger N., Guidi F., Dardari D., and Eldar Y., 6G Wireless Communications: From Far-Field Beam Steering to Near-Field Beam Focusing, IEEE Communications Magazine. (2023) 61, no. 4, 72–77, https://doi.org/10.1109/mcom.001.2200259.
10.1109/MCOM.001.2200259
Web of Science® Google Scholar
3 Qi F., Xie W., Liu L., Hong T., and Zhou F., UAV Digital Twin Based Wireless Channel Modeling for 6G Green IoT, Drones. (2023) 7, no. 9, https://doi.org/10.3390/drones7090562.
10.3390/drones7090562
Web of Science® Google Scholar
4 Hassija V., Chamola V., Agrawal A. et al., Fast, Reliable, and Secure Drone Communication: A Comprehensive Survey, IEEE Communications Surveys & Tutorials. (2021) 23, no. 4, 2802–2832, https://doi.org/10.1109/comst.2021.3097916.
10.1109/COMST.2021.3097916
Web of Science® Google Scholar
5 Javed F., Khan H. Z., and Anjum R., Communication Capacity Maximization in Drone Swarms, Drone Systems and Applications. (2023) 11, 1–12, https://doi.org/10.1139/dsa-2023-0002.
10.1139/dsa-2023-0002
Web of Science® Google Scholar
6 Fan Q. and Ansari N., Towards Traffic Load Balancing in Drone-Assisted Communications for IoT, IEEE Internet of Things Journal. (2019) 6, no. 2, 3633–3640, https://doi.org/10.1109/jiot.2018.2889503, 2-s2.0-85059274495.
10.1109/JIOT.2018.2889503
Web of Science® Google Scholar
7 Zheng Q., Tian X., Yu Z. et al., MobileRaT: A Lightweight Radio Transformer Method for Automatic Modulation Classification in Drone Communication Systems, Drones. (2023) 7, no. 10, https://doi.org/10.3390/drones7100596.
10.3390/drones7100596
Web of Science® Google Scholar
8 Zhang C. and Zhang W., Spectrum Sharing for Drone Networks, IEEE Journal on Selected Areas in Communications. (2016) 35, no. 1, 1–144, https://doi.org/10.1109/jsac.2016.2633040, 2-s2.0-85009844360.
10.1109/JSAC.2016.2633040
Google Scholar
9 Yao Y., Lv K., Huang S., Li X., and Xiang W., UAV Trajectory and Energy Efficiency Optimization in RIS-Assisted Multi-User Air-To-Ground Communications Networks, Drones. (2023) 7, no. 4, https://doi.org/10.3390/drones7040272.
10.3390/drones7040272
Web of Science® Google Scholar
10 Sadek M. and Aissa S., Personal Satellite Communication: Technologies and Challenges, IEEE Wireless Communications. (2012) 19, no. 6, 28–35, https://doi.org/10.1109/mwc.2012.6393515, 2-s2.0-84872049015.
10.1109/MWC.2012.6393515
Web of Science® Google Scholar
11 Abdu T. S., Kisseleff S., Lagunas E., and Chatzinotas S., Flexible Resource Optimization for GEO Multibeam Satellite Communication System, IEEE Transactions on Wireless Communications. (2021) 20, no. 12, 7888–7902, https://doi.org/10.1109/twc.2021.3088609.
10.1109/TWC.2021.3088609
Web of Science® Google Scholar
12 Betancourt J., Wojtkowski B., Castillo P., and Thouvenin I., Exocentric Control Scheme for Robot Applications: An Immersive Virtual Reality Approach, IEEE Transactions on Visualization and Computer Graphics. (2023) 29, no. 7, 3392–3404, https://doi.org/10.1109/tvcg.2022.3160389.
10.1109/TVCG.2022.3160389
PubMed Web of Science® Google Scholar
13 Dini P., Begni A., Ciavarella S. et al., Design and Testing Novel One-Class Classifier Based on Polynomial Interpolation with Application to Networking Security, IEEE Access. (2022) 10, 67910–67924, https://doi.org/10.1109/access.2022.3186026.
10.1109/ACCESS.2022.3186026
Web of Science® Google Scholar
14 Ding G., Wu Q., Zhang L., Lin Y., Tsiftsis T., and Yao Y., An Amateur Drone Surveillance System Based on the Cognitive Internet of Things, IEEE Communications Magazine. (2018) 56, no. 1, 29–35, https://doi.org/10.1109/mcom.2017.1700452, 2-s2.0-85040739915.
10.1109/MCOM.2017.1700452
Web of Science® Google Scholar
15 Hanna S., Dick C., and Cabric D., Signal Processing-Based Deep Learning for Blind Symbol Decoding and Modulation Classification, IEEE Journal on Selected Areas in Communications. (2022) 40, no. 1, 82–96, https://doi.org/10.1109/jsac.2021.3126088.
10.1109/JSAC.2021.3126088
Web of Science® Google Scholar
16 Liu M., Qu N., Shang B., Chen Y., and Gong F., Energy and Spectrum Efficient Blind Equalization with Unknown Constellation for Air-To-Ground Multipath UAV Communications, IEEE Transactions on Green Communications and Networking. (2021) 5, no. 3, 1357–1368, https://doi.org/10.1109/tgcn.2021.3073914.
10.1109/TGCN.2021.3073914
Web of Science® Google Scholar
17 Chaudhari M. S., Kumar S., Gupta R., Kumar M., and Majhi S., Design and Testbed Implementation of Blind Parameter Estimated OFDM Receiver, IEEE Transactions on Instrumentation and Measurement. (2022) 71, 1–11, https://doi.org/10.1109/tim.2021.3124833.
10.1109/TIM.2021.3124833
Web of Science® Google Scholar
18 Nanni J., Bellanca G., Calò G. et al., Multi-path Propagation in On-Chip Optical Wireless Links, IEEE Photonics Technology Letters. (2020) 32, no. 17, 1101–1104, https://doi.org/10.1109/lpt.2020.3012877.
10.1109/LPT.2020.3012877
CAS Web of Science® Google Scholar
19 Ma Z., Chen X., Xiao M., Karagiannidis G., and Fan P., Interference Control for Railway Wireless Communication Systems: Techniques, Challenges, and Trends, IEEE Vehicular Technology Magazine. (2020) 15, no. 3, 51–58, https://doi.org/10.1109/mvt.2020.2970160.
10.1109/MVT.2020.2970160
Web of Science® Google Scholar
20 Zhang Z., Zhu G., Wang R., Lau V., and Huang K., Turning Channel Noise into an Accelerator for Over-the-air Principal Component Analysis, IEEE Transactions on Wireless Communications. (2022) 21, no. 10, 7926–7941, https://doi.org/10.1109/twc.2022.3162868.
10.1109/TWC.2022.3162868
Web of Science® Google Scholar
21 Zheng B., You C., Mei W., and Zhang R., A Survey on Channel Estimation and Practical Passive Beamforming Design for Intelligent Reflecting Surface Aided Wireless Communications, IEEE Communications Surveys & Tutorials. (2022) 24, no. 2, 1035–1071, https://doi.org/10.1109/comst.2022.3155305.
10.1109/COMST.2022.3155305
Web of Science® Google Scholar
22 Allanki Sanyasi Rao and Kallepelli Srikanth K. K. V., Rao A. S., and Srikanth K., Study of Modulation Schemes over a Multipath Fading Channels, International Journal for Modern Trends in Science and Technology. (2021) 7, no. 10, 34–39, https://doi.org/10.46501/ijmtst0710005.
10.46501/IJMTST0710005
Google Scholar
23 Sun S. and Yan H., Channel Estimation for Reconfigurable Intelligent Surface-Assisted Wireless Communications Considering Doppler Effect, IEEE Wireless Communications Letters. (2021) 10, no. 4, 790–794, https://doi.org/10.1109/lwc.2020.3044004.
10.1109/LWC.2020.3044004
Web of Science® Google Scholar
24 Ahmad S., Khan S., Manzoor B. et al., A Compact CPW-Fed Ultra-wideband Multi-Input-Multi-Output (MIMO) Antenna for Wireless Communication Networks, IEEE Access. (2022) 10, 25278–25289, https://doi.org/10.1109/access.2022.3155762.
10.1109/ACCESS.2022.3155762
Web of Science® Google Scholar
25 Mazahir S., Chaaban A., Elgala H., and Alouini M., Achievable Rates of Multi-Carrier Modulation Schemes for Bandlimited IM/DD Systems, IEEE Transactions on Wireless Communications. (2019) 18, no. 3, 1957–1973, https://doi.org/10.1109/twc.2019.2901479, 2-s2.0-85062949194.
10.1109/TWC.2019.2901479
Web of Science® Google Scholar
26 Li S., Yuan J., and Wang L., Improvement of Microwave Electric Field Measurement Sensitivity via Multi-Carrier Modulation in Rydberg Atoms, Applied Sciences. (2020) 10, no. 22, https://doi.org/10.3390/app10228110.
10.3390/app10228110
Google Scholar
27 Beidas B. and Weber C., Higher-order Correlation-Based Approach to Modulation Classification of Digitally Frequency-Modulated Signals, IEEE Journal on Selected Areas in Communications. (1995) 13, no. 1, 89–101, https://doi.org/10.1109/49.363142, 2-s2.0-0029197033.
10.1109/49.363142
Web of Science® Google Scholar
28 Panagiotou P., Anastasopoulos A., and Polydoros A., Likelihood Ratio Tests for Modulation Classification, IEEE 21st Century Military Communications. Architectures and Technologies for Information Superiority (Cat. No. 00CH37155). (2000) 2, 670–674, https://doi.org/10.1109/milcom.2000.904013.
10.1109/milcom.2000.904013
Google Scholar
29 Hameed F., Dobre O., and Popescu D., On the Likelihood-Based Approach to Modulation Classification, IEEE Transactions on Wireless Communications. (2009) 8, no. 12, 5884–5892, https://doi.org/10.1109/twc.2009.12.080883, 2-s2.0-73049085170.
10.1109/TWC.2009.12.080883
Web of Science® Google Scholar
30 Chen W., Xie Z., Ma L., Liu J., and Liang X., A Faster Maximum-Likelihood Modulation Classification in Flat Fading Non-gaussian Channels, IEEE Communications Letters. (2019) 23, no. 3, 454–457, https://doi.org/10.1109/lcomm.2019.2894400, 2-s2.0-85063001412.
10.1109/LCOMM.2019.2894400
Web of Science® Google Scholar
31 De Gaudenzi R., Garde T., and Vanghi V., Performance Analysis of Decision-Directed Maximum-Likelihood Phase Estimators for M-PSK Modulated Signals, IEEE Transactions on Communications. (1995) 43, no. 12, 3090–3100, https://doi.org/10.1109/26.477512, 2-s2.0-0029485069.
10.1109/26.477512
Web of Science® Google Scholar
32 Dini P., Ariaudo G., Botto G., Greca F., and Saponara S., Real-time Electro-thermal Modelling and Predictive Control Design of Resonant Power Converter in Full Electric Vehicle Applications, IET Power Electronics. (2023) 16, no. 12, 2045–2064, https://doi.org/10.1049/pel2.12527.
10.1049/pel2.12527
Web of Science® Google Scholar
33 Marey M. and Dobre O. A., Blind Modulation Classification Algorithm for Single and Multiple-Antenna Systems over Frequency-Selective Channels, IEEE Signal Processing Letters. (2014) 21, no. 9, 1098–1102.
10.1109/LSP.2014.2323241
Web of Science® Google Scholar
34 Kauppi J. P., Martikainen K., and Ruotsalainen U., Hierarchical Classification of Dynamically Varying Radar Pulse Repetition Interval Modulation Patterns, Neural Networks. (2010) 23, no. 10, 1226–1237, https://doi.org/10.1016/j.neunet.2010.06.008, 2-s2.0-78149497124.
10.1016/j.neunet.2010.06.008
PubMed Web of Science® Google Scholar
35 Li T., Li Y., and Dobre O. A., Modulation Classification Based on Fourth-Order Cumulants of Superposed Signal in NOMA Systems, IEEE Transactions on Information Forensics and Security. (2021) 16, 2885–2897, https://doi.org/10.1109/tifs.2021.3068006.
10.1109/TIFS.2021.3068006
CAS Web of Science® Google Scholar
36 Serbes A., Cukur H., and Qaraqe K., Probabilities of False Alarm and Detection for the First-Order Cyclostationarity Test: Application to Modulation Classification, IEEE Communications Letters. (2020) 24, no. 1, 57–61, https://doi.org/10.1109/lcomm.2019.2947043.
10.1109/LCOMM.2019.2947043
Web of Science® Google Scholar
37 Lin X., Dobre O. A., Ngatched T., Eldemerdash Y., and Li C., Joint Modulation Classification and OSNR Estimation Enabled by Support Vector Machine, IEEE Photonics Technology Letters. (2018) 30, no. 24, 2127–2130, https://doi.org/10.1109/lpt.2018.2878530, 2-s2.0-85055859925.
10.1109/LPT.2018.2878530
Web of Science® Google Scholar
38 Lau K., Salibian-Barrera M., and Lampe L., Modulation Recognition in the 868 MHz Band Using Classification Trees and Random Forests, AEU-International Journal of Electronics and Communications. (2016) 70, no. 9, 1321–1328, https://doi.org/10.1016/j.aeue.2016.07.001, 2-s2.0-85003000096.
10.1016/j.aeue.2016.07.001
Web of Science® Google Scholar
39 Luan S., Gao Y., Chen W., Yu N., and Zhang Z., Automatic Modulation Classification: Decision Tree Based on Error Entropy and Global-Local Feature-Coupling Network under Mixed Noise and Fading Channels, IEEE Wireless Communications Letters. (2022) 11, no. 8, 1703–1707, https://doi.org/10.1109/lwc.2022.3175531.
10.1109/LWC.2022.3175531
Web of Science® Google Scholar
40 Huynh-The T., Hua C. H., Pham Q. V., and Kim D. S., MCNet: An Efficient CNN Architecture for Robust Automatic Modulation Classification, IEEE Communications Letters. (2020) 24, no. 4, 811–815, https://doi.org/10.1109/lcomm.2020.2968030.
10.1109/LCOMM.2020.2968030
Web of Science® Google Scholar
41 Hermawan A. P., Ginanjar R. R., Kim D. S., and Lee J., CNN-Based Automatic Modulation Classification for beyond 5G Communications, IEEE Communications Letters. (2020) 24, no. 5, 1038–1041, https://doi.org/10.1109/lcomm.2020.2970922.
10.1109/LCOMM.2020.2970922
Web of Science® Google Scholar
42 Zhang H., Huang M., Yang J., and Sun W., A Data Preprocessing Method for Automatic Modulation Classification Based on CNN, IEEE Communications Letters. (2021) 25, no. 4, 1206–1210, https://doi.org/10.1109/lcomm.2020.3044755.
10.1109/LCOMM.2020.3044755
Web of Science® Google Scholar
43 Wang Y., Gui G., Gacanin H., Ohtsuki T., Sari H., and Adachi F., Transfer Learning for Semi-supervised Automatic Modulation Classification in ZF-MIMO Systems, IEEE Journal on Emerging and Selected Topics in Circuits and Systems. (2020) 10, no. 2, 231–239, https://doi.org/10.1109/jetcas.2020.2992128.
10.1109/JETCAS.2020.2992128
CAS Web of Science® Google Scholar
44 Hou C., Liu G., Tian Q., Zhou Z., Hua L., and Lin Y., Multisignal Modulation Classification Using Sliding Window Detection and Complex Convolutional Network in Frequency Domain, IEEE Internet of Things Journal. (2022) 9, no. 19, 19438–19449, https://doi.org/10.1109/jiot.2022.3167107.
10.1109/JIOT.2022.3167107
Web of Science® Google Scholar
45 Chen Y., Shao W., Liu J., Yu L., and Qian Z., Automatic Modulation Classification Scheme Based on LSTM with Random Erasing and Attention Mechanism, IEEE Access. (2020) 8, 154290–154300, https://doi.org/10.1109/access.2020.3017641.
10.1109/ACCESS.2020.3017641
Web of Science® Google Scholar
46 Yunhao S., Hua X., Lei J., and Zisen Q., ConvLSTMAE: A Spatiotemporal Parallel Autoencoders for Automatic Modulation Classification, IEEE Communications Letters. (2022) 26, no. 8, 1804–1808, https://doi.org/10.1109/lcomm.2022.3179003.
10.1109/LCOMM.2022.3179003
Google Scholar
47 Zang K., Wu W., and Luo W., Deep Sparse Learning for Automatic Modulation Classification Using Recurrent Neural Networks, Sensors. (2021) 21, no. 19, https://doi.org/10.3390/s21196410.
10.3390/s21196410
Web of Science® Google Scholar
48 Li Y., Zhu M., Ma Y., and Yang J., Work Modes Recognition and Boundary Identification of MFR Pulse Sequences with a Hierarchical Seq2seq LSTM, IET Radar, Sonar & Navigation. (2020) 14, no. 9, 1343–1353, https://doi.org/10.1049/iet-rsn.2020.0060.
10.1049/iet-rsn.2020.0060
Web of Science® Google Scholar
49 Zang K. and Ma Z., Automatic Modulation Classification Based on Hierarchical Recurrent Neural Networks with Grouped Auxiliary Memory, IEEE Access. (2020) 8, 213052–213061, https://doi.org/10.1109/access.2020.3039543.
10.1109/ACCESS.2020.3039543
Web of Science® Google Scholar
50 Xuan Q., Zhou J., Qiu K. et al., AvgNet: Adaptive Visibility Graph Neural Network and its Application in Modulation Classification, IEEE Transactions on Network Science and Engineering. (2022) 9, no. 3, 1516–1526, https://doi.org/10.1109/tnse.2022.3146836.
10.1109/TNSE.2022.3146836
Web of Science® Google Scholar
51 Liu Y., Liu Y., and Yang C., Modulation Recognition with Graph Convolutional Network, IEEE Wireless Communications Letters. (2020) 9, no. 5, 624–627, https://doi.org/10.1109/lwc.2019.2963828.
10.1109/LWC.2019.2963828
Web of Science® Google Scholar
52 Yao X., Yang H., and Sheng M., Feature Fusion Based on Graph Convolution Network for Modulation Classification in Underwater Communication, Entropy. (2023) 25, no. 7, https://doi.org/10.3390/e25071096.
10.3390/e25071096
Web of Science® Google Scholar
53 Hamidi-Rad S. and Jain S., Mcformer: A Transformer Based Deep Neural Network for Automatic Modulation Classification, 2021 IEEE Global Communications Conference (GLOBECOM). (2021) 1–6, https://doi.org/10.1109/globecom46510.2021.9685815.
10.1109/globecom46510.2021.9685815
Google Scholar
54 Zhang R., Chang S., Wei Z., Zhang Y., Huang S., and Feng Z., Modulation Classification of Active Attacks in Internet of Things: Lightweight Mcbldn with Spatial Transformer Network, IEEE Internet of Things Journal. (2022) 9, no. 19, 19132–19146, https://doi.org/10.1109/jiot.2022.3163892.
10.1109/JIOT.2022.3163892
Web of Science® Google Scholar
55 Chen S., Qiu K., Zheng S., Xuan Q., and Yang X., Radio–image Transformer: Bridging Radio Modulation Classification and Imagenet Classification, Electronics. (2020) 9, no. 10, https://doi.org/10.3390/electronics9101646.
10.3390/electronics9101646
PubMed Web of Science® Google Scholar
56 Zhang Z., Luo H., Wang C., Gan C., and Xiang Y., Automatic Modulation Classification Using CNN-LSTM Based Dual-Stream Structure, IEEE Transactions on Vehicular Technology. (2020) 69, no. 11, 13521–13531, https://doi.org/10.1109/tvt.2020.3030018.
10.1109/TVT.2020.3030018
Web of Science® Google Scholar
57 Ke Z. and Vikalo H., Real-time Radio Technology and Modulation Classification via an LSTM Auto-Encoder, IEEE Transactions on Wireless Communications. (2022) 21, no. 1, 370–382, https://doi.org/10.1109/twc.2021.3095855.
10.1109/TWC.2021.3095855
Web of Science® Google Scholar
58 Elsagheer M. and Ramzy S., A Hybrid Model for Automatic Modulation Classification Based on Residual Neural Networks and Long Short Term Memory, Alexandria Engineering Journal. (2023) 67, 117–128, https://doi.org/10.1016/j.aej.2022.08.019.
10.1016/j.aej.2022.08.019
Web of Science® Google Scholar
59 Jia F., Yang Y., Zhang J., and Yang Y., A Hybrid Attention Mechanism for Blind Automatic Modulation Classification, Transactions on Emerging Telecommunications Technologies. (2022) 33, no. 7, https://doi.org/10.1002/ett.4503.
10.1002/ett.4503
Web of Science® Google Scholar
60 Zheng Q., Tian X., Yang M., Wu Y., and Su H., PAC-bayesian Framework Based Drop-Path Method for 2D Discriminative Convolutional Network Pruning, Multidimensional Systems and Signal Processing. (2020) 31, no. 3, 793–827, https://doi.org/10.1007/s11045-019-00686-z.
10.1007/s11045-019-00686-z
Web of Science® Google Scholar
61 Zheng Q., Wang R., Tian X. et al., A Real-Time Transformer Discharge Pattern Recognition Method Based on CNN-LSTM Driven by Few-Shot Learning, Electric Power Systems Research. (2023) 219, https://doi.org/10.1016/j.epsr.2023.109241.
10.1016/j.epsr.2023.109241
Web of Science® Google Scholar
62 Zheng Q., Tian X., Yu Z. et al., Application of Wavelet-Packet Transform Driven Deep Learning Method in PM2.5 Concentration Prediction: A Case Study of Qingdao, China, Sustainable Cities and Society. (2023) 92, https://doi.org/10.1016/j.scs.2023.104486.
10.1016/j.scs.2023.104486
PubMed Web of Science® Google Scholar
63 Mostafa S., Mondal D., Beck M., Bidinosti C., Henry C., and Stavness I., Visualizing Feature Maps for Model Selection in Convolutional Neural Networks, IEEE/CVF International Conference on Computer Vision (ICCV), 2021, 1362–1371, https://doi.org/10.1109/iccvw54120.2021.00157.
10.1109/iccvw54120.2021.00157
Google Scholar
64 Chefer H., Gur S., and Wolf L., Transformer Interpretability beyond Attention Visualization, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 782–791, https://doi.org/10.1109/cvpr46437.2021.00084.
10.1109/cvpr46437.2021.00084
Google Scholar
65 Soares E., Angelov P. P., Costa B., Castro M., Nageshrao S., and Filev D., Explaining Deep Learning Models through Rule-Based Approximation and Visualization, IEEE Transactions on Fuzzy Systems. (2021) 29, no. 8, 2399–2407, https://doi.org/10.1109/tfuzz.2020.2999776.
10.1109/TFUZZ.2020.2999776
Web of Science® Google Scholar
66 Hosseini Ghazvini S. M., Ghaffarpour Rahbar A., and Alizadeh B., Load Balancing, Multipath Routing and Adaptive Modulation with Traffic Grooming in Elastic Optical Networks, Computer Networks. (2020) 169, https://doi.org/10.1016/j.comnet.2019.107081.
10.1016/j.comnet.2019.107081
Google Scholar
67 Wang Y., Gui G., Ohtsuki T., and Adachi F., Multi-task Learning for Generalized Automatic Modulation Classification under Non-gaussian Noise with Varying SNR Conditions, IEEE Transactions on Wireless Communications. (2021) 20, no. 6, 3587–3596, https://doi.org/10.1109/twc.2021.3052222.
10.1109/TWC.2021.3052222
Web of Science® Google Scholar
68 Liao Y., Wang J., and Liu Q. H., Transmit Beampattern Synthesis for Frequency Diverse Array with Particle Swarm Frequency Offset Optimization, IEEE Transactions on Antennas and Propagation. (2021) 69, no. 2, 892–901, https://doi.org/10.1109/tap.2020.3027576.
10.1109/TAP.2020.3027576
Web of Science® Google Scholar
69 Cao Y. and Chen K., Pseudo-Doherty Load-Modulated Balanced Amplifier with Wide Bandwidth and Extended Power Back-Off Range, IEEE Transactions on Microwave Theory and Techniques. (2020) 68, no. 7, 3172–3183, https://doi.org/10.1109/tmtt.2020.2983925.
10.1109/TMTT.2020.2983925
Web of Science® Google Scholar
70 Xhonneux M., Afisiadis O., Bol D., and Louveaux J., A Low-Complexity LoRa Synchronization Algorithm Robust to Sampling Time Offsets, IEEE Internet of Things Journal. (2022) 9, no. 5, 3756–3769, https://doi.org/10.1109/jiot.2021.3101002.
10.1109/JIOT.2021.3101002
Web of Science® Google Scholar
71 Yang K., Molisch A. F., Ekman T., Røste T., and Berbineau M., A Round Earth Loss Model and Small-Scale Channel Properties for Open-Sea Radio Propagation, IEEE Transactions on Vehicular Technology. (2019) 68, no. 9, 8449–8460, https://doi.org/10.1109/tvt.2019.2929914.
10.1109/TVT.2019.2929914
Web of Science® Google Scholar
72 Panayirci E., Altabbaa M. T., Uysal M., and Poor H. V., Sparse Channel Estimation for OFDM-Based Underwater Acoustic Systems in Rician Fading with a New OMP-MAP Algorithm, IEEE Transactions on Signal Processing. (2019) 67, no. 6, 1550–1565, https://doi.org/10.1109/tsp.2019.2893841, 2-s2.0-85061301379.
10.1109/TSP.2019.2893841
Web of Science® Google Scholar
73 Yu D., Zhou X., Wang G., Ding Q., Li T., and Jia Y., Effects of Chaotic Activity and Time Delay on Signal Transmission in FitzHugh-Nagumo Neuronal System, Cognitive Neurodynamics. (2022) 16, no. 4, 887–897, https://doi.org/10.1007/s11571-021-09743-5.
10.1007/s11571-021-09743-5
PubMed Web of Science® Google Scholar
74 Amiri I., Rashed A., Mohammed A., El-Din E., and Yupapin P., Spatial Continuous Wave Laser and Spatiotemporal VCSEL for High-Speed Long Haul Optical Wireless Communication Channels, Journal of Optical Communications. (2023) 44, no. 1, 43–51, https://doi.org/10.1515/joc-2019-0061, 2-s2.0-85064933737.
10.1515/joc-2019-0061
Google Scholar
75 Ye H., Liang L., Li G. Y., and Juang B., Deep Learning-Based End-To-End Wireless Communication Systems with Conditional GANs as Unknown Channels, IEEE Transactions on Wireless Communications. (2020) 19, no. 5, 3133–3143, https://doi.org/10.1109/twc.2020.2970707.
10.1109/TWC.2020.2970707
Web of Science® Google Scholar
76 Zheng Q., Yang M., Zhang Q., and Zhang X., Fine-Grained Image Classification Based on the Combination of Artificial Features and Deep Convolutional Activation Features, IEEE/CIC ICCC, 2017, Qingdao, China, 1–6, https://doi.org/10.1109/iccchina.2017.8330485, 2-s2.0-85049688575.
10.1109/iccchina.2017.8330485
Google Scholar
77 Zheng Q., Tian X., Yu Z. et al., Application of Complete Ensemble Empirical Mode Decomposition Based Multi-Stream Informer (CEEMD-MsI) in PM2. 5 Concentration Long-Term Prediction, Expert Systems with Applications. (2024) 245, https://doi.org/10.1016/j.eswa.2023.123008.
10.1016/j.eswa.2023.123008
Web of Science® Google Scholar
78 Yuan H. and Ma T., Federated Accelerated Stochastic Gradient Descent, Advances in Neural Information Processing Systems. (2020) 33, 5332–5344.
Google Scholar
79 Ramjee S., Ju S., Yang D., Liu X., Gamal A., and Eldar Y. C., Ensemble Wrapper Subsampling for Deep Modulation Classification, IEEE Transactions on Cognitive Communications and Networking. (2021) 7, no. 4, 1156–1170, https://doi.org/10.1109/tccn.2021.3108809.
10.1109/TCCN.2021.3108809
Web of Science® Google Scholar
80 Hou C., Zhang X., and Chen X., Electromagnetic Signal Feature Fusion and Recognition Based on Multi-Modal Deep Learning, International Journal of Performability Engineering. (2020) 16, no. 6, https://doi.org/10.23940/ijpe.20.06.p12.941949.
10.23940/ijpe.20.06.p12.941949
Google Scholar
81 Qi P., Zhou X., Zheng S., and Li Z., Automatic Modulation Classification Based on Deep Residual Networks with Multimodal Information, IEEE Transactions on Cognitive Communications and Networking. (2021) 7, no. 1, 21–33, https://doi.org/10.1109/tccn.2020.3023145.
10.1109/TCCN.2020.3023145
Web of Science® Google Scholar
82 Zheng Q., Zhao P., Wang H., Elhanashi A., and Saponara S., Fine-Grained Modulation Classification Using Multi-Scale Radio Transformer with Dual-Channel Representation, IEEE Communications Letters. (2022) 26, no. 6, 1298–1302, https://doi.org/10.1109/lcomm.2022.3145647.
10.1109/LCOMM.2022.3145647
Web of Science® Google Scholar
83 Zheng Q., Yang M., Zhang Q., and Yang J., A Bilinear Multi-Scale Convolutional Neural Network for Fine-Grained Object Classification, IAENG International Journal of Computer Science. (2018) 45, no. 2, 340–352.
Google Scholar
84 Vonesch C., Blu T., and Unser M., Generalized Daubechies Wavelet Families, IEEE Transactions on Signal Processing. (2007) 55, no. 9, 4415–4429, https://doi.org/10.1109/tsp.2007.896255, 2-s2.0-34548211453.
10.1109/TSP.2007.896255
Web of Science® Google Scholar
85 Kaiser G., The Fast Haar Transform, IEEE Potentials. (1998) 17, no. 2, 34–37, https://doi.org/10.1109/45.666645, 2-s2.0-0032046361.
10.1109/45.666645
Google Scholar
86 Zheng Q., Zhao P., Zhang D., and Wang H., MR-DCAE: Manifold Regularization-based Deep Convolutional Autoencoder for Unauthorized Broadcasting Identification, International Journal of Intelligent Systems. (2021) 36, no. 12, 7204–7238, https://doi.org/10.1002/int.22586.
10.1002/int.22586
Web of Science® Google Scholar
87 Lee I. and Lee W., UniQGAN: Unified Generative Adversarial Networks for Augmented Modulation Classification, IEEE Communications Letters. (2022) 26, no. 2, 355–358, https://doi.org/10.1109/lcomm.2021.3131476.
10.1109/LCOMM.2021.3131476
Web of Science® Google Scholar
88 Kammoun A., Slama R., Tabia H., Ouni T., and Abid M., Generative Adversarial Networks for Face Generation: A Survey, ACM Computing Surveys. (2022) 55, no. 5, 1–37, https://doi.org/10.1145/3527850.
10.1145/3527850
Web of Science® Google Scholar
89 Mao Y., Dong Y. Y., Sun T., Rao X., and Dong C. X., Attentive Siamese Networks for Automatic Modulation Classification Based on Multitiming Constellation Diagrams, IEEE Transactions on Neural Networks and Learning Systems. (2023) 34, no. 9, 5988–6002, https://doi.org/10.1109/tnnls.2021.3132341.
10.1109/TNNLS.2021.3132341
PubMed Web of Science® Google Scholar
90 Luan S., Gao Y., Zhou J., and Zhang Z., Automatic Modulation Classification Based on Cauchy-Score Constellation and Lightweight Network under Impulsive Noise, IEEE Wireless Communications Letters. (2021) 10, no. 11, 2509–2513, https://doi.org/10.1109/lwc.2021.3105978.
10.1109/LWC.2021.3105978
Web of Science® Google Scholar
91 Kumar Y., Sheoran M., Jajoo G., and Yadav S., Automatic Modulation Classification Based on Constellation Density Using Deep Learning, IEEE Communications Letters. (2020) 24, no. 6, 1275–1278, https://doi.org/10.1109/lcomm.2020.2980840.
10.1109/LCOMM.2020.2980840
Web of Science® Google Scholar
92 Zhou Y., Lin T., and Zhu Y., Automatic Modulation Classification in Time-Varying Channels Based on Deep Learning, IEEE Access. (2020) 8, 197508–197522, https://doi.org/10.1109/access.2020.3034942.
10.1109/ACCESS.2020.3034942
Web of Science® Google Scholar
93 Xu Y., Li D., Wang Z., Guo Q., and Xiang W., A Deep Learning Method Based on Convolutional Neural Network for Automatic Modulation Classification of Wireless Signals, Wireless Networks. (2019) 25, no. 7, 3735–3746, https://doi.org/10.1007/s11276-018-1667-6, 2-s2.0-85045144591.
10.1007/s11276-018-1667-6
Web of Science® Google Scholar
94 Rumelhart D. E., Hinton G. E., and Williams R. J., Learning Representations by Back-Propagating Errors, Nature. (1986) 323, no. 6088, 533–536, https://doi.org/10.1038/323533a0, 2-s2.0-0022471098.
10.1038/323533a0
Web of Science® Google Scholar
95 Liu Y., Gao Y., and Yin W., An Improved Analysis of Stochastic Gradient Descent with Momentum, Advances in Neural Information Processing Systems. (2020) 33, 18261–18271.
Google Scholar
96 Luo X., Qin W., Dong A., Sedraoui K., and Zhou M., Efficient and High-Quality Recommendations via Momentum-Incorporated Parallel Stochastic Gradient Descent-Based Learning, IEEE/CAA Journal of Automatica Sinica. (2021) 8, no. 2, 402–411, https://doi.org/10.1109/jas.2020.1003396.
10.1109/JAS.2020.1003396
Web of Science® Google Scholar
97 Wang J. and Baerends E. J., Self-consistent-field Method for Correlated Many-Electron Systems with an Entropic Cumulant Energy, Physical Review Letters. (2022) 128, no. 1, https://doi.org/10.1103/physrevlett.128.013001.
10.1103/physrevlett.128.013001
Web of Science® Google Scholar
98 Song X., Zhang Y., Gong D., and Sun X., Feature Selection Using Bare-Bones Particle Swarm Optimization with Mutual Information, Pattern Recognition. (2021) 112, https://doi.org/10.1016/j.patcog.2020.107804.
10.1016/j.patcog.2020.107804
PubMed Web of Science® Google Scholar
99 Yan X., Liu G., Wu H. C., Zhang G., Wang Q., and Wu Y., Robust Modulation Classification over $\alpha$-Stable Noise Using Graph-Based Fractional Lower-Order Cyclic Spectrum Analysis, IEEE Transactions on Vehicular Technology. (2020) 69, no. 3, 2836–2849, https://doi.org/10.1109/tvt.2020.2965137.
10.1109/TVT.2020.2965137
Google Scholar
100 Luan S., Gao Y., Liu T., Li J., and Zhang Z., Automatic Modulation Classification: Cauchy-Score-Function-Based Cyclic Correlation Spectrum and FC-MLP under Mixed Noise and Fading Channels, Digital Signal Processing. (2022) 126, https://doi.org/10.1016/j.dsp.2022.103476.
10.1016/j.dsp.2022.103476
Web of Science® Google Scholar
101 Lei J., Zhou J., Jian Z. et al., A Wiener Filter Denoising Based Intelligent Modulation Recognition System, 2022 IEEE/CIC International Conference on Communications in China (ICCC). (2022) 88–93, https://doi.org/10.1109/iccc55456.2022.9880789.
10.1109/iccc55456.2022.9880789
Google Scholar
102 Liu X., Li C. J., Jin C. T., and Leong P., Wireless Signal Representation Techniques for Automatic Modulation Classification, IEEE Access. (2022) 10, 84166–84187, https://doi.org/10.1109/access.2022.3197224.
10.1109/ACCESS.2022.3197224
Web of Science® Google Scholar
103 Jiang W., Tong F., Dong Y., and Zhang G., Modulation Recognition of Non-cooperation Underwater Acoustic Communication Signals Using Principal Component Analysis, Applied Acoustics. (2018) 138, 209–215, https://doi.org/10.1016/j.apacoust.2018.03.033, 2-s2.0-85045581801.
10.1016/j.apacoust.2018.03.033
Web of Science® Google Scholar
104 Ahvaraki M. R., Ghroashi S. A., and Mahboobi B., A Linear Discriminant Analysis Based Modulation Recognition Method for Linear Modulations, Signal Processing and Renewable Energy. (2018) 2, no. 4, 1–14.
Google Scholar
105 Al Dujaili M. J., Ebrahimi-Moghadam A., and Fatlawi A., Speech Emotion Recognition Based on SVM and KNN Classifications Fusion, International Journal of Electrical and Computer Engineering. (2021) 11, no. 2, https://doi.org/10.11591/ijece.v11i2.pp1259-1264.
10.11591/ijece.v11i2.pp1259-1264
Google Scholar
106 Wu H., Li Y., Zhou L., and Meng J., Convolutional Neural Network and Multi-feature Fusion for Automatic Modulation Classification, Electronics Letters. (2019) 55, no. 16, 895–897, https://doi.org/10.1049/el.2019.1789, 2-s2.0-85070320309.
10.1049/el.2019.1789
Web of Science® Google Scholar
107 Chen G., Hong Z., Guo Y., and Pang C., A Cascaded Classifier for Multi-Lead ECG Based on Feature Fusion, Computer Methods and Programs in Biomedicine. (2019) 178, 135–143, https://doi.org/10.1016/j.cmpb.2019.06.021, 2-s2.0-85067665221.
10.1016/j.cmpb.2019.06.021
PubMed Web of Science® Google Scholar
108 Yan M. S., Deng Z., He B. W., Zou C. S., Wu J., and Zhu Z. J., Emotion Classification with Multichannel Physiological Signals Using Hybrid Feature and Adaptive Decision Fusion, Biomedical Signal Processing and Control. (2022) 71, https://doi.org/10.1016/j.bspc.2021.103235.
10.1016/j.bspc.2021.103235
Web of Science® Google Scholar
109 Zheng Q., Tian X., Yu Z., Wang H., Elhanashi A., and Saponara S., DL-PR: Generalized Automatic Modulation Classification Method Based on Deep Learning with Priori Regularization, Engineering Applications of Artificial Intelligence. (2023) 122, https://doi.org/10.1016/j.engappai.2023.106082.
10.1016/j.engappai.2023.106082
Web of Science® Google Scholar
110 Zhang Z., Wang C., Gan C., Sun S., and Wang M., Automatic Modulation Classification Using Convolutional Neural Network with Features Fusion of SPWVD and BJD, IEEE Transactions on Signal and Information Processing over Networks. (2019) 5, no. 3, 469–478, https://doi.org/10.1109/tsipn.2019.2900201, 2-s2.0-85065021310.
10.1109/TSIPN.2019.2900201
Web of Science® Google Scholar
111 Subbarao M. V. and Samundiswary P., Automatic Modulation Recognition in Cognitive Radio Receivers Using Multi-Order Cumulants and Decision Trees, International Journal of Recent Technology and Engineering. (2018) 7, 61–69.
Google Scholar
112 Bu K., He Y., Jing X., and Han J., Adversarial Transfer Learning for Deep Learning Based Automatic Modulation Classification, IEEE Signal Processing Letters. (2020) 27, 880–884, https://doi.org/10.1109/lsp.2020.2991875.
10.1109/LSP.2020.2991875
Web of Science® Google Scholar
113 Niu D., Wang K., Sun L., Wu J., and Xu X., Short-term Photovoltaic Power Generation Forecasting Based on Random Forest Feature Selection and CEEMD: A Case Study, Applied Soft Computing. (2020) 93, https://doi.org/10.1016/j.asoc.2020.106389.
10.1016/j.asoc.2020.106389
Web of Science® Google Scholar
114 Sheykhmousa M., Mahdianpari M., Ghanbari H., Mohammadimanesh F., Ghamisi P., and Homayouni S., Support Vector Machine versus Random Forest for Remote Sensing Image Classification: A Meta-Analysis and Systematic Review, Ieee Journal of Selected Topics in Applied Earth Observations and Remote Sensing. (2020) 13, 6308–6325, https://doi.org/10.1109/jstars.2020.3026724.
10.1109/JSTARS.2020.3026724
Web of Science® Google Scholar
115 Aslam M., Zhu Z., and Nandi A., Automatic Modulation Classification Using Combination of Genetic Programming and KNN, IEEE Transactions on Wireless Communications. (2012) 11, no. 8, 2742–2750.
Web of Science® Google Scholar
116 Ghauri S., Qureshi I., and Malik A., A Novel Approach for Automatic Modulation Classification via Hidden Markov Models and Gabor Features, Wireless Personal Communications. (2017) 96, no. 3, 4199–4216, https://doi.org/10.1007/s11277-017-4378-x, 2-s2.0-85019881702.
10.1007/s11277-017-4378-x
Web of Science® Google Scholar
117 Sahay R., Brinton C. G., and Love D. J., A Deep Ensemble-Based Wireless Receiver Architecture for Mitigating Adversarial Attacks in Automatic Modulation Classification, IEEE Transactions on Cognitive Communications and Networking. (2022) 8, no. 1, 71–85, https://doi.org/10.1109/tccn.2021.3114154.
10.1109/TCCN.2021.3114154
Web of Science® Google Scholar
118 Zheng Q., Zhao P., Li Y., Wang H., and Yang Y., Spectrum Interference-Based Two-Level Data Augmentation Method in Deep Learning for Automatic Modulation Classification, Neural Computing & Applications. (2021) 33, no. 13, 7723–7745, https://doi.org/10.1007/s00521-020-05514-1.
10.1007/s00521-020-05514-1
Web of Science® Google Scholar
119 Fu X., Gui G., Wang Y., Gacanin H., and Adachi F., Automatic Modulation Classification Based on Decentralized Learning and Ensemble Learning, IEEE Transactions on Vehicular Technology. (2022) 71, no. 7, 7942–7946, https://doi.org/10.1109/tvt.2022.3164935.
10.1109/TVT.2022.3164935
Web of Science® Google Scholar
120 Huynh-The T., Nguyen T., Pham Q., da Costa D. B., and Kim D., MIMO-OFDM Modulation Classification Using Three-Dimensional Convolutional Network, IEEE Transactions on Vehicular Technology. (2022) 71, no. 6, 6738–6743, https://doi.org/10.1109/tvt.2022.3159254.
10.1109/TVT.2022.3159254
Web of Science® Google Scholar
121 Wang F., Yang C., Huang S., and Wang H., Automatic Modulation Classification Based on Joint Feature Map and Convolutional Neural Network, IET Radar, Sonar & Navigation. (2019) 13, no. 6, 998–1003, https://doi.org/10.1049/iet-rsn.2018.5549, 2-s2.0-85066923028.
10.1049/iet-rsn.2018.5549
Web of Science® Google Scholar
122 Huang J., Huang S., Zeng Y., Chen H., Chang S., and Zhang Y., Hierarchical Digital Modulation Classification Using Cascaded Convolutional Neural Network, Journal of Communications and Information Networks. (2021) 6, no. 1, 72–81, https://doi.org/10.23919/jcin.2021.9387706.
10.23919/JCIN.2021.9387706
Google Scholar
123 Zhou Q., Zhang R., Mu J., Zhang H., Zhang F., and Jing X., Amcrn: Few-Shot Learning for Automatic Modulation Classification, IEEE Communications Letters. (2022) 26, no. 3, 542–546, https://doi.org/10.1109/lcomm.2021.3135688.
10.1109/LCOMM.2021.3135688
Web of Science® Google Scholar
124 Li X., Liu Z., and Huang Z., Attention-based Radar PRI Modulation Recognition with Recurrent Neural Networks, IEEE Access. (2020) 8, 57426–57436, https://doi.org/10.1109/access.2020.2982654.
10.1109/ACCESS.2020.2982654
Web of Science® Google Scholar
125 Bhatti S. G. and Bhatti A. I., Radar Signals Intrapulse Modulation Recognition Using Phase-Based Stft and Bilstm, IEEE Access. (2022) 10, 80184–80194, https://doi.org/10.1109/access.2022.3195273.
10.1109/ACCESS.2022.3195273
Web of Science® Google Scholar
126 Liu K., Gao W., and Huang Q., Automatic Modulation Recognition Based on a DCN-BiLSTM Network, Sensors. (2021) 21, no. 5, https://doi.org/10.3390/s21051577.
10.3390/s21051577
Web of Science® Google Scholar
127 Wei S., Qu Q., Zeng X., Liang J., Shi J., and Zhang X., Self-attention Bi-Lstm Networks for Radar Signal Modulation Recognition, IEEE Transactions on Microwave Theory and Techniques. (2021) 69, no. 11, 5160–5172, https://doi.org/10.1109/tmtt.2021.3112199.
10.1109/TMTT.2021.3112199
Web of Science® Google Scholar
128 Cai J., Gan F., Cao X., and Liu W., Signal Modulation Classification Based on the Transformer Network, IEEE Transactions on Cognitive Communications and Networking. (2022) 8, no. 3, 1348–1357, https://doi.org/10.1109/tccn.2022.3176640.
10.1109/TCCN.2022.3176640
Web of Science® Google Scholar
129 Chen Y., Dong B., Liu C., Xiong W., and Li S., Abandon Locality: Frame-wise Embedding Aided Transformer for Automatic Modulation Recognition, IEEE Communications Letters. (2023) 27, no. 1, 327–331, https://doi.org/10.1109/lcomm.2022.3213523.
10.1109/LCOMM.2022.3213523
Web of Science® Google Scholar
130 Li M., Li O., Liu G., and Zhang C., An Automatic Modulation Recognition Method with Low Parameter Estimation Dependence Based on Spatial Transformer Networks, Applied Sciences. (2019) 9, no. 5, https://doi.org/10.3390/app9051010, 2-s2.0-85063664712.
10.3390/app9051010
Google Scholar
131 Kong W., Jiao X., Xu Y., Zhang B., and Yang Q., A Transformer-Based Contrastive Semi-supervised Learning Framework for Automatic Modulation Recognition, IEEE Transactions on Cognitive Communications and Networking. (2023) 9, no. 4, 950–962, https://doi.org/10.1109/tccn.2023.3264908.
10.1109/TCCN.2023.3264908
Web of Science® Google Scholar
132 Wang D., Lin M., Zhang X., Huang Y., and Zhu Y., Automatic Modulation Classification Based on CNN-Transformer Graph Neural Network, Sensors. (2023) 23, no. 16, https://doi.org/10.3390/s23167281.
10.3390/s23167281
Web of Science® Google Scholar
133 Zhang Z., Zhu M., Li Y., Li Y., and Wang S., Joint Recognition and Parameter Estimation of Cognitive Radar Work Modes with LSTM-Transformer, Digital Signal Processing. (2023) 140, https://doi.org/10.1016/j.dsp.2023.104081.
10.1016/j.dsp.2023.104081
Web of Science® Google Scholar
134 Ma Z., Fang S., Fan Y., Li G., and Hu H., An Efficient and Lightweight Model for Automatic Modulation Classification: A Hybrid Feature Extraction Network Combined with Attention Mechanism, Electronics. (2023) 12, no. 17, https://doi.org/10.3390/electronics12173661.
10.3390/electronics12173661
Web of Science® Google Scholar
135 Indira N. and Rao M., Deep Learning CNN-Based Hybrid Extreme Learning Machine with Bagging Classifier for Automatic Modulation Classification, International Journal of Intelligent Systems and Applications in Engineering. (2022) 10, no. 2s, 134–141.
Google Scholar
136 Li L., Zhu Y., and Zhu Z., Automatic Modulation Classification Using ResNeXt-GRU with Deep Feature Fusion, IEEE Transactions on Instrumentation and Measurement. (2023) 72, no. 72, 1–10, https://doi.org/10.1109/tim.2023.3290301.
10.1109/tim.2023.3290301
PubMed Web of Science® Google Scholar
137 Ying S., Huang S., Chang S., Yang Z., Feng Z., and Guo N., A Convolutional and Transformer Based Deep Neural Network for Automatic Modulation Classification, China Communications. (2023) 20, no. 5, 135–147, https://doi.org/10.23919/jcc.ja.2022-0580.
10.23919/JCC.ja.2022-0580
Web of Science® Google Scholar
138 Khan R., Yang Q., Ullah I. et al., 3D Convolutional Neural Networks Based Automatic Modulation Classification in the Presence of Channel Noise, IET Communications. (2022) 16, no. 5, 497–509, https://doi.org/10.1049/cmu2.12269.
10.1049/cmu2.12269
Web of Science® Google Scholar
139 Hao C., Dang X., Yu X., Li S., and Wang C., Probability Density Function Based Data Augmentation for Deep Neural Network Automatic Modulation Classification with Limited Training Data, IET Communications. (2023) 17, no. 7, 852–862, https://doi.org/10.1049/cmu2.12588.
10.1049/cmu2.12588
Web of Science® Google Scholar
140 Ozdemir O., Wimalajeewa T., Dulek B., Varshney P., and Su W., Asynchronous Linear Modulation Classification with Multiple Sensors via Generalized EM Algorithm, IEEE Transactions on Wireless Communications. (2015) 14, no. 11, 6389–6400, https://doi.org/10.1109/twc.2015.2453269, 2-s2.0-84959440794.
10.1109/TWC.2015.2453269
Web of Science® Google Scholar
141 Huang L., Pan W., Zhang Y., Qian L., Gao N., and Wu Y., Data Augmentation for Deep Learning-Based Radio Modulation Classification, IEEE Access. (2020) 8, 1498–1506, https://doi.org/10.1109/access.2019.2960775.
10.1109/ACCESS.2019.2960775
Web of Science® Google Scholar
142 Qi P., Jiang T., Wang L., Yuan X., and Li Z., Detection Tolerant Black-Box Adversarial Attack against Automatic Modulation Classification with Deep Learning, IEEE Transactions on Reliability. (2022) 71, no. 2, 674–686, https://doi.org/10.1109/tr.2022.3161138.
10.1109/TR.2022.3161138
Web of Science® Google Scholar
143 Tang B., Tu Y., Zhang Z., and Lin Y., Digital Signal Modulation Classification with Data Augmentation Using Generative Adversarial Nets in Cognitive Radio Networks, IEEE Access. (2018) 6, 15713–15722, https://doi.org/10.1109/access.2018.2815741, 2-s2.0-85044072752.
10.1109/ACCESS.2018.2815741
Web of Science® Google Scholar
144 O’shea T. J. and West N., Radio Machine Learning Dataset Generation with Gnu Radio, 6th GNU Radio Conference. (2016) 1–6.
Google Scholar
145 O’Shea T. J., Roy T., and Clancy T. C., Over-the-air Deep Learning Based Radio Signal Classification, IEEE Journal of Selected Topics in Signal Processing. (2018) 12, no. 1, 168–179, https://doi.org/10.1109/jstsp.2018.2797022, 2-s2.0-85040963929.
10.1109/JSTSP.2018.2797022
Web of Science® Google Scholar
146 Spooner C. and Mody A., Wideband Cyclostationary Signal Processing Using Sparse Subsets of Narrowband Subchannels, IEEE Transactions on Cognitive Communications and Networking. (2018) 4, no. 2, 162–176, https://doi.org/10.1109/tccn.2018.2790971.
10.1109/TCCN.2018.2790971
Web of Science® Google Scholar
147 Lin Y., Tu Y., and Dou Z., An Improved Neural Network Pruning Technology for Automatic Modulation Classification in Edge Devices, IEEE Transactions on Vehicular Technology. (2020) 69, no. 5, 5703–5706, https://doi.org/10.1109/tvt.2020.2983143.
10.1109/TVT.2020.2983143
Web of Science® Google Scholar
148 Luo B., Peng Q., Cosman P., and Milstein L., Robustness of Deep Modulation Recognition under Awgn and Rician Fading, 2018 52nd Asilomar Conference on Signals, Systems, and Computers. (2018) 447–450, https://doi.org/10.1109/acssc.2018.8645089, 2-s2.0-85062953293.
10.1109/ACSSC.2018.8645089
Google Scholar
149 Li R., Li L., Yang S., and Li S., Robust Automated VHF Modulation Recognition Based on Deep Convolutional Neural Networks, IEEE Communications Letters. (2018) 22, no. 5, 946–949, https://doi.org/10.1109/lcomm.2018.2809732, 2-s2.0-85042724566.
10.1109/LCOMM.2018.2809732
Web of Science® Google Scholar
150 Thameur H. B., Dayoub I., and Hamouda W., USRP RIO-Based Testbed for Real-Time Blind Digital Modulation Recognition in MIMO Systems, IEEE Communications Letters. (2022) 26, no. 10, 2500–2504, https://doi.org/10.1109/lcomm.2022.3191787.
10.1109/LCOMM.2022.3191787
Google Scholar
151 Lin S., Zeng Y., and Gong Y., Modulation Recognition Using Signal Enhancement and Multistage Attention Mechanism, IEEE Transactions on Wireless Communications. (2022) 21, no. 11, 9921–9935, https://doi.org/10.1109/twc.2022.3181026.
10.1109/TWC.2022.3181026
Web of Science® Google Scholar
152 An Z., Zhang T., Shen M. et al., Series-constellation Feature Based Blind Modulation Recognition for beyond 5G MIMO-OFDM Systems with Channel Fading, IEEE Transactions on Cognitive Communications and Networking. (2022) 8, no. 2, 793–811, https://doi.org/10.1109/tccn.2022.3164880.
10.1109/TCCN.2022.3164880
Web of Science® Google Scholar
153 Zhang F., Luo C., Xu J., Luo Y., and Zheng F., Deep Learning Based Automatic Modulation Recognition: Models, Datasets, and Challenges, Digital Signal Processing. (2022) 129, https://doi.org/10.1016/j.dsp.2022.103650.
10.1016/j.dsp.2022.103650
Web of Science® Google Scholar
154 Lin S., Zeng Y., and Gong Y., Learning of Time-Frequency Attention Mechanism for Automatic Modulation Recognition, IEEE Wireless Communications Letters. (2022) 11, no. 4, 707–711, https://doi.org/10.1109/lwc.2022.3140828.
10.1109/LWC.2022.3140828
Web of Science® Google Scholar
155 Shi F., Yue C., and Han C., A Lightweight and Efficient Neural Network for Modulation Recognition, Digital Signal Processing. (2022) 123, https://doi.org/10.1016/j.dsp.2022.103444.
10.1016/j.dsp.2022.103444
Web of Science® Google Scholar
156 Zhang X., Zhao H., Zhu H. et al., NAS-AMR: Neural Architecture Search-Based Automatic Modulation Recognition for Integrated Sensing and Communication Systems, IEEE Transactions on Cognitive Communications and Networking. (2022) 8, no. 3, 1374–1386, https://doi.org/10.1109/tccn.2022.3169740.
10.1109/TCCN.2022.3169740
Web of Science® Google Scholar
157 Zhao Y., Chen S., Chen T. et al., Incremental Learning of Radio Modulation Classification Based on Sample Recall, China Communications. (2023) 20, no. 7, 258–272, https://doi.org/10.23919/jcc.fa.2022-0599.202307.
10.23919/JCC.fa.2022-0599.202307
Web of Science® Google Scholar
158 Zhou S., Li T., and Li Y., Recursive Feature Elimination Based Feature Selection in Modulation Classification for MIMO Systems, Chinese Journal of Electronics. (2023) 32, no. 4, 785–792, https://doi.org/10.23919/cje.2021.00.347.
10.23919/cje.2021.00.347
Web of Science® Google Scholar
159 Dini P. and Saponara S., Processor-in-the-loop Validation of a Gradient Descent-Based Model Predictive Control for Assisted Driving and Obstacles Avoidance Applications, IEEE Access. (2022) 10, 67958–67975, https://doi.org/10.1109/access.2022.3186020.
10.1109/ACCESS.2022.3186020
Web of Science® Google Scholar

Citing Literature

All articles

Recent Advances in Automatic Modulation Classification Technology: Methods, Results, and Prospects

Abstract

1. Introduction

2. Problem Formulation

2.1. Communication Model

2.2. Signal Representation

2.2.1. Temporal Representation

2.2.2. Spectrum

2.2.3. Constellation

2.3. Problem Description

3. Lb Methods

3.1. ALRT

3.2. GLRT

3.3. HLRT

3.4. Variants and Analysis

4. Fb Methods

4.1. Feature Extraction

4.1.1. Spectrum-Based Features

4.1.2. Cumulants

4.1.3. Cyclostationarity

4.1.4. Feature Fusion

4.2. Machine Learning Models

4.2.1. DT

4.2.2. RF

4.2.3. SVM

4.2.4. Model Ensemble

5. End-to-End Deep Learning Methods

5.1. CNNs

5.2. RNNs

5.3. GNNs

5.4. Transformers

5.5. Hybrid Models

5.6. Optimization and Generalization Methods

6. AMC Performance Evaluation

6.1. Experimental Datasets

6.2. Performance Evaluation

6.3. Robustness Analysis

7. Challenges and Future Directions

8. Conclusions

Conflicts of Interest

Author Contributions

Funding

Open Research

Data Availability Statement

References

Citing Literature

Figures

References

Related

Information