Volume 2025, Issue 1 9961530

Research Article

Open Access

Multitask Vehicle Signal Recognition With Dual-Speed Adaptive Weighting

Dianjing Cheng

orcid.org/0009-0000-1877-9620

Shanxi Key Laboratory of Big Data Analysis and Parallel Computing , Taiyuan University of Science and Technology , Taiyuan , 030024 , Shanxi, China , tyust.edu.cn

Search for more papers by this author

Xiangyu Shi,

Xiangyu Shi

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

Zhihua Cui,

Zhihua Cui

orcid.org/0000-0001-7388-3180

Shanxi Key Laboratory of Big Data Analysis and Parallel Computing , Taiyuan University of Science and Technology , Taiyuan , 030024 , Shanxi, China , tyust.edu.cn

Search for more papers by this author

Xingyu Wu,

Xingyu Wu

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

Wenjia Niu,

Corresponding Author

Wenjia Niu

[email protected]

orcid.org/0000-0003-4706-4266

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

Dianjing Cheng,

Dianjing Cheng

orcid.org/0009-0000-1877-9620

Shanxi Key Laboratory of Big Data Analysis and Parallel Computing , Taiyuan University of Science and Technology , Taiyuan , 030024 , Shanxi, China , tyust.edu.cn

Search for more papers by this author

Xiangyu Shi,

Xiangyu Shi

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

Zhihua Cui,

Zhihua Cui

orcid.org/0000-0001-7388-3180

Shanxi Key Laboratory of Big Data Analysis and Parallel Computing , Taiyuan University of Science and Technology , Taiyuan , 030024 , Shanxi, China , tyust.edu.cn

Search for more papers by this author

Xingyu Wu,

Xingyu Wu

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

Wenjia Niu,

Corresponding Author

Wenjia Niu

[email protected]

orcid.org/0000-0003-4706-4266

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation , Beijing Jiaotong University , Beijing , 100044 , China , njtu.edu.cn

Search for more papers by this author

First published: 14 April 2025

https://doi.org/10.1155/atr/9961530

Academic Editor: Gen Li

Share a link

Email
Wechat
Bluesky

Abstract

In mixed traffic environments, the accurate identification of vehicular devices’ modulation schemes, communication protocols, and emitter device information directly affects perception capabilities toward surrounding vehicles and infrastructure. However, existing studies predominantly focus on single-dimensional information analysis, resulting in limited completeness and accuracy in signal feature interpretation. This paper proposes a multitask learning framework (DSR-CNN-LSTM) for collaborative identification of this information. Furthermore, to mitigate task conflicts and noise interference, a dual-rate adaptive weight adjustment strategy is developed to optimize model performance through dynamic balancing of task learning rates and gradient update speeds. Experimental results demonstrate the superior performance of the DSR-CNN-LSTM framework in complex communication environments: Modulation recognition accuracy shows improvements of 20.67%, 10.38%, and 9.96% on three open-source datasets, while the weighted average recognition accuracy for communication protocols and emitter device information achieves enhancements of 45.52%, 72.21%, and 11.11%, respectively. The proposed model outperforms existing methods in both recognition precision and anti-interference capabilities, providing novel technical insights and solutions for the advancement of intelligent connected vehicle technologies.

1. Introduction

In recent years, the rapid advancement of intelligent driving technologies has established wireless signal recognition as a critical enabler for vehicle control, road perception, and intelligent traffic management [1]. While 5G and 6G technology [2]—with its high data transmission rates, low latency, and high reliability [3]—has driven substantial technological innovation and economic growth [4], it has simultaneously intensified spectrum interference and complicated electromagnetic environments.

A complex traffic environment can lead to signal attenuation and interference [5], and different signal recognition techniques are required to adapt to these variations and ensure stable communication. Modulation recognition [6], protocol adaptation [7], and emitter identification [8] in vehicular signals enable vehicles to decode and process signals from other vehicles or infrastructure in real time. Specifically, modulation recognition ensures accurate decoding of diverse vehicular signal modulation schemes. Protocol recognition facilitates seamless communication between autonomous and conventional or other autonomous vehicles. Emitter identification supports the precise behavioral prediction of surrounding traffic participants. Consequently, effective wireless signal recognition is critical for enhancing communication efficiency, enabling real-time information sharing [9], improving traffic flow, promoting road safety, and supporting the deployment of emerging technologies [10]. However, current research on multitask signal recognition (MTSR) mainly focuses on automatic modulation recognition and signal classification tasks [11], where limited task dimensions result in imbalanced resource utilization and constrained feature interpretation.

To address the limitations in current research, this paper proposes a CNN-LSTM-based MTSR framework that achieves joint analysis of modulation types, communication protocols, and emitter device information. Task segmentation enhances recognition professionalism and accuracy, optimizes resource allocation, and improves anti-interference capability. Furthermore, this paper introduces a novel adaptive weight adjustment strategy that dynamically coordinates task weights through both learning rates and gradient update velocities. This approach alleviates task conflicts and interference impacts on system performance, effectively mitigating task competition while enhancing recognition capabilities.

The main contributions of this paper can be summarized as follows:

1.
Expansion of task dimensions: This study integrates multiple dimensions, including modulation schemes at the physical layer, communication protocols at the link layer, and emitter information at the device layer, into a unified analytical framework. This approach mitigates the limitations in signal feature analysis caused by insufficient task coverage and significantly improves signal recognition accuracy and real-time performance in complex traffic environments.
2.
Optimization of task conflicts: To address performance degradation caused by task conflicts and feature competition, this study proposes an adaptive weight adjustment strategy that balances learning velocity and gradient. This method mitigates training instability induced by task interference, effectively enhancing robustness and generalization capability in complex noise environments.

The organization of this paper is as follows. Section 2 reviews related research in the field of signal recognition. Section 3 introduces the multitask learning (MTL)–based signal recognition approach and adaptive weight adjustment technique. Section 4 discusses the effectiveness and advancement of the proposed methods through experimental comparative analysis. Finally, Section 5 summarizes the main contributions of this research and suggests future research directions.

2. Related Work

Wireless signal recognition technology enables the classification and characterization of device-specific information through multidimensional technical approaches. High-precision identification mechanism can effectively analyze the vehicle signal characteristics, providing technical support for accurate spectrum awareness and dynamic resource management. Current technological evolution trends demonstrate that modern wireless signal recognition systems primarily manifest in the following technical branches:

Automatic modulation recognition (AMR) [12]: Modulation techniques directly affect the transmission efficiency and interference resistance of signals. As an intermediary step between signal detection and demodulation, AMR allows for identifying the modulation type of a signal without prior knowledge of the system parameters [12]. This improves demodulation quality and communication efficiency and optimizes the signal reception process [13]. Reference [6] applied two different CNN models for modulation recognition on the RadioML2016.10a and an image dataset, achieving accuracies of 53.65% and 94.39%, respectively. Reference [14] proposed an AMC method based on deep residual neural networks and mask modeling techniques, significantly enhancing modulation types’ recognition performance under limited signal samples, especially in low SNR ratio environments.
Communication protocol identification (CPI): Communication protocols define the data format, content, and timing of information in wireless systems, closely related to their data structures, transmission systems, data types, and application types [7]. Therefore, this technology ensures the correct parsing of routed data and proper handling by network devices by determining the specific communication standards or protocols the signal uses. Reference [7] utilized multidimensional convolutional neural networks to enhance the accuracy and efficiency of wireless protocol identification and introduced a spatial convolutional pooling neural network to improve the flexibility and accuracy of variable–length protocol recognition. Reference [15] employed an incremental learning strategy to adapt to IoT devices’ continuous growth and operational changes, performing exceptionally well with limited signal samples and in low SNR ratio environments.
Specific emitter identification (SEI): The unique characteristics produced by transmitting devices’ physical or operational properties are often referred to as the “fingerprint” of a signal. SEI utilizes these fingerprint features to identify and distinguish specific signal sources, making this technology widely applied in fields such as military, security surveillance, and spectrum management to prevent illegal access and interference. Reference [16] introduced a small-sample SEI method using self-supervised learning and adversarial enhancement, suitable for situations with scarce signal samples and limited labeling. Reference [8] proposed a semisupervised SEI method based on metric adversarial training, which effectively extracts distinctive and generalizable semantic features of wireless signals, enhancing the accuracy and robustness of emitter identification [17].
MTSR: Advanced technologies in the field of wireless signal recognition often focus on independently researching these subtasks. When data are scarce, models that are not fully trained may perform poorly and can be prone to overfitting on specific tasks. Through the sharing of knowledge between tasks, MTL enhances the model’s ability to learn from limited data while reducing the total amount of parameters and computational resources required. Reference [18] marks the first use of deep neural networks and an MTL framework in wireless communications to study AMR and SEI simultaneously. Reference [19] focuses on classifying modulation modes and communication protocols; the proposed MTL architecture benefits from the interrelationships between these two tasks, improving classification accuracy and learning efficiency.

Research indicates that conflicts and interference between tasks, resulting in negative correlations, can lead to performance degradation, especially in noisy environments with low SNR ratios, where conflicts are more likely to obscure helpful information in the signals. By optimizing weight allocation strategies, it is possible to balance resources and mitigate task conflicts effectively [20]. However, inappropriate strategies can reduce learning efficiency and result in resource wastage.

Therefore, this study proposes a CNN-LSTM-MTL framework that can identify the modulation modes, communication protocol, and specific emitters of signals and introduces an adaptive weight adjustment strategy that considers learning difficulty and speed. This strategy effectively reduces conflicts and interference between tasks, enhancing task recognition performance.

3. Materials and Methods

3.1. MTL Model Based on CNN-LSTM

As a machine learning paradigm, MTL learns more robust and general representations by sharing model structures and data across different tasks. This approach not only builds more robust models but also facilitates knowledge sharing between tasks, significantly enhancing the performance of each task while reducing the risk of overfitting [21].

In wireless communications, primary efforts have been focused on AMR and signal classification [18, 19]. However, vehicular signals inherently contain integrated information encompassing modulation schemes, protocol specifications, and device fingerprints. The existing insufficient task coverage may constrain feature analysis capabilities and impede comprehensive data interpretation. To address this research gap, this study integrates AMR, CPI, and SEI tasks, proposing a CNN-LSTM-based MTL framework. As illustrated in Figure 1, this model employs a hard parameter sharing strategy [22], optimizing the model structure by sharing lower level network parameters while retaining task-specific parameters in higher layers. This facilitates knowledge transfer between different tasks and enhances the model’s generalization ability [23].

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

CNN-LSTM multitask learning framework based on dual-speed regulator.

The model includes two shared layers based on CNN and LSTM, respectively. Considering the correlation of subtasks in frequency patterns and amplitude variations, the input signal is treated as two-dimensional time-frequency spectral data. The CNN’s convolution operations effectively capture the signal’s local features, serving as the standard foundational features for all tasks. AMR and CPI often require handling signals’ continuity and contextual relationships. LSTM, adept at capturing long-term dependencies, is chosen as the shared layer for these two subtasks to understand the temporal patterns of the signals better. Moreover, AMR and CPI extract the most relevant information from shared features through specific fully connected layers for classification prediction. For SEI, given its typically fixed spectral or waveform characteristics and relative stability in the signal’s spatial domain, independent of temporal changes, CNN is used to extract advanced features further, focusing on the unique signal details of these task samples.

By extracting spatial features through CNN and capturing temporal dependencies with LSTM, the model is better equipped to handle complex wireless signal data. Moreover, regularization techniques are employed to enhance the model’s performance on the test set to prevent performance degradation caused by overfitting.

3.2. Dual-Speed Regulator (DSR) Adaptive Weight Adjustment Strategy

In MTL research, there is a potential for conflicts between different tasks, where improving the model’s performance on one task may impair the performance of another task that has different requirements [22]. This phenomenon is known as negative transfer or destructive interference [24].

Researchers have explored various adaptive weight adjustment strategies to mitigate the effects of interference. Dynamic weight averaging (DWA) [25], for instance, assigns more attention to tasks that have recently underperformed by observing the relative improvement in task losses [26].

Algorithm 1: DSR adaptive weight adjustment algorithm.

Input:
N: The total number of tasks;
ω_t: The weight for task t;
τ: Temperature parameter;
L_t(i): The loss for task t at epoch i;
α: Sensitivity parameters for gradient norm adjustment;
g_t: The gradients for each task;
G_avg: The average gradient norm across tasks;
β: The hyperparameter controls the strength of the weight;
Output: The optimized parameters
1. Initialize network parameters;
2. for n = 1 to MaxEpochs do
3. for t = 1 toNdo
4. Forward propagate to get the output for task t;
5. Compute the rate of loss change for task t using equation (1);
6. Compute the gradient equilibrium index for task t using equation (2);
7. Compute the weight influence factor for task t using equation (3) (for optimal task prioritization based on difficulty and performance);
8. Compute the weight for task t using equation (5);
9. end for
10. Compute the weighted total loss as
11. Update model parameters by backpropagation;
12. end for

However, excessive focus on underperforming tasks may lead to performance degradation in tasks performing well due to instability over long-term training. The GradNorm strategy [27] adjusts the gradient magnitudes of task losses to ensure that all tasks converge at similar rates during training. However, it lacks sensitivity to changes in task difficulty, leading to excessive balancing among tasks. Considering both factors, we proposed a hybrid adaptive weight adjustment method called DSR, illustrated in Algorithm 1.

DWA defines the learning difficulty of each task through the rate of change in losses. Thus, as shown in equation (1), we define the loss impact factor in DSR as ΔL.

()

where L_t(i) represents the training loss of task t in the i^th iteration and τ is the temperature parameter. The temperature parameter, denoted as τ, moderates the rate of change in the loss impact factor ΔL. This parameter helps in smoothing the variations in loss updates to prevent drastic changes in the learning process. ΔL reflects changes in task difficulty, which influences the model’s focus on various tasks.

In contrast, GradNorm determines the learning rate based on changes in the gradients. The DSR strategy defines the gradient impact factor as ΔG, with its formula shown in equation (2).

()

where ω_t represents the current weight of task t, G_i is the current task’s gradient norm, G_avg is the average gradient, and α is the sensitivity parameter for gradient adjustment. ΔG reflects changes in task gradients, influencing the learning speed of each task. These norms are measured in the same units as the gradients of the loss function they derive from, providing a quantitative measure of how task-specific gradients deviate from the network-wide average.

To prevent the effect of one influencing factor from excessively offsetting another, thereby reducing the effectiveness of the overall optimization strategy, a more balanced method of weight integration is employed.

()

wheres(t) represents the loss change sensitivity, calculated as the current loss relative to the historical average change, and

denotes the average rate of change across all task losses.

Meanwhile, g(t) is defined as a measure of how sensitive the method is to gradient consistency, which identifies tasks whose gradient norms significantly deviate from the average by normalizing against the gradients of other tasks. Therefore, the dynamic weight adjustment factor γ(t) is shown in equation (4):

()

where β is a balancing parameter used to adjust the relative influence of these two strategies in the task weight updates. It allows for a balanced integration of these factors, ensuring that neither dominates the optimization process unduly. By linear combination, the relative importance of the loss and gradient influencing factors can be controlled. Thus, the DSR weight adjustment strategy is presented in equation (5):

()

3.3. CNN-LSTM-MTL Based on Adaptive Weight Fusion

This paper combines the designed CNN-LSTM model with the DSR adaptive weight adjustment strategy into a new MTSR architecture, DSR-CNN-LSTM, as illustrated in Figure 2.

Preprocessed signal data are passed through a convolutional shared layer to extract foundational features for all tasks. An LSTM shared layer, designed explicitly for AMR and CPI, extracts the temporal characteristics of the samples, while the SEI task further utilizes convolutional operations to analyze advanced spatial features. The intertask correlations may affect the learning outcomes. The DSR strategy uses losses and model gradients to calculate weights for each task, with each task computing its specific loss function. These weights are then processed and used to update network parameters through backpropagation. This flexible and efficient approach allows the network to continuously adjust and optimize during training, enabling different tasks to mutually enhance each other, increase the positive impacts among related tasks, and reduce interference from irrelevant or negative impact tasks.

4. Results and Discussion

4.1. Experimental Environment and Parameter Setting

This study was conducted on an Ubuntu 18.04 server configured with the Pytorch1.11.0 framework and equipped with an NVIDIA RTX 3090 graphics card featuring 24 GB of video RAM.

Hyperparameter search was performed using the Tree-structured Parzen Estimator (TPE) algorithm based on Bayesian optimization. We adopted TPE because it adaptively constructs a model of promising hyperparameter regions, enabling efficient exploration of high-dimensional search spaces and faster convergence to optimal settings compared to grid or random searches. The study ultimately employed the Adam gradient descent algorithm for training the network over 40 epochs—a duration determined empirically through preliminary tests indicating that most models tended to converge in fewer than 40 epochs, thus balancing computational cost and performance gains—integrating a cosine annealing weight decay strategy to optimize the learning process, with an initial learning rate set at 0.0001. This relatively small initial learning rate helps stabilize training, especially when dealing with multiple tasks simultaneously. Additionally, based on the TPE strategy, the temperature parameter in the previously mentioned DSR weight adjustment strategy was set to 2, the weight sensitivity parameter was set to 1.5, and the weight balance parameter was set to 0.5. These values were selected to fine-tune the balance between each subtask’s loss and to control the responsiveness to fluctuations in training progress. The temperature parameter determines the sharpness of the weight distribution, the weight sensitivity parameter adjusts how quickly weights shift with changes in loss, and the weight balance parameter sets the baseline ratio among tasks.

4.2. Dataset Settings

References [18, 19] utilize GNURadio to generate communication signal data at various SNRs for MTSR research. The RadComAWGN and RadComDynamic datasets have a sampling rate of 10 MS/s, simulating scenarios where propagation effects and hardware uncertainties affect signal propagation and waveforms. The SNR ranges from −20 dB to 18 dB, with increments of 2 dB. The RadComOTA dataset simulates more challenging over-the-air propagation scenarios, including additional noise and interference. Here, the receiver samples IQ data at a central frequency of 2.45 GHz and a rate of 10 MS/s, with an SNR range from 0 to 32 dB in 4 dB steps, containing over 8 million IQ samples each length of 128.

As shown in Table 1, the RadComAWGN dataset includes nine types of modulation and 11 signal categories. In contrast, the RadComDynamic and RadComOTA datasets share the same label and comprise six types of modulation and eight signal categories.

Table 1. RadComAWGN data classification.

RadComAWGN dataset
Modulation classes	Signal class
BPSK	STACOM

ASK	Short-range

AM-DSB	AM Radio
AM-SSB	AM Radio

GFSK	Bluetooth

DSSS-CCK	IEEE802.11bg

DSSS-OQPSK	IEEE802.15.4

FMCW	Radar-Altimeter

PCW	Airborne detection
	Airborne range
	Ground mapping
	Air-Ground-MTI

RadComDynamic and RadComOTA dataset

BPSK	STACOM

ASK	Short-range

AM-DSB	AM Radio
AM-SSB	AM Radio

FMCW	Radar-Altimeter

PCW	Airborne detection
	Airborne range
	Ground mapping
	Air-Ground-MTI

Building on these three datasets, we further subdivided them to identify communication signal modulation, communication protocols, and specific emitter classes. As shown in Table 2, after removing signals that are neither protocol nor emitter categories, the new RadComAWGN dataset contains eight modulations, five protocols, and five emitter categories. The new RadComDynamic and RadComOTA datasets include five modulations, two protocols, and five emitter categories. Signals such as STACOM, AM Radio, Bluetooth, IEEE802.11bg, and IEEE802.15.4 focus on data transmission and communication standards to ensure compatibility and communication efficiency between different devices and are categorized under protocol types. On the other hand, signals such as Radar-Altimeter, Airborne-detection, Airborne-range, Ground mapping, and Air-Ground-MTI are more focused on performing specific tasks, usually associated with specific equipment applications and often used for precise detection and measurement tasks, thus categorized under specific emitter types. Considering that our protocol and emitter identification tasks are subdivisions of the original dataset’s signal classification tasks, the protocol and emitter labels in the new dataset are mutually exclusive. When a category label is missing, we set it to null.

Table 2. Multitask signal classification after processing.

RadComAWGN dataset
Modulation classes	Protocol classes	Emitter classes
BPSK	STACOM	Radar-Altimeter

AM-DSB	AM Radio	Airborne detection
AM-SSB	AM Radio	Airborne detection

GFSK	Bluetooth	Airborne range
DSSS-CCK	Bluetooth	Airborne range

DSSS-OQPSK	IEEE802.11bg	Ground mapping
FMCW	IEEE802.11bg	Ground mapping

PCW	IEEE802.15.4	Air-Ground-MTI

RadComDynamic and RadComOTA dataset

BPSK	STACOM	Radar-Altimeter
AM-DSB	STACOM	Airborne detection

AM-SSB	AM radio	Airborne range
FMCW		Ground mapping
PCW		Air-Ground-MTI

Finally, the processed signal samples are divided into training, validation, and test sets in a 7 : 2 : 1 ratio. They are standardized to reduce bias, enhance model performance, and reshaped into complex (IQ) tensors of 128 × 2 to match the network input.

4.3. Model Comparison Evaluation

To identify the most suitable model structure for the proposed MTSR, we used samples with SNRs of −10, −2, and 8 dB from the RadComAWGN dataset as the test set to assess the recognition capabilities of various models under low, medium, and high SNR conditions.

As shown in Table 3, all models reached their accuracy limits under high SNR conditions. Since samples at medium and low SNRs contain more noise interference and present higher challenges in feature extraction, these conditions better test the models’ recognition abilities. Therefore, the results of model recognition at low and medium SNRs serve as the basis for model selection in this study.

Table 3. Recognition accuracy of each model on different tasks at SNR = 8 dB.

Model	AMR accuracy (%)	CPI accuracy (%)	SEI accuracy (%)
CNN	99.87	99.77	100
LSTM	99.61	99.51	100
SNN	99.87	99.77	100
ResNet18	99.87	99.77	100
CNN-LSTM	99.87	99.77	100
SNN-LSTM	99.87	99.77	100

Note: The bold values represent the optimal recognition performance in all models.

Different models exhibit their respective strengths and limitations, as illustrated in Tables 4 and 5. CNNs excel in recognizing protocols and emitters by automatically learning the hierarchical spatial features of the data, but they are less effective when dealing with complex or abstract modulation features. LSTMs are proficient in processing time series data, yet they are less effective than image-based methods in recognizing emitter characteristics. SNNs demonstrate balanced high performance across various tasks, with their spike-timing processing being susceptible to subtle changes in communication signals, making them suitable for handling diverse tasks. However, their performance is slightly less than ideal due to the nature of updating parameters through surrogate gradients. ResNet18’s hierarchical feature extraction is particularly effective in deciphering complex modulation patterns and communication protocols, but it lacks the capability to capture unique features related to emitters.

Table 4. Recognition accuracy of each model on different tasks at SNR = −2 dB.

Model	AMR accuracy (%)	CPI accuracy (%)	SEI accuracy (%)
CNN	92.34	99.48	91.15
LSTM	97.79	98.96	79.43
SNN	97.79	98.96	90.62
ResNet18	98.44	98.70	82.55
CNN-LSTM	99.35	99.74	91.93
SNN-LSTM	98.44	99.48	89.06

Note: The bold values represent the optimal recognition performance in all models.

Table 5. Recognition accuracy of each model on different tasks at SNR = −10 dB.

Model	AMR accuracy (%)	CPI accuracy (%)	SEI accuracy (%)
CNN	66.10	84.58	39.63
LSTM	64.16	80.46	24.15
SNN	62.86	84.58	37.01
ResNet18	64.94	83.80	30.71
CNN-LSTM	67.66	84.83	40.16
SNN-LSTM	67.40	84.01	31.76

Note: The bold values represent the optimal recognition performance in all models.

Considering the above performance, different models and techniques have advantages when processing different aspects of communication signals and can serve as a basis for network design. Therefore, we integrate the models’ strengths to design a hybrid model structure suitable for simultaneously handling AMR, CPI, and SEI tasks. Among these, the CNN-LSTM model, which combines the spatial feature extraction of CNNs with the temporal processing capabilities of LSTMs, performs best across all tasks. In contrast, the combination of SNN and LSTM, while leveraging the advantages of spike processing and temporal capture, does not perform as well overall as the CNN-LSTM model, which optimizes both spatial and temporal features.

4.4. Ablation Contrast Analysis

To verify the effectiveness of the proposed DSR adaptive task weight adjustment strategy, we conducted an ablation comparative analysis using the RadComAWGN dataset, setting the SNR range from −20 to 18 dB and the learning rate at 0.0001. In the CNN-LSTM-MTL model, the weights for all tasks were set to 1, while the DSR-CNN-LSTM-MTL model employed the DSR weight adjustment strategy.

Although the recognition performance of our designed CNN-LSTM model is already approaching a high level without using a weight adjustment strategy, achieving accuracy rates close to 100% when SNR > 2 dB, the DSR weight adjustment strategy considers both task learning and gradient optimization speeds. This ensures that the model does not overoptimize one task at the expense of others, thereby helping to balance the learning needs of different tasks in MTL.

As shown in Figure 3, the DSR-CNN-LSTM model applying the weight adjustment strategy demonstrates higher accuracy across all tasks at various SNR levels compared to the CNN-LSTM model without this strategy. Particularly in low SNR environments (SNR < −4 dB), this strategy significantly enhances the model’s robustness, with a rapid improvement in accuracy, whereas the improvement with CNN-LSTM is more gradual. Moreover, the dashed line represents the SNR value at which all models simultaneously reach their performance limits. The figure shows that the DSR-CNN-LSTM reaches its peak performance more quickly as the SNR increases, whereas the CNN-LSTM requires a higher SNR to achieve similar accuracy rates. These findings indicate that our proposed DSR weight adjustment strategy helps the model learn and adapt faster in complex signal environments and enhances the model’s ability to decode signals.

Figure 4 presents a performance comparison of the two models under the RadComAWGN dataset, where our proposed DSR-CNN-LSTM model consistently outperforms the L-CNN network across all SNR levels. Specifically, Figure 4(a) shows the recognition performance for the AMR task, where the DSR-CNN-LSTM reaches convergence at −2 dB compared to 4 dB for L-CNN, demonstrating the higher learning efficiency of our model. Furthermore, in low SNR conditions for the modulation recognition task, the model also exhibits better adaptability and robustness. Calculating the weighted average accuracy across different SNR levels, the AMR average performance improvement is 20.67% when SNR < 0 dB.

Due to our subdivision of the original experiment’s signal classification task into CPI and SEI tasks, we conducted a comparative analysis of these tasks together, as shown in Figure 4(b). The subdivided CPI and SEI subtasks maintain recognition performance at all SNR levels that are not inferior to L-CNN, with CPI exhibiting a high recognition accuracy under low SNR conditions and beginning to converge at SNR = −2 dB. Although SEI is more challenging regarding feature extraction, its recognition accuracy rapidly increases when SNR exceeds −10 dB. Both subtasks significantly surpass the performance of the L-CNN model in the combined signal classification task under the same SNR conditions, with a weighted average performance improvement of 45.52% for both tasks when SNR < 0 dB.

The RadComDynamic dataset simulates conditions where signals are affected by waveform and hardware effects, containing more interference and noise. Figure 5 presents a performance comparison of various tasks under this dataset. When the SNR ranges from −20 to −10 dB, the L-CNN model exhibits considerable performance fluctuations across tasks. In contrast, the DSR-CNN-LSTM model, while not showing significant performance improvements, demonstrates better noise suppression and signal recovery capabilities. As shown in Figure 5(a), when the SNR exceeds −10 dB, the performance of the DSR-CNN-LSTM model rapidly improves and converges at 4 dB, whereas L-CNN shows performance lag. The average performance improvement under conditions where SNR < 0 dB is 10.38%. Furthermore, the DSR-CNN-LSTM performs exceptionally well in the CPI task, as depicted in Figure 5(b), achieving higher accuracy under low SNR conditions and converging significantly faster than L-CNN. Although the SEI task shows a performance improvement trend similar to L-CNN, its accuracy performance is notably superior to that model. Compared to the original study’s signal classification task, the weighted average performance improvement for CPI and SEI under SNR < 0 dB conditions is 72.21%.

The RadComOTA dataset simulates over-the-air propagation conditions, presenting more significant challenges and practical relevance. The authors collected samples under varying signal strengths by adjusting the transmission gain. Figure 6 displays the recognition performance under this dataset. Since the SNR range for the RadComOTA dataset spans from 0 to 32 dB, in increments of 4, and the DSR-CNN-LSTM achieves higher recognition accuracy under high SNR conditions, Figure 6(b) shows that the accuracy for CPI and SEI tasks is close to 100% across all SNR levels, significantly outperforming L-CNN with an average performance improvement of 11.11%. In Figure 6(a), although the accuracy of DSR-CNN-LSTM is lower than that for other tasks, it still exceeds 90%, while L-CNN remains below 80%, yielding an average performance improvement of 9.96%.

Based on the above comparative analysis, our proposed model effectively integrates the spatial feature extraction capabilities of CNN with the temporal processing abilities of LSTM, combined with the adaptive weight adjustment strategy. This model demonstrates exceptional noise suppression and signal recovery capabilities under low SNR conditions. Additionally, it exhibits superior learning speed, achieving performance convergence more quickly, and its recognition accuracy is significantly higher than that of other models.

5. Conclusions

This paper proposes an MTL architecture, DSR-CNN-LSTM, for joint recognition of modulation schemes, communication protocols, and specific emitter characteristics. Meanwhile, we introduce a novel adaptive weight adjustment strategy, the dual-velocity regulator, that dynamically coordinates specific task learning velocities and gradient update rates to mitigate performance degradation caused by task conflicts. Comparative analysis shows that on RadComAWGN, RadComDynamic, and RadComOTA datasets, DSR-CNN-LSTM achieves average accuracy enhancements of 20.67%, 10.38%, and 9.96%, respectively, for AMR compared to state-of-the-art approaches. For CPI and SEI, the weighted average performance improvements over the signal classification task are 45.52%, 72.21%, and 11.11%, respectively. The proposed model outperforms other models across various SNR levels, demonstrating enhanced robustness and anti-interference capabilities, particularly in low signal-to-noise ratio environments. Through the synergistic sharing of task features, the model significantly improves the reliability and real-time performance of intervehicle communication, providing efficient and reliable technical support for the future development of intelligent transportation systems.

However, the relatively complex weight adjustment strategy increases the computational burden, thereby affecting the real-time analysis of signals. In future work, we will explore more lightweight model frameworks or conflict mitigation techniques to reduce computational overhead and better adapt to resource-constrained edge device scenarios.

Conflicts of Interest

The authors declare no conflicts of interest.

Author Contributions

Conceptualization, D.C. and X.S.; methodology, D.C. and X.S.; validation, D.C., X.S., and Z.C.; formal analysis, D.C., X.S., and Z.C.; investigation, D.C., X.S., and X.W.; resources, D.C., X.S., and X.W.; data curation, D.C., X.S., and X.W.; writing – original draft preparation, D.C. and X.S.; writing – review and editing, D.C., X.S., Z.C., and W.N.; visualization, D.C.; supervision, Z.C. and W.N.; project administration, Z.C. and W.N.; funding acquisition, W.N. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Central Funds Guiding the Local Science and Technology Development under Grant No. 236Z0806G, the National Natural Science Foundation of China under Grant No. 62372021, and the Open Competition Mechanism to Select the Best Candidates in Shijiazhuang, Hebei Province, China.

Open Research

Data Availability Statement

The data that support the findings of this study are openly available in Github: “RadarCommDataset” at https://doi.org/10.1016/j.comnet.2021.108441 [28].

References

1 Guevara L. and Auat Cheein F., The Role of 5G Technologies: Challenges in Smart Cities and Intelligent Transportation Systems, Sustainability. (2020) 12, no. 16, https://doi.org/10.3390/su12166469.
10.3390/su12166469
Web of Science® Google Scholar
2 Nguyen D. C., Ding M., Pathirana P. N. et al., 6G Internet of Things: A Comprehensive Survey, IEEE Internet of Things Journal. (2022) 9, no. 1, 359–383, https://doi.org/10.1109/JIOT.2021.3103320.
10.1109/JIOT.2021.3103320
Web of Science® Google Scholar
3 Wang W., Yang Y., Yang X. et al., A Negative Binomial Lindley Approach Considering Spatiotemporal Effects for Modeling Traffic Crash Frequency With Excess Zeros, Accident Analysis & Prevention. (November 2024) 207, https://doi.org/10.1016/j.aap.2024.107741.
10.1016/j.aap.2024.107741
Web of Science® Google Scholar
4 Arjoune Y. and Faruque S., Artificial Intelligence for 5G Wireless Systems: Opportunities, Challenges, and Future Research Directions, Proceedings of the 2020 10th Annual Computing and Communication Workshop and Conference (CCWC), 2020.
Google Scholar
5 Schmidt M., Block D., and Meier U., Wireless Interference Identification With Convolutional Neural Networks, Proceedings of the 2017 IEEE 15th International Conference on Industrial Informatics (INDIN), July 2017, 180–185, https://doi.org/10.1109/indin.2017.8104767, 2-s2.0-85041167973.
10.1109/indin.2017.8104767
Google Scholar
6 Mohsen S., Ali A. M., and Emam A., Automatic Modulation Recognition Using CNN Deep Learning Models, Multimedia Tools and Applications. (January 2024) 83, no. 3, 7035–7056, https://doi.org/10.1007/s11042-023-15814-y.
10.1007/s11042-023-15814-y
Web of Science® Google Scholar
7 Ren J., Li S., Song Y., and Li M., Deep Learning Based Identification Method for Signal-Level Wireless Protocol, IEEE Access. (2022) 10, 118187–118197, https://doi.org/10.1109/ACCESS.2022.3220769.
10.1109/ACCESS.2022.3220769
Web of Science® Google Scholar
8 Fu X., Peng Y., Liu Y. et al., Semi-Supervised Specific Emitter Identification Method Using Metric-Adversarial Training, IEEE Internet of Things Journal. (2023) 10, no. 12, 10778–10789, https://doi.org/10.1109/jiot.2023.3240242.
10.1109/JIOT.2023.3240242
Web of Science® Google Scholar
9 Osman R. A. and Abdelsalam A. K., A Novel Adaptive Approach for Autonomous Vehicle Based on Optimization Technique for Enhancing the Communication Between Autonomous Vehicle-To-Everything Through Cooperative Communication, Applied Sciences. (2021) 11, no. 19, https://doi.org/10.3390/app11199089.
10.3390/app11199089
Google Scholar
10 Malik S., Khan M. A., and El-Sayed H., Collaborative Autonomous Driving—A Survey of Solution Approaches and Future Challenges, Sensors. (2021) 21, no. 11, https://doi.org/10.3390/s21113783.
10.3390/s21113783
Web of Science® Google Scholar
11 Wang T., Yang G., Chen P., Xu Z., Jiang M., and Ye Q., A Survey of Applications of Deep Learning in Radio Signal Modulation Recognition, Applied Sciences. (November 2022) 12, no. 23, https://doi.org/10.3390/app122312052.
10.3390/app122312052
Google Scholar
12 Xiao W., Luo Z., and Hu Q., A Review of Research on Signal Modulation Recognition Based on Deep Learning, Electronics. (September 2022) 11, no. 17, https://doi.org/10.3390/electronics11172764.
10.3390/electronics11172764
Web of Science® Google Scholar
13 Zhang R., He C., Jing L., Zhou C., Long C., and Li J., A Modulation Recognition System for Underwater Acoustic Communication Signals Based on Higher-Order Cumulants and Deep Learning, JMSE. (August 2023) 11, no. 8, https://doi.org/10.3390/jmse11081632.
10.3390/jmse11081632
Google Scholar
14 Peng Y., Guo L., Yan J. et al., Automatic Modulation Classification Using Deep Residual Neural Network with Masked Modeling for Wireless Communications, Drones. (June 2023) 7, no. 6, https://doi.org/10.3390/drones7060390.
10.3390/drones7060390
Web of Science® Google Scholar
15 Liu Y., Wang J., Li J., Niu S., and Song H., Class-Incremental Learning for Wireless Device Identification in IoT, IEEE Internet of Things Journal. (December 2021) 8, no. 23, 17227–17235, https://doi.org/10.1109/JIOT.2021.3078407.
10.1109/JIOT.2021.3078407
Web of Science® Google Scholar
16 Liu C., Fu X., Wang Y. et al., Overcoming Data Limitations: A Few-Shot Specific Emitter Identification Method Using Self-Supervised Learning and Adversarial Augmentation, IEEE Transactions on Information Forensics and Security. (2024) 19, 500–513, https://doi.org/10.1109/TIFS.2023.3324394.
10.1109/TIFS.2023.3324394
Web of Science® Google Scholar
17 Gupta P., Jain P., and Kakde O. G., Deep Learning Techniques in Radar Emitter Identification, Defence Science Journal. (2023) 73, no. 5, 551–563, https://doi.org/10.14429/dsj.73.18319.
10.14429/dsj.73.18319
Web of Science® Google Scholar
18 Jagannath A. and Jagannath J., Multi-Task Learning Approach for Automatic Modulation and Wireless Signal Classification, Proceedings of the ICC 2021-IEEE International Conference on Communications, 2021, IEEE, 1–7.
Google Scholar
19 Jagannath A. and Jagannath J., Multi-Task Learning Approach for Modulation and Wireless Signal Classification for 5g and Beyond: Edge Deployment via Model Compression, Physical Communication. (2022) 54, https://doi.org/10.1016/j.phycom.2022.101793.
10.1016/j.phycom.2022.101793
Web of Science® Google Scholar
20 Kim E., Adaptive Dynamic Search for Multi-Task Learning, Applied Sciences. (November 2022) 12, no. 22, https://doi.org/10.3390/app122211836.
10.3390/app122211836
Google Scholar
21 Zhang Y. and Yang Q., A Survey on Multi-Task Learning, IEEE Transactions on Knowledge and Data Engineering. (December 2022) 34, no. 12, 5586–5609, https://doi.org/10.1109/TKDE.2021.3070203.
10.1109/TKDE.2021.3070203
Web of Science® Google Scholar
22 Wang H., Jin X., Du Y., Zhang N., and Hao H., Adaptive Hard Parameter Sharing Method Based on Multi-Task Deep Learning, Mathematics. (2023) 11, no. 22, https://doi.org/10.3390/math11224639.
10.3390/math11224639
Web of Science® Google Scholar
23 Vandenhende S., Georgoulis S., De Brabandere B., and Van Gool L., Branched Multi-Task Networks: Deciding What Layers to Share, 2020, https://arxiv.org/abs/1904.02920.
Google Scholar
24 Sener O. and Koltun V., Multi-Task Learning as Multi-Objective Optimization, Advances in Neural Information Processing Systems, 2018, Curran Associates, Inc, https://proceedings.neurips.cc/paper_files/paper/2018/hash/432aca3a1e345e339f35a30c8f65edce-Abstract.html.
Google Scholar
25 Liu S., Johns E., and Davison A. J., End-to-End Multi-Task Learning With Attention, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 1871–1880, https://doi.org/10.1109/cvpr.2019.00197.
10.1109/cvpr.2019.00197
Google Scholar
26 Gong T., Lee T., Stephenson C. et al., A Comparison of Loss Weighting Strategies for Multi Task Learning in Deep Neural Networks, IEEE Access. (2019) 7, 141627–141632, https://doi.org/10.1109/ACCESS.2019.2943604.
10.1109/ACCESS.2019.2943604
Web of Science® Google Scholar
27 Chen Z., Badrinarayanan V., Lee C.-Y., and Rabinovich A., GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks, https://arxiv.org/abs/1711.02257.
Google Scholar
28 Jagannath A. and Jagannath J., Dataset for Modulation Classification and Signal Type Classification for Multi-Task and Single Task Learning, Computer Networks (Elsevier). (2021) 199, 1389–1286.
10.1016/j.comnet.2021.108441
Web of Science® Google Scholar

All articles

Multitask Vehicle Signal Recognition With Dual-Speed Adaptive Weighting

Abstract

1. Introduction

2. Related Work

3. Materials and Methods

3.1. MTL Model Based on CNN-LSTM

3.2. Dual-Speed Regulator (DSR) Adaptive Weight Adjustment Strategy

3.3. CNN-LSTM-MTL Based on Adaptive Weight Fusion

4. Results and Discussion

4.1. Experimental Environment and Parameter Setting

4.2. Dataset Settings

4.3. Model Comparison Evaluation

4.4. Ablation Contrast Analysis

5. Conclusions

Conflicts of Interest

Author Contributions

Funding

Open Research

Data Availability Statement

References

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley