Journal of Cellular and Molecular Medicine

ORIGINAL ARTICLE

Open Access

Identification of biomarkers for abdominal aortic aneurysm in Behçet's disease via mendelian randomization and integrated bioinformatics analyses

Chunjiang Liu

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Data curation (equal), Investigation (equal), Project administration (equal), Writing - original draft (equal)

Search for more papers by this author

Huadong Wu,

Huadong Wu

Department of vascular surgery, First affiliated Hospital of Huzhou University, Huzhou, China

Contribution: Data curation (equal), Investigation (equal), Methodology (equal), Validation (equal)

Search for more papers by this author

Kuan Li,

Kuan Li

Department of General Surgery, Kunshan Hospital of Traditional Chinese Medicine, Kunshan, China

Contribution: Data curation (equal), Investigation (equal), Software (equal)

Search for more papers by this author

Yongxing Chi,

Yongxing Chi

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Software (equal)

Search for more papers by this author

Zhaoying Wu,

Zhaoying Wu

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Software (equal)

Search for more papers by this author

Chungen Xing,

Corresponding Author

Chungen Xing

[email protected]

orcid.org/0000-0001-7865-1258

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Correspondence

Chungen Xing, Department of General Surgery, The Second Affiliated Hospital of Soochow University, No.1055, Sanxiang Rd, Suzhou 215004, China.

Email: [email protected]

Contribution: Conceptualization (lead)

Search for more papers by this author

Chunjiang Liu,

Chunjiang Liu

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Data curation (equal), Investigation (equal), Project administration (equal), Writing - original draft (equal)

Search for more papers by this author

Huadong Wu,

Huadong Wu

Department of vascular surgery, First affiliated Hospital of Huzhou University, Huzhou, China

Contribution: Data curation (equal), Investigation (equal), Methodology (equal), Validation (equal)

Search for more papers by this author

Kuan Li,

Kuan Li

Department of General Surgery, Kunshan Hospital of Traditional Chinese Medicine, Kunshan, China

Contribution: Data curation (equal), Investigation (equal), Software (equal)

Search for more papers by this author

Yongxing Chi,

Yongxing Chi

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Software (equal)

Search for more papers by this author

Zhaoying Wu,

Zhaoying Wu

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Contribution: Software (equal)

Search for more papers by this author

Chungen Xing,

Corresponding Author

Chungen Xing

[email protected]

orcid.org/0000-0001-7865-1258

Department of General Surgery, The Second Affiliated Hospital of Soochow University, Suzhou, China

Correspondence

Chungen Xing, Department of General Surgery, The Second Affiliated Hospital of Soochow University, No.1055, Sanxiang Rd, Suzhou 215004, China.

Email: [email protected]

Contribution: Conceptualization (lead)

Search for more papers by this author

First published: 24 May 2024

https://doi.org/10.1111/jcmm.18398

Chunjiang Liu, Kuan Li and Huadong Wu contributed equally to this work.

Share a link

Email
Wechat
Bluesky

Abstract

Behçet's disease (BD) is a complex autoimmune disorder impacting several organ systems. Although the involvement of abdominal aortic aneurysm (AAA) in BD is rare, it can be associated with severe consequences. In the present study, we identified diagnostic biomarkers in patients with BD having AAA. Mendelian randomization (MR) analysis was initially used to explore the potential causal association between BD and AAA. The Limma package, WGCNA, PPI and machine learning algorithms were employed to identify potential diagnostic genes. A receiver operating characteristic curve (ROC) for the nomogram was constructed to ascertain the diagnostic value of AAA in patients with BD. Finally, immune cell infiltration analyses and single-sample gene set enrichment analysis (ssGSEA) were conducted. The MR analysis indicated a suggestive association between BD and the risk of AAA (odds ratio [OR]: 1.0384, 95% confidence interval [CI]: 1.0081–1.0696, p = 0.0126). Three hub genes (CD247, CD2 and CCR7) were identified using the integrated bioinformatics analyses, which were subsequently utilised to construct a nomogram (area under the curve [AUC]: 0.982, 95% CI: 0.944–1.000). Finally, the immune cell infiltration assay revealed that dysregulation immune cells were positively correlated with the three hub genes. Our MR analyses revealed a higher susceptibility of patients with BD to AAA. We used a systematic approach to identify three potential hub genes (CD247, CD2 and CCR7) and developed a nomogram to assist in the diagnosis of AAA among patients with BD. In addition, immune cell infiltration analysis indicated the dysregulation in immune cell proportions.

1 INTRODUCTION

Behçet's disease (BD) is a persistent systemic inflammatory disorder characterised by an underlying chronic vasculitis, reflecting an inflammatory process affecting the blood vessels. Vascular involvement is observed in 15% to 40% of patients with BD; among these individuals, approximately 27.5% could present with vascular lesions as their initial manifestation.¹ Inflammation can lead to arterial aneurysms, thrombosis and endothelial dysfunction. A combination of BD with aneurysm involves true aneurysm, pseudoaneurysm and aortic dissection, with the abdominal aorta as the most common site of occurrence.²

Abdominal aortic aneurysm (AAA) is distinguished by the infiltration of immune cells, heightened proteolytic activity and persistent degradation of extracellular matrix constituents, including collagen, elastin, fibronectin and laminin, thus expanding the aortic wall.³ Although the involvement of AAA in BD is rare, it can be associated with severe consequences. AAA is characterised by progressive aortic dilation that may result in a potentially lethal rupture. Fortunately, endovascular therapy with proper medicines for the treatment of AAA in patients with BD has yielded promising results with low morbidity and mortality.^{4, 5} Therefore, early diagnosis and treatment of AAA in patients with BD are of utmost importance to prevent the rupture of AAA. Biomarkers can facilitate timely detection and medical intervention for conditions such as BD and asymptomatic AAA, which typically lack discernible clinical manifestations.

Microarray-based gene expression profiling has been recently and widely applied to biomedical and clinical research as a prominent biomarker tool.⁶ Multiple studies have successfully identified biomarkers associated with BD and AAA. For instance, CLEC12A, IFI27 and CLC are considered potential and valuable biomarkers for the diagnosis of BD.⁷ A study reported the association of miR-24 and CHI3L1 as biological markers with AAA.⁸ Another study reported the involvement of MEDAG and SERPINE1 genes in the pathogenesis of AAA.⁹ The aetiology of BD remains incompletely elucidated. However, infection-related, genetic, epigenetic and immunological factors are known to collectively contribute to its progression.¹⁰ The immune-mediated infiltration and subsequent destruction of the aortic wall have been implicated in the development of AAA,¹¹ with an impaired inflammatory response playing a significant role in inducing AAA in patients with BD.¹² Therefore, biomarkers linked to immune filtration can be useful in predicting the susceptibility of patients with BD to AAA and aid in its treatment. Nevertheless, the literature on the genetic mechanism underlying BD-induced AAA is little and warrants further investigation.

Mendelian randomization (MR) analysis, a promising epidemiological method, has been proposed to accurately evaluate the potential causal relationships. Moreover, MR analysis effectively mitigates potential confounding factors and reverses causality by leveraging the random allocation principle of alleles and employing instrumental variables (IVs) as genetic variants.¹³ In the present study, we used summary-level statistics from previous genome-wide association studies (GWASs) to conduct an MR analysis, facilitating a more feasible exploration of potential causal relationships between BD and AAA.

Significant advances have occurred in the fields of bioinformatics and machine learning over the past decade.^14-17 These advancements have facilitated the investigation of underlying mechanisms and the identification of potential biomarkers.^18-21 For instance, Limma analysis and weighted gene co-expression network analysis (WGCNA) were employed to identify differentially expressed genes (DEGs) in BD and AAA, as well as specific module genes significantly associated with both conditions. Previous studies have employed this method to identify shared risk genes associated with different phenotypes of disease.²² A combination of protein–protein interaction (PPI) network analysis, machine learning algorithms and evaluation of nomogram results was used to ascertain significant biomarkers associated with AAA and BD.

To the best of our current understanding, research investigating the genetic mechanism underlying BD-induced AAA is lacking. In the current study, we used integrated bioinformatics and machine learning techniques to identify significant biomarkers associated with BD-induced AAA.

2 METHODS

2.1 Mendelian randomization (MR) analysis

Figure 1 depicts the study flowchart. We used two-sample MR analysis to examine the causal link between BD and the likelihood of AAA.²³ The GWAS summary statistics for BD and AAA were acquired from the 9th FinnGen study. BD was treated as the exposure variable and AAA as the outcome measure. Three crucial assumptions must be met to conduct an MR study: Firstly, the selected single nucleotide polymorphisms (SNPs) should exhibit a significant correlation with the exposure (BD). Secondly, the SNPs should be independent of the potential confounding factors. Thirdly, the SNPs should be specifically related to the risk of the outcome (AAA) through BD. Genetic variants with genome-wide significant (p < 5 × 10⁻⁵) association with BD were selected as instrumental variables. The causal effect of BD on the risk of AAA was examined in an inverse variance-weighted (IVW) meta-analysis using the Wald ratio estimates.²⁴ The MR–Egger test was analysed to detect potential pleiotropy. A p-value exceeding 0.05 for the MR–Egger intercept was indicative of the absence of horizontal pleiotropy. The stability was evaluated using leave-one-out sensitivity analyses, wherein a single SNP was excluded in each iteration. A two-sided p-value below 0.05 was considered significant. Statistical analyses were conducted using the “two-sample-MR,” “MR-PRESSO,” and “mr. raps” packages.²⁵

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Study flowchart.

2.2 Microarray data

Four microarray datasets (GSE17114, GSE209567,⁷ GSE57691 and GSE7084²⁶) were downloaded them from the NCBI Gene Expression Omnibus (GEO) database.²⁷ The search strategy used the terms “Behçet's disease” or “abdominal aortic aneurysm” in conjunction with “Homo sapiens” AND “expression profiling by array.” Datasets GSE17114 and GSE209567 included gene expression data on patients with BD and normal controls. Datasets GSE57691 and GSE7084 contained data on patients with AAA and controls. Table 1 presents comprehensive details on datasets, encompassing the microarray platform used, sample groups involved and their respective quantities. The datasets underwent preprocessing using the “afy” package in R, sourced from the Bioconductor project, for background calibration and normalisation. The median expression of multiple probes corresponding to the same gene was calculated. The probes were subsequently converted to gene symbols and organised into matrix files. The GSE17114 and GSE209567 datasets were merged using the R/Bioconductor inSilicoDb package. The R “surrogate variable analysis” package was used to eliminate batch effects and other undesired variations.

TABLE 1. Detailed information of the GEO datasets in the study.

ID	GSE series	Disease	Samples	Source types	Platform	Group
1	GSE17114	BD	15 BD patients and 14 normal controls	Peripheral blood	GPL570	Discovery cohort
2	GSE209567	BD	29 BD patients and 15 normal controls	Peripheral blood	GPL570	Discovery cohort
3	GSE57691	AAA	49 AAA patients and 10 donors	Aortic wall	GPL10558	Discovery cohort
4	GSE7084	AAA	9 AAA patients and 10 donors	Aortic wall	GPL2507 GPL570	Validation cohort

2.3 Differentially expressed gene (DEG) analysis using Limma

Linear models for microarray data (Limma) are a differential expression screening method that utilises generalised linear models to identify differentially expressed genes (DEGs) between different comparison and control groups. The method involves applying the lmFit function to conduct multiple linear regression on the acquired expression profile dataset. Subsequently, the eBays function is used to calculate moderated t-statistics, moderated F-statistics and log-odds of differential expression through empirical Bayes moderation of the standard errors toward a common value. The Limma package²⁸ was used to screen for DEGs between BD and the control group in the merged dataset (GSE17114 and GSE209567). Next, the same package was used to detect the DEGs between AAA and control groups in the GSE57691 dataset. The criteria for screening DEGs were a p-value below 0.05 and a fold change (FC) exceeding 1.2.

2.4 Significant module identification using weighted gene co-expression network analysis (WGCNA)

WGCNA has been extensively used to construct gene co-expression networks.²⁹ The method clusters genes exhibiting similar expression patterns and assesses the correlation between gene modules and specific phenotypes. We used WGCNA to ascertain significant module genes exhibiting a strong correlation with BD and AAA. To begin, the median absolute deviation (MAD) was calculated for each gene. Subsequently, 50% of genes with the lowest MAD values were excluded. Afterward, a scale-free co-expression network was established by applying the goodSamplesGenes function in WGCNA to filter the expression matrix of DEGs. Subsequently, the adjacency was determined using the pick-soft-threshold function, which uses the co-expression similarity to derive the soft thresholding power β. In our study, we set the soft thresholding power β to 8 for BD and 10 for AAA (Figures S1 and S2). Afterward, the adjacency was transformed into a topological overlap matrix (TOM), and the gene ratio and associated dissimilarity were computed. Genes were hierarchically clustered based on their dissimilarity degree (1-TOM) to group genes with similar expression patterns into gene modules. The minimum module size was defined as containing 100 genes. Ultimately, the identification of modules was achieved through hierarchical clustering and dynamic tree-cutting techniques.

2.5 Functional enrichment analysis

Functional enrichment analysis, which encompassed GO and KEGG analyses was conducted using the clusterprofiler package in R.³⁰ The GO analysis categorised functions into biological processes (BP), molecular functions (MF) and cellular components (CC), with the top 10 GO terms in each category visualised using the “ggplot2” package in R while applying the screening criteria of false discovery rate <0.05 and p-value <0.05.

2.6 Protein–protein interaction (PPI) network construction

The potential interplay of the identified DEGs was investigated by mapping them to the PPI network. The PPI network was established using the STRING 12.0 database,³¹ with a minimum required interaction score of 0.400. DEGs that were non-protein-coding and lacked interactions with other DEGs were excluded from the analysis. A visualisation of the PPI network was created using the remaining DEGs. The data in “tsv” format was acquired and imported into the Cytoscape software (version 3.9.1).³² The importance of nodes in biological networks and the identification of central elements can be determined by measuring their network features. Subsequently, three algorithms (betweenness, closeness and degree) were selected to evaluate the topological characteristics of each node in the interaction network.³³ They are crucial topological algorithms used to evaluate node significance within a network and determine whether a target protein serves as a fundamental basis for key targets. The DEGs underwent additional filtration using three algorithms in the CytoHubba plug-in within Cytoscape. Each algorithm assigned a score to each DEG, resulting in the ranking of DEGs according to their scores. The top 30 DEGs, determined by each algorithm, were designated as node DEGs. The interaction of each algorithm with the node DEGs was visually depicted using a Venn diagram.

2.7 Machine-learning algorithms

Machine learning techniques have successfully identified hub genes associated with various diseases. Prominent machine learning algorithms commonly employed to screen potential biomarkers include the Support Vector Machine-Recursive Feature Elimination (SVM-RFE), least absolute shrinkage, selection operator (Lasso) and random forest.³⁴ The risk of overfitting was avoided using the least absolute shrinkage and selection operator (Lasso)³⁵ regression analysis for variable filtration. The Support Vector Machine-Recursive Feature Elimination (SVM-RFE)³⁶ algorithm is suitable for datasets with limited samples as it eliminates redundant factors, retaining only relevant variables. Moreover, random forest³⁷ is advantageous in managing datasets with numerous dimensions, constructing predictive models and estimating the significance of individual variables. Consequently, the intersection of three algorithms was used to obtain DEGs, which were regarded as potential biomarkers.

2.8 Evaluation of receiver operating characteristic curve (ROC) and nomogram

Student's t-test was used to compare candidate gene expression between AAA and control groups. A ROC curve was constructed, and the corresponding area under the curve (AUC) with a 95% confidence interval (CI) was calculated to evaluate the diagnostic performance of each gene.³⁸ Furthermore, we developed a nomogram using the R package “rms.”³⁹ The nomogram converts the relative expression of each gene into a score, which is subsequently aggregated to form the total score to predict the incidence of BD with AAA. In addition, a ROC curve was generated to assess the performance of the nomogram. Only AUC >0.7 was considered significant in patients with BD having AAA.

2.9 Peripheral blood collection, validation of the expression of hub genes and evaluation of the predictive model

To further validate the hub genes we identified, we collected peripheral blood samples from patients diagnosed with BD (n = 8) and BD complicated with AAA (n = 4) at the First Affiliated Hospital of Huzhou University between 1 August 2022 and 1 March 2024. All BD patients were diagnosed by the International Criteria for BD.⁴⁰ All AAA patients were diagnosed by contrast-enhanced CT. The basic clinical characteristics of all participants are provided in Table S1. Approval for the sample collection protocol was obtained from the Ethics Committee of the First Affiliated Hospital of Huzhou University (Huzhou, China). After collecting peripheral blood, the MolPure® Blood RNA Kit (19241ES50, Yeasen, Shanghai, China) was used to extract total RNA, following the provided instructions. Subsequently, the concentration of RNA was assessed using Nanodrop (Thermofisher, USA), and cDNA synthesis was performed using the Hifair® II 1st Strand cDNA Synthesis Kit (11121ES60, Yeasen, Shanghai, China). The primers used in this study are listed in Table S2. Finally, the expression levels of the hub genes were detected between the two groups. In addition, a predictive nomogram model was constructed to distinguish BD patients with or without AAA.

2.10 Immune infiltration analysis, single-sample gene-set enrichment analysis (ssGSEA) and therapeutic agents screening

The composition of infiltrating immune cells from the normalised gene expression matrix was determined using the “Cibersort” algorithm. The R package “Cibersort” was utilised to quantify the proportions of 22 distinct types of immune cells between AAA and control groups. The proportions of immune cells were visualised using bar plots, and differences in immune cell expression between the two groups were measured using boxplots. The correlation between different immune cells in the development of AAA was illustrated using a heatmap generated with the R package “corrplot.”⁴¹ The relationship between immune cell infiltrations and characteristic genes was examined using ssGSEA. A p-value < 0.05 was considered statistically significant.

The association between potential biomarkers and hallmark gene sets was established using the ssGSEA method. Initially, a comprehensive set of 50 well-defined biological states or processes, referred to as hallmark gene sets, was acquired from MSigDB.⁴² Subsequently, the GSVA package⁴³ was used to conduct ssGSEA and determine the correlation between potential biomarkers and hallmark gene sets. A p-value < 0.05 was considered statistically significant.

Finally, Enrichr (https://maayanlab.cloud/Enrichr/) was employed to screen for therapeutic agents targeting the hub genes, with a threshold of p-value < 0.05 being utilised for the enrichment analysis.

2.11 Statistical analysis

Statistical analyses were carried out utilizing the R software (version 4.2.1), GraphPad Prism (version 9.4.0) and SPSS (version 26.0). The continuous variables between the two groups were compared using Student's t-test, with statistical significance considered when the p-value was below 0.05.

3 RESULTS

3.1 MR analysis of genetic susceptibility to BD and AAA

In the MR analysis, we excluded SNPs associated with confounding factors and outcomes as well as SNPs with incompatible alleles or exhibiting palindromic patterns with intermediate allele frequencies. Subsequently, we retained 16 BD-related SNPs that met the three crucial assumptions of the MR study for further analysis. Table S3 lists the comprehensive details of IVs for BD in MR analysis. The IVW analysis indicated a suggestive association between BD and the risk of AAA (OR: 1.0384, 95% CI: 1.0081–1.0696, p = 0.0126). The MR–Egger regression analysis was conducted to examine the presence of horizontal pleiotropy. The findings showed that pleiotropy is unlikely to introduce bias to the causal relationship (p > 0.05; Figure 2 and Table S4).

3.2 DEG identification via Limma in BD and AAA

A total of 482 DEGs were identified by comparing the BD and control groups. Among these, 134 genes displayed upregulation, whereas 348 genes were downregulated. Similarly, a comparison of AAA and control groups revealed 6088 DEGs, of which 2363 genes were upregulated and 3725 genes were downregulated. A heatmap was used to display the 20 most significant DEGs exhibiting either upregulation or downregulation. In addition, a volcano plot was used to represent all DEGs (BD: Figure 3A,B; AAA: Figure 4A,B).

3.3 Identification of significant module genes in BD and AAA via WGCNA

Next, WGCNA was used to identify the significant module genes associated with BD and AAA. The grey module did not successfully cluster the genes commonly considered irrelevant or uninformative (i.e., the “junk module”). The light yellow (r = 0.32, p = 5.3 × 10⁻³) module and cyan (r = −0.38, p = 7.8 × 10⁻⁴) module displayed the highest correlation with BD (Figure 3C). Figure 3D,E depict the relationship between module membership and gene significance in the cyan/light yellow module of BD. The light cyan module demonstrated the highest positive correlation with AAA (r = 0.45, p = 3.3 × 10⁻⁴), whereas the magenta module displayed the strongest negative correlation (r = −0.60, p = 5.7 × 10⁻⁷) (Figure 4C). Figure 4D,E depict the relationship between module membership and gene significance in the light cyan/magenta module of AAA. Consequently, 7004 genes were identified in the BD group, whereas 7036 genes were identified in the AAA group. Subsequently, the intersection of 482 DEGs and 7004 BD-associated module genes led to the identification of 384 BD-related DEGs (Figure 3F). Similarly, the intersection of 6088 DEGs and 7036 module genes associated with AAA led to the identification of 3666 AAA-related DEGs (Figure 4F). Figures S1 and S2 display the soft threshold selection and gene cluster tree, respectively.

3.4 Functional enrichment analysis of BD-related DEGs in AAA

The identification of 94 BD-related DEGs in AAA was achieved by intersecting 384 DEGs associated with BD and 3666 DEGs associated with AAA (Figure 5A). Ninety-four DEGs for BP displayed significant enrichment in GO terms such as “immune system process,” “immune response,” and “T cell activation.” In addition, CC was enriched with terms such as “receptor complex,” “plasma membrane receptor complex,” and “plasma membrane protein complex.” The MF of DEGs displayed significant associations with terms such as “protein kinase binding,” “protein tyrosine kinase binding,” and “non-membrane spanning protein tyrosine kinase activity” (Figure S3 A–C and Table S5). The functional pathway analysis of 94 DEGs, depicted in Figure S3D and Table S3 revealed significant enrichment in pathways such as “T cell receptor signaling pathway,” “PD-L1 expression and PD-1 checkpoint pathway in cancer,” and “primary immunodeficiency.”

3.5 PPI network construction and potential hub gene selection

A preliminary PPI network was constructed using 94 DEGs to identify hub BD- and AAA-associated DEGs. Interaction with others (Figure 5B) retained 57 DEGs, whereas 37 DEGs were excluded due to non-interaction. Furthermore, the CytoHubba plug-in was used along with three distinct algorithms (degree, betweenness and closeness) to identify intersecting DEGs. Figure 5C–E illustrates the top 30 node genes determined using the betweenness, closeness and degree algorithms. Figure 5F presents the overlap of 30 genes identified using three algorithms, among which 20 genes were identified as potential hub genes. Table S6 provides detailed information on these identified 20 genes.

3.6 Selection of candidate hub genes using machine learning techniques

We next identified seven potential hub genes that served as optimal biomarkers for diagnosing AAA in patients with BD by applying the Lasso regression algorithm. These candidate hub genes corresponded to the minimum point on the curve. Figure 6A illustrates the results of the Lasso regression. The significance of DEGs was determined using the random forest approach. The error in AAA was detected using the random forest algorithm, as shown in Figure 6B. Figure 6C displays a compilation of the 10 most significant DEGs. The SVM–RFE analysis identified the top seven DEGs, demonstrating the lowest error and highest accuracy in diagnosing AAA with BD, as depicted in Figure 6D,E. Consequently, four candidate hub genes (CD3G, CD2, CD247 and CCR7) were selected based on the intersection of genes identified by the three algorithms, as shown in Figure 6F.

3.7 Diagnostic value evaluation and nomogram construction

Compared with the control group, AAA exhibited upregulated four candidate hub genes, as shown in Figure 7A. Figure 7B presents the AUC and 95% CIs for each gene: CD2 (AUC: 0.916, 95% CI: 0.836–0.997), CCR7 (AUC: 0.939, 95% CI: 0.876–1.000), CD3G (AUC: 0.700, 95% CI: 0.550–0.850) and CD247 (AUC: 0.947, 95% CI: 0.892–1.000). The ROC curve analyses revealed three genes (CD2, CCR7 and CD247) that exhibited satisfactory diagnostic performance. Finally, the nomogram, as depicted in Figure 7C, yielded an AUC value of 0.941 (95% CI: 0.881–1.000), indicating a significant clinical diagnostic value, as demonstrated in Figure 7D. To further validate its diagnostic potential, the validation dataset GSE7084 was used to perform the ROC curve analysis. Compared with the control group, AAA exhibited the upregulation of the three hub genes, as shown in Figure S4A. The analysis of the nomogram in the validation dataset showed an AUC of 1.000, confirming its substantial clinical diagnostic value, as illustrated in Figure S4B,C.

3.8 Validation of the expression pattern of three hub genes and evaluation of the predictive value of the nomogram model

To further confirm the accuracy of the above-integrated bioinformatics analysis, we first examined the expression pattern of the three hub genes in the recruited patients from our external cohort. All three DEGs showed upregulated expression in BD complicated with AAA compared with the BD groups. Furthermore, a predictive nomogram model was constructed to distinguish BD patients with or without AAA. The nomogram, as depicted in Figure S5, yielded an AUC value of 1, confirming its substantial clinical value in predicting the possibility of AAA in BD patients.

3.9 Immune cell infiltration analysis, ssGSEA and therapeutic agent screening

A bar plot, presented in Figure S6A, was used to illustrate the percentage distribution of 22 distinct immune cell types in each sample after applying the CIBERSORT algorithm. The boxplot analysis revealed a higher prevalence of B naive cells and CD4 memory-activated T cells in AAA compared with the control group. Conversely, the proportion of M2 macrophages displayed a lower prevalence in AAA (Figure S6B). The correlation analysis demonstrated that follicular helper T cells were significantly and positively correlated with naive B cells (r = 0.62), whereas regulatory T cells demonstrated the highest negative correlation with CD4 memory resting T cells (r = −0.47), as depicted in Figure S6C. In conclusion, a mechanism involving regulating the macrophages could be a promising approach for treating AAA. Furthermore, immune cell infiltration analysis revealed a certain correlation with all three hub DEGs, as illustrated in Figure S6D.

The ssGSEA analysis revealed significant positive associations between all three hub DEGs and different biological processes, including “IL2-STAT5 signalling pathway,” “inflammatory response,” “IL6–JAK–STAT3 signalling pathway,” “epithelial–mesenchymal transition,” and “angiogenesis,” as shown in Figure S7. Conversely, the three DEGs displayed negative correlations with the biological processes of “myogenesis,” “NOTCH signalling,” and “cholesterol homeostasis.” These biological processes could be intricately associated with the onset and progression of AAA in individuals with BD. In addition, a PPI network was constructed using 20 genes identified by three algorithms (degree, betweenness and closeness). We discovered that the three hub genes interacted through intermediate molecules (Figure S8).

The Enrichr database was utilised to screen therapeutic agents targeting the three hub genes. The predicted results indicated that methotrexate, alpha-d-mannose, vitinoin, ivermectin and tacrolimus monohydrate could be potential effective agents for targeting the three hub genes associated with BD-induced AAA (Table S7).

4 DISCUSSION

The condition of patients with BD is complicated by abdominal aortic involvement, which can rarely exhibit risk factors for atherosclerosis. This generally occurs at a younger age. The diagnosis of patients is frequently delayed due to the asymptomatic nature of most cases. AAA represents a progressive dilation of the aorta that can ultimately result in a potentially fatal rupture. It is highly crucial to prevent, early detect and treat AAA due to the potential risk of abdominal aortic rupture in patients with BD.⁴⁴

To our knowledge, this study represents the pioneering attempt to study the application of MR and bioinformatics analyses to investigate these two diseases. MR uses genetic variants associated with a specific biological mediator to evaluate causal relationships. It has been widely applied to investigate causal associations among different biological or medical factors. However, MR has not yet been used to investigate the causal connection between BD and AAA.

Reliable biomarkers hold paramount importance in contemporary medicine. In addition, the implementation of bioinformatics and machine learning approaches has greatly facilitated the investigation of underlying mechanisms and the identification of potential biomarkers. Therefore, these approaches can accurately identify disease-related biomarkers, facilitate the investigation of disease occurrence and progression and enable the exploration of the underlying pathogenic mechanisms. The Limma package in R provides a comprehensive solution for analysing gene expression data. Thus, it is a widely used tool for differential gene expression analysis of microarray data. WGCNA is a systematic biological technique that analyzes gene association patterns across different samples, allowing to calculate the correlation between gene modules and phenotypes based on phenotypic information. This approach successfully identifies potential biomarkers.⁴⁵ We utilised the STRING database to generate a protein–protein interaction biochemical network to explore the interplay among differentially expressed genes (DEGs). Subsequently, three algorithms (degree, betweenness and closeness) were used to identify central elements and hub genes based on network centrality and connectivity. A previous study has demonstrated that machine learning methods, as flexible prediction algorithms, exhibit higher accuracy compared to conventional regression.^{46, 47} Furthermore, stacking ensemble learning algorithms have demonstrated better performance than individual machine learning models in identifying risk factors for diseases.⁴⁸ Thus, we employed prominent machine learning algorithms, including SVM-RFE, Lasso and random forest, to screen potential biomarkers. The diagnostic performance of the identified genes or models was evaluated using ROC analysis and nomogram construction, assessing sensitivity, specificity and the area under the curve (AUC). This study stands out for its integration of multiple analytical methods, providing a comprehensive and advanced analysis pipeline for identifying potential key biomarkers, developing predictive models and exploring potential mechanisms for BD-associated AAA.

Because the dataset used comprised peripheral blood samples from individuals diagnosed with BD, the assessment of hub gene expression in the peripheral blood of these patients offers valuable information to estimate the likelihood of AAA incidence in this specific population. Thus, this is a practical and efficacious clinical method. The nomogram allowed us to calculate the cumulative score of each gene and the total score. Consequently, our nomogram demonstrated substantial potential for use in clinical settings as it facilitates the identification and early intervention of patients with BD with elevated total scores, thereby improving the prognosis of this specific patient population.

Contrary to conventional observational studies, the MR analysis effectively mitigates the influence of confounding variables and reverse causation on outcomes.⁴⁹ The MR analysis indicated a strong association between BD and the risk of AAA (OR: 1.0384, 95% CI: 1.0081–1.0696, p = 0.0126). The application of integrated bioinformatics, machine learning techniques and ROC evaluation successfully identified three biomarkers, namely, CD2, CD247 and CCR7. In addition, we developed a nomogram and assessed its diagnostic efficacy for AAA in patients with BD. We conducted an external validation employing an additional dataset (GSE7084), revealing noteworthy correlations between the three key genes and AAA.

CD2 belongs to the transmembrane immunoglobulin superfamily. The CD2 protein acts as a costimulatory receptor located on the surfaces of T and natural killer (NK) cells, initiating an adaptive immune response through interaction with LFA-3/CD58 on antigen-presenting cells (APCs).⁵⁰ Several studies have implicated CD2 in immune responses and inflammatory disorders.^{51, 52} For instance, Pawlowski et al.⁵³ demonstrated that the absence of CD2 can alleviate the intestinal inflammatory damage caused by Toxoplasma gondii infection. Similarly, Inomata et al. used monoclonal antibodies targeting CD2 molecules in mice and demonstrated that these inhibited myocardial cell injury by reducing T-cell infiltration. Although the specific pathological mechanism contributing to the effect of CD2 on AAA is not well understood, numerous CD2⁺ T cells were detected in the cysts of patients with AAA.⁵⁴ We detected upregulated expression of CD2 in patients with AAA, suggesting the crucial role of immune dysfunction and inflammation in the development of AAA.

CD247 encodes the T-cell receptor (TCR) zeta, which is a critical component for assembling the TCR–CD3 complex.⁵⁵ In humans, CD247 has been linked to several autoimmune diseases, such as rheumatoid arthritis⁵⁶ and systemic lupus erythematosus (SLE).⁵⁷ For example, Rudemiller et al.⁵⁸ reported a positive correlation between CD247 and blood pressure in rats fed a high-salt diet. In addition, knockout of CD247 reduced the hypertension levels by decreasing immune cell infiltration into the kidneys, potentially serving as a therapeutic target to delay the progression of AAA.

CCR7, a G protein-coupled receptor, is known to be targeted by two specific ligands, namely the C-C motif chemokine ligand 19 (CCL19) and CCL21.⁵⁹ Similar to other chemokine receptors, CCR7 directs the immune cells towards lymphoid organs by recognising its specific ligands. This process is essential for the initiation and maintenance of adaptive immunity.⁶⁰ Recent research has reported a connection between CCR7, CCL19/CCL21 and several autoimmune and inflammatory diseases.⁶¹ For instance, Katrien et al.⁶² reported that elevated CCL21 levels in patients with rheumatoid arthritis (RA) induced the migration of CCR7⁺ monocyte macrophages into affected joints, thereby promoting the polarisation of Th17 cells and perpetuating bone erosion and vascularization. In atherosclerosis, the absence of CCR7 not only impedes the infiltration of inflammatory cells into the vascular wall but also retards the progression of atherosclerotic plaques.^{63, 64} We demonstrated an upregulation of CCR7 expression in individuals with AAA and BD, suggesting a potentially crucial role of CCR7 in the development of AAA in patients with BD.

Given its pathogenesis and symptomatology, BD occupies a unique position between autoimmune and autoinflammatory diseases. It is characterised by vasculitis and endothelial damage.⁶⁵ AAA is characterised by the presence of immune cell infiltration, increased proteolytic activity and ongoing degradation of extracellular matrix components such as collagen, elastin, fibronectin and laminin. These processes ultimately expand the aortic wall. The aetiology of AAA attributed to BD remains elusive. However, the predominant hypothesis suggests that genetically predisposed individuals could experience an autoimmune response triggered by exposure to environmental factors or an autoantigen, such as a heat shock protein, resulting in the development of vasculitis. Inflammation can lead to arterial aneurysms, thrombosis and endothelial dysfunction.⁶⁶ The GO analysis reported that the DEGs were predominantly linked to immune regulation, encompassing aspects such as immune system functioning, immune response and activation of T cells. Additionally, we analysed immune infiltration in AAA. The findings revealed an increased presence of B naive cells and CD4 memory-activated T cells in individuals diagnosed with AAA when compared to the control group, accompanied by a lower proportion of M2 macrophages in AAA. A previous study reported the significant role of M2 macrophages in resolving the inflammatory phase, promoting tissue remodelling and ultimately inhibiting the development of AAA.⁶⁷ Conversely, an elevated proportion of B naive cells and CD4 memory-activated T cells could contribute to the development and rupture of AAA.^{68, 69} These studies are consistent with our findings.

CD2 plays a critical role in T cell activation and adhesion. BD is associated with dysregulated immune responses and increased T-cell activation. A significant increase in peripheral blood NK cells (CD2⁺) was observed in patients with BD.⁷⁰ Similarly, numerous CD2⁺ T cells were detected in the cysts of patients with AAA.⁵⁴ CD247, as a component of the T cell receptor complex, regulates signal transduction in T cells.⁷¹ Research has demonstrated a strong association between CD2 and CD247 with immune infiltration and their association with the occurrence of AAA.⁷² High proportions of CCR7⁺ cells were observed in active patients with BD⁷³ and aortic aneurysm.⁷⁴ A bioinformatics study identified CCR7 as one of the hub genes in AAA.⁷⁵ In this study, we examined the interplay between the three hub genes we identified and the infiltration of immune cells. We observed a positive correlation between immune-related M1 macrophages and gamma delta T cells and the three identified hub genes (CD2, CD247 and CCR7). The interactions between CD247, CD2 and CCR7 in the context of BD and AAA remain to be completely elucidated. Nevertheless, considering their known functions and our results, it is plausible that CD2, CD247 and CCR7 contribute to immune dysregulation and the development or progression of AAA in patients with BD. Studies pertaining to the regulation of T cells and macrophages could provide promising therapeutic strategies.

Recently, non-coding RNAs (ncRNAs) have garnered significant attention for their functions and potential as diagnostic biomarkers in the modulation of inflammation and autoimmune diseases, including BD.⁷⁶ Similarly, ncRNAs have been demonstrated to participate in the progression of AAA.⁷⁷ Several important computational models, such as the graph convolutional neural (GCN) network and network distance analysis, facilitate the prediction of lncRNA–miRNA interactions.^{78, 79} However, the algorithms employed in this study, particularly machine learning methods, were deemed inadequate in predicting lncRNA–miRNA interactions. Future studies on lncRNA–miRNA interactions, utilizing the aforementioned important computational models, have the potential to enhance our understanding of the mechanisms underlying BD-associated AAA and identify diagnostic biomarkers.

However, our study had several limitations. First, because both the exposure and outcome originate from the same database, a certain degree of sample duplication could be present. While the sensitivity analysis did not identify any instances of horizontal pleiotropy, it remains possible that confounding and pleiotropic factors could be present within the MR analysis. Second, irrespective of using the validation dataset (GSE7084) and clinical samples for assessing the diagnostic value, conducting additional experimental investigations is imperative to validate and explore the underlying mechanisms. The validation samples in the current study were relatively small due to the challenges associated with sample acquisition. Therefore, further verification is necessary through multicentre studies involving larger sample sizes. Third, we utilised only three commonly machine learning algorithms, and in the future, more algorithms, such as the GCN network, could be added.

5 CONCLUSION

Our MR analyses revealed a higher susceptibility of patients with BD to AAA. We used a systematic approach to identify three potential hub genes (CD247, CD2 and CCR7) and developed a nomogram to assist in the diagnosis of AAA among BD patients. In addition, immune cell infiltration analysis indicated the dysregulation in immune cell proportions, suggesting a potential involvement of T cells and macrophages in AAA development.

AUTHOR CONTRIBUTIONS

Chunjiang Liu: Data curation (equal); investigation (equal); project administration (equal); writing – original draft (equal). Huadong Wu: Data curation (equal); investigation (equal); methodology (equal); validation (equal). Kuan Li: Data curation (equal); investigation (equal); software (equal). Yongxing Chi: Software (equal). Zhaoying Wu: Software (equal). Chungen Xing: Conceptualization (lead).

ACKNOWLEDGEMENTS

We are grateful to SangerBox for providing the data analysis platform. We extend our appreciation to Bullet Edits Limited for their assistance in linguistically editing and proofreading the manuscript.

FUNDING INFORMATION

No external funding was received.

CONFLICT OF INTEREST STATEMENT

The authors confirm that there are no conflicts of interest.

Open Research

DATA AVAILABILITY STATEMENT

Data derived from public domain resourcesThe data that support the findings of this study are available inthe 9th release of the FinnGen study at https://www.finngen.fi/en/access_results and public GEO database at https://www.ncbi.nlm.nih.gov/geo. These data were derived from the following resources available in the public domain: GSE17114: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE17114; GSE209567: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE209567; GSE57691: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE57691; GSE7084: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE7084.

Supporting Information

REFERENCES

1Fei Y, Li X, Lin S, et al. Major vascular involvement in Behçet's disease: a retrospective study of 796 patients. Clin Rheumatol. 2013; 32(6): 845-852. doi:10.1007/s10067-013-2205-7
10.1007/s10067-013-2205-7
PubMed Web of Science® Google Scholar
2Kwon TW, Park SJ, Kim HK, Yoon HK, Kim GE, Yu B. Surgical treatment result of abdominal aortic aneurysm in Behçet's disease. Eur J Vasc Endovasc Surg. 2008; 35(2): 173-180. doi:10.1016/j.ejvs.2007.08.013
10.1016/j.ejvs.2007.08.013
PubMed Web of Science® Google Scholar
3Lu S, Wang R, Fu W, Si Y. Applications of extracellular vesicles in abdominal aortic aneurysm. Front Cardiovasc Med. 2022; 9:927542. doi:10.3389/fcvm.2022.927542
10.3389/fcvm.2022.927542
CAS PubMed Web of Science® Google Scholar
4İscan HZ, Yigit G, Cetinkaya F, et al. Early and midterm outcomes of endovascular treatment in arterial manifestations of vascular Behcet disease. Ann Vasc Surg. 2023; 92: 71-81. doi:10.1016/j.avsg.2022.12.074
10.1016/j.avsg.2022.12.074
PubMed Web of Science® Google Scholar
5Nitecki SS, Ofer A, Karram T, Schwartz H, Engel A, Hoffman A. Abdominal aortic aneurysm in Behçet's disease: new treatment options for an old and challenging problem. Isr Med Assoc J. 2004; 6(3): 152-155.
PubMed Web of Science® Google Scholar
6Su Z, Fang H, Hong H, et al. An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era. Genome Biol. 2014; 15(12):523. 10.1186/s13059-014-0523-y
10.1186/s13059-014-0523-y
PubMed Web of Science® Google Scholar
7Oğuz AK, Oygür Ç, Taşır S, Özdağ H, Akar MN. Behçet syndrome: The disturbed balance between anti- (CLEC12A, CLC) and proinflammatory (IFI27) gene expressions. Immun Inflamm Dis. 2023; 11(4):e836. doi:10.1002/iid3.836
10.1002/iid3.836
CAS PubMed Web of Science® Google Scholar
8Maegdefessel L, Spin JM, Raaz U, et al. miR-24 limits aortic vascular inflammation and murine abdominal aneurysm development. Nat Commun. 2014; 5: 5214. doi:10.1038/ncomms6214
10.1038/ncomms6214
CAS PubMed Web of Science® Google Scholar
9Teng B, Xie C, Zhao Y, et al. Identification of MEDAG and SERPINE1 related to hypoxia in abdominal aortic aneurysm based on weighted gene coexpression network analysis. Front Physiol. 2022; 13:926508. doi:10.3389/fphys.2022.926508
10.3389/fphys.2022.926508
PubMed Web of Science® Google Scholar
10Al-Araji A, Kidd DP. Neuro-Behçet's disease: epidemiology, clinical characteristics, and management. Lancet Neurol. 2009; 8(2): 192-204. doi:10.1016/s1474-4422(09)70015-8
10.1016/S1474-4422(09)70015-8
PubMed Web of Science® Google Scholar
11Márquez-Sánchez AC, Koltsova EK. Immune and inflammatory mechanisms of abdominal aortic aneurysm. Front Immunol. 2022; 13:989933. doi:10.3389/fimmu.2022.989933
10.3389/fimmu.2022.989933
CAS PubMed Web of Science® Google Scholar
12Mattioli I, Bettiol A, Saruhan-Direskeneli G, Direskeneli H, Emmi G. Pathogenesis of Behçet's syndrome: Genetic, environmental and immunological factors. Front Med. 2021; 8:713052. doi:10.3389/fmed.2021.713052
10.3389/fmed.2021.713052
Web of Science® Google Scholar
13Skrivankova VW, Richmond RC, Woolf BAR, et al. Strengthening the reporting of observational studies in epidemiology using mendelian randomisation (STROBE-MR): explanation and elaboration. BMJ. 2021; 375: n2233. 10.1136/bmj.n2233
10.1136/bmj.n2233
PubMed Google Scholar
14Hu H, Feng Z, Lin H, et al. Gene function and cell surface protein association analysis based on single-cell multiomics data. Comput Biol Med. 2023; 157:106733. doi:10.1016/j.compbiomed.2023.106733
10.1016/j.compbiomed.2023.106733
CAS PubMed Web of Science® Google Scholar
15Sun F, Sun J, Zhao Q. A deep learning method for predicting metabolite-disease associations via graph neural network. Brief Bioinform. 2022; 23(4). doi:10.1093/bib/bbac266
10.1093/bib/bbac266
Web of Science® Google Scholar
16Wang T, Sun J, Zhao Q. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med. 2023; 153:106464. doi:10.1016/j.compbiomed.2022.106464
10.1016/j.compbiomed.2022.106464
CAS PubMed Web of Science® Google Scholar
17Meng R, Yin S, Sun J, Hu H, Zhao Q. scAAGA: Single cell data analysis framework using asymmetric autoencoder with gene attention. Comput Biol Med. 2023; 165:107414. doi:10.1016/j.compbiomed.2023.107414
10.1016/j.compbiomed.2023.107414
CAS PubMed Web of Science® Google Scholar
18Liu C, Zhou Y, Zhou Y, Tang X, Tang L, Wang J. Identification of crucial genes for predicting the risk of atherosclerosis with system lupus erythematosus based on comprehensive bioinformatics analysis and machine learning. Comput Biol Med. 2023; 152:106388. doi:10.1016/j.compbiomed.2022.106388
10.1016/j.compbiomed.2022.106388
CAS PubMed Web of Science® Google Scholar
19Li Y, He X, Li Q, et al. EV-origin: Enumerating the tissue-cellular origin of circulating extracellular vesicles using exLR profile. Comput Struct Biotechnol J. 2020; 18: 2851-2859. doi:10.1016/j.csbj.2020.10.002
10.1016/j.csbj.2020.10.002
CAS PubMed Web of Science® Google Scholar
20Hu H, Feng Z, Lin H, et al. Modeling and analyzing single-cell multimodal data with deep parametric inference. Brief Bioinform. 2023; 24(1). doi:10.1093/bib/bbad005
10.1093/bib/bbad005
Web of Science® Google Scholar
21Gao H, Sun J, Wang Y, et al. Predicting metabolite-disease associations based on auto-encoder and non-negative matrix factorization. Brief Bioinform. 2023; 24(5). doi:10.1093/bib/bbad259
10.1093/bib/bbad259
Web of Science® Google Scholar
22Zhou Y, Liu C, Zhang Z, et al. Identification and validation of diagnostic biomarkers of coronary artery disease progression in type 1 diabetes via integrated computational and bioinformatics strategies. Comput Biol Med. 2023; 159:106940. doi:10.1016/j.compbiomed.2023.106940
10.1016/j.compbiomed.2023.106940
CAS PubMed Web of Science® Google Scholar
23Gallagher CS, Mäkinen N, Harris HR, et al. Genome-wide association and epidemiological analyses reveal common genetic origins between uterine leiomyomata and endometriosis. Nat Commun. 2019; 10(1): 4857. doi:10.1038/s41467-019-12536-4
10.1038/s41467-019-12536-4
CAS PubMed Google Scholar
24Hemani G, Zheng J, Elsworth B, et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife. 2018; 7:e34408 10.7554/eLife.34408
10.7554/eLife.34408
PubMed Web of Science® Google Scholar
25Bowden J, Del Greco MF, Minelli C, Davey Smith G, Sheehan N, Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat Med. 2017; 36(11): 1783-1802. doi:10.1002/sim.7221
10.1002/sim.7221
PubMed Web of Science® Google Scholar
26Pahl MC, Erdman R, Kuivaniemi H, Lillvis JH, Elmore JR, Tromp G. Transcriptional (ChIP-Chip) Analysis of ELF1, ETS2, RUNX1 and STAT5 in human abdominal aortic aneurysm. Int J Mol Sci. 2015; 16(5): 11229-11258. doi:10.3390/ijms160511229
10.3390/ijms160511229
CAS PubMed Web of Science® Google Scholar
27Clough E, Barrett T. The gene expression omnibus database. Methods Mol Biol. 2016; 1418: 93-110. doi:10.1007/978-1-4939-3578-9_5
10.1007/978-1-4939-3578-9_5
PubMed Google Scholar
28Ritchie ME, Phipson B, Wu D, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43(7): e47 10.1093/nar/gkv007
10.1093/nar/gkv007
CAS PubMed Web of Science® Google Scholar
29Li Y, Li Y, Yu S, et al. Circulating EVs long RNA-based subtyping and deconvolution enable prediction of immunogenic signatures and clinical outcome for PDAC. Mol Ther Nucleic Acids. Dec 3 2021; 26: 488-501. 10.1016/j.omtn.2021.08.017
10.1016/j.omtn.2021.08.017
CAS PubMed Web of Science® Google Scholar
30Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012; 16(5): 284-287. 10.1089/omi.2011.0118
10.1089/omi.2011.0118
CAS PubMed Web of Science® Google Scholar
31Szklarczyk D, Gable AL, Nastou KC, et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021; 49(D1): D605-d612. doi:10.1093/nar/gkaa1074
10.1093/nar/gkaa1074
CAS PubMed Web of Science® Google Scholar
32Doncheva NT, Morris JH, Gorodkin J, Jensen LJ. Cytoscape StringApp: Network analysis and visualization of proteomics data. J Proteome Res. 2019; 18(2): 623-632. 10.1021/acs.jproteome.8b00702
10.1021/acs.jproteome.8b00702
CAS PubMed Web of Science® Google Scholar
33Chen SJ, Liao DL, Chen CH, Wang TY, Chen KC. Construction and analysis of protein-protein interaction network of heroin use disorder. Sci Rep. 2019; 9(1): 4980 10.1038/s41598-019-41552-z
10.1038/s41598-019-41552-z
PubMed Google Scholar
34Xing L, Wu T, Yu L, et al. Exploration of biomarkers of psoriasis through combined multiomics analysis. Mediat Inflamm. 2022; 2022:7731082. doi:10.1155/2022/7731082
10.1155/2022/7731082
PubMed Web of Science® Google Scholar
35Mao B, Ma J, Duan S, Xia Y, Tao Y, Zhang L. Preoperative classification of primary and metastatic liver cancer via machine learning-based ultrasound radiomics. Eur Radiol. 2021; 31(7): 4576-4586. doi:10.1007/s00330-020-07562-6
10.1007/s00330-020-07562-6
PubMed Web of Science® Google Scholar
36Mi X, Zou B, Zou F, Hu J. Permutation-based identification of important biomarkers for complex diseases via machine learning models. Nat Commun. 2021; 12(1): 3008. doi:10.1038/s41467-021-22756-2
10.1038/s41467-021-22756-2
PubMed Web of Science® Google Scholar
37Dawkins JJ, Allegretti JR, Gibson TE, et al. Gut metabolites predict Clostridioides difficile recurrence. Microbiome. 2022; 10(1): 87 10.1186/s40168-022-01284-1
10.1186/s40168-022-01284-1
CAS PubMed Web of Science® Google Scholar
38Li Y, Zhao J, Yu S, et al. Extracellular vesicles long RNA sequencing reveals abundant mRNA, circRNA, and lncRNA in human blood as potential biomarkers for cancer diagnosis. Clin Chem. 2019; 65(6): 798-808. doi:10.1373/clinchem.2018.301291
10.1373/clinchem.2018.301291
CAS PubMed Web of Science® Google Scholar
39Dai W, Wang Y, Yang T, Wang J, Wu W, Gu J. Downregulation of exosomal CLEC3B in hepatocellular carcinoma promotes metastasis and angiogenesis via AMPK and VEGF signals. Cell Commun Signal. 2019; 17(1): 113. doi:10.1186/s12964-019-0423-6
10.1186/s12964-019-0423-6
PubMed Web of Science® Google Scholar
40 International Team for the Revision of the International Criteria for Behçet's Disease (ITR-ICBD). The International Criteria for Behçet's Disease (ICBD): a collaborative study of 27 countries on the sensitivity and specificity of the new criteria. J Eur Acad Dermatol Venereol. 2014; 28(3): 338-347. doi:10.1111/jdv.12107
10.1111/jdv.12107
PubMed Web of Science® Google Scholar
41Newman AM, Liu CL, Green MR, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015; 12(5): 453-457. doi:10.1038/nmeth.3337
10.1038/nmeth.3337
CAS PubMed Web of Science® Google Scholar
42Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 2015; 1(6): 417-425. doi:10.1016/j.cels.2015.12.004
10.1016/j.cels.2015.12.004
CAS PubMed Web of Science® Google Scholar
43Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. 2013; 14:7 10.1186/1471-2105-14-7
10.1186/1471-2105-14-7
PubMed Web of Science® Google Scholar
44Bettiol A, Alibaz-Oner F, Direskeneli H, et al. Vascular Behçet syndrome: from pathogenesis to treatment. Nat Rev Rheumatol. 2023; 19(2): 111-126. doi:10.1038/s41584-022-00880-7
10.1038/s41584-022-00880-7
CAS PubMed Web of Science® Google Scholar
45Feng S, Xu Y, Dai Z, Yin H, Zhang K, Shen Y. Integrative analysis from multicenter studies identifies a WGCNA-Derived cancer-associated fibroblast signature for ovarian cancer. Front Immunol. 2022; 13:951582. doi:10.3389/fimmu.2022.951582
10.3389/fimmu.2022.951582
CAS PubMed Web of Science® Google Scholar
46Churpek MM, Yuen TC, Winslow C, Meltzer DO, Kattan MW, Edelson DP. Multicenter comparison of machine learning methods and conventional regression for predicting clinical deterioration on the wards. Crit Care Med. 2016; 44(2): 368-374. doi:10.1097/ccm.0000000000001571
10.1097/CCM.0000000000001571
PubMed Web of Science® Google Scholar
47Chen Z, Zhang L, Sun J, Meng R, Yin S, Zhao Q. DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction. J Cell Mol Med. 2023; 27(20): 3117-3126. doi:10.1111/jcmm.17889
10.1111/jcmm.17889
PubMed Web of Science® Google Scholar
48Xu X, Yu Z, Ge Z, et al. Web-based risk prediction tool for an individual's risk of HIV and sexually transmitted infections using machine learning algorithms: development and external validation study. J Med Internet Res. 2022; 24(8):e37850. doi:10.2196/37850
10.2196/37850
PubMed Web of Science® Google Scholar
49Xiang K, Zhang JJ, Xu YY, Zhong X, Ni J, Pan HF. Genetically predicted causality of 28 gut microbiome families and type 2 diabetes mellitus risk. Front Endocrinol. 2022; 13:780133. doi:10.3389/fendo.2022.780133
10.3389/fendo.2022.780133
PubMed Web of Science® Google Scholar
50Binder C, Cvetkovski F, Sellberg F, et al. CD2 immunobiology. Front Immunol. 2020; 11:1090. doi:10.3389/fimmu.2020.01090
10.3389/fimmu.2020.01090
CAS PubMed Web of Science® Google Scholar
51Leitner J, Herndler-Brandstetter D, Zlabinger GJ, Grubeck-Loebenstein B, Steinberger P. CD58/CD2 is the primary costimulatory pathway in human CD28-CD8+ T Cells. J Immunol. 2015; 195(2): 477-487. doi:10.4049/jimmunol.1401917
10.4049/jimmunol.1401917
CAS PubMed Web of Science® Google Scholar
52Shao T, Shi W, Zheng JY, et al. Costimulatory function of Cd58/Cd2 interaction in adaptive humoral immunity in a zebrafish model. Front Immunol. 2018; 9:1204. doi:10.3389/fimmu.2018.01204
10.3389/fimmu.2018.01204
PubMed Web of Science® Google Scholar
53Pawlowski NN, Struck D, Grollich K, et al. CD2 deficiency partially prevents small bowel inflammation and improves parasite control in murine Toxoplasma gondii infection. World J Gastroenterol. 2007; 13(31): 4207-4213. doi:10.3748/wjg.v13.i31.4207
10.3748/wjg.v13.i31.4207
CAS PubMed Web of Science® Google Scholar
54Pasquinelli G, Preda P, Gargiulo M, et al. An immunohistochemical study of inflammatory abdominal aortic aneurysms. J Submicrosc Cytol Pathol. 1993; 25(1): 103-112.
CAS PubMed Web of Science® Google Scholar
55Li Y, Chen S, Li X, et al. CD247, a potential T Cell-derived disease severity and prognostic biomarker in patients with idiopathic pulmonary fibrosis. Front Immunol. 2021; 12:762594. doi:10.3389/fimmu.2021.762594
10.3389/fimmu.2021.762594
CAS PubMed Web of Science® Google Scholar
56Teruel M, McKinney C, Balsa A, et al. Association of CD247 polymorphisms with rheumatoid arthritis: a replication study and a meta-analysis. PLoS One. 2013; 8(7):e68295. doi:10.1371/journal.pone.0068295
10.1371/journal.pone.0068295
CAS PubMed Web of Science® Google Scholar
57Takeuchi T, Suzuki K. CD247 variants and single-nucleotide polymorphisms observed in systemic lupus erythematosus patients. Rheumatology (Oxford). 2013; 52(9): 1551-1555. doi:10.1093/rheumatology/ket119
10.1093/rheumatology/ket119
CAS PubMed Web of Science® Google Scholar
58Rudemiller N, Lund H, Jacob HJ, Geurts AM, Mattson DL. CD247 modulates blood pressure by altering T-lymphocyte infiltration in the kidney. Hypertension. 2014; 63(3): 559-564. doi:10.1161/hypertensionaha.113.02191
10.1161/HYPERTENSIONAHA.113.02191
CAS PubMed Web of Science® Google Scholar
59Förster R, Davalos-Misslitz AC, Rot A. CCR7 and its ligands: balancing immunity and tolerance. Nat Rev Immunol. 2008; 8(5): 362-371. doi:10.1038/nri2297
10.1038/nri2297
PubMed Web of Science® Google Scholar
60Comerford I, Harata-Lee Y, Bunting MD, Gregor C, Kara EE, McColl SR. A myriad of functions and complex regulation of the CCR7/CCL19/CCL21 chemokine axis in the adaptive immune system. Cytokine Growth Factor Rev. 2013; 24(3): 269-283. doi:10.1016/j.cytogfr.2013.03.001
10.1016/j.cytogfr.2013.03.001
CAS PubMed Web of Science® Google Scholar
61Brandum EP, Jørgensen AS, Rosenkilde MM, Hjortø GM. Dendritic cells and CCR7 expression: an important factor for autoimmune diseases, chronic inflammation, and cancer. Int J Mol Sci. 2021; 22(15):8340 10.3390/ijms22158340
10.3390/ijms22158340
CAS PubMed Web of Science® Google Scholar
62Van Raemdonck K, Umar S, Shahrara S. The pathogenic importance of CCL21 and CCR7 in rheumatoid arthritis. Cytokine Growth Factor Rev. 2020; 55: 86-93. doi:10.1016/j.cytogfr.2020.05.007
10.1016/j.cytogfr.2020.05.007
CAS PubMed Web of Science® Google Scholar
63Schieffer B, Luchtefeld M. Emerging role of chemokine receptor 7 in atherosclerosis. Trends Cardiovasc Med. 2011; 21(8): 211-216. doi:10.1016/j.tcm.2012.05.012
10.1016/j.tcm.2012.05.012
CAS PubMed Web of Science® Google Scholar
64Luchtefeld M, Grothusen C, Gagalick A, et al. Chemokine receptor 7 knockout attenuates atherosclerotic plaque development. Circulation. 2010; 122(16): 1621-1628. doi:10.1161/circulationaha.110.956730
10.1161/CIRCULATIONAHA.110.956730
CAS PubMed Web of Science® Google Scholar
65Ahn JK, Kim J, Hwang J, Song J, Kim KH, Cha HS. Urinary metabolomic profiling to identify potential biomarkers for the diagnosis of Behcet's disease by gas chromatography/time-of-flight-mass spectrometry. Int J Mol Sci. 2017; 18(11). doi:10.3390/ijms18112309
10.3390/ijms18112309
Web of Science® Google Scholar
66Sezen Y, Buyukhatipoglu H, Kucukdurmaz Z, Geyik R. Cardiovascular involvement in Behçet's disease. Clin Rheumatol. 2010; 29(1): 7-12. doi:10.1007/s10067-009-1302-0
10.1007/s10067-009-1302-0
PubMed Web of Science® Google Scholar
67Song H, Yang Y, Sun Y, et al. Circular RNA Cdyl promotes abdominal aortic aneurysm formation by inducing M1 macrophage polarization and M1-type inflammation. Mol Ther. 2022; 30(2): 915-931. doi:10.1016/j.ymthe.2021.09.017
10.1016/j.ymthe.2021.09.017
CAS PubMed Web of Science® Google Scholar
68Galle C, Schandené L, Stordeur P, et al. Predominance of type 1 CD4+ T cells in human abdominal aortic aneurysm. Clin Exp Immunol. 2005; 142(3): 519-527. doi:10.1111/j.1365-2249.2005.02938.x
10.1111/j.1365-2249.2005.02938.x
CAS PubMed Web of Science® Google Scholar
69Lei C, Yang D, Chen S, et al. Patterns of immune infiltration in stable and raptured abdominal aortic aneurysms: A gene-expression-based retrospective study. Gene. 2020; 762:145056. doi:10.1016/j.gene.2020.145056
10.1016/j.gene.2020.145056
CAS PubMed Web of Science® Google Scholar
70Suzuki Y, Hoshi K, Matsuda T, Mizushima Y. Increased peripheral blood gamma delta+ T cells and natural killer cells in Behçet's disease. J Rheumatol. 1992; 19(4): 588-592.
CAS PubMed Web of Science® Google Scholar
71Christopoulos P, Chung I, Bozorgmehr F, et al. Deficient CD247 expression is a typical histopathological characteristic of thymomas with cortical features. Histopathology. 2018; 73(6): 1040-1043. doi:10.1111/his.13724
10.1111/his.13724
PubMed Web of Science® Google Scholar
72Li T, Wang T, Zhao X. Profiles of immune infiltration in abdominal aortic aneurysm and their associated marker genes: a gene expression-based study. Braz J Med Biol Res. 2021; 54(11):e11372. doi:10.1590/1414-431X2021e11372
10.1590/1414-431x2021e11372
CAS PubMed Web of Science® Google Scholar
73Islam SMS, Kim HA, Choi B, et al. Differences in expression of human leukocyte antigen class II subtypes and T cell subsets in Behçet's disease with arthritis. Int J Mol Sci. 2019; 20(20). doi:10.3390/ijms20205044
10.3390/ijms20205044
Web of Science® Google Scholar
74Cho MJ, Lee MR, Park JG. Aortic aneurysms: current pathogenesis and therapeutic targets. Exp Mol Med. 2023; 55(12): 2519-2530. doi:10.1038/s12276-023-01130-w
10.1038/s12276-023-01130-w
CAS PubMed Web of Science® Google Scholar
75Chen Y, Ouyang T, Fang C, et al. Identification of biomarkers and analysis of infiltrated immune cells in stable and ruptured abdominal aortic aneurysms. Front Cardiovasc Med. 2022; 9:941185. doi:10.3389/fcvm.2022.941185
10.3389/fcvm.2022.941185
PubMed Web of Science® Google Scholar
76Gu F, Huang X, Huang W, et al. The role of miRNAs in Behçet's disease. Front Immunol. 2023; 14:1249826. doi:10.3389/fimmu.2023.1249826
10.3389/fimmu.2023.1249826
CAS PubMed Web of Science® Google Scholar
77Knappich C, Spin JM, Eckstein HH, Tsao PS, Maegdefessel L. Involvement of myeloid cells and noncoding RNA in abdominal aortic aneurysm disease. Antioxid Redox Signal. 2020; 33(9): 602-620. doi:10.1089/ars.2020.8035
10.1089/ars.2020.8035
CAS PubMed Web of Science® Google Scholar
78Zhang L, Yang P, Feng H, Zhao Q, Liu H. Using network distance analysis to predict lncRNA-miRNA interactions. Interdiscip Sci: Comput Life Sci. 2021; 13(3): 535-545. doi:10.1007/s12539-021-00458-z
10.1007/s12539-021-00458-z
CAS PubMed Web of Science® Google Scholar
79Wang W, Zhang L, Sun J, Zhao Q, Shuai J. Predicting the potential human lncRNA-miRNA interactions based on graph convolution network with conditional random field. Brief Bioinform. 2022; 23(6). doi:10.1093/bib/bbac463
10.1093/bib/bbac463
Web of Science® Google Scholar

Volume28, Issue10

May 2024

e18398

Identification of biomarkers for abdominal aortic aneurysm in Behçet's disease via mendelian randomization and integrated bioinformatics analyses

Abstract

1 INTRODUCTION

2 METHODS

2.1 Mendelian randomization (MR) analysis

2.2 Microarray data

2.3 Differentially expressed gene (DEG) analysis using Limma

2.4 Significant module identification using weighted gene co-expression network analysis (WGCNA)

2.5 Functional enrichment analysis

2.6 Protein–protein interaction (PPI) network construction

2.7 Machine-learning algorithms

2.8 Evaluation of receiver operating characteristic curve (ROC) and nomogram

2.9 Peripheral blood collection, validation of the expression of hub genes and evaluation of the predictive model

2.10 Immune infiltration analysis, single-sample gene-set enrichment analysis (ssGSEA) and therapeutic agents screening

2.11 Statistical analysis

3 RESULTS

3.1 MR analysis of genetic susceptibility to BD and AAA

3.2 DEG identification via Limma in BD and AAA

3.3 Identification of significant module genes in BD and AAA via WGCNA

3.4 Functional enrichment analysis of BD-related DEGs in AAA

3.5 PPI network construction and potential hub gene selection

3.6 Selection of candidate hub genes using machine learning techniques

3.7 Diagnostic value evaluation and nomogram construction

3.8 Validation of the expression pattern of three hub genes and evaluation of the predictive value of the nomogram model

3.9 Immune cell infiltration analysis, ssGSEA and therapeutic agent screening

4 DISCUSSION

5 CONCLUSION

AUTHOR CONTRIBUTIONS

ACKNOWLEDGEMENTS

FUNDING INFORMATION

CONFLICT OF INTEREST STATEMENT

Open Research

DATA AVAILABILITY STATEMENT

Supporting Information

REFERENCES

Figures

References

Related

Information