Volume 2020, Issue 1 8824717

Research Article

Open Access

Identification of a Set of Genes Improving Survival Prediction in Kidney Renal Clear Cell Carcinoma through Integrative Reanalysis of Transcriptomic Data

Banlai Ruan

orcid.org/0000-0002-7551-3156

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Xianzhen Feng,

Xianzhen Feng

orcid.org/0000-0002-9956-8637

Department of Obstetrics and Gynecology, The Third People’s Hospital of Linyi, Linyi, 276000 Shandong Province, China

Search for more papers by this author

Xueyi Chen,

Xueyi Chen

orcid.org/0000-0002-9535-4543

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Zhiwei Dong,

Zhiwei Dong

orcid.org/0000-0003-0600-8954

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Qi Wang,

Qi Wang

orcid.org/0000-0001-7155-8878

Department of Oncology, Affiliated Hospital of Qingdao University, Qingdao, 266000 Shandong Province, China qdu.edu.cn

Search for more papers by this author

Kai Xu,

Kai Xu

orcid.org/0000-0003-4163-2904

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Jinping Tian,

Jinping Tian

orcid.org/0000-0002-4186-0985

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Jie Liu,

Jie Liu

orcid.org/0000-0003-1173-6809

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Ziyin Chen,

Ziyin Chen

orcid.org/0000-0003-0335-1028

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Wenzhen Shi,

Wenzhen Shi

orcid.org/0000-0002-8055-1114

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Man Wang,

Man Wang

orcid.org/0000-0002-7364-7994

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Lu Qian,

Corresponding Author

Lu Qian

[email protected]

orcid.org/0000-0003-2388-0357

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Qianshan Ding,

Corresponding Author

Qianshan Ding

[email protected]

orcid.org/0000-0002-9989-6876

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, 430070 Hubei Province, China rmhospital.com

Search for more papers by this author

Banlai Ruan,

Banlai Ruan

orcid.org/0000-0002-7551-3156

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Xianzhen Feng,

Xianzhen Feng

orcid.org/0000-0002-9956-8637

Department of Obstetrics and Gynecology, The Third People’s Hospital of Linyi, Linyi, 276000 Shandong Province, China

Search for more papers by this author

Xueyi Chen,

Xueyi Chen

orcid.org/0000-0002-9535-4543

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Zhiwei Dong,

Zhiwei Dong

orcid.org/0000-0003-0600-8954

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Qi Wang,

Qi Wang

orcid.org/0000-0001-7155-8878

Department of Oncology, Affiliated Hospital of Qingdao University, Qingdao, 266000 Shandong Province, China qdu.edu.cn

Search for more papers by this author

Kai Xu,

Kai Xu

orcid.org/0000-0003-4163-2904

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Jinping Tian,

Jinping Tian

orcid.org/0000-0002-4186-0985

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Jie Liu,

Jie Liu

orcid.org/0000-0003-1173-6809

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Ziyin Chen,

Ziyin Chen

orcid.org/0000-0003-0335-1028

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Wenzhen Shi,

Wenzhen Shi

orcid.org/0000-0002-8055-1114

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Man Wang,

Man Wang

orcid.org/0000-0002-7364-7994

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Search for more papers by this author

Lu Qian,

Corresponding Author

Lu Qian

[email protected]

orcid.org/0000-0003-2388-0357

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Search for more papers by this author

Qianshan Ding,

Corresponding Author

Qianshan Ding

[email protected]

orcid.org/0000-0002-9989-6876

Medical Research Center, Xi’an No. 3 Hospital, the Affiliated Hospital of Northwest University, Xi’an, 710016 Shaanxi Province, China

The College of Life Sciences, Northwest University, Xi’an, 710069 Shaanxi Province, China nwu.edu.cn

Hubei Yicanhealth Co., Ltd., Wuhan, 430070 Hubei Province, China

Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, 430070 Hubei Province, China rmhospital.com

Search for more papers by this author

First published: 13 October 2020

https://doi.org/10.1155/2020/8824717

Citations: 6

Academic Editor: Jian Ma

Share a link

Email
Wechat
Bluesky

Abstract

Background. With an enormous amount of research concerning kidney cancer being conducted, various treatments have been applied to its cure. However, high recurrence and metastasis rates continue to pose a threat to the survival of patients with kidney renal clear cell carcinoma (KIRC). Methods. Data from The Cancer Genome Atlas were downloaded, and a series of analyses were performed, including differential analysis, Cox analysis, weighted gene coexpression network analysis, least absolute shrinkage and selection operator analysis, multivariate Cox analysis, survival analysis, and receiver operating characteristic curve and functional enrichment analysis. Results. A total of 5,777 differentially expressed genes were identified from the differential analysis. The Cox analysis showed 1,853 significant genes (P < 0.01). Weighted gene coexpression network analysis revealed that 226 genes in the module were related to clinical parameters, including Tumor-Node-Metastasis (TNM) staging. Least absolute shrinkage and selection operator and multivariate Cox analyses suggested that four genes (CDKL2, LRFN1, STAT2, and SOWAHB) had a potential function in predicting the survival time of patients with KIRC. Survival analysis uncovered that a high risk of these four genes was associated with an unfavorable prognosis. Receiver operating characteristic curve analysis further confirmed the accuracy of the risk score model. The analysis of clinicopathological parameters of the four identified genes revealed that they were associated with the progression of KIRC. Conclusion. The gene expression model consisting of CDKL2, LRFN1, STAT2, and SOWAHB is a promising tool for predicting the prognosis of patients with KIRC. The results of this study may provide insights into the diagnosis and treatment of KIRC.

1. Introduction

Kidney cancer is one of the most prevalent types of cancer worldwide [1, 2]. Belonging to kidney cancer, kidney renal clear cell carcinoma (KIRC) is characterized by high recurrence and metastasis rates, challenging the health and quality of life of patients [3, 4]. According to statistics, following surgery, the recurrence rate of KIRC may reach 40% [5, 6]. In KIRC, cancer cells often metastasize to other organs [7–9]. In addition, diagnosis of KIRC in the early stage of disease is difficult due to its insidious symptoms. These reasons contribute to the difficulty in treating KIRC.

Bioinformatics analysis has been increasingly important in cancer research for predicting the prognosis of patients and exploring novel therapy targets. Weighted gene coexpression network analysis (WGCNA), least absolute shrinkage and selection operator (LASSO) analysis, and functional enrichment analysis are three of the most popular bioinformatics tools. For example, a recent study identified key pathways and genes in the dynamic progression of hepatocellular carcinoma based on WGCNA [10]. WGCNA may also be applied to construct competing endogenous RNA networks, which are involved in regulating cancer progression [11, 12]. LASSO analysis is often employed to screen the most crucial genes and reduce the number of genes in some models [13]. For instance, a recent study used LASSO analysis to identify prognostic long noncoding RNA signatures in bladder cancer [14]. Functional enrichment analysis is widely utilized in studies to find crucial pathways [15–17].

In this study, mainly using the aforementioned tools, we aimed to establish a model for improving the prediction of survival of patients with KIRC. With data from The Cancer Genome Atlas (TCGA), after obtaining differentially expressed genes (DEGs), Cox analysis was performed to preliminarily detect prognosis-related genes. Subsequently, WGCNA was used to set up a gene coexpression network, and LASSO analysis was employed to delete highly correlated genes, and multivariate Cox analysis was utilized to construct a survival prediction model. We found that a panel of four genes, including cyclin-dependent kinase like 2 (CDKL2), leucine-rich repeat and fibronectin type III domain-containing 1 (LRFN1), signal transducer and activator of transcription 2 (STAT2), and sosondowah ankyrin repeat domain family member B (SOWAHB), was a promising module for predicting the survival of patients with KIRC. Subsequently, functional enrichment analysis was performed to analyze the biological events regulated by this module.

2. Materials and Methods

2.1. Data Acquisition and Processing

RNA sequencing data of KIRC samples (72 normal samples and 538 tumor samples) and relevant clinical information of patients with KIRC were downloaded from TCGA (https://portal.gdc.cancer.gov/). Survival information of 530 samples was available, and the details of the patients are presented in Table 1. Data regarding disease-free survival were downloaded from cBioPortal (http://www.cbioportal.org/). In the process of constructing a risk model, 530 samples were divided into two groups using the R package caret (265 in the training and testing groups, respectively) (Table S1).

Table 1. Clinicopathological features.

Clinicopathological parameters	Frequency	Percentage
Gender
Male	346	64.43%
Female	191	35.57%
Pathologic stage
I-II	326	60.71%
III-IV	208	38.73%
Unknown	3	0.56%
T stage
T1-T2	344	64.06%
T3-T4	193	35.94%
N stage
N0	240	44.69%
N1	17	3.17%
NX	280	52.14%
M stage
M0	446	83.05%
M1	81	15.08%
MX	10	1.86%
Age
<60	247	46.00%
≥60	290	54.00%

2.2. Identification of DEGs

The function package edgeR was utilized to conduct a differential analysis. We selected ∣Log-fold change | >1 and false discovery rate < 0.05 as significant cutoff values based on the Benjamini–Hochberg method. A heat map was generated to show the expression levels of genes in normal and tumor samples.

2.3. WGCNA

WGCNA was performed to combine significant prognostic DEGs with clinical traits [18]. The function hclust was used to cluster samples and delete outliers. The soft-thresholding power was chosen based on the criterion of approximate scale-free topology after the function pickSoftThreshold was performed. According to the soft-thresholding power β, a weighted gene network with a relatively large minimum module size of 30 was constructed. The parameter mergeCutHeight was the threshold to merge of modules. Next, the modules that were significantly associated with the clinical traits were identified. Subsequently, the correlation between modules and clinical traits was determined. The associations of individual genes with clinical traits were quantified by defining gene significance (GS) as the correlation between genes and clinical traits. For each module, the quantitative measure of module membership (MM) was treated as the correlation of the module eigengene and the gene expression profile. GS and MM were highly correlated, illustrating that genes significantly associated with a trait were often also the most important (central) elements of modules related to the trait. Based on this, genes highly significantly associated with clinical traits could be identified.

2.4. Construction of a Cox Model

LASSO analysis and multivariate Cox regression analysis were conducted to construct a risk model. The 226 significant prognosis genes (blue module) identified through these analyses were ranked according to their P values. The top 30 significant prognostic genes were calculated by LASSO analysis. After deleting high correlation genes, a multivariate Cox analysis was performed. P values < 0.05 indicated statistical significance. The hazard ratio and 95% confidence interval for each variable were calculated.

2.5. Functional Enrichment Analysis

Functional enrichment analysis included Gene Ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. GO and KEGG analyses were carried out using the R package clusterProfiler. GO analysis contained biological process, cellular component, and molecular function. P < 0.05 indicated statistical significance.

2.6. Correlation between Clinical Traits and CDKL2, LRFN1, STAT2, and SOWAHB

Correlation between the four genes and clinical parameters (Tumor-Node-Metastasis (TNM) stage, pathological stage, and grade) was analyzed to further confirm the importance of the identified genes. P < 0.05 indicated statistical significance.

2.7. Survival Analysis

Online survival analysis and relapse-free survival (RFS) analysis were performed through Gene Expression Profiling Interactive Analysis (GEPIA; http://gepia.cancer-pku.cn/index.html) to recognize significant prognostic biomarkers. P < 0.05 indicated statistical significance. Moreover, survival analysis of the risk model was performed using R package survival, and receiver operating characteristic (ROC) curve was constructed based on the R package survival ROC.

3. Results

3.1. DEGs in KIRC Samples

The workflow is shown in Figure 1(a). RNA sequencing data of KIRC samples were processed by edgeR, and 5,777 DEGs (3,913 upregulation and 1,863 downregulation) were obtained. The heat map of the top 100 DEGs represents the expression level of DEGs in normal tissues and tumor tissues (Figure 1(b)). From the heat map, a significant difference in the expression levels of genes between normal tissues and tumor tissues was observed. Subsequently, univariate Cox analysis was performed, and 1,853 genes significantly related to the prognosis of patients were identified (data not shown).

Details are in the caption following the image — **Figure 1 (a)**
Open in figure viewer PowerPoint

Flowchart of this work and expression of DEGs in TCGA data. (a) KIRC: kidney renal clear cell carcinoma; GO: Gene Ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes; WGCNA: weighted gene coexpression network analysis; LASSO: least absolute shrinkage and selection operator; ROC: receiver operating characteristic curve. (b) Heat map of DEGs. From green to black to red, the expression of gene increased. The blue panels represented normal samples; the red panels represented tumor samples. DEGs: differentially expressed genes.

3.2. Results of WGCNA

After performing hclust, two samples (TCGA-B0-4696-01 and TCGA-BP-4770-01) were deleted (Figure 2(a)). According to scale independence and mean connectivity, the soft-thresholding power β = 6 was considered to be the fittest value, which was responsible for high correlation and high connectivity between genes (Figure 2(b)). Consistent with the thresholding power, these DEGs were divided into eight effective gene modules, and the grey module was considered an ineffective module for preserving nonmodular genes (Figure 2(c)). Through the correlation between GS and MM, we noted the blue module, in which genes were related to TNM staging and tumor grade. As shown in Figure 2(d), blue module genes were highly connected with clinical traits. Correlation between the blue module and T was 0.31, P = 7e − 13; correlation between the blue module and N was 0.34, P = 2e − 15; correlation between the blue module and M was 0.24, P = 2e − 08; correlation between the blue module and clinical stage was 0.29, P = 9e − 12; and correlation between the blue module and tumor grade was 0.33, P = 3e − 15. In addition, Cytoscape (https://cytoscape.org/download.html) was used to construct a gene coexpression network based on blue module genes (Figure 2(e)). From the gene coexpression network, we observed that most genes exhibited a strong correlation.

3.3. Construction of the Gene Risk-Score System

To construct a risk-score system, we selected the top 30 genes from the blue module which were deemed to be the most significant genes according to their P value (Table 2). Through LASSO analysis and multivariate Cox regression analysis, a gene risk-score system was obtained using relative coefficients (Figures 3(a) and 3(b)). Subsequently, RELT TNF receptor (RELT), transmembrane protein 245 (TMEM245), receptor accessory protein 4 (REEP4), leucine-rich repeat and fibronectin type III domain-containing 1 (LRFN1), and vesicle-associated membrane protein 1 (VAMP1) were excluded, and the final risk score formula was as follows: PI = (−0.23946 × expression level of CDKL2) + (0.58372 × expression level of STAT2) + (−0.12572 × expression level of SOWAHB) + (0.25274 × expression level of LRFN1). Among these genes, CDKL2 and SOWAHB had negative coefficients in the univariate and multivariate Cox regression analyses, suggesting that upregulating their expression levels would improve the survival time of patients with KIRC. According to the risk score, we divided patients into high- and low-risk groups. In both the training and testing groups, the 5-year survival rate in the high- and low-risk groups was 40% and 80%, respectively (Figures 3(c) and 3(d)). The ROC curve analysis further confirmed the accuracy of the risk-score model, and the area under the curve was 0.78 and 0.753 in the training and testing groups, respectively. After dividing 530 patients into the high-/low-risk groups in the training and testing groups, the risk scores of the patients were negatively associated with the patients’ survival time (Figures 3(g)–3(j)). The heat map suggested that STAT2 and LRFN1 were high-risk genes, whereas CDKL2 and SOWAHB were low-risk genes (Figures 3(k) and 3(l)). These results suggested that STAT2, LRFN1, CDKL2, and SOWAHB were prognosis-related genes, and the aforementioned formula could be used to assess the risk of death in patients.

Table 2. Univariate Cox regression analysis of the top 30 genes in the blue module.

Gene	HR	HR, 95 CI (low)	HR, 95 CI (high)	P value
CHFR	1.000833824	1.000626465	1.001041226	3.18E − 15
STAT2	1.000139807	1.000104508	1.000175106	8.28E − 15
RELT	1.00168529	1.00123172	1.002139065	3.17E − 13
LRFN1	1.003620135	1.002620544	1.004620723	1.18E − 12
REEP4	1.000911177	1.000655593	1.001166827	2.75E − 12
VAMP1	1.00096664	1.000690669	1.001242688	6.53E − 12
TCIRG1	1.000176371	1.000125551	1.000227193	1.03E − 11
SOWAHB	0.998170856	0.997638284	0.998703712	1.77E − 11
C17orf62	1.000290856	1.000205244	1.000376475	2.75E − 11
IGFLR1	1.002196092	1.001540895	1.002851717	4.88E − 11
STAC3	1.002374184	1.001648701	1.003100193	1.37E − 10
FKBP11	1.000228649	1.000158225	1.000299078	1.97E − 10
HAPLN3	1.000541992	1.000373872	1.000710141	2.62E − 10
MICAL1	1.000230074	1.000158663	1.000301489	2.70E − 10
SH3BGRL3	1.000044229	1.000030246	1.000058211	5.66E − 10
CASP4	1.000282876	1.000188726	1.000377036	3.88E − 09
CDKL2	0.997021707	0.99601962	0.998024803	6.12E − 09
IFI30	1.001967645	1.001302802	1.002632928	6.46E − 09
NOD2	1.001225892	1.00081042	1.001641536	7.23E − 09
TMEM245	0.999832818	0.999775898	0.99988974	8.61E − 09
MPP5	0.999453585	0.999267007	0.999640198	9.59E − 09
IL15RA	1.000470293	1.000309107	1.000631504	1.07E − 08
FCGR1B	1.004084857	1.002680825	1.005490855	1.13E − 08
ARHGEF1	1.000150458	1.000098375	1.000202543	1.49E − 08
RNF166	1.000630491	1.00041224	1.000848789	1.49E − 08
ACADSB	0.99962285	0.999491681	0.999754037	1.76E − 08
UNC13D	1.000234229	1.000152484	1.00031598	1.95E − 08
PPP1R18	1.000179058	1.000116469	1.00024165	2.05E − 08
MYO6	0.999791047	0.99971764	0.999864459	2.43E − 08
RHBDF2	1.000227139	1.000147305	1.00030698	2.45E − 08

Abbreviations: HR: hazard ratio; CI: confidence interval.

3.4. Correlations between the Four Genes and Clinical Traits

The associations between genes and clinical traits (T stage, N stage, M status, clinical stage, tumor grade, etc.) were analyzed to further clarify the clinical importance of the four identified genes. The results showed that CDKL2 and SOWAHB had lower expression levels, while LRFN1 and STAT2 had higher expression levels in T3/4 tumors versus T1/2 tumors (Figure 4(a)), tumors with lymphatic metastasis (Figure 4(b)), tumors with distant metastasis (Figure 4(c)), and stage III/IV tumors versus stage I/II tumors (Figure 4(d)). In addition, we found that the risk model was also related to TNM, staging, and survival status (Figure 4(e)). To verify these results, we performed correlation analysis using an online website (https://mexpress.be/index.html). The findings showed that these four genes were related to numerous clinical traits (Figure S1). These results suggested that STAT2, LRFN1, CDKL2, and SOWAHB may have an impact on the progression, invasion, and metastasis of KIRC. Additionally, online overall survival (OS) and RFS analyses were performed to further confirm the prognostic value of these genes. The results indicated that high expression levels of STAT2 and LRFN1 were associated with poor prognosis of patients with KIRC; in contrast, high expression levels of CDKL2 and SOWAHB indicated favorable prognosis among patients with KIRC (Figures 5(a)–5(i)). Additionally, the risk model showed that RFS of patients with high risk was shorter compared with that in the low-risk group, in both the training and testing groups (Figures 5(m) and 5(n)).

3.5. GO and KEGG Pathway Analysis

We performed functional enrichment analysis with the R package clusterProfiler to investigate the function and pathway potentially regulated by the genes in the blue module. These genes were mainly enriched in the following: biological process (including T cell activation, regulation of T cell activation, regulation of lymphocyte activation, response to virus, and regulation of cell-cell adhesion (Figure 6(a)); cellular component (including actin cytoskeleton, endocytic vesicle, secretory granule membrane, phagocytic vesicle, and ficolin-1-rich granule membrane (Figure 6(b)); and molecular function, (including actin binding, GTPase regulator activity, nucleoside-triphosphatase regulator activity, GTPase activator activity, and G protein-coupled receptor binding (Figure 6(c))). The pathways potentially regulated by these genes were related to the NOD-like receptor signaling pathway, cytokine-cytokine receptor interaction, osteoclast differentiation, viral protein interaction with cytokine and cytokine receptor, JAK-STAT signaling pathway, T helper 17 cell differentiation, etc. (Figure 6(d)). These results suggested that genes in the blue module were involved in regulating the progression of KIRC via these pathways.

4. Discussion

Comprehensive analysis of the gene expression signature in cancer tissues is of great significance in cancer research. It benefits the diagnosis of cancer and provides novel therapeutic targets for its treatment. In the field of KIRC research, depending on the open source gene expression profile data, several pilot studies apply bioinformatics analysis to construct prognosis prediction models or screen hub genes in cancer progression. For instance, using WGCNA and a protein-protein interaction network, a recent study analyzed the gene expression pattern of 26 pairs of tumor tissues/adjacent tissues and identified four hub genes involved in the progression of KIRC, including AGXT, PTGER3, SLC12A3, and ALOX5 [19]. Another study used LASSO and best subset regression to detect prognostic genes in KIRC from TCGA data, including PADI1, ATP6V0D2, DPP6, C9orf135, and PLG [20]. Furthermore, through TCGA data, another research group screened key splicing factors regulating the alternative splicing events during the tumorigenesis of KIRC, which helps elucidate the mechanism of KIRC progression [21]. High-throughput technologies, such as gene expression chip and RNA sequencing, provide a considerable amount of data to researchers. Optimization of the workflow and combination of multiple analysis methods will provide novel clues. This study presented a novel model for predicting the prognosis of patients with KIRC. Functional enrichment analysis suggested that the genes involved in this model were crucial modulators in the progression of KIRC. Additionally, this model only consisted of four genes, concise and precise, showing a preferable application perspective.

Prognosis factors are important indicators of disease treatment [22, 23]. Cox regression analysis is an effective tool to find out prognosis factors [24]. Cox regression analysis includes univariate Cox analysis and multivariate Cox analysis [25]. Moreover, univariate Cox analysis is usually used to screen potential prognosis factors, and multivariate Cox analysis is frequently applied to construct prognosis models [22, 26–30]. In our study, a risk model was constructed based on Cox regression analysis, which benefits the prognostic evaluation and personalized medicine for patients with KIRC. In our risk model, among the four identified genes, CDKL2 and SOWAHB were protective factors for patients with KIRC, whereas LRFN1 and STAT2 were risk factors for these patients. Belonging to the STAT family, STAT2 is a well-characterized oncogene and a crucial component of the interferon- (IFN-) α/β/γ signaling pathway. Together with STAT1 and IRF9, STAT2 forms the IFN-stimulated gene factor 3 (ISGF3) complex and translocates into the nuclei to trigger the transcription of target genes after activation [31]. Moreover, STAT2 is highly expressed or abnormally activated in multiple types of cancer and promotes malignant biological behaviors, including the proliferation, migration, invasion, and epithelial-to-mesenchymal transition of cancer cells [31–33]. However, the expression characteristics, biological function, and underlying mechanism of STAT2 in KIRC have not been systemically investigated. Interestingly, a recent study reported that the IFN-γ signaling pathway is significantly activated in renal cancer patients with metastatic disease [34]. The results of that study suggested that STAT2 may participate in the progression of KIRC, which is consistent with our finding in the present study. It is worth investigating the role of STAT2 and STAT2-related pathways in the progression of KIRC in future studies.

The function of CDKL2 in different types of cancer is distinct. It functions as an oncogene in breast cancer to facilitate the process of epithelial-to-mesenchymal transition, inducing the expression of zinc finger E-box-binding homeobox 1 (ZEB1) and promoting the conversion of CD24^high cells to CD44^high cancer cells [35]. However, in gliomas, hepatocellular carcinoma, and gastric cancer, its underexpression or hypermethylation of its promoter indicates poor prognosis of patients; additionally, overexpression of CDKL2 in gastric cancer cells suppresses the growth and invasion of cancer cells [36–38]. These results suggest that CDKL2 is a tumor suppressor in these types of cancer. Previously, there was no report on the role of CDKL2 in KIRC. Herein, our data implied that downregulation of CDKL2 in KIRC tissues indicated poor prognosis of patients. It is possible that CDKL2 functions as a tumor suppressor in KIRC; however, this hypothesis requires further investigation through in vitro and in vivo studies.

Importantly, the model utilized in the present study identified two rarely investigated genes, namely, LRFN1 and SOWAHB. A genome-wide association study suggested that SOWAHB was associated with the susceptibility of chronic obstructive pulmonary disease [39]. The biological function of SOWAHB in cancer biology remains obscure. A previous study showed that LRFN1 belongs to the SALM/LRFN family and is a neuronal component in the developing of mature vertebrate nervous system [40, 41]. Our data suggest that LRFN1 and SOWAHB are potential regulators in KIRC progression. Therefore, it is desirable to investigate their biological functions in the following studies.

5. Conclusion

In summary, we utilized a comprehensive analysis to construct a novel risk-score model for KIRC, by which we can predict the prognosis of patients. Our results provide potential biomarkers and therapeutic targets, which may be beneficial for the diagnosis and treatment of KIRC.

Conflicts of Interest

The authors declare that they have no competing interest.

Authors’ Contributions

RBL, DQS, and QL conceived and designed the study; RBL, FXZ, WQ, XK, TJP, LJ, SWZ, and CXY conducted the data selection management; RBL and DQS performed the bioinformatics analyses and statistical analyses and interpreted the results; RBL, DZW, WM, and DQS drafted the manuscript; DQS obtained financial support; RBL and CZY reviewed the manuscript; DQS approved the final version of the manuscript.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (Approval No. 81703030; supervisor: Ding Qianshan).

Open Research

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Supporting Information

References

1 Ferlay J., Shin H. R., Bray F., Forman D., Mathers C., and Parkin D. M., Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008, International Journal of Cancer. (2010) 127, no. 12, 2893–2917, https://doi.org/10.1002/ijc.25516, 2-s2.0-78049485263, 21351269.
10.1002/ijc.25516
CAS PubMed Web of Science® Google Scholar
2 Song J., Peng J., Zhu C., Bai G., Liu Y., Zhu J., and Liu J., Identification and validation of two novel prognostic lncRNAs in kidney renal clear cell carcinoma, Cellular Physiology and Biochemistry. (2018) 48, no. 6, 2549–2562, https://doi.org/10.1159/000492699, 2-s2.0-85052785908.
10.1159/000492699
CAS PubMed Google Scholar
3 Tahbaz R., Schmid M., and Merseburger A. S., Prevention of kidney cancer incidence and recurrence, Current Opinion in Urology. (2018) 28, no. 1, 62–79, https://doi.org/10.1097/MOU.0000000000000454, 2-s2.0-85036552170.
10.1097/MOU.0000000000000454
PubMed Web of Science® Google Scholar
4 Siegel R. L., Miller K. D., and Jemal A., Cancer statistics, 2017, CA: a Cancer Journal for Clinicians. (2017) 67, no. 1, 7–30, https://doi.org/10.3322/caac.21387, 2-s2.0-85008220577, 28055103.
10.3322/caac.21387
PubMed Web of Science® Google Scholar
5 Janowitz T., Welsh S. J., Zaki K., Mulders P., and Eisen T., Adjuvant therapy in renal cell carcinoma-past, present, and future, Seminars in Oncology. (2013) 40, no. 4, 482–491, https://doi.org/10.1053/j.seminoncol.2013.05.004, 2-s2.0-84883177229, 23972712.
10.1053/j.seminoncol.2013.05.004
CAS PubMed Web of Science® Google Scholar
6 Ravaud A., Motzer R. J., Pandha H. S., George D. J., Pantuck A. J., Patel A., Chang Y. H., Escudier B., Donskov F., Magheli A., Carteni G., Laguerre B., Tomczak P., Breza J., Gerletti P., Lechuga M., Lin X., Martini J. F., Ramaswamy K., Casey M., Staehler M., Patard J. J., and S-TRAC Investigators, Adjuvant sunitinib in high-risk renal-cell carcinoma after nephrectomy, The New England Journal of Medicine. (2016) 375, no. 23, 2246–2254, https://doi.org/10.1056/NEJMoa1611406, 2-s2.0-85003550836, 27718781.
10.1056/NEJMoa1611406
CAS PubMed Web of Science® Google Scholar
7 Xu Y., Hou R., Lu Q., Deng Y., and Hu B., Renal clear cell carcinoma metastasis to the breast ten years after nephrectomy: a case report and literature review, Diagnostic Pathology. (2017) 12, no. 1, https://doi.org/10.1186/s13000-017-0666-8, 2-s2.0-85032732245, 29096639.
10.1186/s13000-017-0666-8
PubMed Web of Science® Google Scholar
8 Matsumoto K., Hayakawa N., Nakamura S., and Oya M., Bladder metastasis from renal cell carcinoma: retrospective analysis of 65 reported cases, Clinical & Experimental Metastasis. (2015) 32, no. 2, 135–141, https://doi.org/10.1007/s10585-015-9698-1, 2-s2.0-84925501216, 25630270.
10.1007/s10585-015-9698-1
CAS PubMed Web of Science® Google Scholar
9 Kawakami H., Kuwatani M., Yamato H., Shinada K., Hirano S., Kondo S., Yonemori A., Matsuno Y., and Asaka M., Pancreatic metastasis from renal cell carcinoma with intraportal tumor thrombus, Internal Medicine. (2008) 47, no. 22, 1967–1970, https://doi.org/10.2169/internalmedicine.47.1418, 2-s2.0-59849102433, 19015609.
10.2169/internalmedicine.47.1418
PubMed Web of Science® Google Scholar
10 Yin L., Cai Z., Zhu B., and Xu C., Identification of key pathways and genes in the dynamic progression of HCC based on WGCNA, Genes. (2018) 9, no. 2, https://doi.org/10.3390/genes9020092, 2-s2.0-85042070021, 29443924.
10.3390/genes9020092
PubMed Web of Science® Google Scholar
11 Wang J. D., Zhou H. S., Tu X. X., He Y., Liu Q. F., Liu Q., and Long Z. J., Prediction of competing endogenous RNA coexpression network as prognostic markers in AML, Aging (Albany NY). (2019) 11, no. 10, 3333–3347, https://doi.org/10.18632/aging.101985, 2-s2.0-85066788404, 31164492.
10.18632/aging.101985
CAS PubMed Google Scholar
12 Ren Z. H., Shang G. P., Wu K., Hu C. Y., and Ji T., WGCNA co-expression network analysis reveals ILF3-AS1 functions as a CeRNA to regulate PTBP1 expression by sponging miR-29a in gastric cancer, Frontiers in Genetics. (2020) 11, https://doi.org/10.3389/fgene.2020.00039.
10.3389/fgene.2020.00039
Web of Science® Google Scholar
13 Gao H., Wu Y., Li J., Li H., Li J., and Yang R., Forward LASSO analysis for high-order interactions in genome-wide association study, Briefings in Bioinformatics. (2014) 15, no. 4, 552–561, https://doi.org/10.1093/bib/bbt037, 2-s2.0-84904756276, 23775311.
10.1093/bib/bbt037
PubMed Web of Science® Google Scholar
14 He A., He S., Peng D., Zhan Y., Li Y., Chen Z., Gong Y., Li X., and Zhou L., Prognostic value of long non-coding RNA signatures in bladder cancer, Aging (Albany NY). (2019) 11, no. 16, 6237–6251, https://doi.org/10.18632/aging.102185, 2-s2.0-85071788833.
10.18632/aging.102185
CAS PubMed Google Scholar
15 Qu J., Huang C., and Zhang J., Genome-wide functional analysis of SSR for an edible mushroom Pleurotus ostreatus, Gene. (2016) 575, 2 Part 2, 524–530, https://doi.org/10.1016/j.gene.2015.09.027, 2-s2.0-84950141649, 26386282.
10.1016/j.gene.2015.09.027
PubMed Web of Science® Google Scholar
16 Gusev Y., Computational methods for analysis of cellular functions and pathways collectively targeted by differentially expressed microRNA, Methods. (2008) 44, no. 1, 61–72, https://doi.org/10.1016/j.ymeth.2007.10.005, 2-s2.0-37349042181, 18158134.
10.1016/j.ymeth.2007.10.005
CAS PubMed Web of Science® Google Scholar
17 Chen L., Zhang Y. H., Wang S. P., Zhang Y. H., Huang T., and Cai Y. D., Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways, PLoS One. (2017) 12, no. 9, article e0184129, https://doi.org/10.1371/journal.pone.0184129, 2-s2.0-85028940999, 28873455.
10.1371/journal.pone.0184129
PubMed Web of Science® Google Scholar
18 Langfelder P. and Horvath S., WGCNA: an R package for weighted correlation network analysis, BMC Bioinformatics. (2008) 9, no. 1, https://doi.org/10.1186/1471-2105-9-559, 2-s2.0-60549111634.
10.1186/1471-2105-9-559
PubMed Web of Science® Google Scholar
19 Cui H., Shan H., Miao M. Z., Jiang Z., Meng Y., Chen R., Zhang L., and Liu Y., Identification of the key genes and pathways involved in the tumorigenesis and prognosis of kidney renal clear cell carcinoma, Scientific Reports. (2020) 10, no. 1, https://doi.org/10.1038/s41598-020-61162-4, 32144299.
10.1038/s41598-020-61162-4
PubMed Web of Science® Google Scholar
20 Zhang Z., Lin E., Zhuang H., Xie L., Feng X., Liu J., and Yu Y., Construction of a novel gene-based model for prognosis prediction of clear cell renal cell carcinoma, Cancer Cell International. (2020) 20, no. 1, https://doi.org/10.1186/s12935-020-1113-6.
10.1186/s12935-020-1113-6
Web of Science® Google Scholar
21 Chen T., Zheng W., Chen J., Lin S., Zou Z., Li X., and Tan Z., Systematic analysis of survival-associated alternative splicing signatures in clear cell renal cell carcinoma, Journal of Cellular Biochemistry. (2020) 121, no. 10, 4074–4084, https://doi.org/10.1002/jcb.29590, 31886566.
10.1002/jcb.29590
CAS PubMed Web of Science® Google Scholar
22 Boberg K. M., Rocca G., Egeland T., Bergquist A., Broomé U., Caballeria L., Chapman R., Hultcrantz R., Mitchell S., Pares A., Rosina F., and Schrumpf E., Time-dependent Cox regression model is superior in prediction of prognosis in primary sclerosing cholangitis, Hepatology. (2002) 35, no. 3, 652–657, https://doi.org/10.1053/jhep.2002.31872, 2-s2.0-18244388747, 11870380.
10.1053/jhep.2002.31872
CAS PubMed Web of Science® Google Scholar
23 Johnston M. E., Langton K. B., Haynes R. B., and Mathieu A., Effects of computer-based clinical decision support systems on clinician performance and patient outcome. A critical appraisal of research, Annals of Internal Medicine. (1994) 120, no. 2, 135–142, https://doi.org/10.7326/0003-4819-120-2-199401150-00007, 2-s2.0-0028156822, 8256973.
10.7326/0003-4819-120-2-199401150-00007
CAS PubMed Web of Science® Google Scholar
24 Allgulander C. and Fisher L. D., Survival analysis (or time to an event analysis), and the Cox regression model--methods for longitudinal psychiatric research, Acta Psychiatrica Scandinavica. (1986) 74, no. 6, 529–535, https://doi.org/10.1111/j.1600-0447.1986.tb06279.x, 2-s2.0-0022901103, 3548221.
10.1111/j.1600-0447.1986.tb06279.x
CAS PubMed Web of Science® Google Scholar
25 Shen Y., Peng X., and Shen C., Identification and validation of immune-related lncRNA prognostic signature for breast cancer, Genomics. (2020) 112, no. 3, 2640–2646, https://doi.org/10.1016/j.ygeno.2020.02.015, 32087243.
10.1016/j.ygeno.2020.02.015
CAS PubMed Web of Science® Google Scholar
26 Meng L., He X., Zhang X. et al., Predicting the clinical outcome of melanoma using an immune-related gene pairs signature, PLoS One. (2020) 15, no. 10, https://doi.org/10.1186/s12935-020-1113-6.
10.1371/journal.pone.0240331
Web of Science® Google Scholar
27 Liu G. M., Zeng H. D., Zhang C. Y., and Xu J. W., Identification of a six-gene signature predicting overall survival for hepatocellular carcinoma, Cancer Cell International. (2019) 19, no. 1, https://doi.org/10.1186/s12935-019-0858-2, 2-s2.0-85067128774.
10.1186/s12935-019-0858-2
Web of Science® Google Scholar
28 Jiang Y., Zhang Q., Hu Y., Li T., Yu J., Zhao L., Ye G., Deng H., Mou T., Cai S., Zhou Z., Liu H., Chen G., Li G., and Qi X., ImmunoScore signature, Annals of Surgery. (2018) 267, no. 3, 504–513, https://doi.org/10.1097/SLA.0000000000002116, 2-s2.0-85007174711, 28002059.
10.1097/SLA.0000000000002116
PubMed Web of Science® Google Scholar
29 Jiang Y., Chen C., Xie J., Wang W., Zha X., Lv W., Chen H., Hu Y., Li T., Yu J., Zhou Z., Xu Y., and Li G., Radiomics signature of computed tomography imaging for prediction of survival and chemotherapeutic benefits in gastric cancer, eBioMedicine. (2018) 36, 171–182, https://doi.org/10.1016/j.ebiom.2018.09.007, 2-s2.0-85053178687, 30224313.
10.1016/j.ebiom.2018.09.007
PubMed Web of Science® Google Scholar
30 Zeng D., Zhou R., Yu Y., Luo Y., Zhang J., Sun H., Bin J., Liao Y., Rao J., Zhang Y., and Liao W., Gene expression profiles for a prognostic immunoscore in gastric cancer, The British Journal of Surgery. (2018) 105, no. 10, 1338–1348, https://doi.org/10.1002/bjs.10871, 2-s2.0-85051184934, 29691839.
10.1002/bjs.10871
CAS PubMed Web of Science® Google Scholar
31 Lee C. J., An H. J., Kim S. M., Yoo S. M., Park J., Lee G. E., Kim W. Y., Kim D. J., Kang H. C., Lee J. Y., Lee H. S., Cho S. J., and Cho Y. Y., FBXW7-mediated stability regulation of signal transducer and activator of transcription 2 in melanoma formation, Proceedings of the National Academy of Sciences of the United States of America. (2020) 117, no. 1, 584–594, https://doi.org/10.1073/pnas.1909879116, 31843895.
10.1073/pnas.1909879116
CAS PubMed Web of Science® Google Scholar
32 Liu X., Chen J., and Zhang J., AdipoR1-mediated miR-3908 inhibits glioblastoma tumorigenicity through downregulation of STAT2 associated with the AMPK/SIRT1 pathway, Oncology Reports. (2017) 37, no. 6, 3387–3396, https://doi.org/10.3892/or.2017.5589, 2-s2.0-85019609998.
10.3892/or.2017.5589
CAS PubMed Web of Science® Google Scholar
33 Ogony J., Choi H. J., Lui A., Cristofanilli M., and Lewis-Wambi J., Interferon-induced transmembrane protein 1 (IFITM1) overexpression enhances the aggressive phenotype of SUM149 inflammatory breast cancer cells in a signal transducer and activator of transcription 2 (STAT2)-dependent manner, Breast Cancer Research. (2016) 18, no. 1, https://doi.org/10.1186/s13058-016-0683-7, 2-s2.0-84958529629, 26897526.
10.1186/s13058-016-0683-7
PubMed Web of Science® Google Scholar
34 Lo U. G., Bao J., Cen J., Yeh H. C., Luo J., Tan W., and Hsieh J. T., Interferon-induced IFIT5 promotes epithelial-to-mesenchymal transition leading to renal cancer invasion, American journal of clinical and experimental urology. (2019) 7, no. 1, 31–45, 30906803.
PubMed Web of Science® Google Scholar
35 Li L., Liu C., Amato R. J., Chang J. T., du G., and Li W., CDKL2 promotes epithelial-mesenchymal transition and breast cancer progression, Oncotarget. (2014) 5, no. 21, 10840–10853, https://doi.org/10.18632/oncotarget.2535, 2-s2.0-84916910437, 25333262.
10.18632/oncotarget.2535
PubMed Google Scholar
36 Yi R., Yang S., Liao Y., Hu Z., Long H., Zeng Y., Wang X., Qiu C., Xu A., Lin J., and Wu Z., Decreased CDKL2 expression is correlated with the progression and poor prognosis of glioma, Pathology, Research and Practice. (2020) 216, no. 5, https://doi.org/10.1016/j.prp.2020.152920.
10.1016/j.prp.2020.152920
Web of Science® Google Scholar
37 Zhou Y., Qiu X. P., Li Z. H., Zhang S., Rong Y., Yang G. H., and Fang-Zheng, Clinical significance of aberrant cyclin-dependent kinase-like 2 methylation in hepatocellular carcinoma, Gene. (2019) 683, 35–40, https://doi.org/10.1016/j.gene.2018.10.009, 2-s2.0-85054818220.
10.1016/j.gene.2018.10.009
CAS PubMed Web of Science® Google Scholar
38 Fang C. L., Uen Y. H., Chen H. K., Hseu Y. C., Lin C. C., Hung S. T., Sun D. P., and Lin K. Y., Loss of cyclin-dependent kinase-like 2 predicts poor prognosis in gastric cancer, and its overexpression suppresses cells growth and invasion, Cancer Medicine. (2018) 7, no. 7, 2993–3002, https://doi.org/10.1002/cam4.1577, 2-s2.0-85047537438.
10.1002/cam4.1577
CAS PubMed Web of Science® Google Scholar
39 Boueiz A., Lutz S. M., Cho M. H., Hersh C. P., Bowler R. P., Washko G. R., Halper-Stromberg E., Bakke P., Gulsvik A., Laird N. M., and Beaty T. H., Genome-wide association study of the genetic determinants of emphysema distribution, American Journal of Respiratory and Critical Care Medicine. (2017) 195, no. 6, 757–771, https://doi.org/10.1164/rccm.201605-0997OC, 2-s2.0-85015965783, 27669027.
10.1164/rccm.201605-0997OC
CAS PubMed Web of Science® Google Scholar
40 Morimura N., Inoue T., Katayama K. I., and Aruga J., Comparative analysis of structure, expression and PSD95-binding capacity of Lrfn, a novel family of neuronal transmembrane proteins, Gene. (2006) 380, no. 2, 72–83, https://doi.org/10.1016/j.gene.2006.05.014, 2-s2.0-33748418347, 16828986.
10.1016/j.gene.2006.05.014
CAS PubMed Web of Science® Google Scholar
41 Nam J., Mah W., and Kim E., The SALM/Lrfn family of leucine-rich repeat-containing cell adhesion molecules, Seminars in Cell & Developmental Biology. (2011) 22, no. 5, 492–498, https://doi.org/10.1016/j.semcdb.2011.06.005, 2-s2.0-80053561794, 21736948.
10.1016/j.semcdb.2011.06.005
CAS PubMed Web of Science® Google Scholar

Citing Literature

All articles

Filename	Description
dim8824717-sup-0001-f1.zipapplication/x-compressed, 2.9 MB	Supplementary 1 Supplementary Figure S1 Association between 4 genes, including STAT2, LRFN1, CDKL2, and SOWAHB, and various clinical features in the KIRC cohort from TCGA. Statistics P ≥ 0.05, ^∗P < 0.05, ^∗∗P < 0.01, and ^∗∗∗P < 0.001; r representing coefficient index when analyzing continuous parameters. KIRC: kidney renal clear cell carcinoma; TCGA: The Cancer Genome Atlas.
dim8824717-sup-0002-f2.docxWord 2007 document , 19.7 KB	Supplementary 2 Supplementary Table S1: sample information of training and testing groups from 530 KIRC samples. KIRC: kidney renal clear cell carcinoma.

Identification of a Set of Genes Improving Survival Prediction in Kidney Renal Clear Cell Carcinoma through Integrative Reanalysis of Transcriptomic Data

Abstract

1. Introduction