RESEARCH ARTICLE

Open Access

Gene signatures predict biochemical recurrence-free survival in primary prostate cancer patients after radical therapy

Qiang Su

orcid.org/0000-0003-3230-2882

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Clinical Laboratory Medicine, Beijing Shijitan Hospital, Capital Medical University, Beijing, China

Beijing Key Laboratory of Urinary Cellular Molecular Diagnostics, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Zhenyu Liu,

Zhenyu Liu

CAS Key Laboratory of Molecular Imaging, Beijing Key Laboratory of Molecular Imaging, the State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China

CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing, China

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Chi Chen,

Chi Chen

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Han Gao,

Han Gao

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Yongbei Zhu,

Yongbei Zhu

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Liusu Wang,

Liusu Wang

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Meiqing Pan,

Meiqing Pan

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Jiangang Liu,

Jiangang Liu

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Xin Yang,

Xin Yang

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Jie Tian,

Corresponding Author

Jie Tian

[email protected]

orcid.org/0000-0003-0498-0432

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing, China

Engineering Research Center of Molecular and Neuro Imaging of Ministry of Education, School of Life Science and Technology, Xidian University, Xi’an, Shaanxi, China

Correspondence

Jie Tian, Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing 100191, China.

Email: [email protected]

Search for more papers by this author

Qiang Su,

Qiang Su

orcid.org/0000-0003-3230-2882

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Clinical Laboratory Medicine, Beijing Shijitan Hospital, Capital Medical University, Beijing, China

Beijing Key Laboratory of Urinary Cellular Molecular Diagnostics, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Zhenyu Liu,

Zhenyu Liu

CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing, China

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Chi Chen,

Chi Chen

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Han Gao,

Han Gao

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Yongbei Zhu,

Yongbei Zhu

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Liusu Wang,

Liusu Wang

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Meiqing Pan,

Meiqing Pan

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Jiangang Liu,

Jiangang Liu

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

Search for more papers by this author

Xin Yang,

Xin Yang

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author

Jie Tian,

Corresponding Author

Jie Tian

[email protected]

orcid.org/0000-0003-0498-0432

Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing, China

Key Laboratory of Big Data-Based Precision Medicine (Beihang University), Ministry of Industry and Information Technology, Beijing, China

CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing, China

Engineering Research Center of Molecular and Neuro Imaging of Ministry of Education, School of Life Science and Technology, Xidian University, Xi’an, Shaanxi, China

Correspondence

Jie Tian, Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Medicine and Engineering, Beihang University, Beijing 100191, China.

Email: [email protected]

Search for more papers by this author

First published: 28 August 2021

https://doi.org/10.1002/cam4.4092

Citations: 4

Funding information

Ministry of Science and Technology of China, Grant/Award Numbers: 2017YFA0205200; Ministry of Education of China, Grant/Award Numbers: 201902075003; National Natural Science Foundation of China, Grant/Award Numbers: 81922040, 81930053, and 92059103; Beijing Natural Science Foundation, Grant/Award Numbers: Z200027; Strategic Priority Research Program of Chinese Academy of Sciences, Grant/Award Numbers: XDB32030200, XDB01030200; Chinese Academy of Sciences, Grant/Award Numbers: QYZDJ-SSW-JSC005; The Youth Innovation Promotion Association CAS, Grant/Award Numbers: 2019136; The Youth Fund of Beijing Shijitan Hospital, Grant/Award Numbers: 2020-q06.

Share a link

Email
Wechat
Bluesky

Abstract

Background

This study evaluated the predictive value of gene signatures for biochemical recurrence (BCR) in primary prostate cancer (PCa) patients.

Methods

Clinical features and gene expression profiles of PCa patients were attained from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) datasets, which were further classified into a training set (n = 419), a validation set (n = 403). The least absolute shrinkage and selection operator Cox (LASSO-Cox) method was used to select discriminative gene signatures in training set for biochemical recurrence-free survival (BCRFS). Selected gene signatures established a risk score system. Univariate and multivariate analyses of prognostic factors about BCRFS were performed using the Cox proportional hazards regression models. A nomogram based on multivariate analysis was plotted to facilitate clinical application. Kyoto Encyclopedia of Gene and Genomes (KEGG) and Gene Ontology (GO) analyses were then executed for differentially expressed genes (DEGs).

Results

Notably, the risk score could significantly identify BCRFS by time-dependent receiver operating characteristic (t-ROC) curves in the training set (3-year area under the curve (AUC) = 0.820, 5-year AUC = 0.809) and the validation set (3-year AUC = 0.723, 5-year AUC = 0.733).

Conclusions

Clinically, the nomogram model, which incorporates Gleason score and the risk score, could effectively predict BCRFS and potentially be utilized as a useful tool for the screening of BCRFS in PCa.

1 INTRODUCTION

The second most common male malignancy is prostate cancer (PCa) in the world.¹ In 2020, estimated new cases and deaths of PCa in the United States will account for 21% and 10%, respectively.² Primary PCa is usually managed with radical prostatectomy (RP) or radical radiotherapy (RT).³ Unfortunately, 30%–50% of patients with RT and 20%–40% of patients with RP will develop BCR within ten years.^{4, 5} BCR is defined as two consecutive rising prostate-specific antigen (PSA) values >0.2 ng/ml following RP or >2 ng/ml higher than the PSA nadir value following RT.⁶ It is well known that BCR contributes to distant metastasis. Generally, 24%–34% of men with BCR will progress to metastasis,^{7, 8} who should be carefully monitored and endured salvage therapy. Most earlier studies have focused exclusively on the outcomes of PCa following RP or RT. Accordingly, more accurate rapid methods are eagerly needed to identify BCR of primary PCa patients after radical therapy (including RP and RT).

In recent years, notable improvement has been made in precision oncology that applies molecular and medical imaging information to improve the diagnosis and therapy of urological malignancies.^{9, 10} In particular, molecular information has outstanding interpretability and discriminative power. For example, gene signatures exhibit an excellent discrimination power for BCR.^11-13 Accumulating evidence has suggested gene deregulation related to the prognosis of PCa, such as CRTC2, MYC, and PTEN.^14-18 However, gene signatures in the early identification of patients at high-risk BCR of primary PCa after radical therapy have rarely been reported. Therefore, it is necessary to decipher gene signatures together with underlying molecular mechanisms predicting BCRFS based on genomic information from different platforms.

The current study applied four microarray datasets, which were obtained from GEO and TCGA. Three GEOs were merged as a training set and one dataset from TCGA as a validation set. Afterward, LASSO-Cox was applied to identify prognostic gene signatures to predict BCRFS and to establish a risk score. Accordingly, the prognostic value of the risk score in both sets was verified. Then, a nomogram was built up to estimate BCRFS time. Finally, GO and KEGG on gene signatures were performed to explore molecular mechanisms and crucial genes.

2 MATERIALS AND METHODS

2.1 Data preprocessing

In this study, eligible datasets were selected based on the following inclusion criteria: (a) the dataset must include patients with primary prostate cancer (PCa) following radical therapy and (b) patients with clear clinical and pathological information (i.e., gene expression values, Gleason score, BCR event, time to BCR, total follow-up time). Exclusion criteria were as follows: (a) datasets with a small sample size (n < 50) and (b) datasets without complete data for analysis. Gene expression and complete clinical data from 822 (419 samples from GEO and 403 from TCGA) PCa samples that met the inclusion and exclusion criteria were downloaded from the three GEO (GSE70768, GSE70769, GSE116918) and TCGA datasets, serving as the training and validation sets, respectively. The main characteristics of the datasets are shown in Table 1. To ensure data integrity for each indicator, incomplete raw information (i.e., age in the training set) was excluded for further COX analysis. In addition to Gleason score and follow-up BCR information, the training set included preoperative PSA, clinical T (cT) stage, and radical therapy (RP = 196, RT = 223); meanwhile, the validation set included radical therapy (RP = 403). The characteristics of patients with prostate cancer in the training set and validation set are shown in Table 2. We have made subgroup analyses for every variable in the training set, and the results are shown in Figure S1. Three GEO datasets were merged and applied function “Normalize Between Array” from the R package “limma” for standardization.

TABLE 1. Characteristics of the included datasets

Dataset	Country	Number of samples	GPL	Number of genes
GSE70768	United Kingdom	110T	GPL10558	48,107
GSE70769	United Kingdom	86T	GPL10558	48,107
GSE116918	United Kingdom	223T	GPL 25318	121,563
TCGA	N/A	403T	N/A	5,6754

Abbreviations: GPL, Gene Expression Omnibus Platform; GSE, Gene Expression Omnibus Series; N/A, not applicable; T, tumor samples; TCGA, The Cancer Genome Atlas.

TABLE 2. The characteristics of patients with prostate cancer in the training set and validation set

Characteristics	Training set (n = 419)	Validation set (n = 403)
cT stage n (%)
T1	151 (36.0)	150 (37.2)
T2	147 (35.1)	140 (34.7)
T3	117 (27.9)	45 (11.2)
T4	4 (1)	1 (0.3)
Unknow	0 (0)	67 (16.6)
Gleason n (%)
5	2 (0.5)	0 (0)
6	72 (17.2)	37 (9.2)
7	227 (54.2)	198 (49.1)
8	60 (14.3)	56 (13.9)
9	57 (13.6)	109 (27.0)
10	1 (0.2)	3 (0.8)
Biochemical recurrence n (%)
Yes	93 (22.2)	52 (12.9)
No	326 (77.8)	351 (87.1)
Follow-up time (months, mean ± SD)	45.61±19.49	28.53±17.70

Abbreviations: cT, clinical tumor; SD, standard deviation.

2.2 Identification of gene signatures

The batch influence was adjusted for GEO and TCGA by R package “sva.” Gene expression profiling was merged with clinical information for analyses. To select gene signatures with predictive value, LASSO-Cox regression was applied using the R package “glmnet.”^{19, 20} The risk score was founded by weighting individual normalized expression value of gene signature and LASSO coefficient.

2.3 Validation of gene signatures

According to the median value of risk score in the training set, both training and external validation sets were classified into high-risk and low-risk groups. Kaplan–Meier (K–M) survival curves were drawn by R packages “survival” and “survminer.” Then, logarithmic rank (log-rank) tests were performed to compare differences in BCRFS time between the high- and low-risk groups. To visualize BCRFS differentially, a heatmap was constructed using the R package “pheatmap.” Multivariate and univariate Cox regression models were established using the R package “survival.” R package “timeROC” was applied to build a t-ROC curve, which was used to assess the predictive accuracy of the risk score system for BCRFS. Afterward, based on the results of multivariate models, a nomogram was depicted using the R package “rms.” To assess the performance of the nomogram, calibration plots and C-index were used in both training and validation sets.

2.4 Bioinformatical analysis

To estimate the potential functions of DEGs in low-risk versus (vs.) high-risk groups, the KEGG pathway and GO annotation were performed using the R package “clusterProfiler.”²¹ Briefly, GO and KEGG annotation sets were derived from the R package “org. Hs.eg.db.” GO reveals the catalogs of biological process (BP), cellular component (CC), and molecular function (MF). All visualizations were produced using R packages “ggplot2” and “GOplot.” After multiple-test correction, KEGG pathways and GO terms with corrected P (P.adjust) value <0.05 were considered to be prominently enriched in DEGs.

2.5 Statistical analysis

Data analysis was implemented using the R program (version 3.6.3, https://www.r-project.org) with the following libraries: base-package, survival-package, glmnet-package, survminer-package, timeROC-package, limma-package, rms-package, and clusterProfiler-package. Support Vector Machine (SVM) and Random Forest(RF) models were carried out in Python, using the Scikit-learn (version 0.24.0). BCRFS curves were depicted by Kaplan–Meier plots, and the difference in BCRFS was assessed by the log-rank test. Multivariate and univariate Cox regression models were used to ascertain independent prognostic factors. Time-dependent ROC curves were constructed, and AUCs were used to predict the performance of BCRFS in 3, and 5 years, respectively. Nomogram was validated with C-index. DEGs were defined as differential expression for |logFC| > 0.5 and an adjusted P value <0.05. All P values <0.05 were considered statistically significant.

3 RESULTS

3.1 Prognostic gene signatures identification in the training set

Gene expression variables in the training set were submitted to high-throughput LASSO-Cox proportional hazards regression analysis. All gene variables were reduced to the most useful potential predictors for BCRFS. The optimal λ value was chosen by “Leave-one-out” cross-validation, and the λ value of 0.11517381 with log(λ) = −2.1613129 was selected (Figure 1A). Six BCRFS-associated gene signatures (NOX4, F12, TPX2, PHYHD1, AURKA, and YIPF1) were identified by LASSO-COX models. (Figure 1B).

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Selection strategy for gene signatures. (A) “Leave-one-out” cross-validation for parameter selection in LASSO-COX regression models, and the optimal λ value of 0.11517381 with log(λ) = −2.1613129 was selected; (B) six BCRFS-associated gene signatures were selected by LASSO-COX models. LASSO, least absolute shrinkage and selection operator method; BCRFS, biochemical recurrence-free survival

3.2 Construction of the risk score

The risk score was established by the summating of every gene signature expression value multiplied by its corresponding coefficient, as follows: risk score = (0.046043 × NOX4) + (0.043807 × F12) + (0.066203 × TPX2) + (−0.027543 × PHYHD1) + (0.068834 × AURKA) + (−0.01182 × YIPF1). The expression value of every gene was log2-transformed and standardized. The distributions of risk scores for the training set and validation set were shown, respectively (Figure 2A,B). The distributions of BCRFS and BCR status for both sets are shown in Figure 2C,D. The risk score was ranked and the high-risk score indicates poor BCRFS. The median risk score of the training set was used to classify all patients into high-risk (>0.183427) versus low-risk (<0.183427) groups. The heatmaps of six prognostic genes expression values were presented in Figure 2E,F.

3.3 Validation of the risk score

Based on multivariate and univariate Cox regression(CR) models, the risk score, which was adjusted by the clinical variables in both sets, was an independent prognostic factor for BCRFS (p < 0.05). Gleason score was prominently associated with BCR in the training set (n = 419) (p < 0.05) (Table 3). Conversely, no apparent association was observed between BCR and preoperative PSA or cT stage (Table S1; Figure S1). Similarly, in the validation set (n = 403), a high Gleason score was notably associated with BCR(p < 0.05). Based on Kaplan–Meier survival curves, there were meaningful differences between high-risk and low-risk groups for both sets (p < 0.001) (Figure 3A,B). According to t-ROC, the risk score was a strong prognostic factor for BCRFS in the training set (3-year AUC = 0.82, 5-year AUC = 0.82) (Figure 3C) and the validation set (3-year AUC = 0.71, 5-year AUC = 0.67) (Figure 3D). Python version with scikit-learn 0.24.0 was used to construct support vector machine (SVM) and a random forest (RF) classifier model to calculate risk score (RF, 3-year AUC score = 0.73, 5-year AUC score = 0.76; SVM, 3-year AUC score = 0.81, 5-year AUC score = 0.81). The best-performing model was the CR model, which was selected to calculate the risk score (Table S2).

TABLE 3. Univariate and multivariate Cox proportional hazards regression analyses for predicting biochemical recurrence in the training set (n = 419) and validation set (n = 403)

Variables	Univariate Cox analysis		Multivariate Cox analysis
Variables	HR (95% CI)	P value	HR (95% CI)	P value
Training set (n = 419)
Gleason score
Cont.	1.355 (1.108–1.656)	0.003**	1.426 (1.158–1.757)	<0.001***
Risk score
Cont.	11.417 (7.160–18.206)	<0.001***	11.584 (7.313–18.349)	<0.001***
Validation set (n = 403)
Gleason score
Cont.	2.036 (1.540–2.693)	<0.001***	1.639 (1.178–2.281)	0.003**
Risk score
Cont.	3.884 (2.383–6.331)	<0.001***	2.215 (1.181–4.154)	0.013*

Abbreviations: CI, confidence interval; Cont, continuous; HR, hazard ratio.
* P value < 0.05
** P value < 0.01
*** P value < 0.001.

3.4 Establishment of the nomogram

Following the results of multivariate and univariate Cox analyses, Gleason score and risk score were used to draw a nomogram in the training set. The nomogram predicted the probability of BCRFS in patients with PCa for 3 and 5 years, while the risk score was a dominant factor (Figure 4). The likelihood of BCRFS decreased with an increase in risk score, revealing that our gene signatures might hold promising predictive value for BCRFS. The calibration plots exhibited outstanding conformity between the actual observation and the nomogram prediction for 3- and 5-years BCRFS in the training set (Figure 5A,B) and validation set (Figure 5C,D).

The C-index of the constructed nomogram for estimating BCRFS was 0.793 in the training set. Compared to the Gleason score (C-index of 0.588), risk score (C-index of 0.790), the nomogram showed better predictive accuracy. For the validation set, the constructed nomogram had a C-index of 0.722 that was also finer to Gleason score (C-index of 0.676) and risk score (C-index of 0.710) for BCRFS (Table 4).

TABLE 4. The C-index of the nomogram and other factors in the training and validation sets

Variables	Training set	Validation set
Variables	C-index	C-index
Nomogram	0.793	0.722
Gleason score	0.588	0.676
Risk score	0.790	0.710

Abbreviation: C-index, concordance index.

3.5 Bioinformatics analysis

Above all, three DEGs (PHYHD1, AURKA, and TPX2) between low-risk (n = 210) and high-risk cases (n = 209) were identified using the R package “limma” in the training set, under cut-off criteria of an adjusted P value < 0.05 and |logFC| > 0.5 (Table 5). According to bioinformatics analysis, 145 enriched considerably GO terms belong to the molecular function (MF), biological process (BP), and cellular component (CC) categories (P adjusted < 0.05) (Table S3). The most enriched BP terms were associated with mitotic spindle organization, spindle assembly, and microtubule cytoskeleton organization involved in mitosis. The three most dominant terms in CC were mitotic spindle, spindle pole, and spindle. In the MF category, histone kinase activity was the most abundant term, followed by protein serine/threonine/tyrosine kinase activity and dioxygenase activity (Figure 6a). Moreover, two significantly enriched GO terms belong to KEGG categories (P adjust < 0.05). As shown in Figure 6B, the notably enriched KEGG pathways of the DEGs were “progesterone-mediated oocyte maturation” and “Oocyte meiosis.” Ultimate, a chord diagram, was created to measure the relationship between DEGs and GO terms. Figure 6C summarizes the top three pathways enriched in the BP, CC, and MF.

TABLE 5. DEGs between low-risk cases and high-risk cases in the training set, under cut-off criteria of |logFC| > 0.5 and adjusted P value < 0.05. For each gene, the LogFC, AveExpr, P value, and FDR from limma are given

Gene	LogFC	AveExpr	P value	FDR
TPX2	4.79279	5.08858	6.63E−55	3.98E−54
PHYHD1	−5.05780	8.83455	3.52E−43	1.06E−42
AURKA	0.79642	3.53217	1.81E−10	3.61E−10

Abbreviations: AveExpr, average expression; DEGs, differentially expressed genes; FDR, false discovery rate adjusted P value; LogFC, log fold change.

4 DISCUSSION

The current study focuses on appraising the potential prognostic values of gene signatures in BCR using public datasets. Three GEOs associated with BCR are integrated as a training set to obtain optimal gene signatures. Besides, the TCGA dataset serves as an external validation set. The risk score system consisting of 6-gene signatures is significantly associated with BCRFS by a series of bioinformatical and statistical analyses, which is consistently observed in the validation set. These results indicate that gene signatures have promising predictive value for BCRFS of primary PCa patients after radical therapy. DEGs are explored regarding MF, BP, CC, and KEGG pathways to understand better mechanisms underlying BCR pathogenesis.

Based on uni- and multivariate Cox regression models, risk score and Gleason score can predict prognosis in both sets. In contrast, no significant association is observed between BCR and preoperative PSA in the training set. These findings are consistent with previous reports that a high Gleason score was appreciably related to early BCR,whereas factors (i.e., age at diagnosis, preoperative PSA) were not associated with BCRFS.^{22, 23} However, our consequences were inconsistent with other reports.^{24, 25} This inconsistency may be related to race and radical therapy.

Although radical therapy was a potential prognostic factor for the BCRFS in univariate and subgroup analyses, it was no longer a prognostic factor with multivariate analysis (Figure S1; Table S1). Similar results have been reported in low-intermediate risk patients with PCa.^{26, 27} Nevertheless, this was inconsistent with some previous studies that the therapeutic effect of RP was better than RT, and the probability of BCR after RP was lower than RT.^{4, 5} The reason may be that the data came from different datasets and the sample size was small. The experimental results need to be further verified by larger sample size. BCR mainly arises from PCa process itself or as a result of the side effects during treatment. For instance, positive surgical margins (PSM) and lymph node metastases were associated with BCRFS.^{28, 29} However, some clinical and pathological parameters (i.e., surgical margin and extracapsular extension) were missing in the training set, so they could not be added for further analysis. In future experiments, more clinical and pathological parameters need to be analyzed. This article focus on the biological characteristics of the disease itself rather than on the therapeutic effect. Furthermore, the predictive contribution of the therapeutic effect was much smaller than the risk score in this article. Thus, the therapeutic effect was not that substantial and did not affect the correctness and reliability of our conclusions.

Several studies highlighted different gene signatures associated with BCR following RP. In a case–control study, a 10-gene molecular signature(HDDA10) showed superior performance for predicting BCR in PCa patients with RP (AUC = 0.65).¹² Meanwhile, an original gene signature model predicted 3-years BCRFS in PCa patients after RP (AUC = 0.836).¹¹ In addition, CDO1 promoter methylation was proposed as a feasible predictive biomarker for BCRFS in PCa patients following RP, even though it flunked to reach statistical significance in multivariate analysis.¹³ Our gene signatures may offer a broader range of possibilities for clinical application.

A few biomarkers of our gene signatures have previously been studied in PCa. For example, TPX2, a risk biomarker in our study, positively associated with the BCR of PCa and played an essential role in the proliferation and aggression of PCa.³⁰ TPX2 depletion led to the growth inhibition of PCa cells and reduced tumorigenesis.³¹ AURKA, another essential risk biomarker in our study, was correlated with poor prognosis in lethal treatment-related neuroendocrine prostate cancer.³² Also, the inhibition of TPX2 and AURKA stimulated mitotic catastrophe (MC) or apoptosis in PCa cells, and the possible mechanism might be the Glioma pathogenesis-related protein 1 (GLIPR1) through heat shock cognate protein 70 (Hsc70)-mediated suppression of TPX2 and AURKA.³³ Our conclusions show excellent agreement with these results.

Notably, PHYHD1, which has not been studied in PCa, may be involved in the process of BCR. PHYHD1 had been investigated in other tumors. For instance, one research had shown that the DNA methylation level of PHYHD1 was related to the invasion of non-functioning pituitary adenoma.³⁴ However, the underlying mechanism of its action in PCa remains to be established.

There have been several studies that investigated the possible mechanisms of prostate cancer progression. For instance, the centrosome was associated with cell mitosis, and its defects contributed to the change in cellular and gene that accompany the progression, dissemination, and lethality of prostate cancer.³⁵ Another study demonstrated that spindle orientation controls cell fate of PCa.³⁶ These results resonate well with GO and KEGG results where “mitotic spindle organization” and “Oocyte meiosis” have been the most significantly enriched in prominent GO terms, suggesting their roles as significant progressive pathway signatures in BCR.

This study has the following restrictions. First, this study is restricted by its retrospective proposal and validation. A prospective evaluation would improve the reliability of our findings. Second, experimental evidence to support this conclusion is not yet available and is worthy of further assessment.

In conclusion, the gene signatures in our study have a good fit and discrimination, so does risk score classification, indicating excellent predictive values for BCRFS. Besides, based on risk score and Gleason score, the nomogram can predict 3 and 5-year BCRFS rates precisely, thus providing evidence of treatment for PCa patients. It is worthy of wider clinical application.

ACKNOWLEDGMENTS

This paper was supported by grants from the Ministry of Science and Technology of China (No. 2017YFA0205200), the Ministry of Education of China (No. 201902075003), the National Natural Science Foundation of China (No. 81922040, 81930053), the Beijing Natural Science Foundation (No. 7182109, Z200027), the National Key R&D Program of China (No. 2017YFA0205200), the Strategic Priority Research Program of Chinese Academy of Sciences (No. XDB32030200, XDB01030200), the Chinese Academy of Sciences (No. QYZDJ-SSW-JSC005), the Youth Innovation Promotion Association CAS (No. 2019136). The authors acknowledge the GEO and TCGA databases for providing the platforms and investigators for uploading their meaningful datasets. The authors thank Dr. Ye Yan, Professor of Peking University Third Hospital, for his help in clarifying the clinical value of this study.

CONFLICT OF INTEREST

The authors confirm that there are no conflicts of interest.

ETHICAL APPROVAL STATEMENT

All analyses were based on previously published studies, thus no ethical approval and patient consent are required.

Open Research

DATA AVAILABILITY STATEMENT

The data used to support the findings of this study are available from the GEO (GSE70768, GSE70769, GSE116918, available online: https://www.ncbi.nlm.nih.gov/geo/) and TCGA datasets (available online: https://cancer genome.nih.gov/).

Supporting Information

REFERENCES

1Bray F, Ferlay J, Soerjomataram I, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018; 68(6): 394-424.
10.3322/caac.21492
PubMed Web of Science® Google Scholar
2Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA: Cancer J Clin. 2020; 70(1): 7-30.
10.3322/caac.21590
PubMed Web of Science® Google Scholar
3Mottet N, Bellmunt J, Bolla M, et al. EAU-ESTRO-SIOG guidelines on prostate cancer. Part 1: screening, diagnosis, and local treatment with curative intent. Eur Urol. 2017; 71(4): 618-629.
10.1016/j.eururo.2016.08.003
PubMed Web of Science® Google Scholar
4Kupelian PA, Mahadevan A, Reddy CA, et al. Use of different definitions of biochemical failure after external beam radiotherapy changes conclusions about relative treatment efficacy for localized prostate cancer. Urology. 2006; 68(3): 593-598.
10.1016/j.urology.2006.03.075
PubMed Web of Science® Google Scholar
5Lin X, Kapoor A, Gu Y, et al. Assessment of biochemical recurrence of prostate cancer (review). Int J Oncol. 2019; 55(6): 1194-1212.
CAS PubMed Web of Science® Google Scholar
6Cornford P, Bellmunt J, Bolla M, et al. EAU-ESTRO-SIOG Guidelines on prostate cancer. part II: treatment of relapsing, metastatic, and castration-resistant prostate cancer. Eur Urol. 2017; 71(4): 630-642.
10.1016/j.eururo.2016.08.002
PubMed Web of Science® Google Scholar
7Pound CR, Partin AW, Eisenberger MA, et al. Natural history of progression after PSA elevation following radical prostatectomy. JAMA. 1999; 281(17): 1591-1597.
10.1001/jama.281.17.1591
CAS PubMed Web of Science® Google Scholar
8Boorjian SA, Thompson RH, Tollefson MK, et al. Long-term risk of clinical progression after biochemical recurrence following radical prostatectomy: the impact of time from surgery to recurrence. Eur Urol. 2011; 59(6): 893-899.
10.1016/j.eururo.2011.02.026
PubMed Web of Science® Google Scholar
9Barbieri CE, Chinnaiyan AM, Lerner SP, et al. The emergence of precision urologic oncology: a collaborative review on biomarker-driven therapeutics. Eur Urol. 2017; 71(2): 237-246.
10.1016/j.eururo.2016.08.024
PubMed Web of Science® Google Scholar
10Shao L, Yan YE, Liu Z, et al. Radiologist-like artificial intelligence for grade group prediction of radical prostatectomy for reducing upgrading and downgrading from biopsy. Theranostics. 2020; 10(22): 10200-10212.
10.7150/thno.48706
CAS PubMed Web of Science® Google Scholar
11Shi R, Bao X, Weischenfeldt J, et al. A novel gene signature-based model predicts biochemical recurrence-free survival in prostate cancer patients after radical prostatectomy. Cancers. 2019; 12(1): E1.
10.3390/cancers12010001
PubMed Web of Science® Google Scholar
12Abou-Ouf H, Alshalalfa M, Takhar M, et al. Validation of a 10-gene molecular signature for predicting biochemical recurrence and clinical metastasis in localized prostate cancer. J Cancer Res Clin Oncol. 2018; 144(5): 883-891.
10.1007/s00432-018-2615-7
CAS PubMed Web of Science® Google Scholar
13Meller S, Zipfel L, Gevensleben H, et al. CDO1 promoter methylation is associated with gene silencing and is a prognostic biomarker for biochemical recurrence-free survival in prostate cancer patients. Epigenetics. 2016; 11(12): 871-880.
10.1080/15592294.2016.1241931
PubMed Web of Science® Google Scholar
14Lee H, Lee M, Hong SK. CRTC2 as a novel prognostic biomarker for worse pathologic outcomes and biochemical recurrence after radical prostatectomy in patients with prostate cancer. Investig Clin Urol. 2019; 60(2): 84-90.
10.4111/icu.2019.60.2.84
PubMed Web of Science® Google Scholar
15Pettersson A, Gerke T, Penney KL, et al. MYC overexpression at the protein and mRNA level and cancer outcomes among men treated with radical prostatectomy for prostate cancer. Cancer Epidemiol Biomark Prev. 2018; 27(2): 201-207.
10.1158/1055-9965.EPI-17-0637
CAS PubMed Web of Science® Google Scholar
16Jamaspishvili T, Berman DM, Ross AE, et al. Clinical implications of PTEN loss in prostate cancer. Nat Rev Urol. 2018; 15(4): 222-234.
10.1038/nrurol.2018.9
CAS PubMed Web of Science® Google Scholar
17Zhou X, Yang XU, Sun X, et al. Effect of PTEN loss on metabolic reprogramming in prostate cancer cells. Oncol Lett. 2019; 17(3): 2856-2866.
CAS PubMed Web of Science® Google Scholar
18Shao Y, Ye G, Ren S, et al. Metabolomics and transcriptomics profiles reveal the dysregulation of the tricarboxylic acid cycle and related mechanisms in prostate cancer. Int J Cancer. 2018; 143(2): 396-407.
10.1002/ijc.31313
CAS PubMed Web of Science® Google Scholar
19Friedman J, Hastie T, Tibshirani R. regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010; 33(1): 1-22.
10.18637/jss.v033.i01
PubMed Web of Science® Google Scholar
20Simon N, Friedman J, Hastie T, et al. Regularization paths for Cox's proportional hazards model via coordinate descent. J Stat Softw. 2011; 39(5): 1-13.
10.18637/jss.v039.i05
PubMed Web of Science® Google Scholar
21Yu G, Wang L-G, Han Y, et al. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: J Integ Biol. 2012; 16(5): 284-287.
10.1089/omi.2011.0118
CAS PubMed Web of Science® Google Scholar
22Nørgaard M, Haldrup C, Storebjerg T, et al. Comprehensive evaluation of TFF3 promoter hypomethylation and molecular biomarker potential for prostate cancer diagnosis and prognosis. Int J Mol Sci. 2017; 18(9): 1–17.
10.3390/ijms18092017
Web of Science® Google Scholar
23Sekiguchi A, Ishiyama H, Satoh T, et al. 125Iodine monotherapy for Japanese men with low- and intermediate-risk prostate cancer: outcomes after 5 years of follow-up. J Radiat Res. 2014; 55(2): 328-333.
10.1093/jrr/rrt113
PubMed Web of Science® Google Scholar
24Hong JH, Kwon YS, Kim IY. Risk stratification for disease progression in pT3 prostate cancer after robot-assisted radical prostatectomy. Asian J Androl. 2017; 19(6): 700-706.
10.4103/1008-682X.193569
CAS PubMed Web of Science® Google Scholar
25Takahara K, Sumitomo M, Fukaya K, et al. Clinical and oncological outcomes of robot-assisted radical prostatectomy with nerve sparing vs. non-nerve sparing for high-risk prostate cancer cases. Oncol Lett. 2019; 18(4): 3896-3902.
PubMed Web of Science® Google Scholar
26Nilsson S, Norlen BJ, Widmark A. A systematic overview of radiation therapy effects in prostate cancer. Acta Oncol. 2004; 43(4): 316-381.
10.1080/02841860410030661
PubMed Web of Science® Google Scholar
27Kim DS, Jeon SH, Chang S-G, et al. Comparison of biochemical recurrence in prostate cancer patients treated with radical prostatectomy or radiotherapy. Korean J Urol. 2015; 56(10): 703-709.
10.4111/kju.2015.56.10.703
PubMed Google Scholar
28Jo JK, Hong SK, Byun SS, et al. Positive surgical margin in robot-assisted radical prostatectomy: correlation with pathology findings and risk of biochemical recurrence. Minerva Urol Nefrol. 2017; 69(5): 493-500.
PubMed Web of Science® Google Scholar
29Morizane S, Honda M, Shimizu R, et al. Small-volume lymph node involvement and biochemical recurrence after robot-assisted radical prostatectomy with extended lymph node dissection in prostate cancer. Int J Clin Oncol. 2020; 25(7): 1398-1404.
10.1007/s10147-020-01682-1
PubMed Web of Science® Google Scholar
30Zou J, Huang R, Jiang F, et al. Overexpression of TPX2 is associated with progression and prognosis of prostate cancer. Oncol Lett. 2018; 16(3): 2823-2832.
PubMed Web of Science® Google Scholar
31Pan HW, Su HH, Hsu CW, et al. Targeted TPX2 increases chromosome missegregation and suppresses tumor cell growth in human prostate cancer. OncoTargets Ther. 2017; 10: 3531-3543.
10.2147/OTT.S136491
PubMed Web of Science® Google Scholar
32Mosquera JM, Beltran H, Park K, et al. Concurrent AURKA and MYCN gene amplifications are harbingers of lethal treatment-related neuroendocrine prostate cancer. Neoplasia. 2013; 15(1): 1-10.
10.1593/neo.121550
CAS PubMed Web of Science® Google Scholar
33Li L, Yang G, Ren C, et al. Glioma pathogenesis-related protein 1 induces prostate cancer cell death through Hsc70-mediated suppression of AURKA and TPX2. Mol Oncol. 2013; 7(3): 484-496.
10.1016/j.molonc.2012.12.005
CAS PubMed Web of Science® Google Scholar
34Cheng S, Xie W, Miao Y, et al. Identification of key genes in invasive clinically non-functioning pituitary adenoma by integrating analysis of DNA methylation and mRNA expression profiles. J Transl Med. 2019; 17(1): 407.
10.1186/s12967-019-02148-3
CAS PubMed Web of Science® Google Scholar
35Pihan GA, Purohit A, Wallace J, et al. Centrosome defects can account for cellular and genetic changes that characterize prostate cancer progression. Can Res. 2001; 61(5): 2212-2219.
CAS PubMed Web of Science® Google Scholar
36Shafer MER, Nguyen AHT, Tremblay M, et al. Lineage specification from prostate progenitor cells requires Gata3-dependent mitotic spindle orientation. Stem Cell Rep. 2017; 8(4): 1018-1031.
10.1016/j.stemcr.2017.02.004
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume10, Issue18

September 2021

Pages 6492-6502

Filename	Description
cam44092-sup-0001-FigS1.tifTIFF image, 2.1 MB	Fig S1
cam44092-sup-0002-TableS1.docxWord document, 16 KB	Table S1
cam44092-sup-0003-TableS2.docxWord document, 14.7 KB	Table S2
cam44092-sup-0004-TableS3.docxWord document, 36.3 KB	Table S3

Gene signatures predict biochemical recurrence-free survival in primary prostate cancer patients after radical therapy