miRNA–target gene regulatory networks: A Bayesian integrative approach to biomarker selection with application to kidney cancer
Thierry Chekouo
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
Search for more papers by this authorCorresponding Author
Francesco C. Stingo
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
email: [email protected]Search for more papers by this authorJames D. Doecke
CSIRO Computational Informatics/Australian e-Health Research Centre Level 5, UQ Health Sciences Building, 901/16 Royal Brisbane, Queensland, 4029 Australia
Search for more papers by this authorKim-Anh Do
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
Search for more papers by this authorThierry Chekouo
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
Search for more papers by this authorCorresponding Author
Francesco C. Stingo
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
email: [email protected]Search for more papers by this authorJames D. Doecke
CSIRO Computational Informatics/Australian e-Health Research Centre Level 5, UQ Health Sciences Building, 901/16 Royal Brisbane, Queensland, 4029 Australia
Search for more papers by this authorKim-Anh Do
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 1400 Pressler Street, Unit 1411, Texas, 77030-3722 U.S.A.
Search for more papers by this authorSummary
The availability of cross-platform, large-scale genomic data has enabled the investigation of complex biological relationships for many cancers. Identification of reliable cancer-related biomarkers requires the characterization of multiple interactions across complex genetic networks. MicroRNAs are small non-coding RNAs that regulate gene expression; however, the direct relationship between a microRNA and its target gene is difficult to measure. We propose a novel Bayesian model to identify microRNAs and their target genes that are associated with survival time by incorporating the microRNA regulatory network through prior distributions. We assume that biomarkers involved in regulatory networks are likely associated with survival time. We employ non-local prior distributions and a stochastic search method for the selection of biomarkers associated with the survival outcome. We use KEGG pathway information to incorporate correlated gene effects within regulatory networks. Using simulation studies, we assess the performance of our method, and apply it to experimental data of kidney renal cell carcinoma (KIRC) obtained from The Cancer Genome Atlas. Our novel method validates previously identified cancer biomarkers and identifies biomarkers specific to KIRC progression that were not previously discovered. Using the KIRC data, we confirm that biomarkers involved in regulatory networks are more likely to be associated with survival time, showing connections in one regulatory network for five out of six such genes we identified.
Supporting Information
All Supplemental Data may be found in the online version of this article.
Filename | Description |
---|---|
biom12266-sup-0001-SuppData.pdf1.6 MB | Supporting Materials. |
biom12266-sup-0002-SuppData-Code.zip3.3 MB | Supporting Materials Code. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Ambros, V. (2004). The functions of animal microRNAs. Nature 431, 350–355.
- Bagga, S., Bracht, J., Hunter, S., Massirer, K., Holtz, J., Eachus, R., and Pasquinelli, A. E. (2005). Regulation by let-7 and lin-4 miRNAs results in target mRNA Degradation. Cell 122, 553–563.
- Becker Buscaglia, E. and Li, Y. (2011). Apoptosis and the target genes of miR-21. Chinese Journal of Cancer 30, 371–380.
- Chin, L., Hahn, W. C., Getz, G., and Meyerson, M. (2011). Making sense of cancer genomic data. Genes and Development 25, 534–555.
- Daemen, A., Gevaert, O., Ojeda, F., Debucquoy, A., Suykens, J., Sempoux, C., Machiels, J. P., Haustermans, K., and De Moor, B. (2009). A kernel-based integration of genome-wide data for clinical decision support. Genome Medicine 1, 39.
-
Do, K.,
Qin, Z., and
Vannucci, M.
(2013).
Advances in Statistical Bioinformatics: Models and Integrative Inference for High-Throughput Data.
Cambridge, UK: Cambridge University Press.
10.1017/CBO9781139226448 Google Scholar
- Doecke, J. D., Chekouo, T. T., Stingo, F., and Do., K.-A. (2014). miRNA target gene identification: Sourcing miRNA target gene relationships for the analyses of TCGA Illumina MiSeq and RNA-Seq Hiseq platform data. International Journal of Human Genetics 14, 17–22.
-
Gelfand, A.
(1996).
Model determination using sampling-based me thods. In: Markov Chain Monte Carlo in Practice,
W. R. Gilks,
S. Richardson, and
D. Spiegelhalter, (eds), 145–161. Boca Rotan, FL: Chapman and Hall.
10.1007/978-1-4899-4485-6_9 Google Scholar
- He, X., Zhou, A., Lu, H., Chen, Y., Huang, G., Yue, X., Zhao, P., and Wu, Y. (2013). Suppression of mitochondrial complex I influences cell metastatic properties. PLoS ONE 8, e61677.
- Herranz, H. and Cohen, S. M. (2010). MicroRNAs and gene regulatory networks: Managing the impact of noise in biological systems. Genes and Development 24, 1339–44.
- Huang, J. C., Morris, Q. D., and Frey, B. J. (2007). Bayesian inference of microRNA targets from sequence and expression data. Journal of Computational Biology 14, 550–563.
- Jackson, R. J. and Standart, N. (2007). How do microRNAs regulate gene expression? Science's STKE: Signal Transduction Knowledge Environment 2007, re1.
- Johnson, V. E. and Rossell, D. (2012). Bayesian model selection in high-dimensional settings. Journal of the American Statistical Association 107, 649–660.
- Kanehisa, M. and Goto, S. (2000). KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research 28, 27–30.
- Lawrie, C. H. (2013). MicroRNAs and lymphomagenesis: A functional review. British Journal of Haematology 160, 571–581.
- Li, C. and Li, H. (2008). Network-constrained regularization and variable selection for analysis of genomic data. Bioinformatics 24, 1175–1182.
- Li, J., Min, M. R., Bonner, A. J., and Zhang, Z. (2009). A probabilistic framework to improve microrna target prediction by incorporating proteomics data. Journal of Bioinformatics and Computational Biology 7, 955–972.
- Li, Y., Liang, C., Wong, K.-C., Jin, K., and Zhang, Z. (2014). Inferring probabilistic mirnamrna interaction signatures in cancers: A role-switch approach. Nucleic Acids Research 42, 1–11.
- Long, Q., Johnson, B. A., Osunkoya, A. O., Lai, Y.-H., Zhou, W., Abramovitz, M., Xia, M., Bouzyk, M. B., Nam, R. K., Sugar, L., Stanimirovic, A., Williams, D. J., Leyland-Jones, B. R., Seth, A. K., Petros, J. A., and Moreno, C. S. (2011). Protein-coding and microRNA biomarkers of recurrence of prostate cancer following radical prostatectomy. American Journal of Pathology 179, 46–54.
- Maia, B. M., Rocha, R. M., and Calin, G. A. (2013). Clinical significance of the interaction between non-coding RNAs and the epigenetics machinery: Challenges and opportunities in oncology. Epigenetics 9, 75–80.
- Muniategui, A., Pey, J., Planes, F. J., and Rubio, A. (2012). Joint analysis of miRNA and mRNA expression data. Briefings in Bioinformatics 14, 263–278.
- Pan, W., Xie, B., and Shen, X. (2010). Incorporating predictor network in penalized regression with application to microarray data. Biometrics 66, 474–484.
- Petersen, C. P., Bordeleau, M.-E., Pelletier, J., and Sharp, P. A. (2006). Short RNAs repress translation after initiation in mammalian cells. Molecular Cell 21, 533–542.
- Pomerantz, J. L. and Baltimore, D. (1999). NF-kappaB activation by a signaling complex containing TRAF2, TANK and TBK1, a novel IKK-related kinase. EMBO Journal 18, 6694–6704.
-
Qian, J.,
Siragam, V.,
Lin, J., and
Jichun Ma, Z. D.
(2011).
The role of microRNAs in the formation of cancer stem cells: Future directions for miRNAs.
Hypothesis
9, e10.
10.5779/hypothesis.v9i1.224 Google Scholar
- Qin, L.-X. (2008). An integrative analysis of microRNA and mRNA expression—A case study. Cancer Informatics 6, 369–379.
- Sha, N., Tadesse, M. G., and Vannucci, M. (2006). Bayesian variable selection for the analysis of microarray data with censored outcomes. Bioinformatics 22, 2262–2268.
- Simon, N., Friedman, J. H., Hastie, T., and Tibshirani, R. (2011). Regularization paths for Cox's proportional hazards model via coordinate descent. Journal of Statistical Software 39, 1–13.
- Stingo, F. C., Chen, Y. A., Tadesse, M. G., and Vannucci, M. (2011). Incorporating biological information into linear models: A Bayesian approach to the selection of pathways and genes. Annals of Applied Statistics 5, 1978–2002.
- Stingo, F. C., Chen, Y. A., Vannucci, M., Barrier, M., and Mirkes, P. E. (2010). A Bayesian graphical modeling approach to microRNA regulatory network inference. Annals of Applied Statistics 4, 2024–2048.
- Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., and Altman, R. B. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525.
- Tseng, C. W., Lin, C. C., Chen, C. N., Huang, H. C., and Juan, H. F. (2011). Integrative network analysis reveals active microRNAs and their functions in gastric cancer. BMC Systems Biology 5, 99.
- Wang, W., Baladandayuthapani, V., Morris, J. S., Broom, B. M., Manyam, G., and Do, K.-A. (2013). iBAG: Integrative Bayesian analysis of high-dimensional multiplatform genomics data. Bioinformatics 29, 149–159.
- Waters, K. M., Pounds, J. G., and Thrall, B. D. (2006). Data merging for integrated microarray and proteomic analysis. Briefings in Functional Genomics and Proteomics 5, 261–272.
- Wilczynski, B. and Furlong, E. (2010). Challenges for modeling global gene regulatory networks during development: Insights from Drosophila. Developmental Biology 340, 161–169.
- Zhang, J. D. and Wiemann, S. (2009). KEGGgraph: A graph approach to KEGG PATHWAY in R and Bioconductor. Bioinformatics 25, 1470–1471.