Assessment of patient clinical descriptions and pathogenic variants from gene panel sequences in the CAGI-5 intellectual disability challenge
Marco Carraro
Department of Biomedical Sciences, University of Padua, Padua, Italy
These authors contributed equally to this study.
Search for more papers by this authorAlexander Miguel Monzon
Department of Biomedical Sciences, University of Padua, Padua, Italy
These authors contributed equally to this study.
Search for more papers by this authorLuigi Chiricosta
Department of Biomedical Sciences, University of Padua, Padua, Italy
Search for more papers by this authorFrancesco Reggiani
Department of Biomedical Sciences, University of Padua, Padua, Italy
Department of Information Engineering, University of Padua, Padua, Italy
Search for more papers by this authorMaria Cristina Aspromonte
Department of Woman and Child Health, University of Padua, Padua, Italy
Search for more papers by this authorMariagrazia Bellini
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Search for more papers by this authorKymberleigh Pagel
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorYuxiang Jiang
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorPredrag Radivojac
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorKunal Kundu
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Search for more papers by this authorLipika R. Pal
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Search for more papers by this authorYizhou Yin
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Search for more papers by this authorGaia Andreoletti
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Search for more papers by this authorJohn Moult
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Search for more papers by this authorStephen J. Wilson
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorPanagiotis Katsonis
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorOlivier Lichtarge
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorJingqi Chen
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorYaqiong Wang
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorZhiqiang Hu
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorSteven E. Brenner
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorCarlo Ferrari
Department of Information Engineering, University of Padua, Padua, Italy
Search for more papers by this authorAlessandra Murgia
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Search for more papers by this authorCorresponding Author
Silvio C.E. Tosatto
Department of Biomedical Sciences, University of Padua, Padua, Italy
Institute of Neuroscience, National Research Council (CNR), Padua, Italy
These authors contributed equally to this study.
Correspondence Silvio Tosatto, Department of Biomedical Sciences, University of Padua. Viale G. Colombo 3, 35131, Padua, Italy. Email: [email protected]
Emanuela Leonardi, Department of Woman and Child Health, University of Padua, Padua. Corso Stati Uniti, 4, 35127, Padua, Italy. Email: [email protected]
Search for more papers by this authorCorresponding Author
Emanuela Leonardi
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
These authors contributed equally to this study.
Correspondence Silvio Tosatto, Department of Biomedical Sciences, University of Padua. Viale G. Colombo 3, 35131, Padua, Italy. Email: [email protected]
Emanuela Leonardi, Department of Woman and Child Health, University of Padua, Padua. Corso Stati Uniti, 4, 35127, Padua, Italy. Email: [email protected]
Search for more papers by this authorMarco Carraro
Department of Biomedical Sciences, University of Padua, Padua, Italy
These authors contributed equally to this study.
Search for more papers by this authorAlexander Miguel Monzon
Department of Biomedical Sciences, University of Padua, Padua, Italy
These authors contributed equally to this study.
Search for more papers by this authorLuigi Chiricosta
Department of Biomedical Sciences, University of Padua, Padua, Italy
Search for more papers by this authorFrancesco Reggiani
Department of Biomedical Sciences, University of Padua, Padua, Italy
Department of Information Engineering, University of Padua, Padua, Italy
Search for more papers by this authorMaria Cristina Aspromonte
Department of Woman and Child Health, University of Padua, Padua, Italy
Search for more papers by this authorMariagrazia Bellini
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Search for more papers by this authorKymberleigh Pagel
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorYuxiang Jiang
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorPredrag Radivojac
Khoury College of Computer and Information Sciences, Northeastern University, Boston, Massachusetts
Search for more papers by this authorKunal Kundu
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Search for more papers by this authorLipika R. Pal
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Search for more papers by this authorYizhou Yin
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, Maryland
Search for more papers by this authorGaia Andreoletti
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Search for more papers by this authorJohn Moult
Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland
Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Search for more papers by this authorStephen J. Wilson
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorPanagiotis Katsonis
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorOlivier Lichtarge
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
Search for more papers by this authorJingqi Chen
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorYaqiong Wang
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorZhiqiang Hu
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorSteven E. Brenner
Department of Plant and Microbial Biology, University of California, Berkeley, California
Search for more papers by this authorCarlo Ferrari
Department of Information Engineering, University of Padua, Padua, Italy
Search for more papers by this authorAlessandra Murgia
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
Search for more papers by this authorCorresponding Author
Silvio C.E. Tosatto
Department of Biomedical Sciences, University of Padua, Padua, Italy
Institute of Neuroscience, National Research Council (CNR), Padua, Italy
These authors contributed equally to this study.
Correspondence Silvio Tosatto, Department of Biomedical Sciences, University of Padua. Viale G. Colombo 3, 35131, Padua, Italy. Email: [email protected]
Emanuela Leonardi, Department of Woman and Child Health, University of Padua, Padua. Corso Stati Uniti, 4, 35127, Padua, Italy. Email: [email protected]
Search for more papers by this authorCorresponding Author
Emanuela Leonardi
Department of Woman and Child Health, University of Padua, Padua, Italy
Fondazione Istituto di Ricerca Pediatrica (IRP), Città della Speranza, Padova, Italy
These authors contributed equally to this study.
Correspondence Silvio Tosatto, Department of Biomedical Sciences, University of Padua. Viale G. Colombo 3, 35131, Padua, Italy. Email: [email protected]
Emanuela Leonardi, Department of Woman and Child Health, University of Padua, Padua. Corso Stati Uniti, 4, 35127, Padua, Italy. Email: [email protected]
Search for more papers by this authorAbstract
The Critical Assessment of Genome Interpretation-5 intellectual disability challenge asked to use computational methods to predict patient clinical phenotypes and the causal variant(s) based on an analysis of their gene panel sequence data. Sequence data for 74 genes associated with intellectual disability (ID) and/or autism spectrum disorders (ASD) from a cohort of 150 patients with a range of neurodevelopmental manifestations (i.e. ID, autism, epilepsy, microcephaly, macrocephaly, hypotonia, ataxia) have been made available for this challenge. For each patient, predictors had to report the causative variants and which of the seven phenotypes were present. Since neurodevelopmental disorders are characterized by strong comorbidity, tested individuals often present more than one pathological condition. Considering the overall clinical manifestation of each patient, the correct phenotype has been predicted by at least one group for 93 individuals (62%). ID and ASD were the best predicted among the seven phenotypic traits. Also, causative or potentially pathogenic variants were predicted correctly by at least one group. However, the prediction of the correct causative variant seems to be insufficient to predict the correct phenotype. In some cases, the correct prediction has been supported by rare or common variants in genes different from the causative one.
Supporting Information
Filename | Description |
---|---|
humu23823-sup-0001-Suppl_mat_CAGI5_ID_Monzon_Carraro_R1_clean.pdf1.3 MB | Supplementary information |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
REFERENCES
- Almuhtaseb, S., Oppewal, A., & Hilgenkamp, T. I. M. (2014). Gait characteristics in individuals with intellectual disabilities: A literature review. Research in Developmental Disabilities, 35(11), 2858–2883.
- An, J. Y., Cristino, A. S., Zhao, Q., Edson, J., Williams, S. M., Ravine, D., … Claudianos, C. (2014). Towards a molecular characterization of autism spectrum disorders: An exome sequencing and systems approach. Translational Psychiatry, 4, e394.
- Aspromonte, M. C., Bellini, M., Gasparini, A., Carraro, M., Bettella, E., Polli, R., …Cesca, F. (2019). Characterization of intellectual disability and autism comorbidity through gene panel sequencing.” .https://doi.org/.org/10.1101/545772
- Barabási, A.-L., Gulbahce, N., & Loscalzo, J. (2011). Network medicine: A network-based approach to human disease. Nature Reviews Genetics, 12(1), 56–68.
- Bowley, C., & Kerr, M. (2000). Epilepsy and intellectual disability. Journal of Intellectual Disability Research: JIDR, 44(Pt 5), 529–543.
- Cai, B., Li, B., Kiga, N., Thusberg, J., Bergquist, T., Chen, Y.-C., … Mooney, S. D. (2017). Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges. Human Mutation, 38(9), 1266–1276.
- Chandonia, J.-M., Adhikari, A., Carraro, M., Chhibber, A., Cutting, G. R., Fu, Y., … Buckley, B. A. (2017). Lessons from the CAGI-4 Hopkins clinical panel challenge. Human Mutation, 38(9), 1155–1168.
- Desmet, F.-O., Dalil, H., Lalande, M., Collod-Béroud, G., Claustres, M., & Béroud, C. (2009). Human splicing finder: An online bioinformatics tool to predict splicing signals. Nucleic Acids Research, 37(9), e67.
- Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861–874.
- Gauthier, J., Tabrez, J. S., Huashan, P., Yokomaku, D., Hamdan, F. F., Champagne, N., … Rouleau, G. A. (2011). Truncating mutations in NRXN2 and NRXN1 in autism spectrum disorders and schizophrenia. Human Genetics, 130(4), 563–573.
- Ioannidis, N. M., Rothstein, J. H., Pejaver, V., Middha, S., McDonnell, S. K., Baheti, S., … Sieh, W. (2016). REVEL: An ensemble method for predicting the pathogenicity of rare missense variants. American Journal of Human Genetics, 99(4), 877–885.
- Iossifov, I., O. ’Roak, B. J., Sanders, S. J., Ronemus, M., Krumm, N., Levy, D., … Wigler, M. (2014). The contribution of de novo coding mutations to autism spectrum disorder. Nature, 515(7526), 216–221.
- Jain, S., White, M., & Radivojac, P. (2016). Estimating the class prior and posterior from noisy positives and unlabeled data.” arXiv [stat.ML]. arXiv. http://arxiv.org/abs/1606.08561
- Jian, X., Boerwinkle, E., & Liu, X. (2014). In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Research, 42(22), 13534–13544.
- Katsonis, P., & Lichtarge, O. (2014). A formal perturbation equation between genotype and phenotype determines the evolutionary action of protein-coding variations on fitness. Genome Research, 24(12), 2050–2058.
- Krumm, N., O’Roak, B. J., Shendure, J., & Eichler, E. E. (2014). A de novo convergence of autism genetics and molecular neuroscience. Trends in Neurosciences, 37(2), 95–105.
- Landrum, M. J., Jennifer, M. L., Benson, M., Brown, G., Chao, C., Chitipiralla, S., … Maglott, D. R. (2016). ClinVar: Public archive of interpretations of clinically relevant variants. Nucleic Acids Research, 44(D1), D862–D868.
- Landrum, M. J., Lee, J. M., Riley, G. R., Jang, W., Rubinstein, W. S., Church, D. M., & Maglott, D. R. (2014). ClinVar: Public archive of relationships among sequence variation and human phenotype. Nucleic Acids Research, 42(Database issue), D980–D985.
- Lek, M., Konrad, J. K., Minikel, E. V., Samocha, K. E., Banks, E., Fennell, T., … MacArthur, D. G. (2016). Analysis of protein-coding genetic variation in 60,706 humans. Nature, 536(7616), 285–291.
- Lesch, K.-P. (2016). Maturing insights into the genetic architecture of neurodevelopmental disorders - from common and rare variant interplay to precision psychiatry. Journal of Child Psychology and Psychiatry, and Allied Disciplines, 57(6), 659–661.
- Li, H. (2014). Toward better understanding of artifacts in variant calling from high-coverage samples. Bioinformatics, 30(20), 2843–2851.
- Lin, C.-H., Konecki, D. M., Liu, M., Wilson, S. J., Nassar, H., Wilkins, A. D., …Lichtarge, O. (2018). Multimodal network diffusion predicts future disease-gene-chemical associations. Bioinformatics, https://doi.org/10.1093/bioinformatics/bty858. October.
- Mata, I. F., Jang, Y., Kim, C.-H., Hanna, D. S., Dorschner, M. O., Samii, A., … Zabetian, C. P. (2015). The RAB39B p.G192R mutation causes X-linked dominant Parkinson's disease. Molecular Neurodegeneration, 10, 50.
- Mattingly, C. J., Colby, G. T., Forrest, J. N., & Boyer, J. L. (2003). The Comparative Toxicogenomics Database (CTD). Environmental Health Perspectives, 111(6), 793–795.
- McLaren, W., Gil, L., Hunt, S. E., Riat, H. S., Ritchie, G. R. S., Thormann, A., … Cunningham, F. (2016). The Ensembl Variant Effect Predictor. Genome Biology, 17(1), 122.
- Mitchell, K. J. (2011). The genetics of neurodevelopmental disease. Current Opinion in Neurobiology, 21(1), 197–203.
- Nabieva, E., Jim, K., Agarwal, A., Chazelle, B., & Singh, M. (2005). Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps. Bioinformatics, 21(Suppl 1), i302–i310. June.
- O’Roak, B. J., Deriziotis, P., Lee, C., Vives, L., Schwartz, J. J., Girirajan, S., … Eichler, E. E. (2011). Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nature Genetics, 43(6), 585–589.
- Pagel, K. A., Pejaver, V., Lin, G. N., Nam, H.-J., Mort, M., Jonathan, D. N. C., … Radivojac, P. (2017). When loss-of-function is loss of function: Assessing mutational signatures and impact of loss-of-function genetic variants. Bioinformatics, 33(14), i389–i398.
- Pejaver, V., Urresti, J., Lugo-Martinez, J., Pagel, K. A., Lin, G. N., Nam, H.-J., … Radivojac, P. (2017). MutPred2: Inferring the molecular and phenotypic impact of amino acid variants. bioRxiv, https://doi.org/10.1101/134981
- Pinto, D., Delaby, E., Merico, D., Barbosa, M., Merikangas, A., Klei, L., … Scherer, S. W. (2014). Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. American Journal of Human Genetics, 94(5), 677–694.
- Piton, A., Redin, C., & Mandel, J.-L. (2013). XLID-causing mutations and associated genes challenged in light of data from large-scale human exome sequencing. American Journal of Human Genetics, 93(2), 368–383.
- Potter, J. (1978). Handbook of Clinical Neurology, Vol. 30 (congenital Malformations of the Brain and Skull, Part I): By P. J. Vinken and G. W. Bruyn (Eds.), in Collaboration with N.C. Myrianthopoulos, Xii + 708 Pages, 391 Illustrations, 44 Tables, North-Holland Publishing Company, Amsterdam, 1977, US 121.75, Dfl 280.00, Subscription Price US 103.50, Dfl 238.00. Journal of the Neurological Sciences 38 (3): 442.
- Pruitt, K. D., Garth, R. B., Hiatt, S. M., Thibaud-Nissen, F., Astashyn, A., Ermolaeva, O., … Ostell, J. M. (2014). RefSeq: An update on mammalian reference sequences. Nucleic Acids Research, 42(Database issue), D756–D763.
- Radivojac, P., Peng, K., Clark, W. T., Peters, B. J., Mohan, A., Boyle, S. M., & Mooney, S. D. (2008). An integrated approach to inferring gene-disease associations in humans. Proteins, 72(3), 1030–1037.
- Stenson, P. D., Ball, E. V., Mort, M., Phillips, A. D., Shiel, J. A., Thomas, N. S., … Cooper, D. N. (2003). Human Gene Mutation Database (HGMD): 2003 update. Human Mutation, 21(6), 577–581.
- Stenson, P. D., Mort, M., Ball, E. V., Evans, K., Hayden, M., Heywood, S., … Cooper, D. N. (2017). The Human Gene Mutation Database: Towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Human Genetics, 136(6), 665–677.
- Tonnsen, B. L., Andrea, D. B., Bradley, C. C., Charles, J., Cohen, A., & Carpenter, L. A. (2016). Prevalence of autism spectrum disorders among children with intellectual disability. American Journal on Intellectual and Developmental Disabilities, 121(6), 487–500.
- Vihinen, M. (2012). How to evaluate performance of prediction methods? measures and their interpretation in variation effect analysis. BMC Genomics, 13(Suppl 4), S2.
- Wang, K., Li, M., & Hakonarson, H. (2010). ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Research, 38(16), e164.
- Whiffin, N., Angharad, M. R., Minikel, E., Zappala, Z., Walsh, R., O’Donnell-Luria, A. H., … Ware, J. S. (2019). Using high-resolution variant frequencies empowers clinical genome interpretation and enables investigation of genetic architecture. American Journal of Human Genetics, 104(1), 187–190.
- Xiong, H. Y., Alipanahi, B., Lee, L. J., Bretschneider, H., Merico, D., Yuen, R. K. C., … Frey, B. J. (2015). RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science, 347(6218), 1254806.
- Yang, H., Robinson, P. N., & Wang, K. (2015). Phenolyzer: Phenotype-based prioritization of candidate genes for human diseases. Nature Methods, 12(9), 841–843.