The Effects of Mutations on Protein Function: A Comparative Study of Three Databases of Mutations in Humans
Ariel Azia
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124
Search for more papers by this authorVladimir N. Uversky
Department of Molecular Medicine, University of South Florida, Tampa, FL 33612 (USA)
Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region (Russia)
Search for more papers by this authorAmnon Horovitz
Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100 (Israel)
Search for more papers by this authorCorresponding Author
Ron Unger
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124Search for more papers by this authorAriel Azia
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124
Search for more papers by this authorVladimir N. Uversky
Department of Molecular Medicine, University of South Florida, Tampa, FL 33612 (USA)
Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region (Russia)
Search for more papers by this authorAmnon Horovitz
Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100 (Israel)
Search for more papers by this authorCorresponding Author
Ron Unger
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 (Israel) phone/fax: +972-3-5318124Search for more papers by this authorAbstract
Single-nucleotide mutations (SNPs) in protein-coding regions of the human genome are a major factor in determining human variation in health and disease. Here, we analyze the amino acid changes and functional effects due to non-synonymous SNPs. Three databases were used: (i) Variation – mutations found in the general human population; (ii) Cosmic – mutations found in cancer cells; and (iii) Pathogenic – a curated subset of mutations in Variation that are associated with diseases. The distributions of amino acid changes in these datasets were analyzed. It is shown that mutations in the Pathogenic dataset, in particular, tend to introduce order-promoting residues. The effects of the mutations in these datasets were also studied using the program Polyphen-2, which predicts the functional impact of non-synonymous mutations. In order to evaluate the significance of these predicted effects, we compared them to those due to the same amino acid replacements introduced at other positions in the same proteins as a control. A mutation can be deleterious because the amino acid change is drastic (for example a change from hydrophobic residue to hydrophilic residue) or because of its location in the protein. We found that, on both counts, mutations in the Variation dataset tend to be less deleterious than randomly expected whereas mutations in the Pathogenic dataset tend to be more deleterious than their control mutations. The mutations in the Cosmic dataset are found to be more deleterious than those in its control set but less than those in Pathogenic.
References
- 1
- 1aR. Sachidanandam, D. Weissman, S. C. Schmidt, J. M. Kakol, L. D. Stein, G. Marth, S. Sherry, J. C. Mullikin, B. J. Mortimore, D. L. Willey, S. E. Hunt, C. G. Cole, P. C. Coggill, C. M. Rice, Z. Ning, J. Rogers, D. R. Bentley, P.-Y. Kwok, E. R. Mardis, R. T. Yeh, B. Schultz, L. Cook, R. Davenport, M. Dante, L. Fulton, L. Hillier, R. H. Waterston, J. D. McPherson, B. Gilman, S. Schaffner, W. J. Van Etten, D. Reich, J. Higgins, M. J. Daly, B. Blumenstiel, J. Baldwin, N. Stange-Thomann, M. C. Zody, L. Linton, E. S. Lander, D. Altshuler, Nature 2001, 409, 928–933;
- 1bD. A. Hinds, L. L. Stuve, G. B. Nilsen, E. Halperin, E. Eskin, D. G. Ballinger, K. A. Frazer, D. R. Cox, Science 2005, 307, 1072–1079.
- 2
- 2aB. S. Gaut, A. D. Long, Plant Cell 2003, 15, 1502–1506;
- 2bM. A. Eberle, M. J. Rieder, L. Kruglyak, D. A. Nickerson, PLoS Genet. 2006, 2, e 142.
- 3
- 3aC. Kimchi-Sarfaty, J. M. Oh, I. W. Kim, Z. E. Sauna, A. M. Calcagno, S. V. Ambudkar, M. M. Gottesman, Science 2007, 315, 525–528;
- 3bS. A. Shabalina, N. A. Spiridonov, A. Kashina, Nucleic Acids Res. 2013, 41, 2073–2094.
- 4
- 4aP. Flicek, I. Ahmed, M. R. Amode, D. Barrell, K. Beal, S. Brent, D. Carvalho-Silva, P. Clapham, G. Coates, S. Fairley, S. Fitzgerald, L. Gil, C. García-Girón, L. Gordon, T. Hourlier, S. Hunt, T. Juettemann, A. K. Kähäri, S. Keenan, M. Komorowska, E. Kulesha1, I. Longden, T. Maurel, W. M. McLaren, M. Muffato, R. Nag, B. Overduin, M. Pignatelli, B. Pritchard, E. Pritchard, H. S. Riat, G. R. S. Ritchie, M. Ruffier, M. Schuster, D. Sheppard, D. Sobral, K. Taylor, A. Thormann, S. Trevanion, S. White, S. P. Wilder, B. L. Aken, E. Birney, F. Cunningham, I. Dunham, J. Harrow, J. Herrero, T. J. P. Hubbard, N. Johnson, R. Kinsella, A. Parker, G. Spudich, A. Yates, A. Zadissa, S. M. J. Searle, Nucleic Acids Res. 2013, 41, D48–D55;
- 4bD. Rios, W. M. McLaren, Y. Chen, E. Birney, A. Stabenau, P. Flicek, F. Cunningham, BMC Bioinf. 2010, 11, 238.
- 5
- 5aS. A. Forbes, G. Bhamra, S. Bamford, E. Dawson, C. Kok, J. Clements, A. Menzies, J. W. Teague, P. A. Futreal, M. R. Stratton, Curr. Protoc. Hum. Genet. 2008, 10–11;
- 5bS. A. Forbes, G. Tang, N. Bindal, S. Bamford, E. Dawson, C. Cole, C. Y. Kok, M. Jia, R. Ewing, A. Menzies, J. W. Teague, M. R. Stratton, P. A. Futreal, Nucleic Acids Res. 2010, 38, D 652–D657.
- 6A. Hamosh, A. F. Scott, J. S. Amberger, C. A. Bocchini, V. A. McKusick, Nucleic Acids Res. 2005, 33, D 514–D517.
- 7D. Vitkup, C. Sander, G. M. Church, Genome Biol. 2003, 4, R 72.
- 8
- 8aV. N. Uversky, A. K. Dunker, Biochim. Biophys. Acta 2010, 1804, 1231–1264;
- 8bP. Tompa, Curr. Opin. Struct. Biol. 2011, 21, 419–425.
- 9
- 9aA. K. Dunker, Z. Obradovic, P. Romero, E. C. Garner, C. J. Brown, Genome Inform. Ser. Workshop Genome Inform. 2000, 11, 161–171;
- 9bJ. J. Ward, J. S. Sodhi, L. J. McGuffin, B. F. Buxton, D. T. Jones, J. Mol. Biol. 2004, 337, 635–645;
- 9cB. Xue, A. K. Dunker, V. N. Uversky, J. Biomol. Struct. Dyn. 2012, 30, 137–149.
- 10
- 10aH. J. Dyson, P. E. Wright, Nat. Rev. Mol. Cell Biol. 2005, 6, 197–208;
- 10bL. M. Iakoucheva, C. J. Brown, J. D. Lawson, Z. Obradovic, A. K. Dunker, J. Mol. Biol. 2002, 323, 573–584;
- 10cA. K. Dunker, C. J. Brown, J. D. Lawson, L. M. Iakoucheva, Z. Obradovic, Biochemistry 2002, 41, 6573–6582;
- 10dV. N. Uversky, Protein Sci. 2002, 11, 739–756;
- 10eJ. Liu, J. R. Faeder, C. J. Camacho, Proc. Natl. Acad. Sci. U.S.A. 2009, 106, 19819–19823;
- 10fP. E. Wright, H. J. Dyson, Curr. Opin. Struct. Biol. 2009, 19, 31–38.
- 11
- 11aV. N. Uversky, C. J. Oldfield, A. K. Dunker, Annu. Rev. Biophys. 2008, 37, 215–246;
- 11bU. Midic, C. J. Oldfield, A. K. Dunker, Z. Obradovic, V. N. Uversky, Protein Pept. Lett. 2009, 16, 1533–1547.
- 12
- 12aI. A. Adzhubei, S. Schmidt, L. Peshkin, V. E. Ramensky, A. Gerasimova, P. Bork, A. S. Kondrashov, S. R. Sunyaev, Nat. Methods 2010, 7, 248–249;
- 12bI. A. Adzhubei, D. M. Jordan, S. R. Sunyaev, Curr. Protoc. Hum. Genet. 2013, 76, Unit 7.20.
10.1002/0471142905.hg0720s76 Google Scholar
- 13
- 13aThe 1000 Genomes Project Consortium, Nature 2010, 467, 1061–1073;
- 13bJ. A. Tennessen, A. W. Bigham, T. D. O’Connor, W. Fu1, E. E. Kenny, S. Gravel, S. McGee, R. Do, X. Liu, G. Jun, H. M. Kang, D. Jordan, S. M. Leal, S. Gabriel, M. J. Rieder, G. Abecasis, D. Altshuler, D. A. Nickerson, E. Boerwinkle, S. Sunyaev, C. D. Bustamante, M. J. Bamshad, J. M. Akey, Science 2012, 337, 64–69.
- 14L. Chin, J. W. Gray, Nature 2008, 452, 553–563.
- 15P. Radivojac, L. M. Iakoucheva, C. J. Oldfield, Z. Obradovic, V. N. Uversky, A. K. Dunker, Biophys. J. 2007, 92, 1439–1456.
- 16V. N. Uversky, J. R. Gillespie, A. L. Fink, Proteins 2000, 41, 415–427.
10.1002/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7 CAS PubMed Web of Science® Google Scholar
- 17
- 17aA. K. Dunker, J. D. Lawson, C. J. Brown, R. M. Williams, P. Romero, J. S. Oh, C. J. Oldfield, A. M. Campen, C. M. Ratliff, K. W. Hipps, J. Ausio, M. S. Nissen, R. Reeves, C. Kang, C. R. Kissinger, R. W. Bailey, M. D. Griswold, W. Chiu, E. C. Garner, Z. Obradovic, J. Mol. Graphics Modell. 2001, 19, 26–59;
- 17bR. M. Williams, Z. Obradovic, V. Mathura, W. Braun, E. C. Garner, J. Young, S. Takayama, C. J. Brown, A. K. Dunker, Pac. Symp. Biocomput. 2001, 89–100;
- 17cP. Romero, Z. Obradovic, X. Li, E. C. Garner, C. J. Brown, A. K. Dunker, Proteins 2001, 42, 38–48;
- 17dV. Vacic, V. N. Uversky, A. K. Dunker, S. Lonardi, BMC Bioinf. 2007, 8, 211.
- 18M. O. Dayhoff, R. Schwartz, B. C. Orcutt, Nat. Biomed. 1978, 5, 345–358.
- 19
- 19aD. Chasman, R. M. Adams, J. Mol. Biol. 2001, 307, 683–706;
- 19bP. C. Ng, S. Henikoff, Annu. Rev. Genomics Hum. Genet. 2006, 7, 61–80;
- 19cS. Sunyaev, V. Ramensky, I. Koch, W. Lathe 3rd, A. S. Kondrashov, P. Bork, Hum. Mol. Genet. 2001, 10, 591–597;
- 19dP. Yue, Z. Li, J. Moult, J. Mol. Biol. 2005, 353, 459–473.
- 20D. Tchernitchko, M. Goossens, H. Wajcman, Clin. Chem. 2004, 50, 1974–1978.
- 21
- 21aP. A. Futreal, L. Coin, M. Marshall, T. Down, T. Hubbard, R. Wooster. N. Rahman, M. R. Stratton, Nat. Rev. Cancer 2004, 4, 177–183;
- 21bE. D. Pleasance. R. K. Cheetham, P. J. Stephens, D. J. McBride, S. J. Humphray, C. D. Greenman, I. Varela, M. L. Lin, G. R. Ordóñez, G. R. Bignell, K. Ye, J. Alipaz, M. J. Bauer, D. Beare, A. Butler, R. J. Carter, L. Chen, A. J. Cox, S. Edkins, P. I. Kokko-Gonzales, N. A. Gormley, R. J. Grocock, C. D. Haudenschild, M. M. Hims, T. James, M. Jia, Z. Kingsbury, C. Leroy, J. Marshall, A. Menzies, L. J. Mudie, Z. Ning, T. Royce, O. B. Schulz-Trieglaff, A. Spiridou, L. A. Stebbings, L. Szajkowski, J. Teague, D. Williamson, L. Chin, M. T. Ross, P. J. Campbell, D. R. Bentley, P. A. Futreal, M. R. Stratton, Nature 2010, 463, 191–196.
- 22V. Ramensky, P. Bork, S. Sunyaev, Nucleic Acids Res. 2002, 30, 3894–3900.
- 23Z. Shi, J. Moult, J. Mol. Biol. 2011, 413, 495–512.