Improving the orientation-dependent statistical potential using a reference state
Yufeng Liu
MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084 China
Search for more papers by this authorJianyang Zeng
Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, 100084 China
Search for more papers by this authorCorresponding Author
Haipeng Gong
MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084 China
Correspondence to: Haipeng Gong; MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China. E-mail: [email protected]Search for more papers by this authorYufeng Liu
MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084 China
Search for more papers by this authorJianyang Zeng
Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, 100084 China
Search for more papers by this authorCorresponding Author
Haipeng Gong
MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084 China
Correspondence to: Haipeng Gong; MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing 100084, China. E-mail: [email protected]Search for more papers by this authorABSTRACT
Statistical potentials are frequently engaged in the protein structural prediction and protein folding for conformational evaluation. Theoretically, to describe the many-body effect, pairwise interaction between two atom groups should be corrected by their relative geometric orientation. The potential functions developed by this means are called orientation-dependent statistical potentials and have exhibited substantially improved performance. However, none of the currently available orientation-dependent statistical potentials use any reference state, which has been proven to greatly enhance the power of distance-dependent statistical potentials in numerous previous studies. In this work, we designed a reasonable reference state for the orientation-dependent statistical potentials: using the average geometric relationship between atom pairs in known structures by neglecting their residue identities. The statistical potential developed using this reference state (called ORDER_AVE) prevails most available rival potentials in a series of tests on the decoy sets, although the information of side chain atoms (except the β-carbon) is absent in its construction. Proteins 2014; 82:2383–2393. © 2014 Wiley Periodicals, Inc.
Supporting Information
Additional Supporting Information may be found in the online version of this article.
Filename | Description |
---|---|
prot24600-sup-0001-suppinfo.pdf240.4 KB | Supplementary Information |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
REFERENCES
- 1Chellapa GD, Rose GD. Reducing the dimensionality of the protein-folding search problem. Protein Sci 2012; 21: 1231–1240.
- 2Rykunov D, Fiser A. New statistical potential for quality assessment of protein models and a survey of energy functions. BMC Bioinformatics 2010; 11: 128.
- 3Sippl MJ. Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 1990; 213: 859–883.
- 4Sippl MJ. Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures. J Comput-Aided Mol Des 1993; 7: 473–501.
- 5Samudrala R, Moult J. An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. J Mol Biol 1998; 275: 895–916.
- 6Lu H, kolnick J. A distance-dependent atomic knowledge-based potential for improved protein structure selection. Proteins 2001; 44: 223–232.
- 7Zhou H, Zhou Y. Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein Sci 2002; 11: 2714–2726.
- 8Zhang C, Liu S, Zhou H, Zhou Y. An accurate, residue-level, pair potential of mean force for folding and binding based on the distance-scaled, ideal-gas reference state. Protein Sci 2004; 13: 400–411.
- 9Shen MY, Sali A. Statistical potential for assessment and prediction of protein structures. Protein Sci 2006; 15: 2507–2524.
- 10Ferrada E, Vergara IA, Melo F. A knowledge-based potential with an accurate description of local interactions improves discrimination between native and near-native protein conformations. Cell Biochem Biophys 2007; 49: 111–124.
- 11Mirzaie M, Eslahchi C, Pezeshk H, Sadeghi M. A distance-dependent atomic knowledge-based potential and force for discrimination of native structures from decoys. Proteins 2009; 77: 454–463.
- 12Tanaka S, Scheraga HA. Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 1976; 9: 945–950.
- 13Miyazawa S, Jernigan RL. Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 1985; 18: 534–552.
- 14McConkey BJ, Sobolev V, Edelman M. Discrimination of native protein structures using atom-atom contact scoring. Proc Natl Acad Sci USA 2003; 100: 3215–3220.
- 15Arab S, Sadeghi M, Eslahchi C, Pezeshk H, Sheari A. A pairwise residue contact area-based mean force potential for discrimination of native protein structure. BMC Bioinformatics 2010; 11: 16.
- 16Khare SD, Fleishman SJ. Emerging themes in the computational design of novel enzymes and protein-protein interfaces. FEBS Lett 2013; 587: 1147–1154.
- 17Leaver-Fay A, Tyka M, Lewis SM, Lange OF, Thompson J, Jacak R, Kaufman K, Renfrew PD, Smith CA, Sheffler W, Davis IW, Cooper S, Treuille A, Mandell DJ, Richter F, Ban YE, Fleishman SJ, Corn JE, Kim DE, Lyskov S, Berrondo M, Mentzer S, Popovic Z, Havranek JJ, Karanicolas J, Das R, Meiler J, Kortemme T, Gray JJ, Kuhlman B, Baker D, Bradley P. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 2011; 487: 545–574.
- 18Combs SA, Deluca SL, Deluca SH, Lemmon GH, Nannemann DP, Nguyen ED, Willis JR, Sheehan JH, Meiler J. Small-molecule ligand docking into comparative models with Rosetta. Nat Protoc 2013; 8: 1277–1298.
- 19Wu S, Skolnick J, Zhang Y. Ab initio modeling of small proteins by iterative TASSER simulations. BMC Biol 2007; 5: 17.
- 20Xu D, Zhang Y. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins 2012; 80: 1715–1735.
- 21Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res 2000; 28: 235–242.
- 22Miyazawa S, Jernigan RL. How effective for fold recognition is a potential of mean force that includes relative orientations between contacting residues in proteins? J Chem Phys 2005; 122: 024901.
- 23Yang Y, Zhou Y. Specific interactions for ab initio folding of protein terminal regions with secondary structures. Proteins 2008; 72: 793–803.
- 24Yang Y, Zhou Y. Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all-atom statistical energy functions. Protein Sci 2008; 17: 1212–1219.
- 25Zhang J, Zhang Y. A novel side-chain orientation dependent potential derived from random-walk reference state for protein fold selection and structure prediction. PLoS One 2010; 5: e15386.
- 26Zhou H, Skolnick J. GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. Biophys J 2011; 101: 2043–2052.
- 27Chen L, He J. A distance and orientation dependent energy function of amino acid key blocks. Biopolymers 2014; 101: 681–692.
- 28Feng Y, Kloczkowski A, Jernigan RL. Four-body contact potentials derived from two protein datasets to discriminate native structures from decoys. Proteins 2007; 68: 57–66.
- 29Feng Y, Kloczkowski A, Jernigan RL. Potentials 'R' Us web-server for protein energy estimations with coarse-grained knowledge-based potentials. BMC Bioinformatics 2010; 11: 92.
- 30Gniewek P, Leelananda SP, Kolinski A, Jernigan RL, Kloczkowski A. Multibody coarse-grained potentials for native structure recognition and quality assessment of protein models. Proteins 2011; 79: 1923–1929.
- 31Faraggi E, Kloczkowski A. A global machine learning based scoring function for protein structure prediction. Proteins 2014; 82: 752–759.
- 32Sanchez-Gonzalez G, Kim JK, Kim DS, Garduno-Juarez R. A beta-complex statistical four body contact potential combined with a hydrogen bond statistical potential recognizes the correct native structure from protein decoy sets. Proteins 2013; 81: 1420–1433.
- 33Liu Y, Gong H. Using the unfolded state as the reference state improves the performance of statistical potentials. Biophys J 2012; 103: 1950–1959.
- 34Colubri A, Jha AK, Shen MY, Sali A, Berry RS, Sosnick TR, Freed KF. Minimalist representations and the importance of nearest neighbor effects in protein folding simulations. J Mol Biol 2006; 363: 835–857.
- 35DeBartolo J, Colubri A, Jha AK, Fitzgerald JE, Freed KF, Sosnick TR. Mimicking the folding pathway to improve homology-free protein structure prediction. Proc Natl Acad Sci USA 2009; 106: 3734–3739.
- 36Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol 1990; 215: 403–410.
- 37Fitzgerald JE, Jha AK, Colubri A, Sosnick TR, Freed KF. Reduced C(beta) statistical potentials can outperform all-atom potentials in decoy identification. Protein Sci 2007; 16: 2123–2139.
- 38Xue B, Faraggi E, Zhou Y. Predicting residue-residue contact maps by a two-layer, integrated neural-network method. Proteins 2009; 76: 176–183.
- 39Zou T, Ozkan SB. Local and non-local native topologies reveal the underlying folding landscape of proteins. Phys Biol 2011; 8: 066011.
- 40Park B, Levitt M. Energy functions that discriminate X-ray and near native folds from well-constructed decoys. J Mol Biol 1996; 258: 367–392.
- 41Simons KT, Kooperberg C, Huang E, Baker D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol 1997; 268: 209–225.
- 42Samudrala R, Xia Y, Levitt M, Huang ES. A combined approach for ab initio construction of low resolution protein tertiary structures from sequence. Pac Symp Biocomput 1999: 505–516.
- 43Xia Y, Huang ES, Levitt M, Samudrala R. Ab initio construction of protein tertiary structures using a hierarchical approach. J Mol Biol 2000; 300: 171–185.
- 44Keasar C, Levitt M. A novel approach to decoy set generation: designing a physical energy function having local minima with native structure characteristics. J Mol Biol 2003; 329: 159–174.
- 45John B, Sali A. Comparative protein structure modeling by iterative alignment, model building and model assessment. Nucleic Acids Res 2003; 31: 3982–3992.
- 46Tsai J, Bonneau R, Morozov AV, Kuhlman B, Rohl CA, Baker D. An improved protein decoy set for testing energy functions for protein structure prediction. Proteins 2003; 53: 76–87.
- 47Qian B, Raman S, Das R, Bradley P, McCoy AJ, Read RJ, Baker D. High-resolution structure prediction and the crystallographic phase problem. Nature 2007; 450: 259–264.
- 48Moult J, Fidelis K, Kryshtafovych A, Tramontano A. Critical assessment of methods of protein structure prediction (CASP)—round IX. Proteins: Struct Funct Bioinformatics 2011; 79: 1–5.
- 49Melo F, Sanchez R, Sali A. Statistical potentials for fold assessment. Protein Sci 2002; 11: 430–448.
- 50Zhang Y, Skolnick J. Scoring function for automated assessment of protein structure template quality. Proteins: Struct Funct Bioinformatics 2004; 57: 702–710.
- 51Pappu RV, Srinivasan R, Rose GD. The Flory isolated-pair hypothesis is not valid for polypeptide chains: Implications for protein folding. Proc Natl Acad Sci 2000; 97: 12565–12570.
- 52Ohkubo YZ, Brooks CL. Exploring Flory's isolated-pair hypothesis: Statistical mechanics of helix–coil transitions in polyalanine and the C-peptide from RNase A. Proc Natl Acad Sci 2003; 100: 13916–13921.
- 53Fitzkee NC, Rose GD. Sterics and solvation winnow accessible conformational space for unfolded proteins. J Mol Biol 2005; 353: 873–887.
- 54Danielsson J, Andersson A, Jarvet J, Gräslund A. 15N relaxation study of the amyloid β-peptide: structural propensities and persistence length. Magn Reson Chem 2006; 44: S114–S121.
- 55Punta M, Rost B. PROFcon: novel prediction of long-range contacts. Bioinformatics 2005; 21: 2960–2968.
- 56Izarzugaza JM, Grana O, Tress ML, Valencia A, Clarke ND. Assessment of intramolecular contact predictions for CASP7. Proteins 2007; 69 Suppl 8: 152–158.