Recognition of coarse-grained protein tertiary structure
Corresponding Author
Timothy Lezon
Department of Physics, Pennsylvania State University, University Park, Pennsylvania
Department of Physics, 104 Davey Laboratory, Penn State, University Park, PA 16802===Search for more papers by this authorJayanth R. Banavar
Department of Physics, Pennsylvania State University, University Park, Pennsylvania
Search for more papers by this authorAmos Maritan
Dipartimento di Fisica G. Galileo, Via Marzolos, Italy
Search for more papers by this authorCorresponding Author
Timothy Lezon
Department of Physics, Pennsylvania State University, University Park, Pennsylvania
Department of Physics, 104 Davey Laboratory, Penn State, University Park, PA 16802===Search for more papers by this authorJayanth R. Banavar
Department of Physics, Pennsylvania State University, University Park, Pennsylvania
Search for more papers by this authorAmos Maritan
Dipartimento di Fisica G. Galileo, Via Marzolos, Italy
Search for more papers by this authorAbstract
A model of the protein backbone is considered in which each residue is characterized by the location of its Cα atom and one of a discrete set of conformal (ϕ, ψ) states. We investigate the key differences between a description that offers a locally precise fit to known backbone structures and one that provides a globally accurate fit to protein structures. Using a statistical scoring scheme and threading, a protein's local best-fit conformation is highly recognizable, but its global structure cannot be directly determined from an amino acid sequence. The incorporation of information about the conformal states of neighboring residues along the chain allows one to accurately translate the local structure into a global structure. We present a two-step algorithm, which recognizes up to 95% of the tested protein native-state structures to within a 2.5 Å root mean square deviation. Proteins 2004;55:000–000. © 2004 Wiley-Liss, Inc.
REFERENCES
- 1 Bryngelson JD, Wolynes PG. Spin glasses and the statistical-mechanics of protein folding. Proc Natl Acad Sci USA 1987; 84: 7524–7528.
- 2 Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, path-ways and the energy landscape of protein folding: a synthesis. Proteins 1995; 21: 167–195.
- 3 Wolynes PG, Onuchic JN, Thirumalai D. Navagating the folding routes. Science 1995; 267: 1619–1620.
- 4 Dill KA, Chan HS. From Levinthal to pathways to funnels. Nat Struc Biol 1997; 4: 10–19.
- 5 Baker D. A surprising simplicity to protein folding. Nature 2000; 405: 39–42.
- 6 Pauling L, Corey RB. Conformations of polypeptide chains with favored orientations around single bonds: two new pleated sheets. Proc Natl Acad Sci USA 1951; 37: 729–740.
- 7 Pauling L, Corey RB, Branson HR. The structure of proteins: two hydrogen-bonded helical conformations of the polypeptide chain. Proc Natl Acad Sci USA 1951; 37: 205–211.
- 8
Banavar JR,
Maritan A.
Computational approach to the protein folding problem.
Proteins
2001;
42:
433–435.
10.1002/1097-0134(20010301)42:4<433::AID-PROT10>3.0.CO;2-S CAS PubMed Web of Science® Google Scholar
- 9 Friedrichs MS, Wolynes PG. Toward protein tertiary structure recognition by means of associative memory Hamiltonians. Science 1989; 246: 371–373.
- 10 Goldstein R, Luthey-Schulten ZA, Wolynes PG. Optimal protein-folding codes from spin-glass theory. Proc Natl Acad Sci USA 1992; 89: 4918–4922.
- 11 Goldstein R, Luthey-Schulten ZA, Wolynes PG. Protein tertiary structure recognition using optimized Hamiltonians with local interactions. Proc Natl Acad Sci USA 1992; 89: 9029–9033.
- 12 Hao MH, Scheraga HA. How optimization of potential functions affects protein folding. Proc Natl Acad Sci USA 1996; 93: 4984–4989.
- 13 Hao MH, Scheraga HA. Optimizing potential functions for protein folding. J Phys Chem 1996; 100: 14540–14548.
- 14 Hendlich M, Lackner S, Witckus H, Floechner H, Froschauer R, Gotts-bachner K, Casari G, Sippl MJ. Identification of native protein folds amongst a large number of incorrect models; the calculation of low energy conformations from potentials of mean force. J Mol Biol 1990; 216: 167–180.
- 15 Kolinski A, Godzik A, Skolnick J. A general method for the prediction of the three-dimensional structure and folding pathway of globular proteins: application to designed helical proteins. J Chem Phys 1993; 98: 7400–7433.
- 16 Koretke KK, Luthey-Schulten AZ, Wolynes PG. Self-consistently optimized statistical mechanical energy functions for sequence structure alignment. Protein Sci 1996; 5: 1043–4059.
- 17 Maiorov VN, Crippen GM. Contact potential that recognizes the correct folding of globular proteins. J Mol Biol 1992; 227: 876–888.
- 18 Mirny LA, Shakhnovich EI. How to derive a protein folding potential? A new approach to an old problem. J Mol Biol 1996; 264: 1164–1179.
- 19 Miyazawa S, Jernigan RL. Estimation of effective inter-residue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 1985; 18: 534–552.
- 20 Miyazawa S, Jernigan RL. Residue–residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 1996; 256: 623–644.
- 21
Miyazawa S,
Jernigan RL.
Self-consistent estimation of inter-residue protein contact energies based on an equilibrium mixture approximation of residues.
Proteins
1999;
34:
49–68.
10.1002/(SICI)1097-0134(19990101)34:1<49::AID-PROT5>3.0.CO;2-L CAS PubMed Web of Science® Google Scholar
- 22 Sippl MJ. Detection of native-like models for amino acid sequences of unknown three-dimensional structure in a database of known protein conformations. Proteins 1992; 13: 258–271.
- 23 Skolnick J, Kolinski A. Simulations of folding of globular proteins. Science 1990; 250: 1121–1125.
- 24 Tanaka S, Sheraga HA. Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 1976; 9: 945–950.
- 25 Zhang C, Kim S. Environment-dependent residue contact energies for proteins. Proc Natl Acad Sci USA 2000; 97: 2550–2555.
- 26 Maritan A, Micheletti C, Trovato A, Banavar JR. Optimal shapes of compact strings. Nature 2000; 406: 287–290.
- 27 Banavar JR, Maritan A, Micheletti C, Trovato A. Geometry and physics of proteins. Proteins 2002; 47: 315–322.
- 28 Banavar JR, Flammini A, Marenduzzo D, Maritan A, Trovato A. Geometry of compact tubes and protein structures. ComPlexUs 2003; 1: 4–13.
- 29 Banavar JR, Maritan A, Seno F. Anisotropic effective interactions in a coarse-grained tube picture of proteins. Proteins 2002; 49: 246–254.
- 30 Banavar JR, Maritan A. A comment on the protein folds as platonic forms. J Theor Biol 2003; 223: 263–265.
- 31 Banavar JR, Maritan A. Geometrical approach to protein folding: a tube picture. Rev Mod Phys 2003; 75: 23–34.
- 32 Banavar JR, Gonzalez O, Maddocks JH, Maritan A. Self-interactions of strands and sheets. J Stat Phys 2003; 110: 35–50.
- 33 Baldwin RL, Rose GD. Is protein folding hierarchic? I. Local structure and peptide folding. Trends Biochem Sci 1999; 24: 26–33.
- 34 Srinivasan R, Rose GD. Ab initio prediction of protein structure using LINUS. Proteins 2002; 47: 489–495.
- 35 Chothia C. One thousand families for the molecular biologist. Nature (Lond) 1992; 357: 543–544.
- 36 Jones DT, Taylort WR, Thornton JM. A new approach to protein fold recognition. Nature 1992; 358: 86–89.
- 37 Park BH, Levitt M. The complexity and accuracy of discrete state models of protein structure. J Mol Biol 1995; 249: 493–507.
- 38 Ramachandran GN, Sasisekharan V. Conformation of polypeptides and proteins. Adv Protein Chem 1968; 23: 283–438.
- 39 Krauth W, Mezard M. Learning algorithms with optimal stability in neural networks. J Phys A 1987; 20: L745–L752.
- 40 Wang G, Dunbrack RL Jr. PISCES: a protein sequence culling server. Bioinformatics 2003; 19: 1589–1591.
- 41 Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 1983; 22: 2577–2637.
- 42 Rooman MJ, Kocher JPA, Wodak SJ. Prediction of protein back-bone conformation based on seven structure assignments. J Mol Biol 1991; 221: 961–979.
- 43
Fano RM.
Transmission of information.
Cambridge, MA:
MIT Press;
1961.
10.1063/1.3057290 Google Scholar
- 44 Presta LG, Rose GD. Helix signals in proteins. Science 1988; 240: 1632–1641.
- 45 Aurora R, Srinivasan R, Rose GD. Rules for alpha-helix termination by glycine. Science 1994; 264: 1126–1130.
- 46 Richardson JS. The anatomy and taxonomy of protein structure. Adv Protein Chem 1981; 34: 167–339.
- 47 Fitzkee NC, Rose GD. Ramachandran revisited: steric restrictions beyond the dipeptide. Unpublished manuscript.