PIE—Efficient filters and coarse grained potentials for unbound protein–protein docking
D. V. S. Ravikant
Department of Computer Science, Cornell University, Ithaca, New York 14853
Search for more papers by this authorCorresponding Author
Ron Elber
Department of Chemistry and Biochemistry, Institute of Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas 78712
Department of Chemistry and Biochemistry, Institute of Computational Engineering and Sciences, University of Texas at Austin, 1 University Station, ICES, C0200, Austin, TX 78712===Search for more papers by this authorD. V. S. Ravikant
Department of Computer Science, Cornell University, Ithaca, New York 14853
Search for more papers by this authorCorresponding Author
Ron Elber
Department of Chemistry and Biochemistry, Institute of Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas 78712
Department of Chemistry and Biochemistry, Institute of Computational Engineering and Sciences, University of Texas at Austin, 1 University Station, ICES, C0200, Austin, TX 78712===Search for more papers by this authorAbstract
Identifying correct binding modes in a large set of models is an important step in protein–protein docking. We identified protein docking filter based on overlap area that significantly reduces the number of candidate structures that require detailed examination. We also developed potentials based on residue contacts and overlap areas using a comprehensive learning set of 640 two-chain protein complexes with mathematical programming. Our potential showed substantially better recognition capacity compared to other publicly accessible protein docking potentials in discriminating between native and nonnative binding modes on a large test set of 84 complexes independent of our training set. We were able to rank a near-native model on the top in 43 cases and within top 10 in 51 cases. We also report an atomic potential that ranks a near-native model on the top in 46 cases and within top 10 in 58 cases. Our filter+potential is well suited for selecting a small set of models to be refined to atomic resolution. Proteins 2010. © 2009 Wiley-Liss, Inc.
Supporting Information
Additional Supporting Information may be found in the online version of this article.
Filename | Description |
---|---|
PROT_22550_sm_suppinfo.pdf16.6 MB | Supporting Information. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
REFERENCES
- 1 SöllnerT,BennettMK,WhiteheartSW,SchellerRH,RothmanJE. A protein assembly-disassembly pathway in vitro that may correspond to sequential steps of synaptic vesicle docking, activation, and fusion. Cell 1993; 75: 409–418.
- 2 ArentsG,BurlingameRW,WangBC,LoveWE,MoudrianakisEN. The nucleosomal core histone octamer at 3.1 A resolution: a tripartite protein assembly and a left-handed superhelix. Proc Natl Acad Sci U S A 1991; 88: 10148–10152.
- 3 LesnéS,KohMT,KotilinekL,KayedR,GlabeCG,YangA,GallagherM,AsheKH. A specific amyloid-β protein assembly in the brain impairs memory. Nature 2006; 440: 352–357.
- 4 SachsK,PerezO,Pe'erD,LauffenburgerDA,NolanGP. Causal protein-signaling networks derived from multiparameter single-cell data. Science 2005; 308: 523–529.
- 5 HammHE. The many faces of G protein signaling. J Biol Chem 1998; 273: 669–672.
- 6 UetzP,GiotL,CagneyG,MansfieldTA,JudsonRS,KnightJR,LockshonD,NarayanV,SrinivasanM,PochartP,Qureshi-EmiliA,LiY,GodwinB,ConoverD,KalbfleischT,VijayadamodarG,YangM,JohnstonM,FieldsS,RothbergJM. A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 2000; 403: 623–627.
- 7 GandhiTKB,ZhongJ,MathivananS,KarthickL,ChandrikaKN,MohanSS,SharmaS,PinkertS,NagarajuS,PeriaswamyB,MishraG,NandakumarK,ShenB,DeshpandeN,NayakR,SarkerM,BoekeJD,ParmigianiG,SchultzJ,BaderJS,PandeyA. Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets. Nat Genet 2006; 38: 285–293.
- 8 GiotL,BaderJS,BrouwerC,ChaudhuriA,KuangB,LiY,HaoYL,OoiCE,GodwinB,VitolsE,VijayadamodarG,PochartP,MachineniH,WelshM,KongY,ZerhusenB,MalcolmR,VarroneZ,CollisA,MintoM,BurgessS,McDanielL,StimpsonE,SpriggsF,WilliamsJ,NeurathK,IoimeN,AgeeM,VossE,FurtakK,RenzulliR,AanensenN,CarrollaS,BickelhauptE,LazovatskyY,DaSilvaA,ZhongJ,StanyonCA,FinleyRL,Jr.;WhiteKP,BravermanM,JarvieT,GoldS,LeachM,KnightJ,ShimketsRA,McKennaMP,ChantJ,RothbergJM. A protein interaction map of Drosophila melanogaster. Science 2003; 302: 1727–1736.
- 9 WodakSJ,JaninJ. Computer analysis of protein-protein interaction. J Mol Biol 1978; 124: 323–342.
- 10 Katchalski-KatzirE,SharivI,EisensteinM,FriesemAA,AflaloC,VakserIA. Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. Proc Natl Acad Sci U S A 1992; 89: 2195–2199.
- 11 FischerD,LinSL,WolfsonHJ,NussinovR. A Geometry-based suite of molecular docking processes. J Mol Biol 1995; 248: 459–477.
- 12 AbagyanR, TotrovM, KuznetsovD. ICM—a new method for protein modeling and design: applications to docking and structure prediction from the distorted native conformation. J Comput Chem 1994; 15: 488–506.
- 13 GabbHA,JacksonRM,SternbergMJE. Modelling protein docking using shape complementarity, electrostatics and biochemical information. J Mol Biol 1997; 272: 106–120.
- 14
RitchieDW,KempGJL.
Protein docking using spherical polar Fourier correlations.
Proteins
2000;
39:
178–194.
10.1002/(SICI)1097-0134(20000501)39:2<178::AID-PROT8>3.0.CO;2-6 CAS PubMed Web of Science® Google Scholar
- 15 ChenR,WengZ. Docking unbound proteins using shape complementarity, desolvation, and electrostatics. Proteins 2002; 47: 281–294.
- 16 GrayJJ,MoughonS,WangC,Schueler-FurmanO,KuhlmanB,RohlCA,BakerD. Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. J Mol Biol 2003; 331: 281–299.
- 17
PalmaPN, KrippahlL,WamplerJE,MouraJJG.
BiGGER: a new (soft) docking algorithm for predicting protein interactions.
Proteins
2000;
39:
372–384.
10.1002/(SICI)1097-0134(20000601)39:4<372::AID-PROT100>3.0.CO;2-Q CAS PubMed Web of Science® Google Scholar
- 18 MandellJG,RobertsVA,PiqueME,KotlovyiV,MitchellJC,NelsonE,TsigelnyI,Ten EyckLF. Protein docking using continuum electrostatics and geometric fit. Protein Eng 2001; 14: 105–113.
- 19 CamachoCJ,VajdaS. Protein docking along smooth association pathways. Proc Natl Acad Sci U S A 2001; 98: 10636–10641.
- 20
MoontG,GabbHA,SternbergMJE.
Use of pair potentials across protein interfaces in screening predicted docked complexes.
Proteins
1999;
35:
364–373.
10.1002/(SICI)1097-0134(19990515)35:3<364::AID-PROT11>3.0.CO;2-4 CAS PubMed Web of Science® Google Scholar
- 21 KeskinO,BaharI,BadretdinovAY,PtitsynOB,JerniganRL. Empirical solvent-mediated potentials hold for both intra-molecular and inter-molecular inter-residue interactions. Protein Sci 1998; 7: 2578–2586.
- 22
GlaserF,SteinbergDM,VakserIA,Ben-TalN.
Residue frequencies and pairing preferences at protein-protein interfaces.
Proteins
2001;
43:
89–102.
10.1002/1097-0134(20010501)43:2<89::AID-PROT1021>3.0.CO;2-H CAS PubMed Web of Science® Google Scholar
- 23 LuH,LuL,SkolnickJ. Development of unified statistical potentials describing protein-protein interactions. Biophys J 2003; 84: 1895–1901.
- 24 MiyazawaS,JerniganRL. Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 1985; 18: 534–552.
- 25 MaiorovVN,GrippenGM. Contact potential that recognizes the correct folding of globular proteins. J Mol Biol 1992; 227: 876–888.
- 26
VendruscoloM,NajmanovichR,DomanyE.
Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading?
Proteins
2000;
38:
134–148.
10.1002/(SICI)1097-0134(20000201)38:2<134::AID-PROT3>3.0.CO;2-A CAS PubMed Web of Science® Google Scholar
- 27
TobiD,ShafranG,LinialN,ElberR.
On the design and analysis of protein folding potentials.
Proteins
2000;
40:
71–85.
10.1002/(SICI)1097-0134(20000701)40:1<71::AID-PROT90>3.0.CO;2-3 CAS PubMed Web of Science® Google Scholar
- 28 TobiD,BaharI. Optimal design of protein docking potentials: efficiency and limitations. Proteins 2006; 62: 970–981.
- 29 BordnerAJ,GorinAA. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces. BMC Bioinformatics 2008; 9: 234.
- 30 RobertCH,JaninJ. A soft, mean-field potential derived from crystal contacts for predicting protein-protein interactions. J Mol Biol 1998; 283: 1037–1047.
- 31 LiuS,ZhangC,ZhouH,ZhouY. A physical reference state unifies the structure-derived potential of mean force for protein folding and binding. Proteins 2004; 56: 93–101.
- 32 MartinO,SchomburgD. Efficient comprehensive scoring of docked protein complexes using probabilistic support vector machines. Proteins 2008; 70: 1367–1378.
- 33 MintserisJ,WieheK,PierceB,AndersonR,ChenR,JaninJ,WengZ. Protein-protein docking benchmark 2.0: an update. Proteins 2005; 60: 214–216.
- 34 LeGrandSM,MerzKM. Rapid approximation to molecular surface area via the use of Boolean logic and look-up tables. J Comput Chem 1993; 14: 349–352.
- 35 JorgensenWL,Tirado-RivesJ. The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. J Am Chem Soc 1988; 110: 1657–1666.
- 36 AltschulSF,MaddenTL,SchafferAA,ZhangJ,ZhangZ,MillerW,LipmanDJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res 1997; 25: 3389–3402.
- 37 NeedlemanSB,WunschCD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 1970; 48: 443–453.
- 38 EswarN,WebbB,Marti-RenomMA,MadhusudhanMS,EramianD,ShenMY,PieperU,SaliA. Comparative protein structure modeling using MODELLER. In: JE Coligan, BM Dunn, DW Speicher, PT Wingfield, editors. Current protocols in protein science. Hoboken, NJ: John Wiley & Sons, Inc; 2007: 2.9.1–2.9.31.
- 39 WagnerM,MellerJ,ElberR. Large-scale linear programming techniques for the design of protein folding potentials. Math Prog 2004; 101: 301–318.
- 40 ZhangY,SkolnickJ. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res 2005; 33: 2302–2309.
- 41 MintserisJ,PierceB,WieheK,AndersonR,ChenR,WengZ. Integrating statistical pair potentials into protein complex prediction. Proteins 2007; 69: 511–520.
- 42 DuhovnyD,NussinovR,WolfsonHJ. Efficient unbound docking of rigid molecules. Proceedings of the Second International Workshop on Algorithms in Bioinformatics. New York. Springer-Verlag; 2002.
- 43 PierceB,WengZ. ZRANK: reranking protein docking predictions with an optimized energy function. Proteins 2007; 67: 1078–1086.
- 44 MéndezR,LeplaeR,MariaLD,WodakSJ. Assessment of blind predictions of protein-protein interactions: current status of docking methods. Proteins 2003; 52: 51–67.
- 45 PierceB,WengZ. A combination of rescoring and refinement significantly improves protein docking performance. Proteins 2008; 72: 270–279.
- 46 BoeckmannB,BairochA,ApweilerR,BlatterM-C,EstreicherA,GasteigerE,MartinMJ,MichoudK,O'DonovanC,PhanI,PilboutS,SchneiderM. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res 2003; 31: 365–370.
- 47 ChuangG-Y,KozakovD,BrenkeR,ComeauSR,VajdaS. DARS (decoys as the reference state) potentials for protein-protein docking. Biophys J 2008; 95: 4217–4227.
- 48 KabschW,SanderC. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 1983; 22: 2577–2637.
- 49 Pinak ChakrabartiJJ. Dissecting protein-protein recognition sites. Proteins 2002; 47: 334–343.
- 50 RanjitPB. Dissecting subunit interfaces in homodimeric proteins. Proteins 2003; 53: 708–719.
- 51 AnashkinaA, KuznetsovE,EsipovaN,TumanyanV. Comprehensive statistical analysis of residues interaction specificity at protein-protein interfaces. Proteins 2007; 67: 1060–1077.
- 52 MintserisJ,WengZ. Atomic contact vectors in protein-protein recognition. Proteins 2003; 53: 629–639.
- 53 AnsariS,HelmsV. Statistical analysis of predominantly transient protein-protein interfaces. Proteins 2005; 61: 344–355.