Volume 78, Issue 1 pp. 181-196
Research Article

Quantifying the evolutionary divergence of protein structures: The role of function change and function conservation

Alberto Pascual-García

Alberto Pascual-García

Centro de Biología Molecular ‘Severo Ochoa’ (CSIC-UAM), Cantoblanco, Madrid 28049, Spain

Search for more papers by this author
David Abia

David Abia

Centro de Biología Molecular ‘Severo Ochoa’ (CSIC-UAM), Cantoblanco, Madrid 28049, Spain

Search for more papers by this author
Raúl Méndez

Raúl Méndez

Centro de Biología Molecular ‘Severo Ochoa’ (CSIC-UAM), Cantoblanco, Madrid 28049, Spain

Search for more papers by this author
Gonzalo S. Nido

Gonzalo S. Nido

Centro de Biología Molecular ‘Severo Ochoa’ (CSIC-UAM), Cantoblanco, Madrid 28049, Spain

Search for more papers by this author
Ugo Bastolla

Corresponding Author

Ugo Bastolla

Centro de Biología Molecular ‘Severo Ochoa’ (CSIC-UAM), Cantoblanco, Madrid 28049, Spain

Centro de Biologia Molecular ‘Severo Ochoa’, (CSIC-UAM), Cantoblanco, Madrid 28049, Spain===Search for more papers by this author
First published: 27 November 2009
Citations: 33

The authors state no conflict of interest.

Abstract

The molecular clock hypothesis, stating that protein sequences diverge in evolution by accumulating amino acid substitutions at an almost constant rate, played a major role in the development of molecular evolution and boosted quantitative theories of evolutionary change. These studies were extended to protein structures by the seminal paper by Chothia and Lesk, which established the approximate proportionality between structure and sequence divergence. Here we analyse how function influences the relationship between sequence and structure divergence, studying four large superfamilies of evolutionarily related proteins: globins, aldolases, P-loop and NADP-binding. We introduce the contact divergence, which is more consistent with sequence divergence than previously used structure divergence measures. Our main findings are: (1) Small structure and sequence divergences are proportional, consistent with the molecular clock. Approximate validity of the clock is also supported by the analysis of the clustering coefficient of structure similarity networks. (2) Functional constraints strongly limit the structure divergence of proteins performing the same function and may allow to identify incomplete or wrong functional annotations. (3) The rate of structure versus sequence divergence is larger for proteins performing different functions than for proteins performing the same function. We conjecture that this acceleration is due to positive selection for new functions. Accelerations in structure divergence are also suggested by the analysis of the clustering coefficient. (4) For low sequence identity, structural diversity explodes. We conjecture that this explosion is related to functional diversification. (5) Large indels are almost always associated with function changes. Proteins 2010. © 2009 Wiley-Liss, Inc.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.