Volume 61, Issue 3 pp. 473-480
Prediction Report

Predicting protein secondary structure and solvent accessibility with an improved multiple linear regression method

Sanbo Qin

Sanbo Qin

National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, People's Republic of China

Graduate School of the Chinese Academy of Sciences, Beijing, People's Republic of China

Search for more papers by this author
Yun He

Yun He

National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, People's Republic of China

Graduate School of the Chinese Academy of Sciences, Beijing, People's Republic of China

Search for more papers by this author
Xian-Ming Pan

Corresponding Author

Xian-Ming Pan

National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, People's Republic of China

Department of Biological Sciences and Biotechnology, Tsinghua University, Beijing, People's Republic of China

Department of Biological Sciences and Biotechnology, Tsinghua University, Beijing, People's Republic of China===Search for more papers by this author
First published: 08 September 2005
Citations: 15

Abstract

We have improved the multiple linear regression (MLR) algorithm for protein secondary structure prediction by combining it with the evolutionary information provided by multiple sequence alignment of PSI-BLAST. On the CB513 dataset, the three states average overall per-residue accuracy, Q3, reached 76.4%, while segment overlap accuracy, SOV99, reached 73.2%, using a rigorous jackknife procedure and the strictest reduction of eight states DSSP definition to three states. This represents an improvement of approximately 5% on overall per-residue accuracy compared with previous work. The relative solvent accessibility prediction also benefited from this combination of methods. The system achieved 77.7% average jackknifed accuracy for two states prediction based on a 25% relative solvent accessibility mode, with a Mathews' correlation coefficient of 0.548. The improved MLR secondary structure and relative solvent accessibility prediction server is available at http://spg.biosci.tsinghua.edu.cn/. Proteins 2005. © 2005 Wiley-Liss, Inc.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.