Volume 37, Issue 2 pp. 88-96

Predicting Protein Subcellular Location Using Digital Signal Processing

Yu-Xi PAN

Yu-Xi PAN

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Search for more papers by this author
Da-Wei LI

Da-Wei LI

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Search for more papers by this author
Yun DUAN

Yun DUAN

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Search for more papers by this author
Zhi-Zhou ZHANG

Zhi-Zhou ZHANG

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Search for more papers by this author
Ming-Qing XU

Ming-Qing XU

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Search for more papers by this author
Guo-Yin FENG

Guo-Yin FENG

Bio-X Life Science Research Center, Shanghai Jiaotong University, Shanghai 200030, China;

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Search for more papers by this author
Lin HE

Corresponding Author

Lin HE

Institute for Nutritional Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Science, Shanghai 200030, China;

Neuropsychiatric & Human Genetics Group, Bio-X Center, Shanghai Jiaotong University, Shanghai 200030, China

*Corresponding author: Tel, 86-21-62822491; Fax, 86-21-62822491; E-mail, [email protected] & [email protected]Search for more papers by this author
First published: 09 February 2005
Citations: 2

This work was supported by the grants from the Major State Basic Research Development Program of China (No. 001CB510301), the National High Technology Research and development Program of China (No. 2002AA223021), the National Natural Science Foundation of China, and Shanghai Municipal Commission for Science and Technology

Abstract

Abstract The biological functions of a protein are closely related to its attributes in a cell. With the rapid accumulation of newly found protein sequence data in databanks, it is highly desirable to develop an automated method for predicting the subcellular location of proteins. The establishment of such a predictor will expedite the functional determination of newly found proteins and the process of prioritizing genes and proteins identified by genomic efforts as potential molecular targets for drug design. The traditional algorithms for predicting these attributes were based solely on amino acid composition in which no sequence order effect was taken into account. To improve the prediction quality, it is necessary to incorporate such an effect. However, the number of possible patterns in protein sequences is extremely large, posing a formidable difficulty for realizing this goal. To deal with such difficulty, a well-developed tool in digital signal processing named digital Fourier transform (DFT) [1] was introduced. After being translated to a digital signal according to the hydrophobicity of each amino acid, a protein was analyzed by DFT within the frequency domain. A set of frequency spectrum parameters, thus obtained, were regarded as the factors to represent the sequence order effect. A significant improvement in prediction quality was observed by incorporating the frequency spectrum parameters with the conventional amino acid composition. One of the crucial merits of this approach is that many existing tools in mathematics and engineering can be easily applied in the predicting process. It is anticipated that digital signal processing may serve as a useful vehicle for many other protein science areas.

Edited by Jun YU

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.