Identification of putative plant cold responsive regulatory elements by gene expression profiling and a pattern enumeration algorithm
Joel Kreps
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
These authors are equal contributors. Present address: Diversa Corporation, 4955 Directors Place, San Diego, CA 92121-1609, USA
Search for more papers by this authorPaul Budworth
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
These authors are equal contributors. Present address: Diversa Corporation, 4955 Directors Place, San Diego, CA 92121-1609, USA
Search for more papers by this authorSteve Goff
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
Present address: Syngenta Biotechnology Inc., 3054 Cornwallis Road, Research Triangle Park, NC 27709, USA
Search for more papers by this authorCorresponding Author
Ronglin Wang
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
Present address: Syngenta Biotechnology Inc., 3054 Cornwallis Road, Research Triangle Park, NC 27709, USA
Correspondence (tel +1 919 765 5114; fax +1 919 541 8557; e-mail [email protected])Search for more papers by this authorJoel Kreps
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
These authors are equal contributors. Present address: Diversa Corporation, 4955 Directors Place, San Diego, CA 92121-1609, USA
Search for more papers by this authorPaul Budworth
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
These authors are equal contributors. Present address: Diversa Corporation, 4955 Directors Place, San Diego, CA 92121-1609, USA
Search for more papers by this authorSteve Goff
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
Present address: Syngenta Biotechnology Inc., 3054 Cornwallis Road, Research Triangle Park, NC 27709, USA
Search for more papers by this authorCorresponding Author
Ronglin Wang
Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA
Present address: Syngenta Biotechnology Inc., 3054 Cornwallis Road, Research Triangle Park, NC 27709, USA
Correspondence (tel +1 919 765 5114; fax +1 919 541 8557; e-mail [email protected])Search for more papers by this authorSummary
A pattern enumeration algorithm named GBSSR has been developed to analyse co-expressed gene groups identified through gene chip expression profiling to search for putative cis-regulatory elements, an important step toward understanding transcriptional factors, quantitative trait loci and gene regulatory networks. Without making any statistical assumptions, this algorithm establishes the frequency distribution of all eligible 6–15 bp strings by extensive bootstrap sampling from an entire genome worth of promoters, enabling those over-represented in a co-expressed gene group to be identified. Using a well-studied plant cold responsive gene system as a positive control, several known cold responsive elements were identified as top ranking candidates, along with some potentially novel ones. A typical analysis of 40 co-expressed genes takes a relatively inexpensive Linux cluster with 32 × 1.4 GHz Intel CPUs about 7 days to process.
References
- Altman, R.B. and Raychaudhuri, S. (2001) Whole genome expressin analysis: challenges beyond clustering. Curr. Opin. Struct. Biol. 11, 340–347.
- Arnone, M. and Davidson, E. (1997) The hardwiring of development: organization and function of genomic regulatory systems. Development, 124, 1851–1864.
- Bailey, T.L. and Elkan, C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In: Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28–36. Menlo Park, CA: AAAI Press.
- Baker, S.S., Wilhelm, K.S. and Thomashow, M.F. (1994) The 5′-region of Arabidopsis thaliana cor15a has cis-acting elements that confer cold-, drought- and ABA-regulated gene expression. Plant Mol. Biol. 24, 701–713.
- Birnbaum, K., Benfey, P.N. and Shasha, D.E. (2001) cis Element/Transcriptional factor analysis (cis/TF): a method for discovering transcriptional factor/cis element relationships. Genome Res. 11, 1567–1573.
- Brazma, A., Jonassen, I., Eidhammer, I. and Gilbert, D. (1998a) Approaches to the automatic discovery of patterns in biosequences. J. Comput. Biol. 5, 279–305.
- Brazma, A., Jonassen, I., Vilo, J. and Ukkonen, E. (1998b) Predicting gene regulatory elements in silico on a genomic scale. Genome Res. 8, 1202–1215.
- Chang, C.W. and Sun, T.P. (2002) Characterization of cis-regulatory regions responsible for developmental regulation of the gebberellin biosynthetic gene GA1 in Arabidopsis thaliana. Plant Mol Biol. 49, 579–589.
- Dunn, M.A., White, A.J., Vural, S. and Hughes, M.A. (1998) Identification of promoter elements in a low-temperature-responsive gene (blt4.9) from barley (Hordeum vulgare L.). Plant Mol. Biol. 38, 551–564.
-
Efron, B. and
Tibshirani, R.J. (1993) An Introduction to the Bootstrap. Chapman & Hall/CRC.
10.1007/978-1-4899-4541-9 Google Scholar
- Frith, M.C., Hansen, U. and Weng, Z. (2001) Detection of cis-element clusters in higher eukaryotic DNA. Bioinformatics, 17, 878–889.
- Fujibuchi, W., Anderson, J.S. and Landsman, D. (2001) PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches. Nucl. Acids Res. 29, 3988–3996.
- Ge, H., Liu, Z., Church, G.M. and Vidal, M. (2001) Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae. Nature Genet. 29, 482–486.
- Harmer, S.L., Hogenesch, J.B., Straume, M., Chang, H.S., Han, B., Zhu, T., Wang, X., Kreps, J.A. and Kay, S.A. (2000) Orchestrated transcription of key pathways in Arabidopsis by the circadian clock. Science, 290, 2110–2113.
- Van Helden, J., Andre, B. and Collado-Vides, J. (1998) Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J. Mol. Biol. 281, 827–842.
- Van Helden, J., Rios, A.F. and Collado-Vides, J. (2000) Discovering regulatory elements in non-coding sequences by analysis of spaced dyads. Nucl. Acids Res. 28, 1808–1818.
- Higo, K., Ugawa, Y., Iwamoto, M. and Korenaga, T. (1999) Plant cis-acting regulatory DNA elements (PLACE) database. Nucl. Acids Res. 27, 297–300.
- Hu, Y.-J., Sandmeyer, S., McLaughlin, C. and Kibler, D. (2000) Combinatorial motif analysis and hypothesis generation on a genome scale. Bioinformatics, 16, 222–232.
- Hughes, J.D., Estep, P.W., Tavazoie, S. and Church, G.M. (2000) Computational identifcation of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J. Mol. Biol. 296, 1205–1214.
- Kielbasa, S.M., Korbel, J.O., Beule, D., Schuchhardt, J. and Herzel, H. (2001) Combining frequency and positional information to predict transcription factor binding sites. Bioinformatics, 17, 1019–1026.
- Kim, J.C., Lee, S.H., Cheong, Y.H., Yoo, C.M., Lee, S.I., Chun, H.J., Yun, D.J., Hong, D.J., Lee, S.Y., Lim, C.O. and Cho, M.J. (2001) A novel cold-inducible zinc finger protein from soybean, SCOF-1, enhances cold tolerance in transgenic plants. Plant J. 25, 247–259.
- Kreps, J.A., Wu, Y., Chang, H.-S., Zhu, T., Wang, X. and Harper, J.F. (2002) Transcriptome changes for Arabidopsis in response to salt, osmotic and cold stress. Plant Physiol. in press.
- Lawrence, C., Altschul, S., Boguski, M., Liu, J., Neuwald, A. and Wootton, J. (1993) Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science, 262, 208–213.
- Liu, X., Brutlag, D.L. and Liu, J.S. (2001) Bioprospector: discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes. PAC Symp. Biocomput. 127–138.
- Loots, G.G., Locksley, R.M., Blankespoor, C.M., Wang, Z.E., Miller, W., Rubin, E.M. and Frazer, K.A. (2000) Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science, 288, 136–140.
- Ohler, U. and Niemann, H. (2001) Identification and analysis of eukaryotic promoters: Recent computational approaches. Trends Genet. 17, 56–60.
- Pilpel, Y., Sudarsanam, P. and Church, G. (2001) Identifying regulatory networks by combinatorial analysis of promoter elements. Nature Genet. 29, 153–159.
- Schwechheimer, C., Zourelidou, M. and Bevan, M.W. (1998) Plant transcription factor studies. Annu. Rev. Plant Physiol. Plant Mol. Biol. 49, 127–150.
- Spellman, P.T., Sherlock, G., Zhang, M.Q., Iyer, V.R., Anders, K., Eisen, M., Brown, P.O., Botstein, D. and Futcher, B. (1998) Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol. Biol. Cell, 9, 3273–3297.
- Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J. and Church, G.M. (1999) Systematic determination of genetic network architecture. Nature Genet. 22, 281–285.
- Thomashow, M.F. (2001) So what's new in the field of plant cold acclimation? Lots! Plant Physiol. 125, 89–93.
- Watanabe, M., Rebbert, M.L., Andreazzoli, M., Takahashi, N., Toyama, R., Zimmerman, S., Whitman, M. and Dawid, I.B. (2002) Regulation of the Lim-1 gene is mediated through conserved FAST-1/FoxH1 sites in the first intron. Dev. Dyn. 225, 448–456.
- Yamaguchi-Shinozaki, K. and Shinozaki, K. (1994) A novel cis-acting element in an Arabidopsis gene is involved in responsiveness to drought, low-temperature, or high-salt stress. Plant Cell, 6, 251–264.
- Zhu, J.K. (2001) Cell signaling under salt, water and cold stresses. Curr. Opin. Plant Biol. 4, 401–406.
- Zhu, J.K. (2002) Salt and Drought Stress Signal Transduction in Plants. Annu. Rev. Plant Physiol. Plant Mol. Biol. 53, 247–273.
- Zhu, T. and Wang, X. (2000) Large-scale profiling of the arabidopsis transcriptome. Plant Physiol. 124, 1472–1476.