Polar zippers: Their role in human disease
Corresponding Author
Max Perutz
MRC Laboratory of Molecular Biology, Cambridge CB2 2QH, United Kingdom
MRC Laboratory of Molecular Biology, Cambridge CB2 2QH, UKSearch for more papers by this authorCorresponding Author
Max Perutz
MRC Laboratory of Molecular Biology, Cambridge CB2 2QH, United Kingdom
MRC Laboratory of Molecular Biology, Cambridge CB2 2QH, UKSearch for more papers by this authorAbstract
Ascaris hemoglobin consists of 8 subunits, each of which contains a C-terminal peptide with the sequence Glu-Glu-Lys-His repeated 4 times. When plotted on a β-strand, this sequence leads to alternate lysines and glutamates on one side of the strand, and alternate glutamates and histidines on the other side, suggestive of a polar zipper that links the subunits together. A computer search of the protein database showed that the same or similar sequences also occur in other proteins. Some contain long repeats of Asp-Arg or Glu-Arg, among them the small nuclear ribonucleo-U1 70K protein, which is an autoantigen in systemic lupus erythematosis. These repeats appear to constitute the dominant epitopes in the autoimmune reaction. Single chains with Asp-Arg repeats may form α-helices in which alternate positively charged ridges and negatively charged grooves compensate each other. Several separate chains with Asp-Arg repeats could compensate each other's charges optimally by zipping together to β-sheets.
Several homeodomains of Drosophila, as well as the human transcription factor SP1, contain repeats of glutamines. Molecular modeling, circular dichroism, and electron and X-ray diffraction studies of a synthetic poly(L-glutamine) showed that it forms β-sheets held together by hydrogen bonds between the main-chain and side-chain amides. Published data suggest that the function of these glutamine repeats consists of joining essential transcription factors bound to distant segments of DNA.
The study of the structure and function of glutamine repeats has assumed medical importance with the discovery that Huntington's disease and 3 other dominantly inherited diseases are associated with a lengthening of glutamine repeats in the proteins coded for by the affected genes.
References
- Ali N, Bienz M. 1991. Functional dissection of abdominal-B protein. Mech Dev 35: 55–64.
- Ashida T, Tanaka I, Yamane T. 1981. β-Pleated sheets in oligopeptide crystals. Int J Pept Protein Res 17: 322–329.
- Bouini NM, Leiserson WM, Benzer S. 1993. The eyes absent gene: Genetic control of cell survival and differentiation in the developing Drosophila eye. Cell 72: 379–395.
- Brack A, Orgel LE. 1975. β-Structures of alternating polypeptides and their possible prebiotic significance. Nature 256: 383–387.
- Courey AJ, Holtzman DA, Jackson SP, Tjian R. 1989. Synergistic activations by the glutamine-rich domains of human transcription factor SP1. Cell 59: 827–836.
- Courey AJ, Tjian R. 1988. Analysis of Sp1 in vivo reveals multiple transcriptional domains, including a novel glutamine-rich motif. Cell 55: 887–898.
- Cowan PM, McGavin S. 1955. Structure of poly-L-proline. Nature 176: 501–503.
- Darawshe S, Tsafadyah Y, Daniel E. 1987. Quaternary structure of erythrocruorin from the nematode Ascaris suum. Biochem J 242: 689–694.
- De Baere I, Liu L, Moens L, Van Beeumen J, Gielens C, Richelle J, Trolman C, Finch J, Gerstein M, Perutz M. 1992. Polar zipper sequence in the high affinity hemoglobin of Ascaris suum: Amino acid sequence and structural interpretations. Proc Natl Acad Sci USA 89: 4638–4642.
- Etzerodt M, Vignali V, Ciliberto G, Scherly D, Mattaj IW, Philipson L. 1988. Structure and expression of a Xenopus gene encoding an snRNP protein (U1 70K). EMBO J 7: 4311–4321.
- Greenfield N, Fasman GD. 1969. Computed circular dichroism spectra for the evaluation of protein conformation. Biochemistry 8: 4108–4116.
- Hammarback JA, Obar RA, Hughes SM, Vallee RB. 1991. MAP IB is encoded as a polyprotein that is processed to form a complex N-terminal microtubules-binding domain. Neuron 7: 129–139.
- Haynes SR, Mozer BA, Bhatia-Dey N, Dawid IB. 1989. The Drosophila fsh locus, a maternal effect homeotic gene, encodes apparent membrane proteins. Dev Biol 134: 246–257.
- Hennessey JP, Johnson WC Jr. 1981. Information content in the circular dichroism of proteins. Biochemistry 20: 1085–1094.
- Hoogeveen AT, Willemsen R, Meyer N, de Rooij KE, Roos RAC, van Ommen GJB, Galjaard H. 1993. Characterization and localization of the Huntington disease gene products. Hum Mol Genet 2: 2069–2073.
- Huntington's Disease Collaborative Research Group. 1993. A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes. Cell 72: 971–983.
- Kadonaga JT, Carner KR, Masiarz FR, Tjian R. 1987. Isolation of cDNA encoding transcription factor SP1 and functional analysis of the DNA binding domain. Cell 51: 1079–1090.
- Kao CC, Lieberman PM, Schmidt MC, Zhou Q, Pei R, Berk AJ. 1990. Cloning of a transcriptionally active human TATA binding factor. Science 248: 1646–1650.
- Koide R, Ikeuchi T, Onodera O, Tanaka H, Igarashi S, Endo K, Takahashi H, Kondo R, Ishikawa A, Hayashi T, Saito M, Tomoda A, Mijke T, Naito H, Ikuta F, Tsuji S. 1994. Unstable expansion of CAG repeat in hereditary dentatorubral-pallidoluysian atrophy (DRPLA). Nature Genet 6: 9–13.
- Landschulz WH, Johnson PF, McKnight SL. 1988. The leucine zipper: A hypothetical structure common to a new class of proteins. Science 240: 1759–1764.
- La Spada AR, Roling DB, Harding EA, Warner CL, Spiegel R, Hausmanova-Petrusevicz I, Yee WC, Fischbeck KH. 1992. Meiotic stability and genotype-phenotype correlation of the trinucleotide repeat in X-linked spinal and bulbar muscular atrophy. Nature Genet 2: 301–304.
- La Spada AR, Wilson EM, Lubahn DB, Harding AE, Fischbeck KH. 1991. Androgen receptor gene mutations in X-linked spinal and bulbar muscular atrophy. Nature 352: 77–79.
- Leijonmark M, Liljas A. 1987. Structure of the C-terminal domain of the ribosomal protein L7/L12 from Escherichia coli at 1.7 Å. J Mol Biol 195: 555–580.
- Lévi-Strauss M, Carrol MC, Steinmetz M, Meo T. 1988. A previously undetected MHC gene with an unusual periodic structure. Science 240: 201–204.
- Liljas A. 1982. Structural studies of ribosomes. Prog Biophys Mol Biol 40: 161–228.
- Maassen JA, Schop EN, Brands HMG, Van Hemert J, Leustra A, Möller W. 1985. Molecular cloning and analysis of cDNA sequences for two ribosomal proteins from Artemia. Eur J Biochem 149: 609–616.
- Mack JW, Steven AC, Steinert PM. 1993. The mechanism of interaction of filaggrin with intermediate filaments. J Mol Biol 232: 50–66.
- Mancebo R, Lo PCH, Mount SM. 1990. Structure and expression of the Drosophila melanogaster gene for the U1 ribonuclear-protein particle 70K protein. Mol Cell Biol 90: 2492–2502.
- Martin JB. 1993. Molecular genetics of neurological diseases. Science 262: 674–676.
- Matilla T, Volpini V, Genis D, Rosell J, Corral J, Davalos A, Molins A, Estivill X. 1993. Presymptomatic analysis of spinocerebellar atoxia type 1 (SCA1) via the expansion of the SLA1 CAG-repeat in a large pedigree displaying anticipation and parental male bias. Hum Mol Genet 2: 2123–2128.
- McLachlan AD, Karn J. 1982. Periodic charge distributions in the myosin rod amino acid sequence match cross bridge spacings in muscle. Nature 299: 226–231.
- Mhatre AN, Trifiro MA, Kaufman M, Kazemi-Esfarani P, Figlewicz D, Rouleau G, Pinsky L. 1993. Reduced transcriptional regulatory competence of the androgen receptor in X-linked spinal and bulbar muscular atrophy. Nature Genet 5: 184–188.
- Noble M, Lewis SA, Cowan NJ. 1989. The microtubule-binding domain of microtubule-associated protein MAP1B contains a repeated sequence motif unrelated to that of MAP2 and tau. J Cell Biol 109: 3367–3376.
- Orr MT, Chung MY, Banfi S, Kwiatkovski TJ Jr, Servadio A, Beaudet AL, McCall AE, Duvick LA, Ranum LPW, Zhoghbi HY. 1993. Expansion of an unstable binucleotide CAG repeat in spinocerebellar ataxia type 1. Nature Genet 4: 221–226.
- O'Shea EK, Klem JD, Kim PS, Alber T. 1991. X-ray structure of the GCN4 leucine zipper, a two-stranded parallel coiled coil. Science 254: 539–544.
- Pelsue S, Agris PF. 1994a. Immunoreactivity between a monoclonal lupus autoantibody and the arginine/aspartic acid repeats within the U1-snRNP 70K autoantigen is conformationally restricted. J Protein Chem 13: 401–408.
- Pelsue S, Agris PF. 1994b. Gene usage by an anti U1 snRNP 70 monoclonal autoantibody derived from a lupus-prone mouse. J Autoimmun 7: 165–177.
- Pelsue S, Jung KD, Agris PF. 1993. Immunochemical analysis of an arginine-rich systemic Lupus erythematosus autoepitope. Autoimmunity 15: 231–236.
- Perutz MF, Johnson T, Suzuki M, Finch JT. 1994. Glutamine repeats as polar zippers: Their possible role in inherited neurodegenerative diseases. Proc Natl Acad Sci USA 91: 5355–5358.
- Perutz MF, Staden R, Moens L, De Baere I. 1993. Polar zippers. Curr Biol 3: 249–253.
- Rippon WB, Chen MH, Walton AG. 1973. Spectroscopic characterisation of poly(Glu-Ala). J Mol Biol 75: 369–375.
- Ross CA, McInnis MG, Margolis RL, Li SH. 1993. Genes with triplet repeats: Candidate mediators of neuropsychiatric disorders. Trends Neurosci 16: 254–260.
- Schneuwly S, Kuroiwa A, Baumgartner P, Gehring WJ. 1986. Structural organisation and sequence of the homeotic gene Antennapedia of Drosophila melanogaster. EMBO J 5: 733–739.
- Seipke G, Aufmann MA, Wagner KG. 1974. Synthesis and properties of alternating poly(Lys-Phe) and comparison with the random copolymer (Lys51-Phe49). Biopolymers 13: 1621–1633.
- Sherman DR, Kloek AP, Krishnan BR, Guinn B, Goldberg DE. 1992. Ascaris hemoglobin gene: Plant-like structure reflects ancestral globin gene. Proc Natl Acad Sci USA 89: 11696–11700.
- Smoller D, Friedel C, Schmid A, Bettler D, Lam L, Yedvobnick B. 1990. The Drosophila neurogenic locus mastermind encodes a nuclear protein unusually rich in amino acid homopolymers. Genes & Dev 4: 1688–1700.
- Spritz RA, Strunk K, Surowy CS, Hoch SD, Barton DE, Francke U. 1987. The human U1–70K snRNP protein: cDNA cloning, chromosomal localization, expression, alternative splicing and RNA-binding. Nucleic Acids Res 15: 10373–10378.
- Stahl HD, Bianco AE, Crewther PE, Anders RF, Kyne AP, Coppel RL, Mitchell GF, Kemp DJ, Brown GV. 1986. Sorting large numbers of clones expressing Plasmodium falciparum antigens in Escherichia coli by differential antibody screening. Mol Biol Med 3: 351–368.
- St. Pierre S, Ingwall RT, Verlander MS, Goodman M. 1978. Conformational studies of sequential polypeptides combining lysine and tyrosine. Biochemistry 17: 1837–1848.
- Struther-Arnott S, Dover SD, Elliott A. 1967. Structure of poly-L-alanine: Refined atomic coordinates for an antiparallel beta-pleated sheet. J Mol Biol 30: 201–208.
- Surowy CS, Hoganson G, Gosink J, Strunk K, Spritz BA. 1990. The human RD protein is closely related to nuclear RNA-binding proteins and has been highly conserved. Gene 90: 299–302.
- Trudelle Y. 1975. Conformational study of the sequential (Tyr-Glu)n copolymer in aqueous solution. Polymer 16: 9–15.
- Wharton KA, Yedvobnick B, Finnerty VG, Artavanis-Tsakonas S. 1985. opa: A novel family of transcribed repeats shared by the Notch locus and other developmentally regulated loci in D. melanogaster. Cell 40: 55–62.
- Zhou Q, Boyer TG, Berk AJ. 1993. Factors (TAFs) required for activated transcription interact with TATA box-binding protein conserved core domain. Genes & Dev 7: 180–187.