Invited Review

Full Access

A historical account of hoogsteen base-pairs in duplex DNA

Evgenia N. Nikolova

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Integrative Structural & Computational Biology Department, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA, 92037

Search for more papers by this author

Huiqing Zhou,

Huiqing Zhou

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Federico L. Gottardo,

Federico L. Gottardo

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Heidi S. Alvey,

Heidi S. Alvey

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Isaac J. Kimsey,

Isaac J. Kimsey

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Hashim M. Al-Hashimi,

Corresponding Author

Hashim M. Al-Hashimi

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Correspondence to: Hashim M. Al-Hashimi; e-mail: [email protected]Search for more papers by this author

Evgenia N. Nikolova,

Evgenia N. Nikolova

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Integrative Structural & Computational Biology Department, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA, 92037

Search for more papers by this author

Huiqing Zhou,

Huiqing Zhou

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Federico L. Gottardo,

Federico L. Gottardo

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Heidi S. Alvey,

Heidi S. Alvey

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Isaac J. Kimsey,

Isaac J. Kimsey

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Search for more papers by this author

Hashim M. Al-Hashimi,

Corresponding Author

Hashim M. Al-Hashimi

Department of Chemistry & Biophysics, The University of Michigan, 930 North University Avenue, Ann Arbor, MI, 48109-1055

Correspondence to: Hashim M. Al-Hashimi; e-mail: [email protected]Search for more papers by this author

First published: 02 July 2013

https://doi.org/10.1002/bip.22334

Citations: 93

This article was originally published online as an accepted preprint. The “Published Online” date corresponds to the preprint version. You can request a copy of the preprint by emailing the Biopolymers editorial office at [email protected]

Share a link

Email
Wechat
Bluesky

ABSTRACT

In 1957, a unique pattern of hydrogen bonding between N3 and O4 on uracil and N7 and N6 on adenine was proposed to explain how poly(rU) strands can associate with poly(rA)-poly(rU) duplexes to form triplexes. Two years later, Karst Hoogsteen visualized such a noncanonical A–T base-pair through X-ray analysis of co-crystals containing 9-methyladenine and 1-methylthymine. Subsequent X-ray analyses of guanine and cytosine derivatives yielded the expected Watson–Crick base-pairing, but those of adenine and thymine (or uridine) did not yield Watson–Crick base-pairs, instead favoring “Hoogsteen” base-pairing. More than two decades ensued without experimental “proof” for A–T Watson–Crick base-pairs, while Hoogsteen base-pairs continued to surface in AT-rich sequences, closing base-pairs of apical loops, in structures of DNA bound to antibiotics and proteins, damaged and chemically modified DNA, and in polymerases that replicate DNA via Hoogsteen pairing. Recently, NMR studies have shown that base-pairs in duplex DNA exist as a dynamic equilibrium between Watson–Crick and Hoogsteen forms. There is now little doubt that Hoogsteen base-pairs exist in significant abundance in genomic DNA, where they can expand the structural and functional versatility of duplex DNA beyond that which can be achieved based only on Watson–Crick base-pairing. Here, we provide a historical account of the discovery and characterization of Hoogsteen base-pairs, hoping that this will inform future studies exploring the occurrence and functional importance of these alternative base-pairs. © 2013 Wiley Periodicals, Inc. Biopolymers 99: 955–968, 2013.

INTRODUCTION

In 1953, sixty years ago, Watson and Crick proposed their iconic double helix structure for deoxyribonucleic acid (DNA) based on very little experimental data.1 Although the structure is most known for its double helical appearance, its most important feature was, and remains to this date, the specific pairing of purine with pyrimidine nucleobases—guanine with cytosine and adenine with thymine—through complementary hydrogen bonds (Figure 1).1¹ This endowed the structure with the ability to self-duplicate, making DNA, and not proteins as was widely believed at the time, the likely carrier of genetic information.2 Despite the absence of any experimental data in support of the specific pairing proposed by Watson and Crick, and despite the fact that there are alternative modes for pairing purines with pyrimidines, the pairing proposed by Watson and Crick utilized bases in their most probable tautomeric forms and, most importantly, resulted in similar overall shapes for all four base-pair combinations, so that any sequence could be accommodated within the same double helix framework.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Chemical structures of A–T and G–C Watson–Crick (WC) and Hoogsteen (HG) base-pairs.

Although the discovery of the double helix set in motion one of the greatest scientific revolutions, the structure itself was met with a good deal of skepticism. The available X-ray fiber diffraction data obtained on noncrystalline DNA fibers, particularly B-form DNA, did not provide adequate resolution to determine atomic positions. This is because molecules in the fiber are generally not rotationally oriented relative to one another in a regular manner. Indeed, this was the main reason Rosalind Franklin pursued the more complicated diffraction pattern presented by the dry “A-form” version of DNA,3, 4 where the molecules are not in random rotational orientations, allowing for a more objective 3D crystallographic analysis and where one could, in Franklin's own words, ‘let the data speak for itself'.

While the story of the double helix is well known to scientists and nonscientists alike, it is not commonly known that definitive proof for the DNA double helix structure did not come until 1980 – more than a quarter century after Watson and Crick initially proposed their model – when Drew, Dickerson and coworkers solved the single crystal structure of a DNA dodecamer using heavy atom X-ray crystallography.5, 6 In the ensuing period, experimental evidence began to accumulate for an alternative base-pair, referred to now as the “Hoogsteen” base-pair (Figure 1),7, 8 which, together with other alternative structures of DNA such as left-handed Z-DNA,9 raised doubts about the B-form structure proposed by Watson and Crick. Today, there is little doubt that Hoogsteen (HG) base-pairs do indeed represent an alternative pairing scheme that can expand the structural and functional versatility of duplex DNA beyond that which can be achieved based only on Watson–Crick (WC) base-pairing. The purpose of this review is to provide a historical account of the discovery and characterization of HG base-pairs, hoping that this will inform future studies exploring the occurrence and functional importance of these alternative base-pairs.

PURINE-PYRIMIDINE CO-CRYSTALS

Soon after Watson and Crick proposed their double helix structure, experimentalists rushed to gather data to test its various aspects. Improvements in X-ray cameras and analytical methods for refining models to fit X-ray fiber diffraction data in the following years provided additional evidence in support of the general features of the DNA double helix model. However, fine details of the structure, including the specific base-pairs proposed by Watson and Crick, could not be assessed. Throughout the late 1950s and into the 1970s, much effort was directed toward solving X-ray structures of isolated purine-pyrimidine dimers. The idea was that the monomers might associate to form intermolecular complexes that reflect the pairing that occurs in the double helix. The high-resolution diffraction data afforded by single crystals allowed for an objective characterization of hydrogen bonding interactions between the bases.

The first such study was reported in 1959, when Karst Hoogsteen – an associate of Robert Corey at Caltech – used single crystal X-ray analysis to determine the structures of co-crystals containing 9-methyladenine and 1-methylthymine, where methyl groups were used to block hydrogen bonding to nitrogen atoms otherwise bonded to sugar carbons in DNA.7 Rather than observing a Watson–Crick base-pair, Hoogsteen observed a markedly different pairing scheme (Figure 1), in which the adenine base was flipped upside down. In DNA, such a flip is accomplished by a 180-degree rotation of the adenine base around the glycosidic bond (N9–C1′), changing the base from an anti to a syn conformation. As in WC base-pairs, the thymine base formed two hydrogen bonds with the adenine base, one of which (thymine O4 and adenine N6) is identical to that proposed by Watson and Crick. However, the second hydrogen bond is not between thymine N3 and adenine N1, but, rather, between thymine N3 and N7 of the flipped adenine base (Figure 1). This very same hydrogen bonding scheme was proposed 2 years earlier by Rich and his colleagues to explain how poly(rU) strands might associate with poly(rA)-poly(rU) duplexes to form triplexes.10 Hoogsteen recognized that relative to the scheme proposed by Watson and Crick, this hydrogen bonding scheme required translation of the complementary bases into closer proximity, which requires constriction of the DNA helix diameter by ∼ 2.5 Å.7, 8

In 1963, 1 year after Watson and Crick were awarded the Noble Prize in Physiology or Medicine, single crystal structures were reported for G–C base-pairs using crystals containing 9-ethylguanine and 1-methylcytosine or 1-methyl-5-bromocytosine.11 WC base-pairing was observed in the two cases, even though the two intermolecular complexes crystallized in different space groups and experienced different packing arrangements. As predicted earlier by Pauling and his colleagues,12 G–C base-pairs were stabilized by three and not two hydrogen bonds as proposed by Watson and Crick.1, 2 Subsequent X-ray diffraction analysis of co-crystals involving guanine and cytosine derivatives consistently formed the expected WC base-pairs.13 However, in sharp contrast, attempts to generate co-crystals of adenines and thymine (or uridine) derivatives failed to yield WC base-pairs and, in most cases, favored “Hoogsteen” base-pairs.13-15

In 1968, Guschlbauer and colleagues proposed the formation of G–C⁺ HG base-pairs in poly(dG)-poly(dC) at pH 3–4 based on optical rotatory dispersion spectra suggesting guanine adopted a syn conformation.16 As in A–T base-pairs, the transition from WC to HG entails flipping of the guanine base from an anti to a syn conformation and the preservation of hydrogen bonding between cytosine N4 and guanine O6. However, unlike A–T base-pairs, formation of a second hydrogen bond with N7 of the flipped guanine requires protonation of cytosine N3. The transition results in a net loss of one hydrogen bond and the build-up of a positive charge on the cytosine. Because of this, protonated G–C⁺ HG base-pairs were generally thought to be less energetically favored than A–T HG base-pairs and more stably formed at lower pH.17 G–C⁺ HG base-pair were subsequently used to explain how poly(rC) associates with poly(dC)-poly(dG) duplexes to form triplexes under acidic conditions,18 and NMR studies provided chemical shift evidence for protonated G–C⁺ HG base-pairs at cytosine N3 in a poly(dC)-poly(dC) complex with dGMP at low pH.19 The first crystallographic observation of G–C⁺ HG base-pair came many years later for a DNA duplex bound to the bisintercalating antibiotic triostin A.20

HG base-pairs presented an alternative to WC base-pairs, especially for A–T rich sequences, and this contributed to skepticism about the details of the DNA double helix structure. Among the skeptics was Linus Pauling, who had proposed an incorrect triple helix model for the DNA structure the same year that Watson and Crick proposed their double helix model.21 Indeed, Pauling felt that awarding Watson and Crick a Noble Prize for their discovery was premature “because of existing uncertainty about the detailed structure of nucleic acid” (personal correspondence to the Nobel Committee for Chemistry and Physics). The only real experimental indication that both G–C and A–T/U could form WC base-pairs in double and triple helices came from subsequent solution NMR studies of tRNA and polynucleotide complexes in 1970s.19^,22-27 These studies, which were performed under physiological solution conditions in the absence of potentially perturbing crystal packing forces, showed distinct chemical shift signatures that were consistent with theoretical predictions for WC rather than HG hydrogen bonding in A–T/U base-pairs.

HG BASE-PAIRS IN NAKED DUPLEXES

In 1973, Rich and colleagues reported the single crystal X-ray structure of the AU and GC dinucleoside phosphates and the results were heralded as “the double helix at atomic resolution.”28, 29 These structures verified key aspects of the double helix proposed by Watson and Crick. Both structures revealed a right-handed double helix with two strands running anti-parallel to each other. Importantly, both structures featured WC type base-pairing. This was the first time a WC rather than HG base-pair was observed involving adenine. The structures seemed to put the controversy regarding HC versus WC A–T base-pairs to rest, since it was the WC form that was favored when the bases were constrained in a double helix. As noted by Alex Rich, James Watson phoned him after receiving a preprint of the AU manuscript and said “he had his first good night sleep in 20 years!”30 In retrospect, the observation of A-U WC base-pairs in A-form RNA created a false sense of comfort as recent studies have shown that, in contrast to DNA, HG base-pairs are not likely to form in A-form RNA, as suggested by the lack of HG base-pairs in A-form RNA duplexes in over 1000 high resolution crystal structures surveyed in the PDB.

Six years later, in 1979, following advances in phosphotriester methods for chemical synthesis of large quantities of homogeneous oligonucleotides,31, 32 Rich and colleagues reported the first single crystal X-ray structure of DNA for the d(CG)₃ sequence.9 Prior studies had shown that repeating polymers of inosine-cytosine33 and guanine-cytosine34 resulted in a “reverse” circular dichroism (CD) spectrum and this was interpreted as evidence for a left-handed helix. The structure of d(CG)₃ revealed a stunning left-handed double helix with an unusual zig-zag shape and was called “Z-DNA.”9 To convert B-DNA into Z-DNA, both bases in the base-pair have to be flipped upside down. As in HG base-pairs, the guanine base is flipped into a syn conformation; however, in Z-DNA, the concomitant flipping of the cytosine base and sugar allows the two flipped bases to regroup into WC base-pairs, with the flipping of the cytosine sugar giving rise to the unusual zig-zag backbone. Thus, although the first single crystal structure of DNA provided evidence for G–C WC base-pairs, it fueled skepticism about the overall structure of the double helix.

In the early 1980s, models were put forward for a Z-DNA structure that are exclusively comprised of HG base-pairs,35 particularly for A–T rich sequences that frequently exhibited unusual diffraction patterns when dried (referred to as D- or E-type X-ray diffraction patterns).35-37 This form of DNA required helical structures with 7–7.5 base-pairs per turn, which cannot be stereochemically achieved by right-handed B-form DNA. Spectroscopic studies of poly(rA)-poly(rU) sequences that bear substituents at the adenine C2 position, which sterically block WC base-pairing, also suggested formation of duplexes with parallel or anti-parallel chain polarity, in which strands are held together by HG or reverse HG base-pairing, respectively.38-40

During the same period, in 1980, Drew and Dickerson reported the first single crystal X-ray structure of a DNA polymer longer than one helical turn.5 They used heavy atom replacement approaches to solve the structure of a synthetic DNA dodecamer sequence d(CGCGAATTCGCG). The structure adopted a right-handed duplex containing WC G–C and A–T base-pairs—precisely as proposed by Watson and Crick. It is this structure that is considered to be crystallographic proof that DNA can indeed adopt the structure proposed by Watson and Crick.

The ability to prepare large quantities of highly pure DNA samples in a facile manner, in parallel with developments in 13C/15N isotopic enrichment and solution state NMR spectroscopy of nucleic acids resulted in the high-resolution X-ray and NMR structure determination of diverse DNA sequences in the 1980s and 1990s showing WC B-form DNA duplexes. However, spectroscopic evidence for HG base-pairs continued to mount in the 1990s and 2000s in the context of A–T rich sequences,41 in poly(dG-dC)-poly(dG-dC) sequences at low pH as possible intermediates along the B-to-Z DNA transition,42 as well as in noncanonical DNA regions as closing base-pairs of apical loops.43, 44 But it was not until 2002, when Subirana and colleagues reported the first crystal structure of an AT-repeat not capped by GC base-pairs, that the first single crystal X-ray structure of a naked DNA duplex containing exclusively HG base-pairs was resolved.45 The structure of d(AT)₃ revealed an anti-parallel right-handed double helix made up exclusively of HG base-pairs, with an overall structure similar to that of B-form DNA (Figure 2).45, 46 Key differences included a change in the position of the helical axis relative to the base-pairs, reduction in helical radius and C1′–C1′ distance by ∼2.5–3.0 Å, altered hydrogen bonding donor/acceptor pattern in the major and minor grooves, a narrower and less electronegative minor groove, which favors hydrophobic interactions, and distinct helix stacking and hydration patterns relative to B-DNA. Together, these features provide a distinct physicochemical presentation of the genetic code for potential sequence-specific recognition by the cellular machinery. Similar HG structures were subsequently reported for related sequences d(ATATATCT)47 and d(CGATATATATAT).48

It is important to note that, in all cases, solution state NMR studies of the above DNA sequences under the same conditions used to grow crystals argued against formation of a HG helix, and in favor of a prototypical WC B-form double helix (Zhou et al. unpublished data).46 This suggests that crystal packing plays an important role in stabilizing the HG double helix.

DNA-ANTIBIOTIC COMPLEXES

In 1984, Rich and colleagues reported the single crystal X-ray structure of a DNA double helix with sequence d(CGTACG) bound to triostin A,50 a cyclic octadepsipeptide anti-tumor antibiotic containing two quinoxaline rings that binds DNA and inhibits replication and transcription in vivo (Figure 3).51, 52 This was the first structure of a peptide antibiotic in complex with an oligonucleotide. The structure showed that the two quinoxaline rings bis-intercalate in the minor groove of the DNA double helix and surround the WC G–C base-pairs, disrupting stacking interactions to the central A–T base-pairs (Figure 3). Remarkably, although the two central A–T base-pairs are not covered by the two triostin A molecules, they form HG rather than WC base-pairs. This marked the first crystallographic observation of the co-existence of WC and HG base-pairs within the same duplex. No direct contacts are observed between the antibiotic and the exposed Watson–Crick face of the A–T bases. Rather, the helical constriction at the HG base-pairs appears to stabilize the complex by allowing close packing of the oligonucleotide around the end of the triostin A. Thus, several favorable van der Waals contacts would be lost if the deoxyribose rings were further apart as in WC base-pairs. Similar structures were subsequently reported for DNA bound to the related echinomycin antibiotic,53 and for triostin A bound to d(GCGTACGC),20 which featured two central A–T HG base-pairs, and two terminal G–C⁺ HG base-pairs, marking the first crystallographic observation of protonated G–C⁺ HG base-pairs within a duplex (Figure 3).

Soon after, chemical footprinting studies performed in solution showed that sites that form HG base-pairs in X-ray structures of DNA-echinomycin complexes are hyperreactive to diethyl pyrocarbonate (DEPC),54 which preferentially reacts with exposed N7 atoms of syn purines in noncanonical Z-DNA55 and cruciform loops.56 However, these results were challenged by footprinting studies employing DEPC and other reagents that target thymines, that showed little change in thymine chemical reactivity when replacing adenine with 7-deazaadenine, which has a diminished ability to form HG base-pairs.57-60 Moreover, oligonucleotides containing 7-deazaadenine and 7-deazaguanine bound echinomycin with affinity comparable to that of their unmodified counterparts, suggesting that HG base-pairs are not essential for binding.60 These studies argued that hyperreactivity does not arise from formation of HG base-pairs but, rather, from unwinding and extension of the DNA helix upon drug binding.

Subsequent NMR studies by Feigon, Patel, and their co-workers confirmed formation of HG base-pairs in DNA-antibiotic complexes,61, 62 although their occurrence was shown to be highly dependent on sequence, temperature, and pH.63-65 A–T base-pairs generally form WC, and if they ever form HG, they do so transiently at physiological temperatures. Even the terminal HG A–T base-pairs were only favored in DNA-antibiotic complexes having purine 5′ and pyrimidine 3′ to CG (i.e., ACGT, GCGC) and only at low pH for G–C⁺ HG base-pairs.61, 63, 66, 67

Despite many studies, to date, it remains unclear whether quinoxaline antibiotics stabilize HG base-pairs in DNA in vivo and whether this is related in any way to their biological activity.

DNA-PROTEIN COMPLEXES

In the late 1990s, X-ray structures emerged showing that certain proteins bind and in some cases specifically recognize HG base-pairs embedded in B-form DNA (Figure 4). These studies raised the possibility that proteins exploit the unique structural and chemical features of HG base-pairs in sequence-specific DNA recognition, and therefore, provided evidence for a functional role for HG base-pairs in vivo.

The first crystallographic observation of HG base-pairs in a protein-DNA complex was reported by Rice et al. in 1996,68 who visualized a single A–T HG base-pair immediately adjacent to a nicked site in the X-ray structure of a highly bent (>160°) 35 base-pair (bp) DNA bound to the integration host factor (IHF) protein (Figure 4). Interestingly, a hydrogen bond was observed between the backbone amide group of an arginine residue and N3 of the syn A, suggesting specific recognition of the Watson–Crick face in the HG base-pair. However, the nick is involved in crystal packing with a neighboring molecule in the complex and HG formation helps move the phosphate backbone away from a neighboring molecule. In addition, the protein makes specific contacts with N3 of an anti-A in a symmetric site in the DNA lacking the nick, suggesting interactions that are specific for WC rather than HG base-pairing. Moreover, NMR studies of IHF binding to a shorter recognition sequence containing the first nicked site argue against the presence of an A–T HG base-pair in solution.69

Subsequent X-ray structures of TATA elements bound to the TATA box-binding protein (TBP) revealed a G–C⁺ HG base-pair in the mutant TATAAAC box in a region of DNA unwinding and intercalation.70 No direct contacts were observed between the syn guanine base and the protein. However, the HG base-pair appears to contribute to binding by preventing steric clashes between the protein leucine 72 and the guanine exocyclic NH₂, while still preserving favorable van der Waals contacts with two neighboring phenylalanine residues. A second G–C HG base-pair was observed but attributed to crystal packing forces. Interestingly, the ∼150-fold weaker binding affinity observed for TBP to this mutant TATA box,71 which could be correlated to the selection of a transient HG over a WC base-pair at that site,72 has been implicated in the transcriptional regulation of the human osteocalcin gene.73 This observation suggests a biological role for the formation of a G–C⁺ HG base-pair at the mutant promoter site.

Both IHF and TBP induce large distortions in the DNA, which could facilitate formation of HG base-pairs. In contrast, Wolberger and coworkers observed a single A–T HG base-pair within an otherwise undistorted B-form WC duplex in the X-ray structure of MATα2 homeodomain nonspecifically bound to DNA.74 Van der Waals contacts were observed between an arginine side chain and the syn adenine base as well as the sugar-phosphate backbone of the adenine and the neighboring thymine. Once again, the HG base-pair appears to avoid unfavorable steric clashes that would otherwise arise with a WC base-pair. The HG base-pair is accommodated within the duplex DNA without inducing major distortions, even for the directly neighboring base-pairs. The ease with which HG base-pairs could seamlessly fit within B-DNA raised the possibility that HG base-pairs may have been incorrectly assigned to be WC base-pairs due to misinterpretation of ambiguous electron density at medium to low resolution.74

More recently, HG base-pairs have been observed in the complex of the dmc very-short-patch repair (Vsr) DNA endonuclease, which participates in the nucleotide excision repair of G–T mismatches arising from deamination of 5-methylcytosines, with a specific recognition hemi-deaminated/hemi-methylated DNA sequence.75 Remarkably, the A–T HG base-pair, which is sandwiched between the mismatched and hemi-methylated sites, is also found in the equivalent unbound DNA site within the same crystal but not in a slightly different unmethylated sequence,76 implying that its presence could be an inherent property of the specific DNA sequence and not due to protein-induced distortions in DNA structure.

Two neighboring A–T HG base-pairs were subsequently observed in structures of a palindromic CATG/CATG sequence bound to the DNA binding domain of p53.77 Although no direct contacts are observed with the syn adenines, the formation of the HG base-pairs results in a narrowed minor groove in the region flanking the CATG site, leading to enhanced negative electrostatic potential that is further stabilized by insertion of the positively charged arginine side chains. Remarkably, these HG base-pairs adopt WC geometry in X-ray structures with a longer spacer length77 or a different intervening sequence78, 79 between DNA half-sites, which is accompanied by a different organization between p53 dimers, altered DNA helix conformation, and that also yield different DNA-tetramer binding affinities.80 These studies suggest that WC and HG base-pairs likely exist in equilibrium with each other and that their selection in DNA-p53 complexes is largely dictated by the nature of the DNA binding sequence.

DAMAGED DNA

By the 1960s, it had become clear that DNA could be damaged by exogenous and endogenous factors, and that this in turn may be linked to disease states such as cancer.81 During the 1970s and 1980s, enzymes that recognize and repair damaged DNA began to be uncovered, resulting in great interest in characterizing the structure of damaged DNA.82 These studies showed that HG base-pairing provides an important mechanism for stacking and hydrogen bonding, in cases where the Watson–Crick face of the purine bases is damaged, preventing favorable WC base-pairing.

The first evidence for HG-type base-pairs in damaged DNA was reported in the late 1980s in solution NMR studies by Patel and co-workers showing that guanine adducts on the Watson–Crick edge or the C8 positions strongly favor a syn base orientation.83-86 Subsequent NMR studies showed HG-type pairing in various purine lesions, including WC face alkylation adducts (e.g., 1,N2-propanoguanine87, 88 and 1,N2-ethenoguanine89), the bulky guanine C8 mutagenic adduct aminofluorine-C8-guanine,86, 90 and the common mutagenic lesion N1-methyladenine (Figure 5).91 The direct observation of HG base-pairing (rather than extrahelical states) in a wide variety of lesions in naked DNA in the 1990s and 2000s established HG base-pairs as an energetically closer alternative to WC base-pairs.

There is great speculation and experimental evidence that HG-type pairs play important roles in DNA damage and mismatch repair. For example, it is likely that the enzyme AlkB, which repairs the mutagenic lesion N1-methyladenine, initially recognizes the HG base-pair between N1-methyladenine and thymine (Figure 5)91, 92 before flipping out the damaged purine for oxidative demethylation. The flipping of one purine base to a syn conformation is also often observed in purine-purine mismatches, where the syn–anti base-pair configuration affords a shorter helical radius that can be more readily accommodated within B-DNA as compared to the anti–anti configuration. There is X-ray structural evidence that the DNA mismatch repair enzyme MutS specifically recognizes HG type purine-purine and purine-pyrimidine mismatches, even though they may not be the dominant conformation in unbound DNA, by making specific hydrophobic and hydrogen bonding minor groove contacts with the syn adenine/guanine base in A–C, A–A, and G–G mismatches.93 The recognition of the increased population of syn–anti rather than anti–anti configuration in certain mismatched base-pairs may help the enzyme discriminate against undamaged anti–anti Watson–Crick base-pairs. Thus, HG base-pairs not only provide a mechanism for maintaining the overall structural integrity of damaged or incorrectly replicated DNA, they can play an important role in DNA repair mechanisms.

It is worth noting that HG base-pairs have also been observed in DNA containing non-natural modifications in the sugar-phosphate backbone, including the addition of an ethylene bridge between C3′and C5′ in “bicyclo-DNA,” which fixes the gamma backbone torsion angle to a noncanonical orientation,94 a single-residue substitution of sugar O4′ with a methylene group,95 or in dinucleotide d(TA) analogs containing a nonionic diisopropylsilyl-modified backbone at very low temperatures.96

DNA REPLICATION

Watson–Crick base-pairs were the most important aspect of the DNA double helix structure because, as succinctly stated in the very last sentence of their 1953 Nature paper, it “immediately suggests a possible copying mechanism for the genetic material.”1 Four years later, Kornberg discovered the enzyme that catalyzed template DNA replication97 and ensuing biochemical and structural studies established that high fidelity DNA polymerases replicate DNA by Watson–Crick pairing of the incoming dNTP with the template strand. In particular, multiple studies have demonstrated that the active site of replicative DNA polymerases is highly selective toward insertion of the correct dNTP and that catalytic efficiency is severely diminished when Watson–Crick geometry is not present.98-102 This strict stereochemical requirement for Watson–Crick pairing, together with efficient 3′–5′ proofreading exonuclease activity, prevents misincorporation of incorrect or damaged nucleotides during DNA synthesis that is essential for genome stability.

During the 1990s, studies revealed that certain families of DNA polymerases (the X and Y families)103-105 contributed to damage-induced mutagenesis. Such specialized polymerases function in the replication and repair of damaged DNA, which could present severe replication blocks for common replicative polymerases, and thus play an important role in the maintenance of genome stability. These enzymes are also characterized by much lower replication fidelity than regular polymerases as their active sites are more tolerant toward noncanonical geometries between template and incoming nucleotide and they often lack the 3′–5′ exonuclease domain (in the case of Y family).104, 106 It was later shown that some members of the Y family of DNA polymerases efficiently bypass DNA damage by replicating the template DNA via HG rather than WC base-pairing. HG-based replication was first visualized in X-ray structures of an archaeal DNA Polη homolog, Dpo4, by Yang and coworkers nearly a decade ago.107 The structure showed that Dpo4 replicates UV cross-linked thymine dimers by forming a HG base-pair between the 5′ thymine and an incoming ddATP, thus avoiding backbone distortion and allowing discrimination against guanine and pyrimidines.107

Aggarwal and coworkers108 subsequently showed using X-ray crystallography and biochemical experiments that another member of this family, human DNA Polι, employs HG base-paring as a general mechanism to replicate both damaged and undamaged DNA. A striking X-ray structure of Polι showed a template adenine in the active site of the enzyme adopting a syn conformation and forming a HG base-pair with an incoming dTTP (Figure 6).108 Unlike replicative polymerases or other members of the Y family, Polι featured a narrower active site, which strongly favors formation of HG type base-pairs that are characterized by shorter C1′–C1′ distances as compared to WC base-pairs. The ability to insert the correct nucleotide across an adenine base also provided a rationale for prior biochemical studies showing a much higher efficiency of correct base incorporation across a templating adenine than across a templating thymine, which in fact favors G misincorporation because of its high propensity for forming of anti-G–T wobble base-pair.109 This raised HG base-pairs to a prominent position reserved previously only for WC base-pairs; they provided a basis for copying DNA.

The proposal that hPolι replicates DNA via HG base-pairing was quickly met with skepticism. In an accompanying News and Views article, Wang110 pointed out that, based on the weak electron density for the active site A–T base-pair, it is difficult to resolve a WC from a HG geometry. He also questioned the ability of such a polymerase to form protonated G–C⁺ base-pairs at physiological pH, given the low intrinsic pK_a of cytosine N3 (∼4.2–4.4).111 Aggarwal et al.112 later put the matter to rest by (i) solving X-ray structures of Polι, unambiguously showing a protonated G–C⁺ HG base-pair at pH 6.5, reinforcing their hypothesis that Polι has evolved to favor HG base-pairing by constraining the backbone C1′–C1′ distance between template and incoming nucleotide in its narrow active site and (ii) showing selective inhibition of DNA synthesis by Polι but not other polymerases when using 7-deazaadenine or 7-deazaguanine, which are incapable of forming HG base-pairing, as the templating residue.113 Several other structures capturing DNA synthesis by Polι followed, ultimately demonstrating that major purine alkylation and oxidation lesions, including 1,N6-ethenoadenine,114 N2-ethylguanine,115 O6-methylguanine116 and 8-oxoguanine,117 adopted a syn conformation and, where possible, formed HG type base-pairs with incoming complementary pyrimidine and purine nucleotides (Figure 6) (reviewed in Makarova et al.118). These observations, in conjunction with biological studies showing that Polι was important for cell survival in the presence of alkylating agents119, 120 and oxidative stress,121 provide the most compelling evidence to date for a biological function for HG base-pairs in duplex DNA.

TRANSIENT HOOGSTEEN BASE-PAIRS

The earliest fiber X-ray diffraction studies of DNA highlighted its polymorphic nature and the ability of the double helix to adopt different forms depending on environmental conditions and sequence contexts. Subsequent studies showed that DNA does indeed come in many different forms and that even B-DNA is not rigid, but rather, can undergo large deformations and thermal fluctuations in a sequence-dependent, biologically important manner.122 This flexibility was not confined to the weakly constrained sugar and phosphodiester backbone, but also includes the Watson–Crick base-pairs themselves.123 Chemical probing and hydrogen exchange studies spanning the 1970s–1990s established that WC base-pairs break apart and open at millisecond timescales and that the open state exists in at most ∼0.002% abundance for A–T or ∼0.00008% for G–C base-pairs.124-128 There are now several X-ray structures that capture these open states of the base-pairs when bound to proteins that establish their functional significance.

Two years ago, NMR studies from our laboratory showed that both A–T and G–C Watson–Crick base-pairs can transiently undergo excursions toward HG base-pairs in duplex DNA.71, 129 The transient HG base-pairs were characterized with the use of recently developed NMR R_1ρ relaxation dispersion spectroscopic methods that make it possible to observe and structurally characterize fleeting states of macromolecules.130-132 The transient HG base-pairs had populations of ∼0.1–1%, making them nearly three orders of magnitude more abundant than the open state, with the G–C⁺ HG base-pairs being less abundant than their A–T counterparts at physiological pH by at least a factor of 20 due to an additional required protonation event at cytosine N3.17 The transient HG base-pairs have lifetimes on the order of hundreds of microseconds to milliseconds (∼0.3–1.5 ms), which are significantly longer than the lifetimes of base-pair open states found to be in the nanosecond range.128 The free energy and enthalpy of the WC-to-HG transition were found to closely match those of base-pair opening,128 suggesting that the transition may be limited by a base-pair breaking event that could be coupled to the purine anti–syn isomerization, inside or outside of the double helix. It is remarkable that the HG base-pairs are energetically less favorable than WC counterparts by a mere ∼3 kcal/mol in the case of A–T base-pairs, roughly the equivalent of one strong hydrogen-bond. These energetic differences are small compared to forces that exist in cells due to protein interactions, torsional stress due to binding and supercoiling, or those applied due to crystal packing forces or that arise from changing pH. Studies suggest that the transient HG base-pairs occur universally across all DNA sequence contexts, in a noncooperative manner, and with small, albeit significant, sequence-specific differences in population and lifetimes (unpublished results).

The picture that emerges is one in which every base-pair in DNA exists as an equilibrium of WC and HG base-pairs, with external parameters operating on the DNA resolving one or the other base-pair type. This helps explain the long and controversial observation of WC versus HG—small changes in conditions can favor one form over the other. It is striking that the difference in the abundance of transient G–C⁺ and A–T HG base-pairs mirrors the differences in efficiency observed in Polι replication of A/T versus G/C. The HG base-pairs transiently expose the Watson–Crick faces of purines, and may potentially help explain the much greater abundance of N1 methylation in adenine versus guanine. Most importantly, the observation of transient HG base-pairs in duplex DNA, with comparable energetics to WC, raises the possibility that HG base-pairs exist in much greater abundance in vivo, particularly in A–T rich regions of the genome. When combined with the current difficulties in resolving WC from HG based on X-ray diffraction data, it may well be the case that there are more HG base-pairs in X-ray structures currently deposited in the PDB that have gone undetected, particularly for A–T base-pairs. We hope that this review provides the impetus to be more critical of the interpretation of X-ray diffraction data to rule out the possibility of HG base-pairs.

FUTURE OUTLOOK

Thus, 60 years later, there are fundamental questions that remain to be answered regarding the structure of the DNA double helix.

What fraction of the existing DNA structures in the Protein Data Bank contain base-pairs that, due to biases and poor electron density, have been misinterpreted to be WC rather than HG? It should be relatively straightforward to re-interpret the electron density for DNA structures and to examine, base-pair by base-pair, to what extent the data favor WC versus HG. We predict that there will be many structures that will be revised to include uncertain base-pair geometry, or even HG base-pairs, particularly for A–T base-pairs.
Are HG base-pairs that are observed in X-ray structures also observed under solution conditions? Here, there is a need to develop new methods to allow the characterization of HG versus WC under solution conditions. Developments in NMR that can allow studies of large protein-DNA complexes will undoubtedly be important—but other, high-throughput, approaches are also needed to streamline such applications.
If HG base-pairs do occur, to what extent are they functionally important? Are the HG base-pairs “passively” present because they merely provide a more stable form under a particular condition or do they “actively” participate in biological function? Here, there is a danger of rushing to interpret results with 7-deazapurines, which diminish the ability to form HG base-pairs. It is conceivable that the 7-deazapurines can still form distorted HG base-pairs that preserve function, or affect an aspect of function in vivo that is not explored in such in vitro studies.
To what extent do HG or any other type of base-pair occur in vivo? Today, there are no methods for characterizing, at the atomic level, the nature of the base-pairs that hold together DNA duplexes in vivo. In Eukaryotes, DNA is wrapped around nucleosomes and packaged with proteins to form chromatin fibers that make up chromosomes. The DNA is subjected to extreme packing and supercoiling, which present forces that in all likelihood exceed the energetic differences between WC and HG observed in duplex DNA. Given the growing evidence that HG base-pairs tend to be favored under tight packing conditions, and in regions of stress, we can predict that the genome may in fact be enriched with HG base-pairs in vivo relative to relaxed duplex DNA. One can imagine the existence of “HG islands” in A–T rich regions that have been shown to be important for DNA minor groove recognition by a variety of protein factors and anticancer drugs regulating replication/transcription, DNA bending, supercoiling induced DNA destabilization, nucleosome positioning, and chromosomal translocation.133-140 A challenge for the future will be the development of methods for visualizing the high resolution structure of DNA in vivo.

REFERENCES

1 Watson, J. D.; Crick, F. H. Nature 1953, 171, 737–738.
10.1038/171737a0
CAS PubMed Web of Science® Google Scholar
2 Watson, J. D.; Crick, F. H. Nature 1953, 171, 964–967.
10.1038/171964b0
CAS PubMed Web of Science® Google Scholar
3 Franklin, R. E.; Gosling, R. G. Nature 1953, 171, 740–741.
10.1038/171740a0
CAS PubMed Web of Science® Google Scholar
4 Franklin, R. E.; Gosling, R. G. Acta Crystallogr 1953, 6, 673–677.
10.1107/S0365110X53001939
CAS Web of Science® Google Scholar
5 Wing, R.; Drew, H.; Takano, T.; Broka, C.; Tanaka, S.; Itakura, K.; Dickerson, R. E. Nature 1980, 287, 755–758.
10.1038/287755a0
CAS PubMed Web of Science® Google Scholar
6 Dickerson, R. E.; Drew, H. R. J Mol Biol 1981, 149, 761–786.
10.1016/0022-2836(81)90357-0
CAS PubMed Web of Science® Google Scholar
7 Hoogsteen, K. Acta Crystallogr 1959, 12, 822–823.
10.1107/S0365110X59002389
CAS Web of Science® Google Scholar
8 Hoogsteen, K. Acta Crystallogr Sect A: Foundations Crystallogr 1963, 16, 907–916.
10.1107/S0365110X63002437
CAS Web of Science® Google Scholar
9 Wang, A. H.; Quigley, G. J.; Kolpak, F. J.; Crawford, J. L.; van Boom, J. H.; van der Marel, G.; Rich, A. Nature 1979, 282, 680–686.
10.1038/282680a0
CAS PubMed Web of Science® Google Scholar
10 Felsenfeld, G.; Rich, A. Biochim Biophys Acta 1957, 26, 457–468.
10.1016/0006-3002(57)90091-4
CAS PubMed Web of Science® Google Scholar
11 Sobell, H. M.; Tomita, K. I.; Rich, A. Proc Natl Acad Sci USA 1963, 49, 885–892.
10.1073/pnas.49.6.885
CAS PubMed Web of Science® Google Scholar
12 Corey, R. B.; Pauling, L. Arch Biochem Biophys 1956, 65, 164–181.
10.1016/0003-9861(56)90185-0
CAS PubMed Web of Science® Google Scholar
13 Haschemeyer, A. E.; Sobell, H. M. Acta Crystallogr 1965, 19, 125–130.
10.1107/S0365110X65002864
CAS PubMed Web of Science® Google Scholar
14 Haschemeyer, A. E.; Sobell, H. M. Proc Natl Acad Sci USA 1963, 50, 872–877.
10.1073/pnas.50.5.872
CAS PubMed Web of Science® Google Scholar
15 Mathews, F. S.; Rich, A. J Mol Biol 1964, 8, 89–95.
10.1016/S0022-2836(64)80151-0
CAS PubMed Web of Science® Google Scholar
16 Courtois, Y.; Fromageot, P.; Guschlbauer, W. Eur J Biochem 1968, 6, 493–501.
10.1111/j.1432-1033.1968.tb00472.x
CAS PubMed Web of Science® Google Scholar
17 Nikolova, E. N.; Goh, G. B.; Brooks, C. L., 3rd; Al-Hashimi, H. M. J Am Chem Soc 2013, 135, 6766–6769.
10.1021/ja400994e
CAS PubMed Web of Science® Google Scholar
18 Haas, B. L.; Guschlbauer, W. Nucleic Acids Res 1976, 3, 205–218.
10.1093/nar/3.1.205
CAS PubMed Web of Science® Google Scholar
19 Kallenbach, N. R.; Daniel, W. E., Jr.; Kaminker, M. A. Biochemistry 1976, 15, 1218–1224.
10.1021/bi00651a007
CAS PubMed Web of Science® Google Scholar
20 Quigley, G. J.; Ughetto, G.; van der Marel, G. A.; van Boom, J. H.; Wang, A. H.; Rich, A. Science 1986, 232, 1255–1258.
10.1126/science.3704650
CAS PubMed Web of Science® Google Scholar
21 Pauling, L.; Corey, R. B. Proc Natl Acad Sci USA 1953, 39, 84–97.
10.1073/pnas.39.2.84
CAS PubMed Web of Science® Google Scholar
22 Kearns, D. R.; Patel, D. J.; Shulman, R. G. Nature 1971, 229, 338–339.
10.1038/229338a0
CAS PubMed Web of Science® Google Scholar
23 Cross, A. D.; Crothers, D. M. Biochemistry 1971, 10, 4015–4023.
10.1021/bi00798a002
CAS PubMed Web of Science® Google Scholar
24 Wong, Y. P.; Kearns, D. R.; Reid, B. R.; Shulman, R. G. J Mol Biol 1972, 72, 725–740.
10.1016/0022-2836(72)90187-8
CAS PubMed Web of Science® Google Scholar
25 Shulman, R. G.; Hilbers, C. W.; Wong, Y. P.; Wong, K. L.; Lightfoot, D. R.; Reid, B. R.; Kearns, D. R. Proc Natl Acad Sci USA 1973, 70, 2042–2045.
10.1073/pnas.70.7.2042
CAS PubMed Web of Science® Google Scholar
26 Crothers, D. M.; Hilbers, C. W.; Shulman, R. G. Proc Natl Acad Sci USA 1973, 70, 2899–2901.
10.1073/pnas.70.10.2899
CAS PubMed Web of Science® Google Scholar
27 Patel, D. J.; Tonelli, A. E. Biopolymers 1974, 13, 1943–1964.
10.1002/bip.1974.360131003
CAS PubMed Web of Science® Google Scholar
28 Day, R. O.; Seeman, N. C.; Rosenberg, J. M.; Rich, A. Proc Natl Acad Sci USA 1973, 70, 849–853.
10.1073/pnas.70.3.849
CAS PubMed Web of Science® Google Scholar
29 Rosenberg, J. M.; Seeman, N. C.; Kim, J. J.; Suddath, F. L.; Nicholas, H. B.; Rich, A. Nature 1973, 243, 150–154.
10.1038/243150a0
CAS PubMed Web of Science® Google Scholar
30 Rich, A. Q Rev Biophys 2009, 42, 117–137.
10.1017/S0033583509004776
CAS PubMed Web of Science® Google Scholar
31 Arentzen, R.; Van Boeckel, C.; Van der Marel, G.; Van Boom, J., 1979, 137–139.
Google Scholar
32 de Rooij, J. F. M.; Wille-Hazeleger, G.; van Deursen, P. H.; Serdijn, J.; van Boom, J. H. Recueil des Travaux Chimiques des Pays-Bas 1979, 98, 537–548.
10.1002/recl.19790981105
Google Scholar
33 Mitsui, Y.; Langridge, R.; Shortle, B. E.; Cantor, C. R.; Grant, R. C.; Kodama, M.; Wells, R. D. Nature 1970, 228, 1166–1169.
10.1038/2281166a0
CAS PubMed Web of Science® Google Scholar
34 Pohl, F. M.; Jovin, T. M. J Mol Biol 1972, 67, 375–396.
10.1016/0022-2836(72)90457-3
CAS PubMed Web of Science® Google Scholar
35 Drew, H. R.; Dickerson, R. E. Embo J 1982, 1, 663–667.
10.1002/j.1460-2075.1982.tb01227.x
CAS PubMed Web of Science® Google Scholar
36 Davies, D. R.; Baldwin, R. L. J Mol Biol 1963, 6, 251–255.
10.1016/S0022-2836(63)80086-8
CAS PubMed Web of Science® Google Scholar
37 Selsing, E.; Arnott, S. J Mol Biol 1975, 98, 243–248.
10.1016/S0022-2836(75)80112-4
CAS PubMed Web of Science® Google Scholar
38 Ikehara, M.; Hattori, M.; Fukui, T. Eur J Biochem 1972, 31, 329–334.
10.1111/j.1432-1033.1972.tb02537.x
CAS PubMed Web of Science® Google Scholar
39 Ishikawa, F.; Frazier, J.; Howard, F. B.; Miles, H. T. J Mol Biol 1972, 70, 475–490.
10.1016/0022-2836(72)90554-2
CAS PubMed Web of Science® Google Scholar
40 Hattori, M.; Ikehara, M.; Miles, H. T. Biochemistry 1974, 13, 2754–2761.
10.1021/bi00710a015
CAS PubMed Web of Science® Google Scholar
41 Liu, K.; Miles, H. T.; Frazier, J.; Sasisekharan, V. Biochemistry 1993, 32, 11802–11809.
10.1021/bi00095a008
CAS PubMed Web of Science® Google Scholar
42 Segers-Nolten, G. M.; Sijtsema, N. M.; Otto, C. Biochemistry 1997, 36, 13241–13247.
10.1021/bi971326w
CAS PubMed Web of Science® Google Scholar
43 Blommers, M. J.; van de Ven, F. J.; van der Marel, G. A.; van Boom, J. H.; Hilbers, C. W. Eur J Biochem 1991, 201, 33–51.
10.1111/j.1432-1033.1991.tb16253.x
CAS PubMed Web of Science® Google Scholar
44 Ronning, D. R.; Guynet, C.; Ton-Hoang, B.; Perez, Z. N.; Ghirlando, R.; Chandler, M.; Dyda, F. Mol Cell 2005, 20, 143–154.
10.1016/j.molcel.2005.07.026
CAS PubMed Web of Science® Google Scholar
45 Abrescia, N. G.; Thompson, A.; Huynh-Dinh, T.; Subirana, J. A. Proc Natl Acad Sci USA 2002, 99, 2806–2811.
10.1073/pnas.052675499
CAS PubMed Web of Science® Google Scholar
46 Abrescia, N. G.; Gonzalez, C.; Gouyette, C.; Subirana, J. A. Biochemistry 2004, 43, 4092–4100.
10.1021/bi0355140
CAS PubMed Web of Science® Google Scholar
47 Pous, J.; Urpi, L.; Subirana, J. A.; Gouyette, C.; Navaza, J.; Campos, J. L. J Am Chem Soc 2008, 130, 6755–6760.
10.1021/ja078022+
CAS PubMed Web of Science® Google Scholar
48 De Luchi, D.; Tereshko, V.; Gouyette, C.; Subirana, J. A. Chembiochem 2006, 7, 585–587.
10.1002/cbic.200500449
CAS PubMed Web of Science® Google Scholar
49 Zheng, G.; Lu, X. J.; Olson, W. K. Nucleic Acids Res 2009, 37, W240–W246.
10.1093/nar/gkp358
CAS PubMed Web of Science® Google Scholar
50 Wang, A. H.; Ughetto, G.; Quigley, G. J.; Hakoshima, T.; van der Marel, G. A.; van Boom, J. H.; Rich, A. Science 1984, 225, 1115–1121.
10.1126/science.6474168
CAS PubMed Web of Science® Google Scholar
51 Ward, D. C.; Reich, E.; Goldberg, I. H. Science 1965, 149, 1259–1263.
10.1126/science.149.3689.1259
CAS PubMed Web of Science® Google Scholar
52 Sato, K.; Shiratori, O.; Katagiri, K. J Antibiot (Tokyo) 1967, 20, 270–276.
CAS PubMed Web of Science® Google Scholar
53 Ughetto, G.; Wang, A. H.; Quigley, G. J.; van der Marel, G. A.; van Boom, J. H.; Rich, A. Nucleic Acids Res 1985, 13, 2305–2323.
10.1093/nar/13.7.2305
CAS PubMed Web of Science® Google Scholar
54 Mendel, D.; Dervan, P. B. Proc Natl Acad Sci USA 1987, 84, 910–914.
10.1073/pnas.84.4.910
CAS PubMed Web of Science® Google Scholar
55 Herr, W. Proc Natl Acad Sci USA 1985, 82, 8009–8013.
10.1073/pnas.82.23.8009
CAS PubMed Web of Science® Google Scholar
56 Scholten, P. M.; Nordheim, A. Nucleic Acids Res 1986, 14, 3981–3993.
10.1093/nar/14.10.3981
CAS PubMed Web of Science® Google Scholar
57 McLean, M. J.; Waring, M. J. J Mol Recognit 1988, 1, 138–151.
10.1002/jmr.300010307
CAS PubMed Google Scholar
58 Portugal, J.; Fox, K. R.; McLean, M. J.; Richenberg, J. L.; Waring, M. J. Nucleic Acids Res 1988, 16, 3655–3670.
10.1093/nar/16.9.3655
CAS PubMed Web of Science® Google Scholar
59 McLean, M. J.; Seela, F.; Waring, M. J. Proc Natl Acad Sci USA 1989, 86, 9687–9691.
10.1073/pnas.86.24.9687
CAS PubMed Web of Science® Google Scholar
60 Sayers, E. W.; Waring, M. J. Biochemistry 1993, 32, 9094–9107.
10.1021/bi00086a014
CAS PubMed Web of Science® Google Scholar
61 Gao, X. L.; Patel, D. J. Biochemistry 1988, 27, 1744–1751.
10.1021/bi00405a054
CAS PubMed Web of Science® Google Scholar
62 Gilbert, D. E.; van der Marel, G. A.; van Boom, J. H.; Feigon, J. Proc Natl Acad Sci USA 1989, 86, 3006–3010.
10.1073/pnas.86.9.3006
CAS PubMed Web of Science® Google Scholar
63 Gilbert, D. E.; Feigon, J. Biochemistry 1991, 30, 2483–2494.
10.1021/bi00223a027
CAS PubMed Web of Science® Google Scholar
64 Gilbert, D. E.; Feigon, J. Nucleic Acids Res 1992, 20, 2411–2420.
10.1093/nar/20.10.2411
CAS PubMed Web of Science® Google Scholar
65 Park, J. Y.; Choi, B. S. J Biochem 1995, 118, 989–995.
10.1093/jb/118.5.989
CAS PubMed Web of Science® Google Scholar
66 Gallego, J.; Luque, F. J.; Orozco, M.; Burgos, C.; Alvarez-Builla, J.; Rodrigo, M. M.; Gago, F. J Med Chem 1994, 37, 1602–1609.
10.1021/jm00037a010
CAS PubMed Web of Science® Google Scholar
67 Gallego, J.; Luque, F. J.; Orozco, M.; Gago, F. J Biomol Struct Dyn 1994, 12, 111–129.
10.1080/07391102.1994.10508091
CAS PubMed Web of Science® Google Scholar
68 Rice, P. A.; Yang, S.; Mizuuchi, K.; Nash, H. A. Cell 1996, 87, 1295–1306.
10.1016/S0092-8674(00)81824-3
CAS PubMed Web of Science® Google Scholar
69 Dhavan, G. M.; Lapham, J.; Yang, S.; Crothers, D. M. J Mol Biol 1999, 288, 659–671.
10.1006/jmbi.1999.2690
CAS PubMed Web of Science® Google Scholar
70 Patikoglou, G. A.; Kim, J. L.; Sun, L.; Yang, S. H.; Kodadek, T.; Burley, S. K. Genes Dev 1999, 13, 3217–3230.
10.1101/gad.13.24.3217
CAS PubMed Web of Science® Google Scholar
71 Hoopes, B. C.; LeBlanc, J. F.; Hawley, D. K. J Mol Biol 1998, 277, 1015–1031.
10.1006/jmbi.1998.1651
CAS PubMed Web of Science® Google Scholar
72 Nikolova, E. N.; Kim, E.; Wise, A. A.; O'Brien, P. J.; Andricioaei, I.; Al-Hashimi, H. M. Nature 2011, 470, 498–502.
10.1038/nature09775
CAS PubMed Web of Science® Google Scholar
73 Meyer, T.; Gustafsson, J. A.; Carlstedt-Duke, J. DNA Cell Biol 1997, 16, 919–927.
10.1089/dna.1997.16.919
CAS PubMed Web of Science® Google Scholar
74 Aishima, J.; Gitti, R. K.; Noah, J. E.; Gan, H. H.; Schlick, T.; Wolberger, C. Nucleic Acids Res 2002, 30, 5244–5252.
10.1093/nar/gkf661
CAS PubMed Web of Science® Google Scholar
75 Bunting, K. A.; Roe, S. M.; Headley, A.; Brown, T.; Savva, R.; Pearl, L. H. Nucleic Acids Res 2003, 31, 1633–1639.
10.1093/nar/gkg273
CAS PubMed Web of Science® Google Scholar
76 Tsutakawa, S. E.; Jingami, H.; Morikawa, K. Cell 1999, 99, 615–623.
10.1016/S0092-8674(00)81550-0
CAS PubMed Web of Science® Google Scholar
77 Kitayner, M.; Rozenberg, H.; Rohs, R.; Suad, O.; Rabinovich, D.; Honig, B.; Shakked, Z. Nat Struct Mol Biol 2010, 17, 423–429.
10.1038/nsmb.1800
CAS PubMed Web of Science® Google Scholar
78 Malecka, K. A.; Ho, W. C.; Marmorstein, R. Oncogene 2009, 28, 325–333.
10.1038/onc.2008.400
CAS PubMed Web of Science® Google Scholar
79 Chen, Y.; Dey, R.; Chen, L. Structure 2010, 18, 246–256.
10.1016/j.str.2009.11.011
CAS PubMed Web of Science® Google Scholar
80 Kitayner, M.; Rozenberg, H.; Kessler, N.; Rabinovich, D.; Shaulov, L.; Haran, T. E.; Shakked, Z. Mol Cell 2006, 22, 741–753.
10.1016/j.molcel.2006.05.015
CAS PubMed Web of Science® Google Scholar
81 Alexander, P. Advanc Cancer Res 1954, 2, 1–72.
10.1016/S0065-230X(08)60491-1
CAS PubMed Web of Science® Google Scholar
82 Friedberg, E. C. Cell Res 2008, 18, 3–7.
10.1038/cr.2007.113
CAS PubMed Web of Science® Google Scholar
83 Patel, D. J.; Shapiro, L.; Kozlowski, S. A.; Gaffney, B. L.; Jones, R. A. J Mol Biol 1986, 188, 677–692.
10.1016/S0022-2836(86)80014-6
CAS PubMed Web of Science® Google Scholar
84 Kalnik, M. W.; Li, B. F.; Swann, P. F.; Patel, D. J. Biochemistry 1989, 28, 6182–6192.
10.1021/bi00441a009
CAS PubMed Web of Science® Google Scholar
85 Kouchakdjian, M.; Marinelli, E.; Gao, X. L.; Johnson, F.; Grollman, A.; Patel, D. Biochemistry 1989, 28, 5647–5657.
10.1021/bi00439a047
CAS PubMed Web of Science® Google Scholar
86 Norman, D.; Abuaf, P.; Hingerty, B. E.; Live, D.; Grunberger, D.; Broyde, S.; Patel, D. J. Biochemistry 1989, 28, 7462–7476.
10.1021/bi00444a046
CAS PubMed Web of Science® Google Scholar
87 Singh, U. S.; Moe, J. G.; Reddy, G. R.; Weisenseel, J. P.; Marnett, L. J.; Stone, M. P. Chem Res Toxicol 1993, 6, 825–836.
10.1021/tx00036a012
CAS PubMed Web of Science® Google Scholar
88 Weisenseel, J. P.; Reddy, G. R.; Marnett, L. J.; Stone, M. P. Chem Res Toxicol 2002, 15, 127–139.
10.1021/tx0101090
CAS PubMed Web of Science® Google Scholar
89 Shanmugam, G.; Kozekov, I. D.; Guengerich, F. P.; Rizzo, C. J.; Stone, M. P. Chem Res Toxicol 2008, 21, 1795–1805.
10.1021/tx8001466
CAS PubMed Web of Science® Google Scholar
90 Mao, B.; Hingerty, B. E.; Broyde, S.; Patel, D. J. Biochemistry 1998, 37, 81–94.
10.1021/bi972257o
CAS PubMed Web of Science® Google Scholar
91 Yang, H.; Zhan, Y.; Fenn, D.; Chi, L. M.; Lam, S. L. FEBS Lett 2008, 582, 1629–1633.
10.1016/j.febslet.2008.04.013
CAS PubMed Web of Science® Google Scholar
92 Lu, L.; Yi, C.; Jian, X.; Zheng, G.; He, C. Nucleic Acids Res 2010, 38, 4415–4425.
10.1093/nar/gkq129
CAS PubMed Web of Science® Google Scholar
93 Natrajan, G.; Lamers, M. H.; Enzlin, J. H.; Winterwerp, H. H.; Perrakis, A.; Sixma, T. K. Nucleic Acids Res 2003, 31, 4814–4821.
Google Scholar
94 Bolli, M.; Litten, J. C.; Schutz, R.; Leumann, C. J. Chem Biol 1996, 3, 197–206.
10.1016/S1074-5521(96)90263-X
CAS PubMed Web of Science® Google Scholar
95 Isaksson, J.; Zamaratski, E.; Maltseva, T. V.; Agback, P.; Kumar, A.; Chattopadhyaya, J. J Biomol Struct Dyn 2001, 18, 783–806.
10.1080/07391102.2001.10506707
CAS PubMed Web of Science® Google Scholar
96 Xiao, Z.; Weisz, K. J Am Chem Soc 2010, 132, 3862–3869.
10.1021/ja910220s
CAS PubMed Web of Science® Google Scholar
97 Lehman, I. R.; Bessman, M. J.; Simms, E. S.; Kornberg, A. J Biol Chem 1958, 233, 163–170.
10.1016/S0021-9258(19)68048-8
CAS PubMed Web of Science® Google Scholar
98 Joyce, C. M.; Steitz, T. A. Annu Rev Biochem 1994, 63, 777–822.
10.1146/annurev.bi.63.070194.004021
CAS PubMed Web of Science® Google Scholar
99 Doublie, S.; Sawaya, M. R.; Ellenberger, T. Structure 1999, 7, R31–R35.
10.1016/S0969-2126(99)80017-3
CAS PubMed Web of Science® Google Scholar
100 Steitz, T. A. J Biol Chem 1999, 274, 17395–17398.
10.1074/jbc.274.25.17395
CAS PubMed Web of Science® Google Scholar
101 Kunkel, T. A.; Bebenek, K. Annu Rev Biochem 2000, 69, 497–529.
10.1146/annurev.biochem.69.1.497
CAS PubMed Web of Science® Google Scholar
102 Reha-Krantz, L. J. Biochim Biophys Acta 2010, 1804, 1049–1063.
10.1016/j.bbapap.2009.06.012
CAS PubMed Web of Science® Google Scholar
103 Pata, J. D. Biochim Biophys Acta 2010, 1804, 1124–1135.
10.1016/j.bbapap.2010.01.020
CAS PubMed Web of Science® Google Scholar
104 Washington, M. T.; Carlson, K. D.; Freudenthal, B. D.; Pryor, J. M. Biochim Biophys Acta 2010, 1804, 1113–1123.
10.1016/j.bbapap.2009.07.004
CAS PubMed Web of Science® Google Scholar
105 Yamtich, J.; Sweasy, J. B. Biochim Biophys Acta 2010, 1804, 1136–1150.
10.1016/j.bbapap.2009.07.008
CAS PubMed Web of Science® Google Scholar
106 Yang, W.; Woodgate, R. Proc Natl Acad Sci USA 2007, 104, 15591–15598.
10.1073/pnas.0704219104
CAS PubMed Web of Science® Google Scholar
107 Ling, H.; Boudsocq, F.; Plosky, B. S.; Woodgate, R.; Yang, W. Nature 2003, 424, 1083–1087.
10.1038/nature01919
CAS PubMed Web of Science® Google Scholar
108 Nair, D. T.; Johnson, R. E.; Prakash, S.; Prakash, L.; Aggarwal, A. K. Nature 2004, 430, 377–380.
10.1038/nature02692
CAS PubMed Web of Science® Google Scholar
109 Tissier, A.; McDonald, J. P.; Frank, E. G.; Woodgate, R. Genes Dev 2000, 14, 1642–1650.
CAS PubMed Web of Science® Google Scholar
110 Wang, J. Nature 2005, 437, E6–E7; discussion E7.
10.1038/nature04199
CAS PubMed Web of Science® Google Scholar
111 Izatt, R. M.; Christensen, J. J.; Rytting, J. H. Chem Rev 1971, 71, 439–481.
10.1021/cr60273a002
CAS PubMed Web of Science® Google Scholar
112 Nair, D. T.; Johnson, R. E.; Prakash, L.; Prakash, S.; Aggarwal, A. K. Structure 2005, 13, 1569–1577.
10.1016/j.str.2005.08.010
CAS PubMed Web of Science® Google Scholar
113 Johnson, R. E.; Prakash, L.; Prakash, S. Proc Natl Acad Sci USA 2005, 102, 10466–10471.
10.1073/pnas.0503859102
CAS PubMed Web of Science® Google Scholar
114 Nair, D. T.; Johnson, R. E.; Prakash, L.; Prakash, S.; Aggarwal, A. K. Nat Struct Mol Biol 2006, 13, 619–625.
10.1038/nsmb1118
CAS PubMed Google Scholar
115 Pence, M. G.; Blans, P.; Zink, C. N.; Hollis, T.; Fishbein, J. C.; Perrino, F. W. J Biol Chem 2009, 284, 1732–1740.
10.1074/jbc.M807296200
CAS PubMed Web of Science® Google Scholar
116 Pence, M. G.; Choi, J. Y.; Egli, M.; Guengerich, F. P. J Biol Chem 2010, 285, 40666–40672.
10.1074/jbc.M110.183665
CAS PubMed Web of Science® Google Scholar
117 Kirouac, K. N.; Ling, H. Proc Natl Acad Sci USA 2011, 108, 3210–3215.
10.1073/pnas.1013909108
PubMed Web of Science® Google Scholar
118 Makarova, A. V.; Kulbachinskiy, A. V. Biochemistry (Mosc) 2012, 77, 547–561.
10.1134/S0006297912060016
CAS PubMed Web of Science® Google Scholar
119 Johnson, R. E.; Yu, S. L.; Prakash, S.; Prakash, L. Mol Cell Biol 2007, 27, 7198–7205.
10.1128/MCB.01079-07
CAS PubMed Web of Science® Google Scholar
120 Plosky, B. S.; Frank, E. G.; Berry, D. A.; Vennall, G. P.; McDonald, J. P.; Woodgate, R. Nucleic Acids Res 2008, 36, 2152–2162.
10.1093/nar/gkn058
CAS PubMed Web of Science® Google Scholar
121 Petta, T. B.; Nakajima, S.; Zlatanou, A.; Despras, E.; Couve-Privat, S.; Ishchenko, A.; Sarasin, A.; Yasui, A.; Kannouche, P. EMBO J 2008, 27, 2883–2895.
10.1038/emboj.2008.210
CAS PubMed Web of Science® Google Scholar
122 Rich, A. Gene 1993, 135, 99–109.
10.1016/0378-1119(93)90054-7
CAS PubMed Web of Science® Google Scholar
123 Travers, A. A. Philos Transact A Math Phys Eng Sci 2004, 362, 1423–1438.
10.1098/rsta.2004.1390
CAS PubMed Web of Science® Google Scholar
124 Englander, S. W.; Downer, N. W.; Teitelbaum, H. Annu Rev Biochem 1972, 41, 903–924.
10.1146/annurev.bi.41.070172.004351
CAS PubMed Web of Science® Google Scholar
125 Gueron, M.; Kochoyan, M.; Leroy, J. L. Nature 1987, 328, 89–92.
10.1038/328089a0
CAS PubMed Web of Science® Google Scholar
126 Gueron, M.; Leroy, J. L. Methods Enzymol 1995, 261, 383–413.
10.1016/S0076-6879(95)61018-9
CAS PubMed Web of Science® Google Scholar
127 Russu, I. M. Methods Enzymol 2004, 379, 152–175.
10.1016/S0076-6879(04)79009-6
PubMed Web of Science® Google Scholar
128 Coman, D.; Russu, I. M. J Biol Chem 2005, 280, 20216–20221.
10.1074/jbc.M502773200
CAS PubMed Web of Science® Google Scholar
129 Nikolova, E. N.; Gottardo, F. L.; Al-Hashimi, H. M. J Am Chem Soc 2012, 134, 3667–3670.
10.1021/ja2117816
CAS PubMed Web of Science® Google Scholar
130 Massi, F.; Johnson, E.; Wang, C.; Rance, M.; Palmer, A. G., 3rd. J Am Chem Soc 2004, 126, 2247–2256.
10.1021/ja038721w
CAS PubMed Web of Science® Google Scholar
131 Korzhnev, D. M.; Orekhov, V. Y.; Kay, L. E. J Am Chem Soc 2005, 127, 713–721.
10.1021/ja0446855
CAS PubMed Web of Science® Google Scholar
132 Hansen, A. L.; Nikolova, E. N.; Casiano-Negroni, A.; Al-Hashimi, H. M. J Am Chem Soc 2009, 131, 3818–3819.
10.1021/ja8091399
CAS PubMed Web of Science® Google Scholar
133 Bowater, R.; Aboul-ela, F.; Lilley, D. M. Biochemistry 1991, 30, 11495–11506.
10.1021/bi00113a003
CAS PubMed Web of Science® Google Scholar
134 Carrera, P.; Azorin, F. Nucleic Acids Res 1994, 22, 3671–3680.
10.1093/nar/22.18.3671
CAS PubMed Web of Science® Google Scholar
135 Woynarowski, J. M.; Trevino, A. V.; Rodriguez, K. A.; Hardies, S. C.; Benham, C. J. J Biol Chem 2001, 276, 40555–40566.
10.1074/jbc.M103390200
CAS PubMed Web of Science® Google Scholar
136 Rohs, R.; West, S. M.; Sosinsky, A.; Liu, P.; Mann, R. S.; Honig, B. Nature 2009, 461, 1248–1253.
10.1038/nature08473
CAS PubMed Web of Science® Google Scholar
137 Segal, E.; Widom, J. Curr Opin Struct Biol 2009, 19, 65–71.
10.1016/j.sbi.2009.01.004
CAS PubMed Web of Science® Google Scholar
138 Gordon, B. R.; Li, Y.; Cote, A.; Weirauch, M. T.; Ding, P.; Hughes, T. R.; Navarre, W. W.; Xia, B.; Liu, J. Proc Natl Acad Sci USA 2011, 108, 10690–10695.
10.1073/pnas.1102544108
CAS PubMed Web of Science® Google Scholar
139 Kato, T.; Kurahashi, H.; Emanuel, B. S. Curr Opin Genet Dev 2012, 22, 221–228.
10.1016/j.gde.2012.02.004
CAS PubMed Web of Science® Google Scholar
140 Rajewska, M.; Wegrzyn, K.; Konieczny, I. FEMS Microbiol Rev 2012, 36, 408–434.
10.1111/j.1574-6976.2011.00300.x
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume99, Issue12

50th Anniversary Special Issue on Nucleic Acids

December 2013

Pages 955-968

A historical account of hoogsteen base-pairs in duplex DNA

ABSTRACT

INTRODUCTION

PURINE-PYRIMIDINE CO-CRYSTALS

HG BASE-PAIRS IN NAKED DUPLEXES

DNA-ANTIBIOTIC COMPLEXES

DNA-PROTEIN COMPLEXES

DAMAGED DNA

DNA REPLICATION

TRANSIENT HOOGSTEEN BASE-PAIRS

FUTURE OUTLOOK

REFERENCES

Citing Literature

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

A historical account of hoogsteen base-pairs in duplex DNA

ABSTRACT

INTRODUCTION

PURINE-PYRIMIDINE CO-CRYSTALS

HG BASE-PAIRS IN NAKED DUPLEXES

DNA-ANTIBIOTIC COMPLEXES

DNA-PROTEIN COMPLEXES

DAMAGED DNA

DNA REPLICATION

TRANSIENT HOOGSTEEN BASE-PAIRS

FUTURE OUTLOOK

REFERENCES

Citing Literature

Figures

References

Related

Information