



structural communications
The structure of Haemophilus influenzae prephenate dehydrogenase suggests unique features of bifunctional TyrA enzymes
aStanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory,
Menlo Park, CA, USA,bJoint Center for Structural Genomics, https://www.jcsg.org , USA,cProtein Sciences Department, Genomics Institute of the Novartis Research Foundation,
San Diego, CA, USA,dCenter for Research in Biological Systems, University of California, San Diego, La
Jolla, CA, USA,eDepartment of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA,fProgram on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute,
La Jolla, CA, USA, and gPhoton Science, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
*Correspondence e-mail: [email protected]
Chorismate mutase/prephenate dehydrogenase from Haemophilus influenzae Rd KW20 is a bifunctional enzyme that catalyzes the rearrangement of chorismate to prephenate and the NAD(P)+-dependent oxidative decarboxylation of prephenate to 4-hydroxyphenylpyruvate in tyrosine biosynthesis. The of the prephenate dehydrogenase component (HinfPDH) of the TyrA protein from H. influenzae Rd KW20 in complex with the inhibitor tyrosine and cofactor NAD+ has been determined to 2.0 Å resolution. HinfPDH is a dimeric enzyme, with each monomer consisting of an N-terminal α/β dinucleotide-binding domain and a C-terminal α-helical dimerization domain. The structure reveals key active-site residues at the domain interface, including His200, Arg297 and Ser179 that are involved in catalysis and/or ligand binding and are highly conserved in TyrA proteins from all three kingdoms of life. Tyrosine is bound directly at the catalytic site, suggesting that it is a competitive inhibitor of HinfPDH. Comparisons with its structural homologues reveal important differences around the active site, including the absence of an α–β motif in HinfPDH that is present in other TyrA proteins, such as Synechocystis sp. arogenate dehydrogenase. Residues from this motif are involved in discrimination between NADP+ and NAD+. The loop between β5 and β6 in the N-terminal domain is much shorter in HinfPDH and an extra helix is present at the C-terminus. Furthermore, HinfPDH adopts a more closed conformation compared with TyrA proteins that do not have tyrosine bound. This conformational change brings the substrate, cofactor and active-site residues into close proximity for catalysis. An ionic network consisting of Arg297 (a key residue for tyrosine binding), a water molecule, Asp206 (from the loop between β5 and β6) and Arg365′ (from the additional C-terminal helix of the adjacent monomer) is observed that might be involved in gating the active site.
Keywords: tyrosine biosynthesis; prephenate; chorismate; Haemophilus influenzae; structural genomics.
3D view: 2pv7
PDB reference: chorismate mutase/prephenate dehydrogenase, 2pv7
1. Introduction
The TyrA protein family comprises dehydrogenases that are dedicated to L-tyrosine biosynthesis. These dehydrogenases can be classified into three groups according
to their substrate specificity. Prephenate dehydrogenases (TyrAp or PDH) only use
prephenate as a substrate, arogenate dehydrogenases (TyrAa of ADH) only accept arogenate
and cyclohexadienyl dehydrogenases (TyrAc or CDH) use either prephenate or arogenate.
The TyrA dehydrogenases convert prephenate to tyrosine through two different routes
(Fig. 1). In the prephenate pathway, PDH enzymes catalyze the NAD(P)+-dependent oxidative decarboxylation of prephenate to 4-hydroxyphenylpyruvate (HPP),
which is then converted to L-tyrosine by an aminotransferase. In the arogenate route, prephenate is first transaminated
to L-arogenate by prephenate aminotransferase. L-Arogenate is then decarboxylated and converted to L-tyrosine by ADH. In addition to diverse substrate specificity, the TyrA family also
exhibits diversity with respect to its cofactor specificity. TyrA proteins may be
specific for NAD+ or NADP+ or may use both. TyrA proteins exist as either monofunctional or bifunctional proteins.
The common fusion partners of TyrA proteins include chorismate mutase (aroQ; Calhoun et al., 2001
), 3-phosphoshikimate 1-carboxyvinyltransferase (aroF; Beller et al., 2006
) and an ACT (aspartate kinase–chorismate mutase–TyrA) regulatory domain (Chipman
& Shaanan, 2001
; Grant, 2006
).
![]() |
Figure 1 Tyrosine biosynthetic pathway. The figure is modified from The Enzyme Database (https://www.enzyme-database.org ). |
TyrAp is mostly present in low-GC Gram-positive organisms, such as Bacillus subtilis. TyrAa is abundant in higher plants and in at least three bacterial lineages, cyanobacteria,
actinomycetes and Nitrosomonas europaea, whereas TyrAc is found in most bacteria. An analysis of the phylogenetic relationship
of TyrA enzymes identified a distinct within the TyrAc group, denoted here as TyrAc_Δ (Song et al., 2005). When the primary sequences of these TyrAc_Δ proteins are aligned with those of other TyrA groups, it is immediately apparent
that the TyrAc_Δ proteins contain a number of deletions within the catalytic core region and possess
a second functional domain, which classifies them as bifunctional enzymes. Biochemical
studies have shown that this displays narrower substrate and cofactor specificity compared with the parent TyrAc
enzymes. The TyrA enzymes from Escherichia coli and Klebsiella pneumoniae are the two best characterized TyrAc_Δ enzymes and both prefer prephenate over arogenate by more than one order of magnitude
and only use NAD+ as cofactor (Ahmad & Jensen, 1987
; Turnbull et al., 1990
). The studies further suggested that the TyrAc family, with its broad substrate specificity,
represents the ancestral enzymes from which the TyrAc_Δ, TyrAa and TyrAp enzymes have evolved to exhibit a narrower range of substrate specificity
(Song et al., 2005
).
The regulation of TyrA activity is important as prephenate is a common precursor in
the biosynthesis of tyrosine and phenylalanine. TyrA enzymes are regulated by various
mechanisms, including feedback inhibition and gene regulation by the Tyr operon (Cobbett
& Delbridge, 1987). Kinetic studies of chorismate mutase/prephenate dehydrogenase (CM/PDH) from E. coli have led to the proposal of two different types of mechanism for tyrosine inhibition.
Christopherson (1985
) concluded that tyrosine acts as a competitive inhibitor in the dehydrogenase reaction,
whereas Turnbull, Morrison et al. (1991
) suggested that tyrosine binds at a distinct allosteric site. B. subtilis PDH is inhibited competitively by tyrosine and noncompetitively by tryptophan and
HPP (Champney & Jensen, 1970
). B. subtilis PDH has a C-terminal fusion of an ACT regulatory domain. The ACT domain was first
identified in 1995 and is a small-molecule binding domain that is found in enzymes
involved in amino-acid metabolism and transcription regulation. Small-molecule binding
to the ACT domain is thought to control the through allosteric regulation. Thus, the noncompetitive inhibition by tryptophan
and HPP in B. subtilis PDH might be a consequence of the presence of the ACT domain. Crystal structures
of PDH enzymes from our study and from Aquifex aeolicus (Sun et al., 2009
) both revealed bound tyrosine, but only at the active site, which supports the role
of tyrosine as a competitive inhibitor. The source of tyrosine in these two structures
originated from protein expression and cocrystallization, respectively. Not all TyrA
enzymes are inhibited by tyrosine; Synechocystis ADH, for example, is completely insensitive to by tyrosine (Legrand et al., 2006
). Studies on A. aeolicus PDH (Sun et al., 2009
) showed that His217 is critical for the inhibitory effect of tyrosine where a His217Ala
mutation completely abolished the inhibitory effect of tyrosine. Studies on E. coli TyrA reported similar results, in which a His257Ala mutation (His257 is equivalent
to His217 in A. aeolicus PDH) abolished inhibition by tyrosine (Christendat et al., 1998
). Comparison of the of Synechocystis ADH with that of A. aeolicus PDH revealed that Val182 is present in this location, which could account for the
loss of tyrosine inhibition.
Studies of the enzymatic mechanisms of TyrA enzymes have revealed some variations
in the reaction pathway. The kinetic data for Synechocystis ADH suggested a sequential substrate-binding event in which arogenate first binds
to the protein, followed by the cofactor (Beller et al., 2006), whereas kinetic studies on E. coli and Arabidopsis thaliana suggested a random addition of NAD+ and prephenate (Sampathkumar & Morrison, 1982
; Rippert & Matringe, 2002
). Kinetic studies using 13C-labeled substrates suggested a mechanism in which decarboxylation and proton transfer
occur in a concerted manner (Hermes et al., 1984
). Kinetic and mutagenesis studies of several systems have identified key active-site
residues (Christendat et al., 1998
; Christendat & Turnbull, 1999
). Conserved histidine, arginine and serine residues are critical for For instance, in E. coli TyrA His197 has been proposed to facilitate hydride transfer from prephenate to NAD+ by polarizing the 4-OH group of prephenate, whereas Arg294 is critical for substrate
binding. A His197Ala mutation decreased the dehydrogenase activity significantly and
an Arg294Gln mutation greatly increased the Km, but did not affect the kcat. In A. aeolicus PDH, Ser126 is hydrogen bonded to the 4′-OH of tyrosine and to NAD+ and could participate in both catalysis and ligand binding. The Ser126Ala mutation
reduced kcat 15-fold and increased Km tenfold.
The TyrA gene from Haemophilus influenza Rd KW20 encodes a bifunctional enzyme, chorismate mutase (EC 5.4.99.5)/prephenate
dehydrogenase (EC 1.3.1.12) (CM/PDH), with a molecular weight of 43 kDa (residues
1–377) and a calculated of 5.56. The of the prephenate dehydrogenase component (HinfPDH; residues 81–377) of this TyrA enzyme was determined in complex with tyrosine
and NAD+ at 2.0 Å resolution and represents the first PDH structure from a bifunctional TyrA
enzyme. The structure was determined using the semi-automated high-throughput pipeline
of the Joint Center for Structural Genomics (JCSG; Lesley et al., 2002) as part of the National Institute of General Medical Sciences' Protein Structure
Initiative (PSI; https://www.nigms.nih.gov/Initiatives/PSI/ ).
2. Materials and methods
2.1. Protein production and crystallization
Clones were generated using the Polymerase Incomplete Primer Extension (PIPE) cloning
method (Klock et al., 2008). The gene encoding HinfPDH (GenBank AAC22939, gi|1574749; Swiss-Prot TYRA) was amplified by (PCR) from H. influenzae Rd KW20 genomic DNA using PfuTurbo DNA polymerase (Stratagene) and I-PIPE (Insert) primers (forward primer, 5′-ctgtacttccagggcATGCGTGAATCCTATGCCAATGAAAACC-3′;
reverse primer, 5′-aattaagtcgcgttaGCATAAAACGGCGTAGAACATCTTCAAT-3′; target sequence
in upper case) that included sequences for the predicted 5′ and 3′ ends. The expression
vector pSpeedET, which encodes an amino-terminal tobacco etch virus (TEV) protease-cleavable
expression and purification tag (MGSDKIHHHHHHENLYFQ/G), was PCR-amplified with V-PIPE
(Vector) primers. The V-PIPE and I-PIPE PCR products were mixed to anneal the amplified
DNA fragments together. E. coli GeneHogs (Invitrogen) competent cells were transformed with the V-PIPE/I-PIPE mixture
and dispensed onto selective LB–agar plates. The cloning junctions were confirmed
by DNA sequencing. Using the PIPE method, the part of the gene encoding residues Met1–Phe80
was excluded from the final construct. Expression was performed in selenomethionine-containing
medium with suppression of normal methionine synthesis (Van Duyne et al., 1993
). At the end of lysozyme was added to the culture to a final concentration of 250 µg ml−1 and the cells were harvested and frozen. After one freeze–thaw cycle, the cells were
sonicated in lysis buffer [50 mM HEPES pH 8.0, 50 mM NaCl, 10 mM imidazole, 1 mM tris(2-carboxyethyl)phosphine–HCl (TCEP)] and the lysate was clarified by centrifugation
at 32 500g for 30 min. The soluble fraction was passed over nickel-chelating resin (GE Healthcare)
pre-equilibrated with lysis buffer, the resin was washed with wash buffer [50 mM HEPES pH 8.0, 300 mM NaCl, 40 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP] and the protein was eluted with elution buffer [20 mM HEPES pH 8.0, 300 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP]. The was buffer-exchanged with TEV buffer (20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP) using a PD-10 column (GE Healthcare) and incubated with 1 mg TEV protease per
15 mg of eluted protein. The protease-treated was run over nickel-chelating resin (GE Healthcare) pre-equilibrated with HEPES crystallization
buffer (20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP) and the resin was washed with the same buffer. The flowthrough and wash fractions
were combined and concentrated to 19.6 mg ml−1 by centrifugal ultrafiltration (Millipore) for crystallization trials. HinfPDH was crystallized using the nanodroplet vapor-diffusion method (Santarsiero et al., 2002
) with standard JCSG crystallization protocols (Lesley et al., 2002
). Sitting drops composed of 200 nl protein mixed with 200 nl crystallization solution
were equilibrated against a 50 µl reservoir at 293 K for 28 d prior to harvest. The
crystallization reagent was composed of 0.04 M potassium dihydrogen phosphate, 20.0%(v/v) glycerol and 16.0%(w/v) PEG 8000. A rod-shaped crystal of approximate dimensions 0.1 × 0.05 × 0.05 mm was
harvested for data collection. No additional cryoprotectant was added to the crystal.
Initial diffraction screening was carried out using the Stanford Automated Mounting
system (SAM; Cohen et al., 2002
) at the Stanford Synchrotron Radiation Lightsource (SSRL, Menlo Park, California,
USA). The crystal was indexed in the tetragonal P41212. The oligomeric state of HinfPDH was determined to be a dimer by gel filtration using a 0.8 × 30 cm Shodex Protein
KW-803 column (Thomson Instruments) equilibrated in 20 mM Tris, 200 mM NaCl, 0.5 mM TCEP pH 7.5 and pre-calibrated with gel-filtration standards (Bio-Rad). Protein concentrations
were determined using the Coomassie Plus assay (Pierce).
2.2. Data collection, structure solution and refinement
Multiple-wavelength anomalous diffraction (MAD) data were collected on beamline 11-1
at the SSRL using a 0.1 × 0.1 mm X-ray beam at wavelengths corresponding to the inflection
(λ1) and remote (λ2) wavelengths of a selenium MAD experiment. The data sets were collected at 100 K
using a Rayonix MAR Mosaic MX-325 CCD detector. The MAD data were integrated and reduced
using XDS and scaled with the program XSCALE (Kabsch, 1993, 2010a
,b
). An initial solution was obtained with SHELXD (Sheldrick, 2008
) and the phases were refined using autoSHARP (Vonrhein et al., 2007
), which gave a mean figure of merit of 0.48 with 14 selenium sites. Automated model
building was performed with ARP/wARP (Cohen et al., 2004
). Model completion and were performed with Coot (Emsley & Cowtan, 2004
) and REFMAC 5.2 (Winn et al., 2003
) using the high-energy remote (λ2) data set. The included phase restraints from SHARP and TLS with three TLS groups per chain. CCP4 programs were used for data conversion and other calculations (Collaborative Computational
Project, Number 4, 1994
).
During structure + cofactor. As these molecules were not added during the crystallization experiment,
they must have been acquired during protein expression and have remained bound during
purification and crystallization. Data-collection and are summarized in Table 1.
![]() ![]() ‡Rmeas = ![]() ![]() ![]() ![]() §Rcryst = ![]() ![]() ¶Rfree is the same as Rcryst but for 5.1% of the total reflections that were chosen at random and omitted from ††This value represents the total B that includes TLS and residual B components. ‡‡The estimated overall coordinate error (Collaborative Computational Project, Number 4, 1994 ![]() ![]() |
2.3. Validation and deposition
The quality of the JCSG Quality Control server (https://smb.slac.stanford.edu/jcsg/QC ). This server verifies the stereochemical quality of the model using AutoDepInputTool (Yang et al., 2004), MolProbity (Chen et al., 2010
) and WHATIF v.5.0 (Vriend, 1990
), the agreement between the atomic model and the data using SFCHECK v.4.0 (Vaguine et al., 1999
) and RESOLVE (Terwilliger, 2004
), the protein sequence using ClustalW (Thompson et al., 1994
), the atom occupancies using MOLEMAN2 (Kleywegt, 2000
) and the consistency of NCS pairs. Protein quaternary-structure analysis was conducted
using the PISA server (https://www.ebi.ac.uk/msd-srv/prot_int/pistart.html ; Krissinel & Henrick, 2005
). Fig. 2
(d) was adapted from PDBsum (Laskowski, 2009
); all other figures were prepared with PyMOL (DeLano, 2002
). Atomic coordinates and experimental structure factors have been deposited in the
PDB under accession code 2pv7 .
![]() |
Figure 2 Structure of HinfPDH. (a) Ribbon diagram of a HinfPDH monomer complexed with NAD+ and a tyrosine molecule. Helices α1–α12 and strands β1–β7 are indicated. The C-terminal domain is colored green and the helices and β-strands of the N-terminal domain are colored cyan and red, respectively. Bound tyrosine and NAD+ molecules are shown in ball-and-stick representation; C, O, N and phosphate atoms are colored yellow, red, blue and orange, respectively. (b) Ribbon diagram of the HinfPDH dimer; monomer A is colored green and monomer B is colored orange. Helices are shown as cylinders. (c) Top view compared with (b) of the HinfPDH dimer. (d) Diagram showing the secondary-structure elements of HinfPDH superimposed on its primary sequence. The labeling of secondary-structure elements is in accord with PDBsum (https://www.ebi.ac.uk/pdbsum ), where α-helices (H1–H12) and β-strands (β1–β7) are sequentially labeled, β-turns and γ-turns are designated by Greek letters (β, γ) and β-hairpins are indicated by red loops. |
3. Results and discussion
3.1. Overall structure
The final model consists of a dimer of the PDH domain of TyrA (residues 92–371 for
chains A and B), two nicotinamide adenine dinucleotides (NAD+), two tyrosines and 393 water molecules in the (Figs. 2a, 2
b and 2
c). No electron density was observed for Gly80–Phe91, Asp312–Glu315 and Asn372–Gly377
in chain A and Gly80–Val91 and Asn372–Gly377 in chain B. The side chains of Lys212 and Lys348 from chain A and of Arg132, Lys212, Lys239, Gln325 and Ala371 from chain B were omitted owing to poor electron density. The Matthews coefficient (VM; Matthews, 1968
) was 2.98 Å3 Da−1 and the estimated solvent content was 58.7%. The Ramachandran plot produced by MolProbity showed that 98% of the residues are in favored regions, with no outliers.
Each monomer consists of an N-terminal α/β dinucleotide-binding domain (residues 92–243) and a C-terminal α-helical dimerization domain (residues 244–371) (Fig. 2a). The active site is located at the domain interface. The N-terminal domain adopts
a modified Rossmann fold, which consists of a parallel seven-stranded β-sheet (strand order β2-β1-β3-β4-β5-β6-β7) with the α1 helix on one face of the β-sheet and the α2, α3, α4 and α5 helices on the other. Structural comparisons of HinfPDH and other nucleotide-binding proteins, including other TyrA enzymes, show that
HinfPDH lacks the α–β structural motif between α2 and β2 that is present and is part of the β-sheet in other nucleotide-binding proteins. The C-terminal domain consists of seven
helices (α6–α12) that form the dimer interface; the helices from each monomer are intertwined into
a tightly packed helical bundle with a buried surface of 11 000 Å2 (Figs. 2
b and 2
c). The two monomers are structurally similar to each other, with an r.m.s.d. of 0.2 Å
for 268 equivalent Cα atoms.
3.2. The active site
The active site is located in the cleft between the N- and C-terminal domains (Figs.
2a, 2
b and 2
c). One tyrosine and one NAD+ are bound in each monomer (Fig. 3
a). His200, Ser179 and Arg297 (Fig. 3
b) are among the important residues for enzyme catalysis and/or ligand binding and
are conserved in TyrA enzymes across all kingdoms of life. The His200 imidazole is
hydrogen bonded to the 4′-OH of the bound tyrosine. Ser179 hydrogen bonds to both
the ribose O atom of nicotinamide nucleoside and the 4′-OH of the bound tyrosine and
is important for orientating prephenate and NAD+ for catalysis. The Arg297 guanadinium forms a pair of electrostatic interactions
with the tyrosine carboxyl, which also interacts with Gln301 from the α8 helix. The tyrosine amino group hydrogen bonds to Tyr306 from α9 and Tyr288′ from α8 of the adjacent monomer. His260 is located close to the tyrosine amino group and
could be involved in regulation of tyrosine inhibition in a similar way to His217
in A. aeolicus.
![]() |
Figure 3 Active site of HinfPDDH. (a) Final model of tyrosine and NAD+ molecules fitted in 2Fo − Fc electron-density maps prior to model building as output from the last step of XSOLVE (JCSG, unpublished work). The map is contoured at 1σ. (b) Active site of HinfPDDH showing key active-site residues and the bound tyrosine and NAD+ molecules in ball-and-stick representation. C atoms are colored gray for the protein and yellow for the ligands; O, N and phosphate atoms are colored red, blue and orange, respectively. |
The aromatic ring of the bound tyrosine packs against the nicotinamide ring of NAD+ such that the 4′-OH of tyrosine is approximately 4 Å away from C4 of the nicotinamide
ring. Assuming that prephenate adopts the same binding mode, the structure suggests
that the hydride is transferred from prephenate to the si face of NAD+, which is consistent with a previous proton NMR study on the E. coli TyrA enzyme using isotope-labeled NAD-4-d (Hermes et al., 1984).
The binding mode of NAD+ is similar to those of arogenate dehydrogenase from Synechocystis sp. and prephenate dehydrogenase from A. aeolicus (Legrand et al., 2006; Sun et al., 2006
). The pyrophosphate of NAD+ interacts with the P-loop (Gly108–Gly113) between α1 and β1, forming hydrogen bonds to Lys111 and the main-chain of Lys111 and Leu112. The diol of the adenylyl ribose hydrogen bonds to Asp131 and
Lys132 from the loop located between α2 and β2. The adenine ring is sandwiched between Val152 and Arg132 from helices α2 and α3, with its N1 hydrogen bonded to Trp135 from α2. The diol of the nicotinamide ribose is hydrogen bonded to Ser179 and Val152. The
nicotinamide ring interacts with the protein mainly through hydrophobic interactions.
3.3. Structural comparison with other TyrA enzymes
A search with FATCAT (Ye & Godzik, 2004) using the HinfPDH coordinates identified the closest structural homologues of HinfPDH in the PDB as the prephenate dehydrogenases from A. aeolicus (AaeoPDH; PDB code 2g5c ; Sun et al., 2006
) and Streptococcus thermophilus (SthePDH; PDB code 3dzb ; Z. Zhang, S. Eswaramoorthy, S. K. Burley & S. Swaminathan, unpublished work) and
the arogenate dehydrogenase from Synechocystis sp. (SyneADH; PDB code 2f1k ; Legrand et al., 2006
). HinfPDH is bifunctional, whereas the other three enzymes are monofunctional. The pairwise
sequence identities between HinfPDH and AaeoPDH, SthePDH and SyneADH are 20, 27 and 25%, respectively. Despite the low sequence identity, the overall
structures of these enzymes are very similar. The structures of AaeoPDH complexed with the ligands NAD+ (PDB code 2g5c ), NAD+ and L-tyrosine (PDB code 3ggg ), NADH and 4-hydroxylphenylpyruvate (HPP; PDB code 3ggo ) and NAD+ and 4-hydroxyphenylpropionate (PDB code 3ggp ) are available (Sun et al., 2009
), but only the first two structures were used for comparison because they are sufficient
to represent the two unique enzyme states; they are denoted AaeoPDH and AaeoPDH–Tyr–NAD+, respectively. SyneADH has an NADP+ bound at the active site and SthePDH has no ligand bound. The dehydrogenase activity of SyneADH is strictly dependent on arogenate and NADP+ (Legrand et al., 2006
). Conversely, NAD+ and prephenate are the preferred cofactor and substrate for AaeoPDH, although a very low level of dehydrogenase activity is detected when NADP+ with prephenate or NAD+ with arogenate are used (Bonvin et al., 2006
).
Although the overall fold is similar, structural comparisons revealed important differences
around the active site. Compared with other TyrA enzymes, HinfPDH lacks an α–β structural motif between α2 and β2, the loop between β5 and β6 (Lβ5–β6) is much shorter, and an extra helix α12 is found at the C-terminus (Fig. 4). Multiple sequence alignment of many representative TyrA proteins clearly shows
that the α–β motif and Lβ5–β6 represent unique structural differences between bifunctional and monofunctional TyrA
enzymes (Fig. 5
), but the extra C-terminal helix is less obviously discernable from the sequence
comparisons, presumably because the exact end point of the prephenate dehydrogenase
domain is difficult to determine for cases such as TyrA-aroF or TyrA-ACT fusions in which aroF and ACT are fused at the C-terminal end.
![]() |
Figure 4 Structural differences between HinfPDH and its structural homologues. (a) Superimposition of HinfPDH, AaeoPDH, SthePDH and SyneADH reveals unique structural differences in HinfPDH including a missing α–β motif, a shorter Lβ5–β6 loop, and an extra C-terminal helix. HinfPDH is colored green or orange for each monomer. The AaeoPDH, SthePDH and SyneADH structures are colored magenta, yellow and blue, respectively. Bound tyrosine and NAD+ molecules are shown in ball-and-stick representation. (b) Enlarged view of the area around the α–β motif showing that HinfPDH is missing the α–β motif compared with AaeoPDH, SthePDH and SyneADH. (c) Enlarged view of the area around Lβ5–β6. Lys111 of HinfPDH is shown with a green backbone, Ser155 of AaeoPDH is in magenta and Gln120 of SyneADH is in blue. (d) Enlarged view of the C-terminal area showing that an extra C-terminal helix is present in HinfPDH. The α7 and α8 helices of HinfPDH superimposed onto structurally equivalent helices in AaeoPDH, SthePDH and SyneADH are also indicated. |
![]() |
Figure 5 Multiple sequence alignment of representative TyrA enzymes. Regions around the α–β motif and Lβ5–β6 are shown. Abbreviations: Hinf, Haemophilus influenzae Rd KW20; Etas, Erwinia tasmaniensis Et1/99; Ecol, Escherichia coli str. K-12 substr. MG1655; Plum, Photorhabdus luminescens subsp. laumondii TTO1; Pmul, Pasteurella multocida subsp. multocida str. Pm70; Vcho, Vibrio cholerae O1 biovar eltor str. N16961; Vpar, Vibrio parahaemolyticus RIMD 2210633; Yent, Yersinia enterocolitica subsp. enterocolitica 8081; Aful, Archaeoglobus fulgidus; Syne, Synechocystis sp. (Legrand et al., 2006 ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
In HinfPDH, Lβ5–β6 is eight residues shorter than in AaeoPDH, SthePDH and SyneADH. This loop is well ordered in the HinfPDH, AaeoPDH and SyneADH structures (Fig. 4c) but is disordered in the SthePDH structure owing to the absence of bound cofactor. In AaeoPDH and SyneADH, this loop extends along the cofactor-binding site. Ser155 in AaeoPDH and Gln120 in SyneADH are in structurally equivalent positions (Fig. 4
c) and both form hydrogen bonds to the pyrophosphate O atom of the bound cofactor.
However, in HinfPDH no residue is structurally equivalent because of the shorter loop and Lys111 from
α1 instead provides the equivalent interaction with the pyrophosphate O atom (Fig.
4
c). The equivalent residues to Lys111 in AaeoPDH and SyneADH are Phe40 and Leu10, respectively, and neither side chain can form hydrogen bonds
to the pyrophosphate. This suggests that although a shorter Lβ5–β6 loop has evolved in HinfPDH, the ability to bind cofactor is not affected.
In SyneADH, the phosphate group of the adenosine ribose of NADP+ is recognized by helical residues in the α–β motif. The phosphate group is stabilized by electrostatic interaction with Arg31 and hydrogen-bonding interactions with Gln32 and Thr35, as well as with the main chain of Arg31 and Gln32. The equivalent residues in AaeoPDH are Ile63, Asn64 and Ser67. A loss of electrostatic interaction caused by the substitution of Arg by Ile may explain why AaeoPDH prefers NAD+ over NADP+ as cofactor. In SthePDH, the equivalent residues are Arg36, Ser37 and Ser40. Thus, the ability to form electrostatic and hydrogen-bonding interactions is similar to that of SyneADH, suggesting that SthePDH is capable of binding NADP+. In HinfPDH, Arg132 is structurally equivalent to SyneADH Arg31, but residues equivalent to Gln32 and Thr35 are absent owing to the lack of the α–β structural motif. It is possible that HinfPDH also prefers NAD+ over NADP+ in a manner similar to bifunctional TyrA enzymes from E. coli and K. pneumonia.
A smaller local difference among these TyrA structures is the loop joining α7 and α8 and the adjacent residues (Fig. 4d). This loop takes a wider turn in HinfPDH compared with that in AaeoPDH, SthePDH and SyneADH and is positioned next to α12. Ser284–Leu290 in this region are highly conserved in TyrAc_Δ sequences, suggesting that they play important roles.
3.4. Global conformational change
Another important difference between HinfPDH and other TyrA structures is the relative orientation of the respective N- and
C-terminal domains. Pairwise structural alignment of HinfPDH with AaeoPDH, AaeoPDH–Tyr–NAD+, SthePDH and SyneADH using only the N-terminal domain gives r.m.s.d.s of 1.8, 1.7, 1.9 and 1.5 Å, respectively,
for 128–132 superimposed Cα atoms (0.8–0.9 Å for core β-sheet residues). Upon structural superimposition, it is immediately noticeable that
differences in the relative orientation of the N- and C-terminal domains are present
in these TyrA structures (Figs. 6a and 6
b). A hinge region around Glu242–Asn244 connects the N- and C-terminal domains at
the domain interface opposite to the substrate-binding site. Superimposition using
the C-terminal domain gives similar results, although the results are less obvious
owing to internal structural differences within the C-terminal domains of these TyrA
structures. Therefore, the discussion below is based on superimpositions using the
N-terminal domain.
![]() |
Figure 6 Superimposition of TyrA structures showing differences in the relative orientation of the N- and C-terminal domains and a close-up view of the active site of HinfPDH. (a) Superimposition of HinfPDH, AaeoPDH, AaeoPDH–Tyr–NAD+, SthePDH and SyneADH reveals differences in the relative orientation of the N- and C-terminal domains in these TyrA proteins. HinfPDH, AaeoPDH, AaeoPDH (monomer B)–Tyr–NAD+, SthePDH and SyneADH are colored green, magenta, orange, cyan and blue, respectively. (b) Same orientation as (a); for clarity, only HinfPDH and SyneADH are shown. (c) An ionic network in the active site of HinfPDH consists of Arg297, a bridging water molecule, Asp206 and Arg365′ from adjacent molecule in a dual conformation. His260, Tyr288, Gln301 and Tyr306 that could be involved in substrate selectivity are also shown. Hydrogen bonds are indicated as dashed lines. |
Using the C-terminal domain of HinfPDH as a reference, the C-terminal domains of AaeoPDH, SthePDH and SyneADH are farther away from the substrate-binding site; therefore, HinfPDH represents the most closed form, SyneADH is the most open form and AaeoPDH and SthePDH are in intermediate states (Figs. 6a and 6
b). Monomer B of AaeoPDH–Tyr–NAD+ has both NAD+ and tyrosine bound and the conformation is closed, similar to HinfPDH. Monomer A of AaeoPDH–Tyr–NAD+ only has NAD+ bound and the conformation is similar to AaeoPDH. Comparison of the AaeoPDH and AaeoPDH–Tyr–NAD+ structures suggest that tyrosine induces a conformational change upon binding. Hence,
it is possible that the binding of tyrosine to HinfPDH also induces a conformational change from an open to a closed form and the closed
form is captured in the current HinfPDH structure. A of apo HinfPDH could provide direct evidence for this proposal. In the closed conformation, α6, α8, α9 and α8′ (α8 from monomer B) are close to the bound tyrosine and α8 and α12′ (α12 from monomer B) are near Lβ5–β6, facilitating closure of the active site and the proper alignment of active-site
residues for catalysis. Key active-site residues in this region include His260 (α6), Arg297, Gln301 (α8), Tyr288′ (α8′), Tyr306 (α9), Arg365′ (α12′) and Asp206 (Lβ5–β6). If a tyrosine molecule is modeled adjacent to the cofactor in AaeoPDH and SthePDH, the residue equivalent to Arg297 of HinfPHDH is too distant to interact with the tyrosine. The conformational change is independent
of cofactor binding as the structures of SyneADH with NADP+ bound, AaeoPDH with NAD+ bound and SthePDH without any cofactor are all in similar open conformations.
In the closed conformation of HinfPDH an ionic network consisting of Arg297, a bridging water molecule, Asp206 from
Lβ5–β6 and Arg365′ from α12′ is observed that may be involved in gating the active site (Fig. 6c). The bridging water is present in both monomers. The Arg365′ side chain adopts dual
conformations, with one conformation participating in the ionic network and the other
pointing away from the active site. This dual conformation may be part of the gating
mechanism, in which an ionic network forms on closure of the active site after substrate
is bound and is broken when the product is released. It is worth noting that Asp206
is absolutely conserved and Arg365 is highly conserved in chorismate mutase/prephenate
dehydrogenase sequences but not in monofunctional TyrA enzymes. Thus, the active-site
gating mechanism might be different in bifunctional and monofunctional TyrA enzymes.
3.5. Insights into the catalytic mechanism
Previous studies on the pH-dependence of the E. coli TyrA enzyme showed that a catalytic group with a pKa value of about 6.5 is deprotonated for dehydrogenase activity (Turnbull, Cleland
et al., 1991). Subsequent site-directed mutagenesis experiments revealed that this critical catalytic
residue is His197 (Christendat et al., 1998
). The catalytic mechanism of the oxidative decarboxylation of E. coli TyrA was investigated and suggested a concerted mechanism in which hydride transfer
and decarboxylation occur in a concerted manner. It was proposed that His197 provides
the driving force for the dehydrogenase reaction by polarizing the 4′-hydroxyl group
of prephenate (Christendat et al., 1998
). It was also postulated that since the end-product of the reaction is aromatic,
polarization of the 4′-OH group is sufficient to lower the energy barrier for the
reaction, rather than deprotonation of the 4′-OH group to form a β-keto acid intermediate in a stepwise mechanism. The of HinfPDH shows that the N∊2 atom of His200 (equivalent to His197 in E. coli TyrA) is hydrogen bonded to the 4′-hydroxyl group of tyrosine at a distance of ∼2.6 Å.
A hydrogen-bonding network between His200, His248 and Asp249 is observed in which
His200 Nδ1 hydrogen bonds to His248 N(H)∊2 and His248 N(H)δ1 hydrogen bonds to Asp249 Oδ1. Asp249 is located near the protein surface. As for His200, His248 and Asp249 are
highly conserved in TyrA sequences. This hydrogen-bonding network can help to maintain
His200 N∊2 in a deprotonated state. The Ser179 hydrogen bond to the 4′-OH of the bound tyrosine
can provide an additional driving force for the reaction by polarizing the 4′-OH group
since the equivalent Ser126 in AaeoPDH is critical for catalysis. In addition, HinfPDH and E. coli TyrA share 57% sequence identity in their prephenate dehydrogenase domains and all
key active-site residues are conserved. Hence, HinfPDH is likely to adopt a concerted mechanism for dehydrogenase reaction as found
for E. coli TyrA. In HinfPDH, tyrosine is bound directly at the catalytic site, suggesting that it acts as
a competitive inhibitor.
How TyrA enzymes evolved to be specific for prephenate or arogenate is intriguing
because prephenate and arogenate have very similar structures. In HinfPDH, Gln301, Tyr306, Tyr288 and His260 are positioned close to the amino and carboxyl
groups of the bound tyrosine and could be involved in substrate specificity (Fig.
6c). Gln301, Tyr306 and Tyr288 are highly conserved in bifunctional TyrA sequences.
In other TyrA sequences, Gln301 is replaced by Gly, Ser or Thr. Tyr306 is relatively
conserved as Trp in TyrAp and TyrAc, but is Trp, Gly or Val in TyrAa. Tyr288 is located
at the N-terminus of α8, where some local structural differences are found between HinfPDH and other TyrA structures. Alignment of TyrA sequences shows a gap of approximately
four residues around Tyr288 in the TyrAp, TyrAa and TyrAc sequences. His260 is conserved
in TyrA, except for some TyrAas, where it is Val or Gln. Given the structural resemblance
between prephenate and arogenate, further experiments to elucidate the exact prephenate-binding
mode will advance our understanding of substrate specificity in TyrA enzymes.
4. Conclusions
HinfPDH is the first prephenate dehydrogenase structure to be determined from a bifunctional
TyrA enzyme. This structure reveals active-site residues that are important for catalysis
and/or ligand binding and are consistent with previously determined structures of
other TyrA enzymes. The comparison of HinfPDH with other known TyrA structures indicates important differences that appear to
be characteristic features that differentiate the bifunctional and monofunctional
TyrA enzymes and suggest that the regulation of is likely to differ between bifunctional and monofunctional TyrA enzymes. These structural
differences may be related to the presence of a chorismate mutase domain in the bifunctional
TyrA enzymes. A of full-length H. influenzae TyrA should provide insight into this question. Additional information about the
proteins described in this study is available from TOPSAN (Krishna et al., 2010) at https://www.topsan.org/explore?PDBid=2pv7 .
Supporting information
3D view: 2pv7
PDB reference: chorismate mutase/prephenate dehydrogenase, 2pv7
Acknowledgements
This work was supported by the NIH, National Institute of General Medical Sciences, Protein Structure Initiative grant U54 GM074898. Portions of this research were carried out at the Stanford Synchrotron Radiation Lightsource (SSRL). The SSRL is a national user facility operated by Stanford University at the SLAC National Accelerator Laboratory on behalf of the US Department of Energy, Office of Basic Energy Sciences. The SSRL Structural Molecular Biology Program is supported by the Department of Energy, Office of Biological and Environmental Research and by the National Institutes of Health (National Center for Research Resources, Biomedical Technology Program and the National Institute of General Medical Sciences. Genomic DNA from H. influenzae Rd KW20 (ATCC No. 5190) was obtained from the American Type Culture Collection (ATCC). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
References
Ahmad, S. & Jensen, R. A. (1987). FEBS Lett. 216, 133–139. CrossRef CAS PubMed Web of Science Google Scholar
Beller, H. R., Chain, P. S., Letain, T. E., Chakicherla, A., Larimer, F. W., Richardson,
P. M., Coleman, M. A., Wood, A. P. & Kelly, D. P. (2006). J. Bacteriol. 188, 1473–1488. Web of Science CrossRef PubMed CAS Google Scholar
Bonvin, J., Aponte, R. A., Marcantonio, M., Singh, S., Christendat, D. & Turnbull,
J. L. (2006). Protein Sci. 15, 1417–1432. Web of Science CrossRef PubMed CAS Google Scholar
Calhoun, D. H., Bonner, C. A., Gu, W., Xie, G. & Jensen, R. A. (2001). Genome Biol. 2, 1–16. CrossRef Google Scholar
Champney, W. S. & Jensen, R. A. (1970). J. Biol. Chem. 245, 3763–3770. CAS PubMed Web of Science Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral,
G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chipman, D. M. & Shaanan, B. (2001). Curr. Opin. Struct. Biol. 11, 694–700. Web of Science CrossRef PubMed CAS Google Scholar
Christendat, D., Saridakis, V. C. & Turnbull, J. L. (1998). Biochemistry, 37, 15703–15712. Web of Science CrossRef CAS PubMed Google Scholar
Christendat, D. & Turnbull, J. L. (1999). Biochemistry, 38, 4782–4793. Web of Science CrossRef PubMed CAS Google Scholar
Christopherson, R. I. (1985). Arch. Biochem. Biophys. 240, 646–654. CrossRef CAS PubMed Web of Science Google Scholar
Cobbett, C. S. & Delbridge, M. L. (1987). J. Bacteriol. 169, 2500–2506. CAS PubMed Web of Science Google Scholar
Cohen, A. E., Ellis, P. J., Miller, M. D., Deacon, A. M. & Phizackerley, R. P. (2002).
J. Appl. Cryst. 35, 720–726. Web of Science CrossRef CAS IUCr Journals Google Scholar
Cohen, S. X., Morris, R. J., Fernandez, F. J., Ben Jelloul, M., Kakaris, M., Parthasarathy,
V., Lamzin, V. S., Kleywegt, G. J. & Perrakis, A. (2004). Acta Cryst. D60, 2222–2229. Web of Science CrossRef CAS IUCr Journals Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals Google Scholar
Cruickshank, D. W. J. (1999). Acta Cryst. D55, 583–601. Web of Science CrossRef CAS IUCr Journals Google Scholar
Diederichs, K. & Karplus, P. A. (1997). Nature Struct. Biol. 4, 269–275. CrossRef CAS PubMed Web of Science Google Scholar
DeLano, W. L. (2002). PyMOL Molecular Viewer. DeLano Scientific LLC, Palo Alto, California, USA. https://www.pymol.org . Google Scholar
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Grant, G. A. (2006). J. Biol. Chem. 281, 33825–33829. Web of Science CrossRef PubMed CAS Google Scholar
Hermes, J. D., Tipton, P. A., Fisher, M. A., O'Leary, M. H., Morrison, J. F. & Cleland,
W. W. (1984). Biochemistry 23, 6263–6275. CrossRef CAS PubMed Web of Science Google Scholar
Kabsch, W. (1993). J. Appl. Cryst. 26, 795–800. CrossRef CAS Web of Science IUCr Journals Google Scholar
Kabsch, W. (2010a). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kabsch, W. (2010b). Acta Cryst. D66, 133–144. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kleywegt, G. J. (2000). Acta Cryst. D56, 249–265. Web of Science CrossRef CAS IUCr Journals Google Scholar
Klock, H. E., Koesema, E. J., Knuth, M. W. & Lesley, S. A. (2008). Proteins, 71, 982–994. Web of Science CrossRef PubMed CAS Google Scholar
Krishna, S. S., Weekes, D., Bakolitsa, C., Elsliger, M.-A., Wilson, I. A., Godzik,
A. & Wooley, J. (2010). Acta Cryst. F66, 1143–1147. Web of Science CrossRef IUCr Journals Google Scholar
Krissinel, E. & Henrick, K. (2005). CompLife 2005, edited by M. R. Berthold, R. Glen, K. Diederichs, O. Kohlbacher & I. Fischer, pp.
163–174. Berlin/Heidelberg: Springer-Verlag. Google Scholar
Laskowski, R. A. (2009). Nucleic Acids Res. 37, D355–D359. Web of Science CrossRef PubMed CAS Google Scholar
Legrand, P., Dumas, R., Seux, M., Rippert, P., Ravelli, R., Ferrer, J.-L. & Matringe,
M. (2006). Structure, 14, 767–776. Web of Science CrossRef PubMed CAS Google Scholar
Lesley, S. A. et al. (2002). Proc. Natl Acad. Sci. USA, 99, 11664–11669. Web of Science CrossRef PubMed CAS Google Scholar
Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. CrossRef CAS PubMed Web of Science Google Scholar
Rippert, P. & Matringe, M. (2002). Eur. J. Biochem. 269, 4753–4761. Web of Science CrossRef PubMed CAS Google Scholar
Sampathkumar, P. & Morrison, J. F. (1982). Biochim. Biophys. Acta, 702, 212–219. CrossRef CAS PubMed Web of Science Google Scholar
Santarsiero, B. D., Yegian, D. T., Lee, C. C., Spraggon, G., Gu, J., Scheibe, D.,
Uber, D. C., Cornell, E. W., Nordmeyer, R. A., Kolbe, W. F., Jin, J., Jones, A. L.,
Jaklevic, J. M., Schultz, P. G. & Stevens, R. C. (2002). J. Appl. Cryst. 35, 278–281. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. Web of Science CrossRef CAS IUCr Journals Google Scholar
Song, J., Bonner, C. A., Wolinsky, M. & Jensen, R. A. (2005). BMC Biol. 3, 13. Google Scholar
Sun, W., Shahinas, D., Bonvin, J., Hou, W., Kimber, M. S., Turnbull, J. & Christendat,
D. (2009). J. Biol. Chem. 284, 13223–13232. Web of Science CrossRef PubMed CAS Google Scholar
Sun, W., Singh, S., Zhang, R., Turnbull, J. L. & Christendat, D. (2006). J. Biol. Chem. 281, 12919–12928. Web of Science CrossRef PubMed CAS Google Scholar
Terwilliger, T. (2004). J. Synchrotron Rad. 11, 49–52. Web of Science CrossRef CAS IUCr Journals Google Scholar
Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994). Nucleic Acids Res. 22, 4673–4680. CrossRef CAS PubMed Web of Science Google Scholar
Turnbull, J., Cleland, W. W. & Morrison, J. F. (1990). Biochemistry, 29, 10245–10254. CrossRef CAS PubMed Web of Science Google Scholar
Turnbull, J., Cleland, W. W. & Morrison, J. F. (1991). Biochemistry, 30, 7777–7782. CrossRef PubMed CAS Web of Science Google Scholar
Turnbull, J., Morrison, J. F. & Cleland, W. W. (1991). Biochemistry 30, 7783–7788. CrossRef PubMed CAS Web of Science Google Scholar
Vaguine, A. A., Richelle, J. & Wodak, S. J. (1999). Acta Cryst. D55, 191–205. Web of Science CrossRef CAS IUCr Journals Google Scholar
Van Duyne, G. D., Standaert, R. F., Karplus, P. A., Schreiber, S. L. & Clardy, J.
(1993). J. Mol. Biol. 229, 105–124. CrossRef CAS PubMed Web of Science Google Scholar
Vonrhein, C., Blanc, E., Roversi, P. & Bricogne, G. (2007). Methods Mol. Biol. 364, 215–230. PubMed CAS Google Scholar
Vriend, G. (1990). J. Mol. Graph. 8, 52–56. CrossRef CAS PubMed Web of Science Google Scholar
Winn, M. D., Murshudov, G. N. & Papiz, M. Z. (2003). Methods Enzymol. 374, 300–321. Web of Science CrossRef PubMed CAS Google Scholar
Xia, T. & Jensen, R. A. (1990). J. Biol. Chem. 265, 20033–20036. CAS PubMed Web of Science Google Scholar
Xie, G., Bonner, C. A. & Jensen, R. A. (2000). Comput. Biochem. Physiol. C Toxicol. Pharmacol. 125, 65–83. CAS Google Scholar
Yang, H., Guranovic, V., Dutta, S., Feng, Z., Berman, H. M. & Westbrook, J. D. (2004).
Acta Cryst. D60, 1833–1839. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ye, Y. & Godzik, A. (2004). Nucleic Acids Res. 32, W582–W585. Web of Science CrossRef PubMed CAS Google Scholar
Zhao, G., Xia, T., Ingram, L. O. & Jensen, R. A. (1993). Eur. J. Biochem. 212, 157–165. CrossRef CAS PubMed Web of Science Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.