Volume 18, Issue 4 pp. 839-844
Protein Structure Report
Free Access

C-terminal domain of SARS-CoV main protease can form a 3D domain-swapped dimer

Nan Zhong

Nan Zhong

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

Nan Zhong, Shengnan Zhang, and Fei Xue authors contributed equally to the work.

Search for more papers by this author
Shengnan Zhang

Shengnan Zhang

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

Nan Zhong, Shengnan Zhang, and Fei Xue authors contributed equally to the work.

Search for more papers by this author
Fei Xue

Fei Xue

Structural Biology Laboratory, Tsinghua University, Beijing 100084, China

Nan Zhong, Shengnan Zhang, and Fei Xue authors contributed equally to the work.

Search for more papers by this author
Xue Kang

Xue Kang

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

Search for more papers by this author
Peng Zou

Peng Zou

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

Search for more papers by this author
Jiaxuan Chen

Jiaxuan Chen

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Life Science, Peking University, Beijing 100871, People' Republic of China

Search for more papers by this author
Chao Liang

Chao Liang

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

Search for more papers by this author
Zihe Rao

Zihe Rao

Structural Biology Laboratory, Tsinghua University, Beijing 100084, China

Search for more papers by this author
Changwen Jin

Changwen Jin

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

College of Life Science, Peking University, Beijing 100871, People' Republic of China

Search for more papers by this author
Zhiyong Lou

Corresponding Author

Zhiyong Lou

Structural Biology Laboratory, Tsinghua University, Beijing 100084, China

Zhiyong Lou, Laboratory of Structural Biology, Tsinghua University, Beijing 100084, People's Republic of China

Bin Xia, Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People s Republic of China

Search for more papers by this author
Bin Xia

Corresponding Author

Bin Xia

Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People' Republic of China

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, People' Republic of China

College of Life Science, Peking University, Beijing 100871, People' Republic of China

Zhiyong Lou, Laboratory of Structural Biology, Tsinghua University, Beijing 100084, People's Republic of China

Bin Xia, Beijing Nuclear Magnetic Resonance Center, Peking University, Beijing 100871, People s Republic of China

Search for more papers by this author
First published: 10 February 2009
Citations: 27

Abstract

SARS coronavirus main protease (Mpro) plays an essential role in the extensive proteolytic processing of the viral polyproteins (pp1a and pp1ab), and it is an important target for anti-SARS drug development. We have reported that both the Mpro C-terminal domain alone (Mpro-C) and the N-finger deletion mutant of Mpro (Mpro-Δ7) exist as a stable dimer and a stable monomer (Zhong et al., J Virol 2008; 82:4227-4234). Here, we report structures of both Mpro-C monomer and dimer. The structure of the Mpro-C monomer is almost identical to that of the C-terminal domain in the crystal structure of Mpro. Interestingly, the Mpro-C dimer structure is characterized by 3D domain-swapping, in which the first helices of the two protomers are interchanged and each is enwrapped by four other helices from the other protomer. Each folding subunit of the Mpro-C domain-swapped dimer still has the same general fold as that of the Mpro-C monomer. This special dimerization elucidates the structural basis for the observation that there is no exchange between monomeric and dimeric forms of Mpro-C and Mpro-Δ7.

Introduction

SARS coronavirus (SARS-CoV) was identified as the etiological agent of the pandemic transmissible disease, severe acute respiratory syndrome.1-3 The SARS-CoV 5′ two-thirds genome of the virus encodes two overlapping polyproteins, pp1a (486 kDa) and pp1ab (790 kDa), which are proteolytically processed into 16 matured nonstructural proteins (nsp1-16) by two proteases included in these two polyproteins. These nonstructural proteins mediate the viral replication and transcription.4 Main protease of SARS-CoV (Mpro) plays an important role in the extensive proteolytic processing of the viral polyproteins, which makes it essential for the viral life cycle and represents an attractive target for antiviral agent development.5-7

The first structure of Mpro was solved in 2003 and revealed a homodimer which is highly similar to other previously reported coronavirus main proteases structures.5, 8 It has been reported that Mpro exists in solution as an equilibrium between monomeric and dimeric forms,9 and only the dimeric form is enzymatically active.10 Shi et al.11 have reported that the Mpro N-terminal alone is a monomer, whereas the Mpro C-terminal domain alone (Mpro-C) exists as a stable dimer. However, our previous studies demonstrated that Mpro-C exists as both a stable monomer and a stable dimer in solution, and so does the N-finger deletion mutant of Mpro (Mpro-Δ7) which can also form a stable dimer through dimerization of the C-terminal domain.12

Here, we report structures of monomeric and dimeric forms of the C-terminal domain of Mpro (Mpro-C). Mpro-C monomer maintains the same fold as that in the crystal structure of Mpro. On the other hand, the Mpro-C dimer has a novel structure characterized by 3D domain-swapping, which provides the structural basis for the dimer stability.

Results

Solution structure of Mpro-C monomer

We have obtained nearly complete backbone resonance assignments for the Mpro-C monomer except those for residues F219 and E288, and more than 95% of side-chain resonances were assigned. The solution structures of the Mpro-C monomer were calculated using interproton NOE-derived distance restraints together with the dihedral angle and hydrogen bond restraints (Table I). The 20 structures with the lowest energies are shown in Figure 1(A), together with the ribbon diagrams of the mean structure [Fig. 1(B)]. The Ramachandran plot indicates that a majority of residues (99.7%) have their Phi and Psi angles in allowed regions, and only 0.3% of them are in disallowed region. The root mean square deviation (RMSD) for backbone heavy atoms in secondary structure elements is 0.20 ± 0.05 Å and that for all heavy atoms in secondary structure elements is 0.62 ± 0.08 Å (Table I). All these indicate that the solution structure is determined with good quality.

Details are in the caption following the image

Solution structure of Mpro-C monomer. (A) Superimposition of the backbone trace of 20 representative structures; (B) Ribbon diagram of the mean structure with secondary structural elements labeled; (C) Comparison of the structure of Mpro-C monomer (blue) with its corresponding part in the crystal structure of Mpro (1UK2, magenta). domains I and II of Mpro are shown in light gray.

Table I. Restraints and Structural Statistics for Mpro-C Monomer
Distance restraints
 Intra-residue 1492
 Sequential (|ij| = 1) 656
 Medium (1<|ij|<5) 449
 Long-range (|ij|>5) 620
 Ambiguous 823
 Total 4040
Hydrogen bond restraints
 Hydrogen bond 32
Dihedral angle restraints
 Φ 58
 Ψ 58
 Total 116
Violation statistics
 NOE violation (>0.2 Å) 1
 Torsion angle violation (>2°) 0
Ramachandran plot statistics
 Residues in most  favored regions (%) 85.3
 Residues in allowed  regions (%) 13.7
 Residues in generously  allowed regions (%) 0.7
 Residues in disallowed  regions (%) 0.3
RMSD from mean structure (Å)
Backbone Heavy atoms
 Global 3.7 ± 0.8 4.0 ± 0.7
 Structured region 0.6 ± 0.1 0.9 ± 0.1
 Regular secondary structure 0.20 ± 0.05 0.62 ± 0.08

Just as the C-terminal domain in the full-length Mpro, the Mpro-C monomer adopts a globular all-alpha fold consisting of five α-helices (α1 (T201-N214), α2 (L227-Y237), α3 (Q244-T257), α4 (V261-N274), and α5 (P293-Q299)), two well-defined loops (L2 (N238-T243) and L3 (G258-A260)), and two flexible loops (L1 (D216-T226) and L4 (G275-T292)). Helix α1 is enwrapped by helices α234, and α5, along with L4, which forms the hydrophobic core [Fig. 1(B)]. The mean structure of the Mpro-C monomer overlaps well with the corresponding part in the structure of Mpro (1UK2) except for loops L1 and L4, with a 0.5 Å RMSD for backbone heavy atoms in secondary structure elements [Fig. 1(C)]. As loops L1 and L4 are relatively flexible in solution structure, the structural difference for these two loops is not unexpected. Therefore, the structure of Mpro-C monomer should remain the same as that of the C-terminal domain of Mpro.

Crystal structure of Mpro-C dimer

As the NMR data quality for the Mpro-C dimer is very poor, we have determined its crystal structure with a resolution of 2.4 Å. The crystal structure of the Mpro-C dimer was determined by molecular replacement using the structure of the C-terminal 120 residues from Mpro crystal structure as the search model (Table II).

Table II. X-Ray Diffraction Data and Model Refinement Statistics for Mpro-C dimer
Parameters
 Data collection statistics
  Cell parameters a = 51.4 Å, b = 51.3 Å, c = 51.4 Å
α = 112.2°, β = 112.0°, γ = 104.4°
  Space group P1
  Wavelength used (Å) 1.5418
  Resolution (Å) 50.0 (2.5) – 2.4
  No. of all reflections 44,600
  No. of unique reflections 15,252
  Completeness (%) 96.0 (88.1)
  Average I/σ(I) 27.3 (5.2)
  Rmerge (%) 5.1 (19.0)
 Refinement statistics
  No. of reflections   used (σ(F) > 0) 14,727
  Rwork (%) 20.8
  Rfree (%) 24.3
  No. of protein residues 402
  No. of protein atoms 3162
  No. of solvent molecules 60
  RMSD bond distance (Å) 0.007
  RMSD bond angle (°) 1.322
  Average B-value (Å2) 40.2
 Ramachandran plot statistics
  Residues in most   favored regions 91.9%
  Residues in additionally   allowed regions 7.8%
  Residues in generously   allowed regions 0.3%
  • a Numbers in parentheses are corresponding values for the highest resolution shell.
  • b Rmerge = ΣhΣl | Iih − <Ih> |/ΣhΣI <Ih>, where <Ih> is the mean of the observations Iih of reflection h.
  • c Rwork = Σ(||Fp(obs)| − |Fp(calc)||)/ Σ|Fp(obs)|; Rfree = R factor for a selected subset (5%) of the reflections that was not included in prior refinement calculations.

The N-terminal 11 residues and C-terminal 8 or 9 residues in two protomers are invisible on the electron density map. To our surprise, the structure of the Mpro-C dimer is characterized by 3D domain-swapping with the two helices α1 of the two molecules interchange their positions (Fig. 2). Each helix α1 is now surrounded by helices α2–α5 of the other molecule and these five helices from two molecules adopts a compact globular all-alpha fold the same as that of Mpro-C monomer, which is called a “folding subunit.” Thus, the Mpro-C dimer is consisted of two identical folding subunits and each is reconstituted by chain fragments from both protomers. In other words, helix α1 and helices α2–α5 from one protomer are now located in different folding subunits for the Mpro-C dimer, and the two protomers are symmetrically related by a crystallographic twofold axis. The two folding subunits of the Mpro-C dimer are linked by two hinge loops consisting of residues D216-T226 which form loop L1 in the structure of the Mpro-C monomer. As a result, a new short 310 helix is formed by residues Trp218, Phe219, and Leu220 on both hinge loops. Consequently, helix α4, which is covered by loop L1 in the Mpro-C monomer, is now exposed in both folding subunits of the Mpro-C dimer [Fig. 2(D)].

Details are in the caption following the image

Crystal structure of Mpro-C dimer. (A and B). Ribbon diagrams of the 3D domain-swapped dimer are shown with 90° rotation. The secondary structural elements are labeled. The two protomers are shown in red and blue, respectively. (C). Topology diagrams of Mpro-C dimer with the same color scheme as in A and B. (D). Superimposition of Mpro-C dimer structure with the mean structure of Mpro-C monomer. The two protomers of Mpro-C dimer are shown in red and blue, respectively. Mpro-C monomer is in light grey.

According to the nomenclature of Liu et al.,13 the interface between domains of a domain-swapped oligomer with the structural characteristics presented in the monomer is termed the closed interface and that found only in the oligomer is termed the open interface. The Mpro-C dimer has an extensive close interface of about 3000 Å2, which involves interchanging of α1 helices of both protomers and produces tight entanglement of the two protomers. This elucidates the structure basis for the exceeding stability of the Mpro-C dimer and the lack of exchange between the monomer and dimer.12 However, there is no apparent interaction to define an open interface between the two hinge loops. In addition, the electron densities of residues on the hinge loops are relatively weak and their main chain B-factors are relatively high (>50 Å2), which suggest that the hinge loops linking the two folding subunits are rather flexible. Thus, we expect that the relative orientation of two folding subunits in the Mpro-C dimer should not be fixed in solution.

Discussion

We have reported that Mpro-C exists in both monomeric and a dimeric forms in solution, and here, we have solved the structures of both forms. We found that the Mpro-C monomer structure adopts the same conformation as that of the C-terminal domain in the crystal structure of full-length Mpro. However, the Mpro-C dimer has an unusual structure which is a 3D domain-swapped dimer. We have carried out denature-refolding experiments and found that both stable monomeric and dimeric forms can be regenerated from refolding of either Mpro-C monomer or dimer (data not shown). Also, taking into consideration that both Mpro-C monomer and dimer are produced in E. coli and there is no exchange between the two, it seems that forming domain-swapped dimer is not dependent on other components of the virus or human cells, but should be an intrinsic ability of Mpro-C.

Furthermore, our previous NMR studies demonstrated that the two C-terminal domains of the Mpro-Δ7 dimer have identical dimerization pattern to that of the Mpro-C dimer.12 Thus, Mpro-Δ7 can also form a domain-swapped dimer through domain-swapping of their C-terminal domains, which also explains why the Mpro-Δ7 dimer is stable and does not exchange with the Mpro-Δ7 monomer. However, what puzzles us is that we failed to detect any domain-swapped dimer for the full-length Mpro expressed in E. coli. Why could the full-length Mpro not form domain-swapped dimer? Could it be that the N-finger of Mpro is able to interfere and prevent the 3D domain-swapping dimerization of the C-terminal domain of Mpro? Is there a biological relevance for this 3D domain-swapping? It is clear that the two N-termini of both protomers in the crystal structure of the Mpro-C dimer stretch out from the proteins in opposite directions, and the relative orientation of the two folding subunits in the domain-swapped Mpro-C dimer is different from that of the two C-terminal domains in the crystal structure of the full-length Mpro. Even if the full-length Mpro could form a domain-swapped dimer, the two N-terminal domains of the dimer would be far away, and the domain-swapped dimer could not adopt the active conformation. In other words, a domain-swapped dimer of the full-length Mpro would not be active. But, why would the virus retain the C-terminal domain within Mpro that has the potential to deactivate the enzyme? These are the questions we will pursue in the future.

Materials and Methods

For the Mpro-C, the DNA fragment encoding residues 187–306 was cloned into pET21a vector. Samples of Mpro-C monomer and dimer were prepared according to previously published method.12 All NMR samples were at a concentration of about 1 mM and were prepared in buffer containing 50 mM potassium phosphate (pH 7.0), 1 mM EDTA, 0.03% NaN3, in 90% H2O/10% D2O, plus Complete, an EDTA-free Protease Inhibitor Cocktail (Roche, Germany). All NMR experiments were performed at 298 K on a Bruker Avance 500 MHz (with cryoprobe), 600 MHz NMR, and 800 MHz spectrometers. Resonance assignments were obtained using standard methods.14 The interproton nuclear Overhauser effect (NOE) was employed to generate the distance restraints. Dihedral angles were determined from backbone chemical shifts using TALOS.15 Hydrogen bond restraints were generated from the H-D exchange experiments in combined with the CSI secondary structural prediction.16, 17 Structures were calculated and refined using the program CYANA and AMBER.18-20

Mpro-C dimer sample was concentrated to 30 mg/mL in 20 mM Tris pH 7.0. Crystallization was performed by the sitting-drop vapor-diffusion method at 289 K in 48-well plates. The crystals selected for diffraction studies grew in 0.2M Sodium chloride, 0.1M BIS-TRIS (pH 5.5), 25% w/v PEG3350. A 2.4 Å resolution diffraction data set was collected at 100 K from a single Mpro-C dimer crystal using an in-house Rigaku MM-007 generator and an R-Axis VI++ detector. The beam was focused by osmic mirror. A total of 270 frames of data were collected. Processing of diffraction images and scaling of the integrated intensities were performed using the HKL2000 software package.21 Initial phases were obtained by molecular replacement with PHASER22 using the crystal structure of SARS-CoV main protease (PDB code: 2H2Z, excluding the N terminal 186 residues) as the searching model. The final manual rebuilding and refinement were performed in COOT,23 Refmac,24 and CNS regarding to the 2Fo-Fc and 1Fo-Fc density map.

The solution and crystal structures were analyzed using the program packages PROCHECK25 and Figures were created by using MOLMOL.26

Coordinates

Coordinates and structure factors have been deposited at the PDB under accession code 3EBN for the crystal structure of the Mpro-C dimer and 2K7X for the solution structure of the Mpro-C monomer.

Acknowledgements

All NMR experiments were carried out at the Beijing NMR Center.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.