An in
Nature Communications volume 14, Article number: 1545 (2023) Cite this article
1750 Accesses
21 Altmetric
Metrics details
The main protease from SARS-CoV-2 (Mpro) is responsible for cleavage of the viral polyprotein. Mpro self-processing is called maturation, and it is crucial for enzyme dimerization and activity. Here we use C145S Mpro to study the structure and dynamics of N-terminal cleavage in solution. Native mass spectroscopy analysis shows that mixed oligomeric states are composed of cleaved and uncleaved particles, indicating that N-terminal processing is not critical for dimerization. A 3.5 Å cryo-EM structure provides details of Mpro N-terminal cleavage outside the constrains of crystal environment. We show that different classes of inhibitors shift the balance between oligomeric states. While non-covalent inhibitor MAT-POS-e194df51-1 prevents dimerization, the covalent inhibitor nirmatrelvir induces the conversion of monomers into dimers, even with intact N-termini. Our data indicates that the Mpro dimerization is triggered by induced fit due to covalent linkage during substrate processing rather than the N-terminal processing.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of COVID-191. Like SARS-CoV and Middle East Respiratory Syndrome coronavirus (MERS-CoV), SARS-CoV-2 is single-strand RNA virus (ssRNA) that belongs to the genera of the beta-coronaviruses1,2. The SARS-CoV-2 genome is composed of nearly 30,000 nucleotides which contains the ORF1ab gene, a large open-reading-frame (ORF) responsible for encoding 16 non-structural proteins (nsp's) as two polyproteins after ribosomal frameshifting, named 1a and 1b1,2,3. Proteolytic processing of viral polyproteins is essential for the viral life cycle, and it's performed by two SARS-CoV-2 encoded cysteine proteases: the papain-like protease (PLpro), which is one of the domains of nsp3, and the viral main protease (Mpro or 3CLpro), encoded by the nsp54,5. PLpro cleaves the viral polyprotein at three sites, while Mpro is responsible for cleaving the polyprotein at eleven distinct sites4,5, including its own N- and C-termini6. Mpro is one of the most promising targets for drug development against SARS-CoV-25,7. All the gathered structural and biochemical information on this target has been crucial for the rapid development of new antivirals8,9,10,11, including the recent drug Paxlovid/nirmatrelvir10, approved for emergency use in USA, Europe and China.
To obtain heterologous expressed mature Mpro, researchers adopted the general strategy of adding nsp4 C-terminal portion to the N-terminal portion of nsp5 constructs, allowing the self-cleavage and dimerization of Mpro in-vitro6. Previously, we described the activity and biochemical profile of SARS-CoV-2 Mpro C145S mutant containing the C-terminal portion of nsp4 at its N-termini6. This serine mutation generated an active but much slower version of Mpro, a valuable tool for studying the biochemical aspects of this enzyme. Differently than the dimeric Mpro, this sample behaved as a dynamic mix of monomers, dimers, trimers and tetramers in solution6. The residual activity of this serine mutant permitted the slow cleavage of N-termini nsp4-nsp5 peptide, which allowed us to monitor the oligomeric states of Mpro in solution during the maturation process6. We observed that the tetrameric form of Mpro C145S achieves the equilibrium as dimers over the course of two days, in a phenomenon that is directly proportional to the N-termini cleavage, therefore inferring that the N-terminal processing was critical for dimerization6. Additionally, we previously revealed the crystal structure of Mpro C145S in complex with the nsp5-nsp6 C-terminal peptide, obtained from the tetrameric Mpro sample, presenting a detailed depiction of this key cleavage maturation event. In summary, two mature Mpro dimers assemble in a dimer-dimer transient complex, positioning the C-terminal of one Mpro molecule towards the active site of another, and followed by C-terminal processing in a trans-cleavage event6.
Here we report the cryo-electron microscopy (cryo-EM) structure of SARS-CoV-2 Mpro C145S mutant containing the nsp4-nsp5 cleavage portion, at 3.5 Å resolution. In this structure, we glimpse the in-solution state of Mpro, bound to two unfolded nsp5 protomers during N-terminal processing. We demonstrate that dimerization is not dependent of N-terminal processing, as previously assumed, but likely a consequence of induced fit required for the covalent linkage during the substrate processing.
Native mass spectroscopy analysis of peaks containing multimers of SARS-CoV-2 Mpro C145S revealed that the sample is composed of a mix of monomers, dimers, trimers and tetramers (Fig. 1), as previously indicated by SEC-MALS analysis6. Additionally, we observed monomer particles that contain sequences of folded and unfolded uncleaved nsp4-nsp5 peptide. In line with our previous model of maturation, in which the cleavage of N-terminal peptide serves as the trigger for dimerization.
From left to right, peaks show monomers cleaved (blue semicircle) an uncleaved (red semicircle), dimers formed by cleaved (blue circles) or half-cleaved (blue-red circle) particles, trimers formed by two cleaved and one uncleaved particles (two thirds blue, one-third red circles) and tetramers formed by two cleaved and two uncleaved particles (two-quarters blue, two-quarters red). Minor peaks of other forms are described in supplementary materials. Graphs were plotted from individual native mass spectrometry experiments.
However, native mass spectroscopy also revealed that the peaks containing oligomeric states could be formed by the combination of uncleaved (theoretical mass of 34,554.54 Da) and cleaved (theoretical mass of 33,780.58 Da) nsp4-nsp5 particles. The same goes both for trimers or tetramers peaks, that surprisingly could be composed by all combinatory possibilities of cleaved and non-cleaved peptides (Fig. 1 and Supplementary Fig. 1). This contradicts preliminary models of Mpro maturation, in which N-terminal processing dictates the dimerization6,12,13. It is also clear by the mass relative quantities that the equilibria of oligomeric states favor the states where more cleaved elements are present (Fig. 1 and Supplementary Fig. 1), still befitting with that model were N-terminal cleavage is directly involved in dimerization.
To confirm these results, we performed a secondary native mass spectroscopy analysis of sample 2 using different setup. Using an Orbitrap Q-Exactive UHMR, seven charge state distributions were observed. We compared the data of with the previous run from Agilent instrument in Supplementary Fig. 6. An abundant charge state distribution near m/z ~ 2500 and centered at 11+ ion corresponds to Mpro with molecular mass 34,556 ± 24 Da (m1 in Supplementary Fig. 6). A low-abundance charge state distribution with a similar magnitude of the charge is also observed, which corresponds to a species with mass 33,865 ± 44 Da (m2 in Supplementary Fig. 6). These two charge state distributions can be assigned to the different Mpro monomers. Multiple peak series can be found between m/z 4000–5000 (all of which are centered at charge state 16 + ), which correspond to ions with molecular weights consistent with m1m1 homo- and m1m2 heterodimers, i.e., 67,687 ± 44 Da for homodimers and 68,466 ± 9 Da for heterodimers. Interestingly, the additional peaks in the spectrum correspond to charge state that deconvolute to molecular weights that are consistent with higher order oligomeric states, with different amounts of m1 and m2. Still, the data shows that sample 2 is composed of native particles combining cleaved and uncleaved protomers, confirming our previous observation.
Here we describe the cryo-Em structure of SARS-CoV-2 Mpro C145S at 3.5 Å (Fig. 2). The final model showed all 306 residues from C145S Mpro visible for both chains, plus clear evidence of an at least 11 residues long peptide occupying both active sites (Fig. 3a). The final model showed no rotamers, Cβ or Ramachandran outliers, and a map-model correlation coefficient (CC) of 0.80 (Fig. 3b, c). Model and/or map were deposited under the codes 8EY2 (for PDB) and EMD-28666 (for EMDB). Final cryo-EM model is very similar to X-ray known structures (Fig. 3d), with an RMSD of 0.7 Å (for 3,909 atoms) when compared with the mature dimeric form of Mpro (PDB 7KPH), or 1.2 Å (for 3,822 atoms) when compared with X-ray structure of C145S mutant (PDB 7N5Z) (Fig. 3d and e). Statistics and parameters from data collection and processing are available in Table 1.
a Aligned micrographs, with scale bar at the bottom. b CTF-function calculated from obtained micrographs. c Extracted particles examples. d Detailed schematic of steps taken for final reconstruction, highlighting obtained 2D and 3D classes, and first high-resolution reconstruction. e Fourier shell correlation (FSC) between half maps of the final reconstructions. At the top, graph shows FSCs versus spatial frequency calculated in directions x (blue), y (green) and z (red). Average cos phase is in black, and global FSC is plotted in yellow. At the bottom, percentage of per angle FSC (blue) overlaid with gold standard FSC plot (red). f Local resolution projected on the final map from two orientations.
a Four-sides rotation view of final map displayed as surface, with chains A and B coloured in white and grey, respectively, and active site peptide map coloured in cyan. b Chain A (blue) domain III model fitted into final map (grey). c Chain A (blue) and B (salmon) interface region fitted into final cryo-EM map (grey). d Superposition of X-ray Mpro model (yellow, PDB 7KPH), X-ray SARS-CoV-2 C145S Mpro (pink, PDB 7N5Z) and SARS-CoV-2 C145S Mpro cryo-EM model (dark blue). e SARS-CoV-2 C145S Mpro cryo-EM model chain A (left) and (right) coloured according its RMSD versus X-ray model of Mpro (PDB 7KPH).
It was previously assumed that the tetrameric sample of Mpro C145S observed in the solution had the same organization than the one obtained with crystal structure of dimer-dimer association6. However, the cryo-EM structure of Mpro C145S revealed a dimeric particle of Mpro anchored to the nsp4-nsp5 region in the moment that precedes cleavage, with detailed density of P and P‘ key residues (Fig. 3a). The 3.5 Å resolution structure provides structural insights into an important step of Mpro maturation, the N-terminal cleavage, exhibiting a clear electron density of the peptide in both chains A and B of the dimer (Fig. 3a). The electric potential map around the active site Ser145* indicates a non-covalent interaction of the nsp4/nps5 peptide (residues SAVLQ of nsp4 and residues 1-6 of nsp5 Mpro, SGFRKM), forming an extended β-sheet maintained mostly by hydrogen bonds at Thr24, Gly143, Ser145*, His163, Glu166 and Gln189 (Fig. 4a). Substrate positions exhibited a binding position highly like the crystallographic structures of Mpro in complex with the physiological peptide substrate (PDBid 7N6N and 7DVP) with RMSD values of respectively of 0.57 and 0.9 calculated between 30 and 54 atoms.
a Active site view of Mpro C145S chain A surface (in grey) bound to nsp4-nsp5 peptide (yellow sticks). Subsites are denotated from S4 to S5’. b Detailed view of Mpro C145S chain A active site residues (in grey) bound to nsp4-nsp5 peptide (yellow sticks), with cryo-EM map showed as surface (contour level of 4.55). c Interaction scheme between nsp4-nsp5 peptide and Mpro C145S chain A. d Selected low-pass filtered particles, highlighting dimer particles (marked with a blue line) bound to monomeric uncleaved particles (marked with a red line). Scale bar is show at the bottom left. e Schematic representation of Mpro C145S dimer (blue) bound to uncleaved particles (red).
In the cryo-EM model, we can see Gln0 occupying the S1 subsite, with its NE2 side chain moiety interacting with OE1 atom of residue Glu166 through a hydrogen bond (2.7 Å and 2.5 Å for chains A and B, respectively). Additionally, OG atom of Ser145* forms a hydrogen bond with main chain hydroxyl group of Gln0. The Gly143 main chain amide donates a hydrogen bond to P1’ carbonyl oxygen, stabilizing the oxyanion hole during catalysis. Leu-1 at the S2 subsite interacts with Gln189 through a hydrogen bond formed between Leu-1 main chain amide and Gln189 side chain OE1 (Fig. 4a, b). Met49, Met149 and Gln189 from the S2 subsite assume a more open conformation compared with captured unbound forms of the enzyme. At the S3 subsite, there is an interaction between the polar atoms of the Val-2 main chain and Glu166 side chain (Fig. 4c). The hydroxyl group of main chain Ala-3 in P4 donates a hydrogen bond to NE2 of Gln189. The lack of polar interactions between protein sites S2-S4 with protein residues would help to explain the variety of distinct amino acids these positions are willing to accommodate. The density map of active site anchored Mpro particle only extends towards residues Met6 from Mpro, suggesting that the rest of the particle is too mobile for model reconstruction.
By carefully inspecting low pass filtered particles used for final reconstruction, we noticed an elongated satellite particle anchored around each one of the dimer active sites regions, with size and shape consistent with a partially folded monomer of Mpro (Fig. 4d). Contrasting with the dimer-dimer complex formed during the C-terminal cleavage, these elongated protomers are randomly distributed around the active site, indicating that the particles are misfolded (Fig. 4d, e). Moreover, the elongated particles shaped as nsp5 monomers does not seem to form dimers (Fig. 4e). These observations are in line with our previous model of immature Mpro, in which we showed that the addition of three non-cleavable amino acids to its N-termini disrupts Mpro folding and prevents dimerization6. Nevertheless, our structure reveals a unique view of the dimeric form of Mpro caught during the cleavage of the N-termini in solution. We could not generate 2D or 3D classes highlighting the satellite particles, which agrees with the partially folded hypothesis.
Many in-silico molecular simulations and docking studies have tried to describe Mpro in solution movement and substrate recognition14,15, no experimentally determined model is available to validate these free of the enclosure of crystal constrains. In here, we used the cryo-EM model to study the dynamic of Mpro during N-terminal cleavage. For that, the particles used for model reconstruction were analyzed using cryoSPARC 3DVA tool for sorting in solution variability of Mpro. The first of four eigenvector components explored generated low-quality volumes that were not interpretable. The other three (components 2, 3 and 4) produced volume frames that were used for analysis of particle variability. In all three, one can see a volume with shape of Mpro bound to the nsp4 peptide in both active sites, suggesting that particles of other oligomeric states were filtered during data processing. As expected, the analysis indicate that the highest conformational plasticity is concentrated in the active site region of Mpro, with the highest RMSD concentrated in the regions of helix 43-53, and in the β-hairpins 19-27, 62-65 and 166-171. Helix 43-53 contains key residues that are involved in the shaping of hydrophobic subsite S2, including Met49. Together with Met165, Met49 form a substrate recognition cavity, with a preference for hydrophobic side chains such as leucine and valine6. The β-hairpin 19-27, responsible for the formation of S1’ subsite also appears to be remarkably mobile when compared with the rest of the protein.
The analysis indicates that both active sites are gradually expanding and contracting over the frames (Supplementary Fig. 2). The expansion of Mpro active site was already correlated with the substrate and ligand binding6,16, so it is likely that this variability is correlated with distinct moments of substrate anchoring. The conformational flexibility of Mpro active site during substrate and ligand binding has been previously observed using cryogenic and room-temperature X-ray crystallography6,17. Kneller et al. demonstrated that during ligand binding, secondary structural elements that form P2-P5 subsites are driven from original position for ligand/substrate accommodation up to 2.4 Å, shifting the shape and electrostatic potential of the site17. The expansion-contraction movements of the active site seem to be symmetrical for both Mpro protomers (Supplementary Fig. 2). This diverge significantly from multiple in-silico studies that have demonstrated that Mpro protomers are independently active18,19,20, but would help to explain why dimerization is critical for full enzyme efficiency. The active site plasticity of in-solution coronaviruses Mpro is critical to understanding the enzyme dynamics and should assist drug discovery campaigns, especially for those platforms relying on in-silico analysis of protein-ligand interaction, which often do not correlate with experimental data21.
The C145S Mpro sample was shown to be a valuable tool to investigate the in-vitro behavior of Mpro 6. Here we explore the capacity of our method to explore the study the effect of potent inhibitors on the enzyme maturation. Our first object was the competitive non-covalent inhibitor MAT-POS-e194df51-1, developed by the COVID Moonshot initiative, with a pIC50 of 7.5. Samples containing monomeric C145S Mpro (sample 1) or tetrameric C145S Mpro (sample 2) were incubated with two concentrations of MAT-POS-e194df51-1, and their oligomeric state was monitored over the course of 48 h by SEC-MALS and compared with control. For simplification, trimers were considered to be part of the tetramers pool.
At 0 h, sample 1 control presented a 1.0/0.0 (monomer/dimer) ratio, progressing to 0.88/0.12 at 24 h and 0.39/0.61 at 48 h (Fig. 5a). Regarding sample 1 containing 4:1 MAT-POS-e194df51-1/protein, it remained at a ratio of 1.0/0.0 monomer/dimer between 0 h and 24 h, only progressing to 0.004/0.996 by the 48 h timepoint (Fig. 5b). Reducing the MAT-POS-e194df51-1/protein molar ratio from 4:1 to 0.4:1 returned the behavior to closer to that of the control, reaching a monomer/dimer ratio of 0.89/0.11 at 24 h and 0.74/0.25 at 48 h. Sample 1 showed that unfolded monomers are basically forming fully mature dimers over time. MAT-POS-e194df51-1 to sample 1, both concentrations seem to inhibit the dimer maturation and cause the accumulation of unfolded monomers particles. With this experiment we were able to demonstrate a unique effect of MAT-POS-e194df51-1 on the enzyme cycle that was never saw, showing not only its capacity of competing with the substrate but also blocking the enzyme maturation cycle and accumulate uncleaved polyprotein. In 2018 Constant et a.l demonstrated for Dengue protease NS3 that the most effective inhibitors should specifically target cleavage sites that would accumulate uncleaved viral protein precursors22. In this case, this inhibited phenotype might trans-dominantly inhibit other in-cell viral phenotypes, possibly suppressing the generation of drug-resistant variants. More studies will be necessary to comprehend the effect of non-cleaved elements in the SARS-CoV-2 metabolism.
a Control reaction containing monomers at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom). b Monomers conversion reaction in presence of non-covalent inhibitor MAT-POS-e194df51-1 at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom). c Monomers conversion reaction in presence of covalent inhibitor Nirmatrelvir at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom).
For control of sample 2, we saw the ratio of 0.43/0.34/0.23 monomer/dimer/tetramer protein mass detected at 0 h, changing to 0.09/0.89/0.02 at 24 h and 0.0/0.98/0.02 at 48 h (Fig. 6a). For sample 2 containing 4:1 MAT-POS-e194df51-1/protein, we saw the ratio of 0.49/0.45/0.05 monomer/dimer/tetramer protein mass detected at 0 h, changing to 0.49/0.51/0.0 at 24 h and 0.45/0.55/0.02 at 48 h (Fig. 6b). For sample 2 containing 0.4:1 MAT-POS-e194df51-1/protein, we saw the ratio of 0.46/0.47/0.06 monomer/dimer/tetramer protein mass detected at 0 h, changing to 0.26/0.73/0.0 at 24 h and 0.02/0.96/0.02 at 48 h.
a Control reaction containing tetramers at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom). b Tetramers conversion reaction in presence of non-covalent inhibitor MAT-POS-e194df51-1 at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom). c Tetramers conversion reaction in presence of covalent inhibitor Nirmatrelvir at 0 h (top), after 24 h incubation (middle) and after 48 h (bottom).
On the control of this experiment, we saw that the sample initiates as a mix of tetramers, dimers and monomers. Over the course of 48 h, both monomers and tetramers are extinguished, and dimers seen to became predominant. The presence of MAT-POS-e194df51-1 on sample 2 seem to cause the same effect that in sample 1 of blocking monomer consumption, but also seem to accelerate the consumption of tetramers. If we think about the tetramer samples as being the enzyme-substrate complex, it is logical that a competitive inhibitor would have this effect on the sample as it causes the dislocation of the unfolded substrate that is bound to Mpro active site. In parallel with sample 1, sample 2 monomers conversion to dimers was also blocked by the presence of MAT-POS-e194df51-1, revealing that the inhibitor not only blocks activity but also might prevent the maturation process, possibly enhancing the antiviral effect beyond its potency to the target.
The effect of covalent Mpro inhibitor Nirmatrelvir/PF-07321332 with a pIC50 of 7.710 was also tested with against C145S samples. The effect of this molecule was remarkably different from that with MAT-POS-e194df51-1 (Fig. 5c). For sample 1 containing 4:1 Nirmatrelvir/protein, we saw the ratio of 0.65/0.35 between monomers/dimers protein mass detected at 0 h, changing to 0.06/0.94 at 24 h and 0.02/0.98 at 48 h (Fig. 5c). For sample 2 containing 4:1 Nirmatrelvir/protein, we saw the ratio of 0.31/0.56/0.12 between monomers/dimers/tetramers protein mass detected at 0 h, changing to 0.0/0.88/0.12 at 24 h and 0.0/0.98/0.02 at 48 h (Fig. 6c).
In sample 1, the presence of Nirmatrelvir seen to strongly induce the formation of dimers from monomers even at zero hours and enhance the ratio of dimer formation over time significantly, with almost full conversion after 24 h (Fig. 5c). In sample 2, both monomers and tetramers equilibrium also seem to be dislocated to favor dimer formation (Fig. 6c). While for the tetramer increased consumption could be explained simple by the competition between Nirmatrelvir and the substrate-enzyme complex, the accelerated ratio of monomers conversion into dimers was completely unexpected. Yet, these unique results might shed light in the details of the first step of protein maturation, the N-terminal processing.
Our initial model for this step was that the N-terminal cleavage is a mix of cis and trans-events, and its proper cleavage would eliminate the steric hindrance that would be preventing dimerization6, which was in line with previous prosed models for SARS-CoV Mpro 12,13. However, the fact that the covalent inhibitor (but not a non-covalent) might induce dimerization in a non-cleaved sample suggests that the trigger of the dimerization is not the N-terminal processing, but the induced fit caused by the covalent linkage itself. This model would also explain why oligomers can be formed by the combination of non-cleaved and cleaved particles rather than only cleaved particles (Supplementary Fig.1).
To confirm this hypothesis, we crystalized Mpro C145S monomers in the presence of Nirmatrelvir. The crystal structure revealed that despite the C145S mutation, Nirmatrelvir is linked covalently to the protein. Notwithstanding, we also observed displacement of Mpro helixes αF, αH and αI from adjacent Domain III, which seem to be caused by the steric hindrance from non-cleaved N-terminal amino acids (Fig. 7). This is like our previous structure of immature Mpro, with the exception that we do not see the displacement of active site residues Phe140, Glu166, Pro168 and Gln1896. This suggested that the covalent linkage permit the proper reshaping of the active site, allowing the dimerization event even in the absence of proper N-terminal cleavage. One might infer that in order to prevent dimerization, an inhibitor must be non-covalent and successfully inhibit the monomeric form of Mpro binding to all the eleven distinct cleavage sites at the viral polyprotein.
Ser1 and Gln-1 alpha carbons are highlighted as red spheres. Native Mpro is shown as grey transparent cartoon, with Ser1 alpha-carbon highlighted as a green sphere.
If one combines the cryo-EM structure of C145S mutant bound to a uncleaved peptide with our previous X-ray structure of C145S (PDB 7N6N) chains A (cleaved peptide covalently bound to Ser145) and chain B (post-cleaved peptide), we can glimpse the structural modifications of Mpro during all steps of the enzymatic processing (Fig. 8). These structures reveal that the enzymatic process starts with key amino acids positioned in similar manner with than the apo-structure (Fig. 8a), but loop β166-171 must be pushed towards the enzyme core to accommodate substrate peptides P3-P5 (Fig. 8b). Next, during the covalent enzyme-substrate intermediary, this loop β166-171 must be pushed even further the enzyme core to allow a shorten covalent distance between C/S145 and the scissile-bond carbonyl C and proper anchoring to the active site, while helixes 43-53 and 62-65 show a loosen conformation (Fig. 8c). After cleavage, the product peptide is no more linked to C/S145, allowing the active site domain return to its apo conformation (Fig.8d). During this process, Mpro active site surface undergoes significant structural and electrostatic potential modifications to accommodate all enzyme-substrate intermediary conformations, now detailed in this series of structures (Fig. 8).
a Key active site residues (green sticks) of Mpro in apo form (top), cartoon view of active site in apo state in yellow (middle) and calculated electrostatic potential projected into surface of Mpro active site (bottom). b Key active site residues (green sticks) of Mpro C145S bound to intact peptide (top), cartoon view of active site from the respective form (middle) and calculated electrostatic potential projected of respective form (bottom). c Key active site residues (green sticks) of Mpro C145S covalently bound to cleaved peptide forming the enzyme-substrate intermediary complex (top), cartoon view of active site from the respective form (middle) and calculated electrostatic potential projected of respective form (bottom). d Key active site residues (green sticks) of Mpro C145S in complex with post-cleaved peptide (top), cartoon view of active site from the respective form (middle) and calculated electrostatic potential projected of respective form (bottom). The transparent sticks and cartoons (grey) in the top and middle figures represent the structural position from the relative elements of the previous step.
The SARS-CoV-2 Mpro is a three-domain 34 kDa protein that is essentially active as a dimer5,6. Since the first X-ray structure of SARS-CoV-2 Mpro was deposited at the Protein Data Bank (PDB) under the code 6LU7 in February 202023, there are nearly 450 structures up to date from Mpro available at the PDB up to 1.2 Å resolution (PDB 7K3T), making the Mpro one of the most well structurally characterized SARS-CoV-2 nsp's33. The Mpro was characterized by X-ray in complex with multiple new drug candidates5,24,25,26,27 and repurposing molecules24,28,29, fragments from large fragment-screening campaigns30, as well as endogenous peptides6,16,31 and peptide-mimetics drug candidates32. Moreover, room and cryo temperature X-ray structures and neutron crystallography were successfully used to provide valuable information for detailing the Mpro mechanism of action16,33 as well as conformational changes during substrate and ligand recognition6,17. Yet, the cryo-EM structure of Mpro reported here, reveals unique features of the enzyme-substrate recognition and in-solution dynamics.
We also showed that the Mpro N-terminal cleavage is important for the proper folding of protein, but not critical for the dimerization. Our data suggest that the dimerization event is governed by the induced fit of the protein during the formation of a covalent linkage during cleavage. This likely means that the cleavage of any of the eleven viral Mpro target sites would serve as a trigger for the dimerization, regardless of the state of the N-terminus. These observations diverge from previous proposed models, in which the cis-cleavage of N and C-terminal between two protomers are necessarily the initial step of the maturation process13,34. In fact, previous studies had showed that substrate-induced dimerization can occur in an in-vitro model of the viral polyprotein, where N and C-terminal are bound to other proteins35. Our model might also explain why Mpro studies that used other strategies for protein production that generated authentic N-terminus, such as the cases of MERS-CoV Mpro where authors opted for using thrombin at the N-termini (therefore, not allowing the protein to perform internal cleavage and complete maturation) had generated protein that behaves mostly as monomers 36. Curiously, authors have also reported that dimerization of MERS-CoV Mpro could be triggered by substrate, which also agrees with our proposed mode36. Still, more data is required to conclude nascent Mpro are processed and converted into mature, and if this process is dependent of processing event, the processed particle or a combination of both. These observations impact our understanding of the maturation process of coronaviruses Mpro, as well as the pharmacodynamics differences between covalent and noncovalent inhibitors.
Details of construct planning, cloning and protein expression are given elsewhere6. Briefly, the viral cDNA template (GenBank MT126808.1), kindly provided by Dr. Edison Durigon (University of São Paulo, São Paulo, Brazil), was used for cloning the coding region of coding region of Mpro (residues 3264-3569) using primers: Fw 5’ CAGGGCGCCATGAGTGGTTTTAGAAAAATGGCATTC 3’ and Rv 5’ GACCCGACGCGGTTATTGGAAAGTAACACCTGAGAC 3’. This sequence was inserted in pET_M11 vector using the LIC method37. This plasmid was then used as the template for the insertion of Mpro native N-terminal residues (GAMSAVLQ ↓ SGFRK) by inverse PCR using primers: Fw: 5’ GCTGCAGAGTGGTTTTAGAAAAATGGCATTC 3’ and Rv: 5’ ACGGCTGACATGGCGCCCTGAAAATA 3’. Then, C145S mutation was inserted in this plasmid by inverse PCR using primers Fw 5’ CCTTAATGGTTCATCTGGTAGTG 3’ and Rv 5’ AATGAACCCTTAATAGTGAAATTGG 3’, resulting in the pET_M11-C145S-Mpro. This construct contains a N-termini 6x histidine tag followed by a TEV cleavage site, and the nsp4 C-termini recognition sequence amino acids (Ser-4, Ala-3, Val-2, Leu-1, Gln-0 ↓ ) followed by the Mpro sequence containing C145S substitution6.
For protein production, pET_M11-C145S-Mpro was used to transform E. coli BL21 cells and cultured in ZYM-5052 media38 at 37 oC and 200 RPM to an OD600 of 0.8, followed by expression at 18 °C, 200 RPM for 16 h. Cells were harvested by centrifugation at 5000 g for 40 min at 4 °C, resuspended in lysis buffer (20 mM Tris pH 7.8, 150 mM NaCl, 1 mM DTT) and disrupted by sonication. Lysate containing soluble protein was clarified by centrifugation at 12,00x g for 30 min at 4 °C.
C145S Mpro was purified by immobilized metal chromatography (IMAC) using a 5 mL HisTrap FF column (GE Healthcare). After column washing with buffer A (20 mM Tris pH 7.8, 150 mM NaCl, 25 mM Imidazole), the protein was eluted with buffer A supplemented with 250 mM imidazole. Sample was buffer exchanged using a 5 mL HiTrap desalting column (GE Healthcare) equilibrated with buffer A. To remove the 6xHis-tag, 2 mg of TEV protease and 4 mM DTT were added to the sample and incubated overnight at 4 °C. Next day, non-cleaved protein and TEV were removed by a second step of IMAC in buffer A. The protein was then purified by size-exclusion chromatography (SEC) using a HiLoad 16/60 Superdex 75 column (GE Healthcare) equilibrated with gel filtration buffer (20 mM Tris pH 7.8, 150 mM NaCl, 1 mM EDTA, 1 mM DTT). The SEC profile of this sample showed multiple overlapping peaks of mixed oligomeric states of Mpro, like previously described6. Peaks with higher retention time are mainly composef (sample 1), while peaks with lower retention time are mainly composed by tetramers (sample 2). Protein purity was analyzed by SDS-PAGE and quantified using the theorical extinction coefficient of 32,890 M−1.cm−1 39. Fresh purified fractions of both peaks were aliquoted at 10 mg.mL−1 and immediately flash-frozen in liquid nitrogen.
For grid preparation, freshly purified C145S Mpro samples from the tetrameric of SEC peak (sample 2) were thawed in ice, spined and diluted in gel filtration buffer. Then, 3 µL of 0.25 mg.ml−1 of samples was applied on glow discharged Quantifoil 300 mesh Cu R2/2 grid. Grid was blotted using Vitrobot (FEI) for 2.5 s with blot force 1 at humidity 100% and 4 °C before plunge freezing into liquid ethane.
Data were collected on Titan Krios operated at 300 kV using a K3 detector in counted super-resolution mode with slit width of 20 eV at a nominal magnification of 130k x corresponding to a calibrated pixel size of 0.653 Å at the specimen level. Movies were recorded over 1 sec at dose rate of 18.5 e-/px/s and fractionated into 43 frames. Data acquisition was done using ThermoFisher Scientific EPU 2.12 with a defocus range of −0.5 to −2.5 µm.
For high-resolution structure determination, the 13,770 super-resolution movies collected at Titan were dose-weighted, aligned and two times binned using MotionCor240,41 implemented in RELION v3.141, resulting in a physical pixel of 0.653 Å. The contrast-transfer-function (CTF) of resulting micrographs was estimated using CTFFIND-v4.142.
1,956,496 particles were picked using SPHIRE-crYOLO43 and down sampled by a factor of 3, extracted with a box size of 72 px. Particles were filtered and sorted with 3 rounds of 2D classification. 569,725 good particles were selected for De novo 3D model. Particles were re-extracted with full resolution. Particles were further filtered and sorted with 3D classification. The particles from the best 3D class were selected for 3D refinement and Bayesian polishing in Relion v3.144.
Refine3D volume and the 227,534 shiny particles were then imported to cryoSPARC v3.2.0 for refinement45. For the deposited structure, map was initially refined using Homogeneous Refinement algorithm with 1 extra final pass, 7 Å initial lowpass resolution and C2 applied symmetry, resulting in a refined map with gold-standard Fourier shell correlation (gsFSC) of 3.75 Å46. The resulting map was used for the Non-uniform Refinement47 with 1 extra final pass, 8 Å initial lowpass resolution, C2 applied symmetry, per-particle defocus optimization and per-group CTF optimization, resulting in a refined map with gsFSC of 3.6 Å. The resulting map was than refined using Local Refinement algorithm with C2 imposed symmetry, resulting in a map with gsFSC of 3.5 Å. The same protocol was also tested with C1 symmetry, resulting in a nearly identical map with gsFSC of 3.76 Å. Therefore, the C2 map was used for further modelling and refinement. Map resolution was calculated using cryoSPARC Local Resolution and 3DFSC Processing Server48. Statistics of data collection, processing and model refinement for the C2 are available in Table 1. Schematic of the data processing steps is available in Fig. 2.
Initial docking of model into map was performed using PDB 7N6N and sharpened map from cryoSPARC using UCSF ChimeraX v1.2.549. For sharpening and enhance visualization of the high-resolution map, we used phenix auto-sharpen50. Model building and refinement was conducted with Phenix Real Space Refinement51 and Coot52, and validation was conducted with MolProbity53. Figures were prepared using PyMOL and ChimeraX v1.2.5. Statistics of model refinement are available in Table 1.
To analyze variability of SARS-CoV-2 Mpro C145S particles, we used the 3D Variability Analysis (3DVA) tool available in cryoSPARC v3.2.054. For that, the 569,725 full size particles used during cryo-EM data processing 3D classification (in Relion) were imported in cryoSPARC v3.2.0. These particles and Refine3D volume were incorporated in a symmetry-free homogeneous refinement cycle. Then, particles and mask were used to compute 4 eigenvectors of the covariance matrix of the data distribution using 3DVA54, with resolution filtered at 5 Å. Model series were generated using 3DVA display tool. Images were generated with ChimeraX v1.2.5.
Reversed-phase chromatography was performed in-line prior to mass spectrometry using an Agilent 1100 HPLC system (Agilent Technologies inc. – Palo Alto, CA, USA). One sample of freshly prepared C145S containing tetrameric peaks (sample 2) was diluted to 20 µg/ml in 0.1% formic acid, and fifty µl was injected on to a 2.1 mm × 12.5 mm Zorbax 5 µm 300SB-C3 guard column housed in a column oven set at 40 oC. The solvent system used consisted of 0.1% formic acid in LC-MS grade water (Millipore, solvent A) and 0.1 % formic acid in methanol (LC-MS grade, Chromasolve, solvent B). Chromatography was performed as follows: Initial conditions were 90 % A and 10 % B and a flow rate of 1.0 ml/min. After 15 s at 10 % B, a two-stage linear gradient from 10 % B to 80 % B was applied, over 45 s and then from 80% B to 95% B over 3 s. Elution then proceeded isocratically at 95 % B for 1 min 12 s followed by equilibration at initial conditions for a further 45 s. Protein intact mass was determined using an MSD-ToF electrospray ionisation orthogonal time-of-flight mass spectrometer (Agilent Technologies Inc. – Palo Alto, CA, USA). The instrument was configured with the standard ESI source and operated in positive ion mode. The ion source was operated with the capillary voltage at 4000 V, nebulizer pressure at 60 psig, drying gas at 350oC and drying gas flow rate at 12 L/min. The instrument ion optic voltages were as follows: fragmentor 250 V, skimmer 60 V and octopole RF 250 V.
For native mass spectrometry, one sample of freshly prepared sample 2 were held on ice and buffer exchanged into 75 µl of 50 mM ammonium acetate pH 7.5 by 3 rounds of gel filtration using BioGel P6 (Biorad) spin columns according to the manufacturer's instructions. Mass spectra were acquired using an Agilent 6530 QTOF operating in positive ion 1 GHz mode using a standard ESI source. Samples were introduced via a syringe pump at a flow rate of 6 µl/min. The ion source was operated with the capillary voltage at 3500 V, nebulizer pressure at 17 psig, drying gas at 325oC and drying gas flow rate at 5 L/min. The instrument ion optic voltages were as follows: fragmentor 430 V, skimmer 65 V and octopole RF 750 V. This same setup was used to describe the tetrameric native form of E. coli LacZ (theorical mass of 465,932 Da), with observed folded mass of 466,092 Da for tetramer55.
In solution oligomeric states of the purified samples were evaluated by size exclusion chromatography coupled with multi-angle light scattering (SEC-MALS). All assays were performed in running buffer composed by 20 mM Tris-HCl pH 7.8, 100 mM NaCl and 1 mM DTT, as described previously6. For that, one sample of 50 µL of each Mpro C145S oligomers at concentration of 50 µM were injected in a Waters 600 HPLC system (Waters) coupled in-line with an UV detector, a miniDAWN TREOS multi-angle light scattering apparatus (Wyatt Technology), a column Superdex 200 Increase 10/300 GL (GE Healthcare) with a flow rate of 0.5 mL/min, and a refractive index detector Optilab T-rEX (Wyatt Technology). The light-scattering detectors were normalized with bovine serum albumin (Sigma-Aldrich). Data were collected and analyzed with the ASTRA 7 integrated software, provided by Wyatt.
Samples analyzed contained 100 µg of C145S Mpro monomers or tetramers that were incubated with 1% DMSO (control) or ligand for 0 h, 24 h, or 48 h. Non-covalent inhibitor MAT-POS-e194df51-1, obtained from Moonshot COVID Consortium was tested in the inhibitor:protein ratio concentration of 4:1 and 0.4:1, and incubated for 0 h, 24 h, or 48 h incubation with each sample. Covalent inhibitor Nirmatrelvir was tested in the inhibitor:protein ratio concentration of 4:1, and incubated for 0 h, 24 h, or 48 h. Given the lack of sufficient resolution of the methods, trimers and tetramers peaks were summed and treated as tetramers. All generated data is available at Supplementary Fig. 3.
Monomers samples of C145S Mpro (sample 1) at 5 mg/mL were incubated with Nirmatrelvir in a 1:5 protein:compound ratio for 1 h at room temperature. Then, sitting drop crystallization plates were set using in condition 0.1 M MES pH 6.7, 5% DMSO, 8% PEG 4000 at 20 °C. Crystals were cryo-protected with 30% PEG 400, and data was collected at MANACA beamline (SIRIUS). Data was processed with XDS via Autoproc56,57. Data was scaled using Aimless via CCP458, and molecular replacement was performed with Phaser59, using PDB 7MBG as template. Data collection statistics are available in Table 2. Data was modelled with Coot and refined with Acedrg and Refmac52,60,61,62,63. Figures were made with ChimeraX, Pymol and BioRender.com. Electron density of this structure is visible in Supplementary Fig. 7.
For a confirmatory experiment using a different setup, native mass spectrometry was carried out using an Orbitrap Q-Exactive UHMR as previously described64. Briefly, analyte solutions were buffer exchanged into 200 mM ammonium acetate pH 7.4 (Sigma Aldrich) using Zeba micro spin desalting columns (7 kDa MWCO, Pierce). One sample containing 1-3 µL of analyte (in 200 mM ammonium acetate) was loaded into in-house prepared gold-coated glass capillaries pulled to ~1 µm point. The capillary was biased 1.0 – 1.2 kV relative to the heated metal capillary (100 °C) to generate positive ions by nanoelectrospray ionization. The mass spectrometer was operated in the positive ion mode using the manufacturers’ recommended settings for "low m/z" detection. The following parameters were manually adjusted: in-source trapping potential: 20 – 50 V, 4 ms; HCD potential: 1 – 10 V; pressure setting: 8.5. Native mass spectra were visualized in QualBrowser (Thermo Fisher Scientific) and OriginPro 2021 (OriginLab Corporation, Northampton, MA). M/z spectra here and from native ESI-TOF were analyzed using Masshunter B.07.00 (Agilent); ESIprot65 and using an ion table. Charge states were assigned manually. Results of this run are available in Supplementary Fig. 6.
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
All data associated with this study are publicly available. All collected movies and polished particles used for final structure determination are available at Electron Microscopy Public Image Archive (EMPIAR) under the code EMPIAR-10810. Cryo-EM maps and structural models are available at Protein Data Bank (PDB) under code 8EY2 and Electron Microscopy Data Bank (EMDB) under the code EMD-28666, and the X-ray model is available under PDB code 8EYJ. The structures used in this study are available in PDB database under the codes 7N5Z, 7KPH, 7N6N, 7K3T and 7DVP. A source data file is provided with this paper for all SEC-MALS and native mass spectroscopy. Source data are provided with this paper.
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bhatt, P. R. et al. Structural basis of ribosomal frameshifting during translation of the SARS-CoV-2 RNA genome. Science 372, 1306–1313 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Osipiuk, J. et al. Structure of papain-like protease from SARS-CoV-2 and its complexes with non-covalent inhibitors. Nat. Commun. 12, 1–9 (2021).
Article Google Scholar
Zhang, L. et al. Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved a-ketoamide inhibitors. Science 368, 409–412 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Noske, G. D. et al. A Crystallographic Snapshot of SARS-CoV-2 Main Protease Maturation Process. J. Mol. Biol. 433, 167118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gordon, D. E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Chodera, J., Lee, A. A., London, N. & von Delft, F. Crowdsourcing drug discovery for pandemics. Nat. Chem. 12, 581 (2020).
Article CAS PubMed Google Scholar
von Delft, F. et al. A white-knuckle ride of open COVID drug discovery. Nature 594, 330–332 (2021).
Article ADS Google Scholar
Owen, D. R. et al. An oral SARS-CoV-2 Mpro inhibitor clinical candidate for the treatment of COVID-19. Science 374, 1586–1593 (2021).
Article ADS CAS PubMed Google Scholar
The COVID Moonshot Consortium et al. Open Science Discovery of Oral Non-Covalent SARS-CoV-2 Main Protease Inhibitor Therapeutics. Preprint at https://www.biorxiv.org/content/10.1101/2020.10.29.339317v3 (2021).
Xia, B. & Kang, X. Activation and maturation of SARS-CoV main protease. Protein Cell 2, 282–290 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hsu, M. F. et al. Mechanism of the maturation process of SARS-CoV 3CL protease. J. Biol. Chem. 280, 31257–31266 (2005).
Article CAS PubMed Google Scholar
Römer, R. A., Römer, N. S. & Wallis, A. K. Flexibility and mobility of SARS-CoV-2-related protein structures. Sci. Rep. 11, 1–13 (2021).
Article Google Scholar
Zimmerman, M. I. et al. SARS-CoV-2 simulations go exascale to predict dramatic spike opening and cryptic pockets across the proteome. Nat. Chem. 13, 651–659 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. et al. Crystallographic structure of wild-type SARS-CoV-2 main protease acyl-enzyme intermediate with physiological C-terminal autoprocessing site. Nat. Commun. 11, 5877 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Kneller, D. W. et al. Structural plasticity of SARS-CoV-2 3CL Mpro active site cavity revealed by room temperature X-ray crystallography. Nat. Commun. 11, 7–12 (2020).
Article Google Scholar
Chen, H. et al. Only one protomer is active in the dimer of SARS 3C-like proteinase. J. Biol. Chem. 281, 13894–13898 (2006).
Article CAS PubMed Google Scholar
Cho, E. et al. Dynamic Profiling of β-Coronavirus 3CL Mpro Protease Ligand-Binding Sites. J. Chem. Inf. Model. 61, 3058–3073 (2021).
Article CAS PubMed PubMed Central Google Scholar
Grottesi, A. et al. Computational Studies of SARS-CoV-2 3CLpro: Insights from MD Simulations. Int. J. Mol. Sci. 21, 1–18 (2020).
Article Google Scholar
Macip, G. et al. Haste makes waste: A critical review of docking-based virtual screening in drug repurposing for SARS-CoV-2 main protease (M-pro) inhibition. Med. Res. Rev. 42, 744–769 (2021).
Article PubMed PubMed Central Google Scholar
Constant, D. A., Mateo, R., Nagamine, C. M. & Kirkegaard, K. Targeting intramolecular proteinase NS2B/3 cleavages for trans-dominant inhibition of dengue virus. Proc. Natl Acad. Sci. USA 115, 10136–10141 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Jin, Z. et al. Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors. Nature 582, 289–293 (2020).
Article ADS CAS PubMed Google Scholar
Qiao, J. et al. SARS-CoV-2 Mpro inhibitors with antiviral activity in a transgenic mouse model. Science 371, 1374–1378 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Zaidman, D. et al. An automatic pipeline for the design of irreversible derivatives identifies a potent SARS-CoV-2 Mpro inhibitor. Cell. Chem. Biol. 12, 1795–1806 (2021).
Article Google Scholar
Han, S. H. et al. Structure-Based Optimization of ML300-Derived, Noncovalent Inhibitors Targeting the Severe Acute Respiratory Syndrome Coronavirus 3CL Protease (SARS-CoV-2 3CLpro). J. Med. Chem. 65, 2880–2904 (2021).
Article PubMed Google Scholar
Dampalla, C. S. et al. Structure-Guided Design of Conformationally Constrained Cyclohexane Inhibitors of Severe Acute Respiratory Syndrome Coronavirus-2 3CL Protease. J. Med. Chem. 64, 10047–10058 (2021).
Article CAS PubMed Google Scholar
Günther, S. et al. X-ray screening identifies active site and allosteric inhibitors of SARS-CoV-2 main protease. Science 372, 642–646 (2021).
Article ADS PubMed PubMed Central Google Scholar
Ma, C. et al. Boceprevir, GC-376, and calpain inhibitors II, XII inhibit SARS-CoV-2 viral replication by targeting the viral main protease. Cell. Res. 30, 678–692 (2020).
Article CAS PubMed PubMed Central Google Scholar
Douangamath, A. et al. Crystallographic and electrophilic fragment screening of the SARS-CoV-2 main protease. Nat. Commun. 11, 1–11 (2020).
Article Google Scholar
Zhao, Y. et al. Structural basis for replicase polyprotein cleavage and substrate specificity of main protease from SARS-CoV-2. Proc. Natl Acad. Sci. USA 119, e2117142119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bai, B. et al. Peptidomimetic α-Acyloxymethylketone Warheads with Six-Membered Lactam P1 Glutamine Mimic: SARS-CoV-2 3CL Protease Inhibition, Coronavirus Antiviral Activity, and in Vitro Biological Stability. J. Med. Chem. 65, 2905–2925 (2021).
Article PubMed Google Scholar
Kneller, D. W. et al. Unusual zwitterionic catalytic site of SARS–CoV-2 main protease revealed by neutron crystallography. J. Biol. Chem. 295, 17365–17373 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Jonas, F., Shen, C. & Higenfeld, R. Liberation of SARS-CoV main protease from the viral polyprotein: N-terminal autocleavage does not depend on the mature dimerization mode. Protein Cell 1, 59–74 (2010).
Article CAS PubMed PubMed Central Google Scholar
Li, C. et al. Maturation mechanism of severe acute respiratory syndrome (SARS) coronavirus 3C-like proteinase. J. Biol. Chem. 285, 28134–28140 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ho, B. L. et al. Critical Assessment of the Important Residues Involved in the Dimerization and Catalysis of MERS Coronavirus Main Protease. PLoS One 10, e0144865 (2015).
Article PubMed PubMed Central Google Scholar
Aslanidis, C. & de Jong, P. J. Ligation-independent cloning of PCR products (LIC-PCR). Nucleic Acids Res. 18, 6069–6074 (1990).
Article CAS PubMed PubMed Central Google Scholar
Studier, F. W. Protein production by auto-induction in high density shaking cultures. Protein Expr. Purif. 41, 207–234 (2005).
Article CAS PubMed Google Scholar
Wilkins, M. R. et al. Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. 112, 531–552 (1999).
CAS PubMed Google Scholar
Zheng, S. Q. et al. MotionCor2: Anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife 7, e42166 (2018).
Article PubMed PubMed Central Google Scholar
A, R. & N, G. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Article Google Scholar
Wagner, T. et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun. Biol. 2, 1–13 (2019).
Article Google Scholar
J, Z., T, N. & SHW, S. A Bayesian approach to beam-induced motion correction in cryo-EM single-particle analysis. IUCrJ 6, 5–17 (2019).
Article Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. CryoSPARC: Algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Scheres, S. H. W. & Chen, S. Prevention of overfitting in cryo-EM structure determination. Nat. Methods 9, 853–854 (2012).
Article CAS PubMed PubMed Central Google Scholar
Punjani, A., Zhang, H. & Fleet, D. J. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat. Methods 17, 1214–1221 (2020).
Article CAS PubMed Google Scholar
Tan, Y. Z. et al. Addressing preferred specimen orientation in single-particle cryo-EM through tilting. Nat. Methods 14, 793–796 (2017).
Article CAS PubMed PubMed Central Google Scholar
EF, P. et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
Article Google Scholar
Terwilliger, T. C., Sobolev, O. V., Afonine, P. V. & Adams, P. D. Automated map sharpening by maximization of detail and connectivity. Acta Crystallogr. D. Struct. Biol. 74, 545–559 (2018).
Article CAS PubMed PubMed Central Google Scholar
Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D. Struct. Biol. 74, 531–544 (2018).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D. Struct. Biol. 66, 486–501 (2010).
Article CAS Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D. Struct. Biol. 66, 12–21 (2010).
Article CAS Google Scholar
Punjani, A. & Fleet, D. J. 3D variability analysis: Resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. J. Struct. Biol. 213, 107702 (2021).
Article CAS PubMed Google Scholar
Chalk R. In Heterologous Gene Expression in E. coli: Methods and Protocols (ed. Burgess-Brown N. A.) 373–395 (Humana Press, New York City, 2017).
Kabsch, W. Acta Crystallogr. D. Struct. Biol. 66, 125–132 (2010). XDS.
Article CAS Google Scholar
Vonrhein, C. et al. Data processing and analysis with the autoPROC toolbox. Acta Crystallogr. D. Struct. Biol. 67, 293–302 (2011).
Article CAS Google Scholar
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D. Struct. Biol. 67, 235–242 (2011).
Article CAS Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Article CAS PubMed PubMed Central Google Scholar
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D. Struct. Biol. 67, 355–367 (2011).
Article CAS Google Scholar
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr. D. Struct. Biol. 69, 1204–1214 (2013).
Article CAS Google Scholar
Karplus, P. A. & Diederichs, K. Linking crystallographic model and data quality. Science 336, 1030–1033 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Long, F. et al. AceDRG: a stereochemical description generator for ligands. Acta Crystallogr. D. Struct. Biol. 73, 112–122 (2017).
Article CAS PubMed PubMed Central Google Scholar
El-Baba, T. J. et al. Allosteric Inhibition of the SARS-CoV-2 Main Protease: Insights from Mass Spectrometry Based Assays. Angew. Chem. Int. Ed. 59, 23544 (2020).
Article CAS Google Scholar
Winkler, R. ESIprot: a universal tool for charge state determination and molecular weight calculation of proteins from electrospray ionization mass spectrometry data. Rapid Commun. Mass SP 24, 285–294 (2010).
Article CAS Google Scholar
Download references
Authors acknowledge Diamond Light Source for access and support of the cryo-EM facilities at the UK national electron Bio-Imaging Centre (eBIC) through proposal BI27083 and NT29349, funded by the Wellcome Trust, MRC and BBSRC. Authors acknowledge COVID Moonshot Consortium for supplying compound and Prof. Carlos Alberto Montanari for kindly supplying Nirmatrelvir. ASG acknowledge Andressa Patricia Alves Pinto for support with SEC-MALS. This project was funded by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES – Project 88887.516153/2020-00, ASG) and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP projects 2013/07600-3, 2015/16811-3 and 2016/19712-9, GO). This work has received support from the EU/EFPIA/OICR/McGill/KTH/Diamond Innovative Medicines Initiative 2 Joint Undertaking (EUbOPEN grant n° 875510, RC). This research used facilities of the SIRIUS, part of the Brazilian Center for Research in Energy and Materials (CNPEM), a private non-profit organization under the supervision of the Brazilian Ministry for Science, Technology, and Innovations (MCTI). The MANACA beamline staff is acknowledged for the assistance during the experiments of proposal 20220605. Research reported in this publication was supported in part by NIAID of the National Institute of Health under award number U19AI171399. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health. Research in the Robinson laboratory is supported by a Medical Research Council (MRC) program grant (MR/V028839/1). This research was also funded in part, by the Wellcome Trust, Grant No. 221795/Z/20/Z. TJE is a Junior Research Fellow at Linacre College and was supported by a Royal Society Newton International Fellowship. TJE and CVR are grateful for generous support provided by the University of Oxford COVID-19 Research Response fund and its donors (BRD00230).
These authors contributed equally: Gabriela Dias Noske, Yun Song.
A full list of members and their affiliations appears in the Supplementary Information.
Sao Carlos Institute of Physics, University of Sao Paulo, Av. Joao Dagnone, 1100 - Jardim Santa Angelina, Sao Carlos, 13563-120, Brazil
Gabriela Dias Noske, Rafaela Sachetto Fernandes, Glaucius Oliva & Andre Schutzer Godoy
Electron Bio-imaging Centre, Diamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot, OX11 0QX, UK
Yun Song & C. David Owen
Centre for Medicines Discovery, Oxford University, OX1 3QU, Oxford, UK
Rod Chalk, Haitem Elmassoudi & Lizbé Koekemoer
Physical and Theoretical Chemistry Laboratory, Department of Chemistry, University of Oxford, South Parks Road, OX1 3TA, Oxford, UK
Tarick J. El-Baba & Carol V. Robinson
The Kavli Institute for Nanoscience Discovery, Dorothy Crowfoot Hodgkin Building, South Parks Road, OX1 3QU, Oxford, UK
Tarick J. El-Baba & Carol V. Robinson
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
A.S.G. conceived this project. G.D.N. performed biochemical and biophysical experiments. Y.S. prepared grids and data collection. A.S.G. and Y.S. processed the cryo-EM data. A.S.G. and G.D.N. performed modelling and analysis. L.K., H.E., R.C., T.H.E. and C.V.R. performed MS experiments. A.S.G. wrote and conceived this manuscript. A.S.G., R.S.F., Y.S., C.D.O. and G.O. revised this manuscript.
Correspondence to Andre Schutzer Godoy.
The authors declare no competing interests.
Nature Communications thanks Jeffrey Agar, Stephen Harding and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.
Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and Permissions
Noske, G.D., Song, Y., Fernandes, R.S. et al. An in-solution snapshot of SARS-COV-2 main protease maturation process and inhibition. Nat Commun 14, 1545 (2023). https://doi.org/10.1038/s41467-023-37035-5
Download citation
Received: 01 August 2022
Accepted: 28 February 2023
Published: 20 March 2023
DOI: https://doi.org/10.1038/s41467-023-37035-5
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.