Volume 42, Issue 26 p. 1861-1872
Full Paper
Free Access

Identification of novel inhibitors of SARS-CoV-2 main protease (Mpro) from Withania sp. by molecular docking and molecular dynamics simulation

Surjeet Verma PhD,

Department of Plant & Soil Sciences, University of Pretoria, Pretoria, South Africa

Search for more papers by this author
Chirag N. Patel PhD,

Department of Botany, Bioinformatics and Climate Change Impacts Management, School of Sciences, Gujarat University, Ahmedabad, India

Search for more papers by this author
Muktesh Chandra PhD,

Corresponding Author

Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, Uttar Pradesh, India

Correspondence

Muktesh Chandra, Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, Uttar Pradesh, India.

Email: mchandra33@gmail.com

Search for more papers by this author
First published: 20 July 2021

Abstract

Since December 2019, coronavirus disease (COVID-19) has claimed the lives of millions of people across the globe. To date, no medicine is available for the responsible virus SARS-CoV-2. 3CLpro, that is, 3-chymotrypsin-like protease, the main protease (Mpro), has an important role in cleaving pp1a and pp1ab polyproteins. This Mpro serves as an important target in drug designing against COVID-19. Herein, the study includes the investigation, screening, and identification of potent leads from (Withania sps.), against SARS-CoV-2, using virtual screening, molecular docking, and molecular dynamics (MD) simulations. Seventy-three natural compounds from this important medicinal plant were screened. The Binding affinity was used to identify the most probable target to inhibit the Mpro, compounds 27-hydroxywithanolide F (W32, −11.5 kcal/mol), withanolide A (W56, −11.4 kcal/mol), and withacoagulin H (W30, −11.1 kcal/mol) showed highest binding energy. Lipinski's rule, followed by drug-likability and likeness screening, resulted in 36 molecules. Further, MD simulation of 50 ns predicted withacoagulin H possessing strong binding affinity and hydrogen-bonding interactions with the active site. The binding free energy calculation showed the most negative energy of withacoagulin H (−63.463 KJ/mol) compared to other selected compounds. The study also compared the bonding energy of already reported repurposed and newly synthesized drugs. Further, absorption, distribution, metabolism, and excretion predictions were made to found a good balance of potency. Hence the following screened compounds from Withania sps. could serve as the potential leads for drug development against COVID-19.

1 INTRODUCTION

The first case for novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was reported in the live wildlife market at Hubei province of China in December 2019.1 Since its inception, the virus has made unprecedented growth and affected the human population worldwide. Due to the increased number of cases worldwide, the World Health Organization declared it a pandemic on March 11, 2020. The novel SARS-CoV-2 infection creates a pneumonia-like condition known as COVID-19. Symptoms such as sore throat, fever, and dry cough are observed, causing complications like septic, pulmonary edema, organ failure, and Acute Respiratory Distress Syndrome.2 In such a devastating situation development of medication are much needed.

There are four classes of coronaviruses, namely alpha, beta, gamma, and delta. SARS and Middle East respiratory syndrome (MERS) belongs to the beta category. The causative agent for COVID-19 is SARS-CoV2.3 The virus got SARS-CoV-2, the name due to the 82% similarity of its RNA genome with SARS-CoV. The SARS-CoV-2 consists of a 30 kb long single-stranded RNA molecule with a 5′ cap and 3′ poly-A tail with positive polarity. Its virion consists of crown-shaped peplomers, with 80–160 nm in diameter.4 The CoVs consist of four structural proteins, namely spike (S protein), envelope (E) protein, membrane (M) protein, and nucleocapsid (N) protein. The N protein participates in RNA genome packaging, and S, M, E proteins are involved in forming a viral coat.5 The COVID-19 genome, for its replication and transcription, encodes two replicase proteins, pp1a and pp1ab.6 The main protease (Mpro) releases the functional peptides from the polyproteins through extensive proteolytic processing. Mpro is vital for the viral life cycle as it digests the polyproteins at 11 conserved sites by autolytic cleavage by pp1a and pp1ab.7 Moreover, the absence of homologs in humans makes it a perfect target for antiviral drug design.

As per the information, SARS-CoV-2 has a low fatality rate but spreads more efficiently than MERS and SARS-CoV, making it more difficult to stop its infection. For designing therapeutic to counter infection and devise pathology for COVID-19, it is necessary to understand how coronavirus infects the host and hijacks its machinery. The knowledge obtained can be used to design new drugs, repurpose the existing drugs, or test molecules of medicinal plants for their efficacy against the virus.

Medicinal plants serve as a vital source for many pharmacologically active structurally diverse metabolites, having their role in traditional medicinal systems and pharmaceutical industries. These metabolites, aka secondary metabolites, have an important functional and physiological function in plants. Metabolites differ among plants, for example, phenylpropanoids and terpenoids in Ocimum species, withanolides and withaferins in Withania species, and so on. Here, molecules from Withania sps were tested to get their efficacy and binding capability to the main protease of SARS-CoV-2.

In recent years, strategies such as drug repurposing have played a crucial role in speeding the drug designing against rapidly spreading infections, either be Ebola infection, Hepatitis C, Zika virus, or the SARS-CoV-2.8-10 Repurposing investigations revealed various medications, such as lopinavir, remdesivir, lopinavir-ritonavir, hydroxychloroquine, ivermectin, and numerous other natural products and natural product-derived compounds, had good effectiveness against SARS-CoV-2.11, 12 Moreover, few other repurposed synthetic molecules have shown good inhibitory activity against SARS-CoV-2 Mpro in-vitro.6, 13 Several molecules isolated from various plants such as Glycyrrhiza glabra, Asparagus rasemosus, Ocimum basilicum, Ocimum sanctum, Clonorchis sinensis, and Withania somnifera, and so forth, also exhibit antiviral properties.14-16 Recently, few phyto-molecules including oolonghomobisflavan-A, theasinensin-D, theaflavin-3-O-gallate, glycyrrhizic acid, asparoside-C, D, and F have been identified as potential inhibitors of SARS-CoV-2 Mpro and spike protein in several in-silico studies.17-19 The glycyrrhizic acid, its derivatives, and other triterpenes have shown great inhibition potential against SARS-CoV replication and cytopathic effect in-vitro. Due to the 82% of RNA genome similarity between SARS-CoV and SARS-CoV-2, Verma et al.11 have hypothesized that these compounds or similar compounds may potentially inhibit SARS-CoV-2 viral replication and infection as well. Furthermore, few similar steroidal glycosides such as abiraterone acetate, ouabain, digoxin, and digitoxin have shown promising in-vitro inhibitory activity against SARS-CoV-2.12

Following this hypothesis having the close structural similarity to these compounds, 73 withanolides isolated from five Withania species were selected from the literature and screened virtually. Further, the rule of five analyses reduced the list of molecules. In-silico studies, including molecular docking simulation, were done to screen the potential inhibitors for Mpro of SARS-CoV-2. Absorption, distribution, metabolism, and excretion (ADME) prediction provided the bioavailability score (ABS) and permeability of molecules which act as an essential factor for drug designing. Furthermore, we envisage that in-vitro and in-vivo studies should be conducted for better inhibitory potential on these selected molecules, serving as a template for drug design for Mpro from SARS-CoV-2.

2 MATERIALS AND METHODS

2.1 Compilation of dataset

The three-dimensional structure of SARS-CoV-2 Mpro, PDB ID 6Y2F, with a resolution of 1.95, was retrieved from the protein data bank.3 The structure of reference compounds (RC1-5) and 73 Withania compounds (W1-73) were collected from the literature.6, 11, 20 The local database of names and structures of all the compounds (Table S1) was created by drawing their structure through ChemDraw ultra12 and saved in SDF format. These 2D formats of molecules were transformed to the 3D coordinates and optimized using the GUI version of Openbabel2.3.2.21

2.2 Molecule preparation and grid generation

The crystal structure was prepared for binding analysis using Autodock Tools (ADT).22 Protein preparation included adding charges (Gasteiger charges), polar hydrogens, and optimizing the rotatable bonds. Prepared protein was thus saved in pdbqt format for further analysis. The idea of a binding site was obtained from already available information the coordinates around the centre atom of the active site considered for the generation of the grid box. ADT received the information of the grid box based on the above information. The grid box for PDB ID 6Y2F considered was with coordinates as 10.601, −0.613, and 21.174 for x, y, and z-axis, respectively. The size of the grid box was manually adjusted and set as 20.180 × 20.180 × 20.180. The configuration file was created using the receptor, ligand, and grid parameters information before the final execution of the molecular docking. In the virtual screening process, already reported drug molecules, lopinavir, and RC1-4 were used as control molecules throughout the study.

All the biomolecules and the control molecules were prepared using ADT by adding polar hydrogen and gasteiger charges. After adjusting the rotatable bonds and torsion angle, each of the molecules was saved in pdbqt format. The molecular docking output files were analyzed based on ligand and protein binding energy and interactions profile. The protein-ligand interaction profile was utilized to evaluate the interactions.

2.3 ADME risk and rule of five based screening

Lipinski's rule of five was considered the primary factor for screening the molecules.23 After screening, the molecules were subjected to ADME risk assessment software to ascertain their risk for drugability. Physiochemical properties were evaluated to understand the oral activity of molecules through Lipinski's rules.

2.4 Pharmacokinetics compliance evaluation

Most drugs fail in the discovery process in reaching clinical trials, which happens due to poor pharmacokinetics (PK). The PK properties were calculated for drug testing toward ADME.24 The study included physiochemical properties estimation (such as molecular weight [MW], topological polar surface area [TPSA]), lipophilicity (indicated by log P, and consensus log P), water solubility (indicated by log S, on a scale of −10 [insoluble] to 0 [highly soluble]), pharmacokinetics (gastrointestinal [GI] absorption, BBB permeant for blood–brain barrier, log Kp for skin permeability), drug-likeness (included Lipinski, Ghose, Egan, and bioavailability), and so forth. The process of excreting the drug from the body depends on log P and MW.

2.5 Molecular dynamics simulation studies

The Desmond (Schrodinger Release 2019-3) package was used for molecular dynamic (MD) simulations25 with the top three Withania compounds of the SARS-CoV-2 Mpro target. The experiment included simulations for 50 ns followed by plotting various matrices to check the stability of structures. The protein preparation wizard at the Schrodinger interface was used to create protein starting structures. For this bond orders assignments, the addition of hydrogens, and filling of missing amino acid chain and loops, and hydrogen bond assignment optimization. System Builder module applied to build the periodic simulation box and single point charge water model used for salvation, with Optimized Potentials for Liquid Simulations (OPLS) all-atom force field. The system was minimized using the steepest descent technique for 1000 iterations. After equilibration, the unrestrained production phase ran under NPT (number of atoms, pressure, and temperature were kept constant) ensemble for 50 ns at 300 K temperature and 1.01325 bar pressure. Nosé–Hoover thermostat (relaxation time = 1 ps) and the isotropic Martyna–Tobias–Klein barostat (relaxation time = 2 ps) were applied. Short-range interactions (cutoff = 9 Å) and long-range Coulombic interactions were evaluated using the smooth particle mesh Ewald (PME) method with the RESPA integrator. The conformations captured in the simulation trajectories were exported at every 5 ps. After completing simulations, the system's stability was assessed using root mean square deviation (RMSD), root mean square fluctuations (RMSF), Hydrogen bond analysis, the radius of gyration (rGyr), and the histogram for torsional bonds.

2.6 Binding free energy calculation (MM/GBSA)

To ascertain the binding free energy of protein-ligand complex calculations are made based on Molecular mechanics generalized Born surface area (MM/GBSA). A total 50 ns trajectory was submitted to the prime module of Schrödinger to perform MM/GBSA calculations.26 Estimations of free energy are based on the following equation
urn:x-wiley:01928651:media:jcc26717:jcc26717-math-0001
where, ΔG is free energy for the respective complex, receptor, and ligand.

3 RESULTS AND DISCUSSION

Literature survey helped in identifying the 73 phytochemicals isolated from Withania somnifera (L.) Dunal, Withania coagulans (Stocks) Dunal, Withania adpressa Coss, Withania aristata (Aiton) Pauquy, Withania frutescens (L.) Pauquy, and Withania obtusifolia Täckh. Few repurposed drugs/synthetic compounds, including ebselen (RC1), tideglusib (RC2), and two feline drugs, GC373 (RC3), and GC376 (RC4), have shown significant inhibitory potential against SARS-CoV-2 Mpro.6, 13 These compounds and lopinavir (RC5, an HIV protease inhibitor) were selected as reference compounds in this study (Figure S1).

3.1 Molecular docking studies

In the current scenario, virtual screening of phytochemicals plays an important part in the drug discovery process. Virtual screening performed based on pharmaco-informatics approach utilizing in-silico tools plays a pivotal role in discovering drug against the pathogen.27 Mpro of SARS-CoV-2 is a dimeric cysteine protease, with two promoters oriented at right angles to each other.28 SARS Mpro active site is similar to that of human CoV (HCoV) and porcine transmissible gastroenteritis virus, consisting of a Cys-His dyad with positional variations.28 Coronavirus utilizes chymotrypsin-like protease and papain protease to cleave long polyprotein precursors into individual functional Nsps, essential for virus life.5 Hence, Mpro becomes the main target for anti-coronavirus drug development. A set of 73 molecules were docked to the binding site of Mpro, and prioritization was made based on interaction energy scores. Molecule W48 (−13.0 kcal/mol), W32 (−11.5 kcal/mol), W56 (−11.4 kcal/mol), W30 (−11.1 kcal/mol), W60 (−11.0 kcal/mol), W52 (−10.9 kcal/mol), W34 (−10.8 kcal/mol), and W55 (−10.6 kcal/mol) binding affinity (Table 1 and Table S1) were more than that of RC1 (−5.8 kcal/mol), RC2 (−6.9 kcal/mol), RC3 (−7.7 kcal/mol), RC4 (−8.3 kcal/mol), and RC5 (−7.1 kcal/mol). The other molecules also showed good binding scores as defined (Table S1). The discovery studio visualizer examined the interacting residues between the active site and selected molecules. Re-docking of the initial inhibitor was conducted with SARS-CoV-2 Mpro and found 0.00 Å RMSD between the docked pose and crystal structure available RCSB database.

TABLE 1. Inhibition constant (Ki) in-silico obtained by molecular docking for the reference and top nine compounds
S. no. Ligands Binding affinityaa Docking scores (kcal/mol) corresponds to the virtual screening scores as provided through autodock tools.
Ki (Molar)bb Ki values are in-silico values in molar calculated using the anaconda python script.
1. RC1 −5.8 5.41E-05
2. RC2 −6.9 8.39E-06
3. RC3 −7.7 2.16E-06
4. RC4 −8.3 7.83E-07
5. RC5 −7.1 5.98E-06
6. W32 −11.5 3.46E-09
7. W56 −11.4 4.10E-09
8. W30 −11.1 6.82E-09
9. W34 −10.8 1.13E-08
10. W55 −10.6 1.59E-08
11. W62 −10.4 2.23E-08
12. W33 −10.0 4.39E-08
13. W36 −10.0 4.39E-08
14. W39 −9.9 5.21E-08
  • a Docking scores (kcal/mol) corresponds to the virtual screening scores as provided through autodock tools.
  • b Ki values are in-silico values in molar calculated using the anaconda python script.

Further, PDB ID: 7BRR and PDB ID: 7BQY, crystal structures of Mpro checked for binding energy with the screened molecules' significant activity with the same molecules was observed. Autodock-vina and Autodock are robust tools utilizing the Lamarckian algorithm to fit the molecule into an active site.22, 29 The binding score obtained by these tools acts as deciding factor for screening the large set of molecules and makes a positive impact in the process of drug discovery.10, 19, 30

3.2 Screening through the rule of five

Drug discovery requires passing out drug candidate molecules in subsequent preclinical, clinical, and further commercial phases of drug development. The development of automated and improved in-silico methods had increased the ability to dissect the fundamentals of ADME/PK. Undesirable and dispossessed pharmacokinetic properties become the primary reason for the compound to phase out from the process of drug discovery. Lipinski's rule was used to filter the compounds and assess the chemical properties related to oral bioavailability. Out of 73 natural biomolecules from Withania, only 36 could make it through the rule of five with zero violations (Table 2). These 36 compounds were further evaluated for pharmacokinetics compliance, including ADME. To avoid failure at later stages of drug development, compliance with pharmacokinetic parameters are of utmost importance. Despite having a high binding affinity, compounds W48, W51, W52, W54, W60, and W61 did not pass the rule of five and hence were not included in further testing.

TABLE 2. Compliance of screened molecules with Lipinski's rule
Code Compound name MW TPSA H-bond donors H-bond acceptors Rotatable bonds Rule of five violation
(20 ≤ X ≤ 500)g/mol (20 ≤ X ≤ 130)Å2 (≥4) (≥7) (≤9)
W1 (17S,20S,22R)-14α,15α,17β,20β-Tetrahydroxy-1-oxowitha-2,5,24-trienolide 486.6 124.29 4 7 2 0
W2 Withanolide F 470.6 104.06 3 6 2 0
W3 Withanolide J 470.6 104.06 3 6 2 0
W6 Witharistatin 468.58 96.36 2 6 3 0
W9 4β-Hydroxy-1-oxo-5β,6β-epoxywitha-2,24-dienolide 454.6 76.13 1 5 2 0
W10 4β-Hydroxy-1-oxo-5β,6β-epoxy-22R-witha-2,14,24-trienolide 452.58 76.13 1 5 2 0
W14 (20S,22R)-27-Hydroxy-1,4-dioxo-witha-2,5,16,24-tetraenolide 450.57 80.67 1 5 3 0
W15 (4S,22R)-4,16,27-Trihydroxy-1-oxo-witha-2,5,17(20),24-tetraenolide 468.58 104.06 3 6 2 0
W16 (4S,20S,22R)-4,27-Dihydroxy-1-oxo-witha-2,5,16,24-tetraenolide 452.58 83.83 2 5 3 0
W19 4β,27-Dihydroxy-1-oxo-witha-2,5,24-trienolide 454.6 83.83 2 5 3 0
W23 Obtusifonolide 496.64 89.9 1 6 5 0
W26 Isowithanone 470.6 96.36 2 6 2 0
W29 Withacoagulin G 470.6 104.06 3 6 3 0
W30 Withacoagulin H 468.58 104.06 3 6 3 0
W31 Withacoagulin I 470.6 104.06 3 6 2 0
W32 Ajugin E 486.6 124.29 4 7 3 0
W33 (20R, 22R-14α, 20α)-Dihydroxy1-oxowitha-2,5,16,24 tetraenolide 452.58 83.83 2 5 2 0
W34 Withacoagulin 452.58 83.83 2 5 3 0
W35 Withanolide H 470.6 104.06 3 6 3 0
W36 Withanolide G 454.6 83.83 2 5 2 0
W37 Coagulin C 452.58 72.83 1 5 1 0
W38 17β-Hydroxywithanolide K 470.6 104.06 3 6 2 0
W39 Deglucosylcoagulin L 488.61 124.29 4 7 2 0
W41 Withanolide L 452.58 83.83 2 5 2 0
W42 Withacoagulin A 452.58 83.83 2 5 2 0
W43 Withacoagulin C 486.6 124.29 4 7 2 0
W44 Withacoagulin D 486.6 124.29 4 7 3 0
W45 Withacoagulin E 454.6 83.83 2 5 2 0
W47 5β,6β,14α,15α-Diepoxy-4β,27-dihydroxy-1-oxowitha-2,24-dienolide 484.58 108.89 2 7 3 0
W55 (20S,22R)-3α,6α-Epoxy-4β,5β,27-trihydroxy-1-oxowitha-24-enolide 488.61 113.29 3 7 3 0
W56 Withanolide A 470.6 96.36 2 6 2 0
W57 (20S,22R)-4β,5β,6α,27-Tetrahydroxy-1-oxowitha-2,24-dienolide 488.61 124.29 4 7 3 0
W62 Viscosalactone B 488.61 116.59 3 7 3 0
W66 2,3-Dihydrowithaferin A 472.61 96.36 2 6 3 0
W68 17α-Hydroxywithaferin A 486.6 116.59 3 7 3 0
W73 Withaferin A 470.6 96.36 2 6 3 0
  • Abbreviations: H-bond, hydrogen bond; MW, molecular weight; TPSA, topological polar surface area.

3.3 Binding site analysis of selected molecules

In-silico inhibition constant (Ki) expresses the inhibitor concentration required to inhibit the chemical reaction 50% at a specific substrate concentration.31 Ki value of the top nine molecules and reference compounds were predicted as per docking score (Table 1). Molecular docking provides a stable complex comprising confirmation with the best-fit ligand in the substrate-binding pocket. The bonding pattern was established to substantiate the docking results. It was found that molecules formed the bonding with Cys-His dyad, either be through hydrogen bond (H-bond), van der Waals (VdW), and Pi-alkyl interactions, and so forth. After exploring the interactions, W32 manifests three H-bond with Gln192 and Thr190, two Pi-Alkyl bonding with His41 and Cys145, and VdW with Thr25, Met49, Asn142, Gly143, His164, Met165, Glu166, Pro168, Arg188, and Gln189. In W56 residue, Gly143 formed H-bond and residues His41, Met49, and Met165 formed Pi-alkyl interactions and VdW with residues Thr25, Thr26, Leu27, Asn142, Cys145, His164, Glu166, Asp187, Arg188, Gln189, Thr190, and Gln192. W30 manifests four H-bond with Gly143, Arg188, Thr190, and Gln192, while two Pi-alkyl bonds are established with His41 and Cys145 and VdW Thr25, Thr26, Leu27, Met49, Asn142, Ser144, His164, Met165, Gln166, and Gln189. W34 forms four H-bond with residues Thr190, Gln192, and Gly143, two Pi-Alkyl bonds with His41 and Cys145 and VdW with residues Thr25, Thr26, Leu27, Met49, Asn142, Ser144, His164, Met165, Arg188, and Gln189. Likewise W55 forms four H-bond with Asn142, Ser144, Cys145, and His163 and VdW with residues Thr24, Thr25, Thr26, Leu27, His41, Cys44, Thr45, Phe140, Gly143, Glu166, and His172 (Figure 1(A,B,D,G,H) and Table 3). Similarly, H-bond, Pi-alkyl bonds, and VdW by W62, W33, W36, and W39 (Figure 1(C,E,I) and Table 3). The in-vitro tested molecules were also analyzed for binding residues, and they also showed binding with the HisCys dyad. In RC5, the three H-bond formed with Gly143, Ser144, and Glu166, Pi-alkyl bond with Met49 and Met165, and VdW with Thr25, Leu27, Leu141, His164, Leu167, Pro168, Asp187, Arg188, Gln189, Thr190, and Gln192. Similar chemical molecule RC1 showed three H-bonds Gly143, Ser144, and Cys145, one Pi-sigma (Asn142), and one Pi-Alkyl bond (Cys145). RC2 manifested one H-bond with Glu166 and one three Pi-Alkyl interactions with His41, Met165, and Cys145, RC3 interaction through H-bonds with residues Phe140 and Glu166 and His41, and Cys145 forms Pi-Alkyl interactions. Likewise, in RC4, six H-bonds formed with His41, Gly143, Ser144, Cys145, and His164, one Pi-alkyl bond formed with His41 and VdW interactions were also manifested (Figure 2 and Table 3). Hydrogen bonds are intermolecular interactions, play an important role in determining molecular aggregations and conformations. Their presence becomes an important factor in governing the dynamics and functions.32 Binding interactions such as H-bond, carbon-hydrogen bonds, π-alkyl, alkyl, Pi-sigma, Van der Waals, and so forth interactions play an important role in the binding of the molecule to the substrate.

image
Interacting residues of top nine phytomolecules (2D format) with SARS-CoV-2 Mpro (i) withacoagulin H (W30), (ii) ajugin E (W32), (iii) (20R, 22R-14α, 20α)-dihydroxy1-oxowitha-2,5,16,24 tetraenolide (W33), (iv) withacoagulin (W34), (v) withanolide G (W36), (vi) deglucosylcoagulin L (W39), (vii) (20S,22R)-3α,6α-epoxy-4β,5β,27-trihydroxy-1-oxowitha-24-enolide (W55), (viii) withanolide A (W56), (ix) viscosalactone B (W62)
TABLE 3. Binding residues of top nine compounds and reference compounds, obtained through molecular docking
S. no. Compounds Interacting residuesaa Interacting residues shown are within the 4 Å vicinity of ligands.
1. RC1 Thr25, Thr26, Leu27, His41, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, Glu166
2. RC2 His41, Met49, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, His163, His164, Met165, Glu166, Asp187, Arg188, Gln189, Gln192
3. RC3 Thr25, His41, Met49, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, His163, His164, Met165, Glu166, His172, Arg188, Gln189, Thr190, Gln192
4. RC4 Thr25, Leu27, His41, Met49, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, His163, His164, Met165, Glu166, Leu167, Pro168, Asp187, Arg188, Gln189, Gln192
5. RC5 Thr25, Leu27, His41, Met49, Leu141, Asn142, Gly143, Ser144, Cys145, His164, Met165, Glu166, Leu167, Pro168, Asp187, Arg188, Gln189, Thr190, Gln192
6. W32 Thr25, His41, Met49, Asn142, Gly143, Cys145, His164, Met165, Glu166, Pro168, Arg188, Gln189, Thr190, Gln192
7. W56 Thr25, Thr26, Leu27, His41, Met49, Asn142, Gly143, Cys145, His164, Met165, Glu166, Asp187, Arg188, Gln189, Thr190, Gln192
8. W30 Thr25, Thr26, Leu27, His41, Met49, Asn142, Gly143, Ser144, Cys145, His164, Met165, Gln166, Arg188, Gln189, Thr190, Gln192
9. W34 Thr25, Thr26, Leu27, His41, Met49, Asn142, Gly143, Ser144, Cys145, His164, Met165, Arg188, Gln189, Thr190, Gln192
10. W55 Thr24, Thr25, Thr26, Leu27, His41, Cys44, Thr45, Phe140, Asn142, Gly143, Ser144, Cys145, His163, Glu166, His172
11. W62 His41, Met49, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, His163, His164, Met165, Leu167, Pro168, His172, Arg188, Gln189, Thr190, Gln192
12. W33 Met49, Phe140, Leu141, Asn142, Met165, Glu166, Leu167, Pro168, Arg188, Gln189, Thr190, Gln192
13. W36 His41, Met49, His164, Met165, Glu166, Leu167, Pro168, Asp187, Arg188, Gln189, Thr190, Ala191, Gln192
14. W39 Met49, Phe140, Leu141, Asn142, Ser144, Cys145, His163, His164, Met165, Glu166, Leu167, Pro168, His172, Arg188, Gln189, Thr190, Gln192
  • Note: Most common residues are highlighted in bold.
  • a Interacting residues shown are within the 4 Å vicinity of ligands.
image
Interacting residues of reference compounds (2D format) with SARS-CoV-2 Mpro (i) ebselen (RC1), (ii) tideglusib (RC2), (iii) GC373 (RC3), (iv) GC376 (RC4), (v) lopinavir (RC5)

3.4 Assessment of drug-likeness

Further, drug-like physicochemical properties were calculated by Lipinski's rule of five for the candidate compounds. Drug likeness is an essential consideration in the early phases of drug discovery for compound selection with desired bioavailability.33 Approaches such as a quantitative estimate of drug-likeness,32 oral PhysChem score,33 ABS,34 or Gaussian scoring function35 include physiochemical parameters into the drug-likeness score, assisting in compound screening. Physiochemical properties such as permeability, lipophilicity, solubility, affinity, and metabolic stability were also assessed. Screened compounds ABS was 0.55, which signifies that the compounds passed Lipinski's rule of five.23 ADME and molecular PK were also taken into consideration. Such as solubility with water, absorption by the intestine, hepatotoxicity, plasma protein binding, GI absorption, blood–brain penetration (BBB), and interaction with cytochrome P450 (CYP). Rules define, though, cannot predict the pharmacological activity of compounds. Yet, these rules in a well-defined stepwise manner play a crucial role in determining the significant lead molecules for protein specificity and biological activity. To adjudicate the significance of compounds, MW is considered the essential factor, that is, MW of the compound should be between 150 and 500 g/mol (Table 2). Lipophilicity is signified as the coefficient between n-octanol and water (log Po/w), that is, the ratio of solubility of the compound in octanol compared to its solubility in an aqueous medium. MW in collaboration with log P defined as an important factor for permeability. Similarly, water solubility determined by log S value which did not exceeded six for compound to be soluble. log S along with determinants such as MW, number of rotatable bonds, fraction of aromatic heavy atoms and log P, defines the compounds are soluble in aqueous medium (Table 4).

TABLE 4. Compliance of screened molecules with standard ranges of ADME and drug-likeness
Compounds log S (solubility) log Kp log P CYP1A2 BBB permeant
W33 −4.33 −7.02 3.65 No No
W36 −4.71 −6.61 3.9 No No
W56 −4.67 −6.86 3.39 No No
W32 −3.74 −8.05 2.67 No No
W30 −3.94 −7.59 2.96 No No
W39 −3.51 −8.41 2.41 No No
W34 −4.62 −6.61 3.77 No No
W55 −4.47 −7.26 2.73 No No
W62 −4.47 −7.26 2.8 No No
  • Note: All the values predicted are by Swiss-ADME analysis server.

The candidate compounds have high bioavailability because they do not violate Lipinski's rule of five, that is, their molecular mass is below 500 Da, they possess high lipophilicity (log P < 5), hydrogen donors (<5), hydrogen acceptors (<10), and, their molar refractivity lie in between 40 and 130. Moreover, when we calculated the TPSA for passive molecular transport through membranes, the result showed their values were <130 Å2. Its a general observation that compounds with TPSA>140 Å2 have low oral bioavailability. Furthermore, GI absorption values, BBB permeation calculated to see whether the compounds show active efflux through biological membranes, that is, through the GI wall to lumen or from the brain. Additionally, CYP interactions were calculated to understand drug elimination through biotransformation. CYP interaction plays an important step in clearing a drug or its metabolite from the body leading to low/high toxicity. The values of our probable compounds were within the range of standard drugs (Table 4).

As a result, the top nine molecules identified in this study that have the potential to inhibit SARS-CoV-2 Mpro include withacoagulin H (W30), 27-hydroxywithanolide F or ajugin E (W32), (20R, 22R-14, 20α)-dihydroxy1-oxowitha-2,5,16,24 tetraenolide (W33), withacoagulin (W34), withanolide G (W36), deglucosylcoagulin L (W39), (20S,22R)-3α,6α-epoxy-4β,5β,27-trihydroxy-1-oxowitha-24-enolide (W55), withanolide A (W56), viscosalactone B (W62) (Figure 3). As evident from Table 2, all of these compounds showed better binding affinity to SARS-CoV-2 Mpro than the reference compounds (RC1-5). Hence, these compounds could exhibit more potent inhibitory activity than the RC1-5 in in-vitro assessments. The in-vitro half-maximum inhibitory concentration (IC50) values for the reference compounds ebselen (RC1), tideglusib (RC2), GC373 (RC3), and GC376 (RC4) are quite low, 0.67, 1.55, 0.40, and 0.19 μM, respectively.6, 13 Few other similar compounds like digitoxin, digoxin, and ouabain have been reported to have in-vitro anti-SARS-CoV-2 activity with significantly lower IC50 values of 0.23, 0.19, and 0.097 μM, respectively. Another similar compound, abiraterone acetate, has been reported to exhibit anti-SARS-CoV-2 activity with 50% maximal effective concentrations (EC50) of 1.94 μM.12

image
Chemical structure top nine screened phytomolecules namely withacoagulin H (W30), ajugin E (W32), (20R, 22R-14α, 20α)-dihydroxy1-oxowitha-2,5,16,24 tetraenolide (W33), withacoagulin (W34), withanolide G (W36), deglucosylcoagulin L (W39), (20S,22R)-3α,6α-epoxy-4β,5β,27-trihydroxy-1-oxowitha-24-enolide (W55), withanolide A (W56), viscosalactone B (W62) and few similar molecules with in-vitro anti-SARS-CoV-2 activity (digitoxin, ouabain, digoxin, and abiraterone acetate)

Compounds W30-36 have been isolated from W. coagulans while W39 is a semisynthetic derivative of coagulin L (W4), Coagulin L has been isolated from W. adpressa as well as W. coagulans, and compounds W55-62 isolated from W. somnifera. Hence, from these results, it can be concluded that out of five species, W. adpressa, W. coagulans, and W. somnifera may have great potential to inhibit SARS-CoV-2 (COVID-19). However, the in-vitro assessment is still to be done for the confirmation of the same. Apart from this, these plant species have been well known for their immune-boosting potential,11 which is further advantageous in COVID-19 treatment. Compounds W30-36 have also been reported to show significant inhibition of nitric oxide (NO) production, tumor necrosis factor-α, and induced nuclear factor-kappa B (NF-κB). The infection with SARS-CoV-2 also activates transcription factors such as NF-κβ. Hence, these compounds might also help reduce such complications. Compound W39 exhibited significant inhibition of the postprandial rise in hyperglycemia and post-sucrose load in normoglycemic and streptozotocin-induced diabetic rats. Since COVID-19 has been well known for its secondary complications with diabetic patients, W39 and W. coagulans will be further advantageous for COVID-19 treatment in diabetic patients. Compounds W55 and 56 have also been reported for their significant neurite outgrowth activity. Compound W62 has been reported for its selective inhibitory activity against cyclooxygenase-2 (COX-2) enzyme and lipid peroxidation. Patients infected with SARS-CoV-2 have several inflammatory complications as well. Hence, these compounds and respective plant species might help reduce various other complications and inhibit viral replication.11

3.5 Molecular dynamics and simulation

MD simulation study of 50 ns time scale of top three docked complexes (W30, W32, and W34) conducted to check for the stability in the active site of main protease of SARS-CoV-2. Although the top nine compounds may be potential candidates for in-vitro analysis, only these three were chosen for MD simulation because they formed a His-Cys dyad. Results were analyzed on various parameters generated by MD simulation, such as RMSD, RMSF, and interaction studies.

The stability of the simulation system and protein backbone conformational perturbations due to simulation were measured using the RMSD value.34 From the RMSD graph, we predict that the protein backbone of the main protease was stable during the entire simulation time with a mean value of 2 Å. These lower fluctuations indicate the stable behavior of the protein backbone during the simulation. Comparing the docked complex RMSD trajectories, we inferred that W30 and W34 manifested fluctuations initially up to the 20 ns time scale. After 20 ns, the compound withacoagulin H did not reveal any significant RMSD fluctuations compared to the other compounds docked complex trajectories. The W30 docked complex trajectory was comparatively more stable with a mean value of 3 Å as compared to the other two compound trajectories. Moreover, the W30 trajectory is stable throughout the simulation, and changes in backbone were within the specified limits of RMSD (Figure 4(A)).

image
(A) Root mean square deviation (RMSD) plot of top three compounds as a function of time. RMSD evolution of protein shown on left Y-axis, Ligand RMSD shown on right Y-axis. Key for the respective color code is provided above of individual figure frame. (B) Root mean square fluctuations (RMSF) plot for top three compounds. B-factor shown on right Y-axis, and RMSF on left Y-axis. Alpha-helical and beta-strand regions are highlighted in red and blue backgrounds respectively. Protein residues that interact with the ligand are marked with green-colored vertical bars. withacoagulin H (W30), ajugin E (W32), and withacoagulin (W34)

Further, RMSF analysis was performed to understand the docked complexes' residue-wise fluctuation. RMSF behavior was generated using the docked complexes; a plot was made using the RMSF, B factor, and interactions (Figure 4(B)). Peaks indicate most protein fluctuations; it is understood that N- and C-terminal fluctuate more than any other part of the protein. Also, the secondary structure elements revealed that alpha helices and beta strands are rigid compared to the loops more fluctuated (Figure S2).

Different parameters were calculated for the ligand to get insights into the conformational strain the ligand undergoes to maintain the protein-bound complex. The RMSD plot of W30 shows peaks of around ~1.5 Å. The rGyr measured for extendedness of the ligand, equivalent to its moment of inertia, was almost similar for all three ligands. The numbers of internal hydrogen molecules with ligand were more in the case of W30 shown by intramolecular surface area (intraHB) than other ligands. The molecular surface area (MolSA) was around 368–376 Å in W30. The solvent-accessible surface area (SASA) and polar surface area were ranging from 120 to 240 Å (Figure 5(A)). Further, the protein-ligand interactions were also assessed in the form of H-bonds, water-bridges, Ionic interactions, hydrophobic interactions, and so forth, to check the stability of the docked complexes. Figure 5(B) plots these different intermolecular interactions made by each pocket residue with its bound ligand.

image
(A) Various measures of molecular dynamic simulations of top three compounds. Root mean square deviation (RMSD), radius of gyration (rGyr), intramolecular hydrogen bonds (intraHB), molecular surface area (MolSA), solvent accessible surface area (SASA), polar surface area (PSA). (B) Various intramolecular interactions made by top three compounds with Mpro, during molecular dynamics simulation. Color's code: Hydrogen bond (green), hydrophobic contacts (purple) and water-bridge (blue), withacoagulin H (W30), ajugin E (W32), and withacoagulin (W34)

3.6 Binding free energy calculations

The free energy of binding of W30, W32, and W34 with Mpro of SARS-CoV-2 was computed using MM/GBSA approach. The free energy calculations are widely used and commonly accepted to estimate the ligand-binding affinities in the protein system. The ΔG (MM/GBSA) free energy calculated was −63.463 KJ/mol for W30, −56.140 KJ/mol for W32, and −44.496 KJ/mol for W34. All the binding energy values were negative, indicating that all the compounds' hits with protein were favorable. These results showed that W30 possesses maximum negative energy (−63.463 KJ/mol) than other compounds (Table 5). Therefore, it can be stated that withacoagulin H with minimum binding energy and better binding affinity could prove to be a better inhibitor for Mpro of SARS-CoV-2.

TABLE 5. Computed ΔG (binding free energy change) of top three compounds against Mpro through MM/GBSA approach
S. no. Compound name MM/GBSA (KJ/mol)
1 Withacoagulin H (W30) −63.463
2 Ajugin E (W32) −56.140
3 Withacoagulin (W34) −44.496

W30 becomes the compound of choice among all the screened compounds as it possesses an excellent binding affinity compared to other compounds. Interaction pattern shows its interaction with Hys41-Cys145 dyad, in simulation studies also its interaction with substrate binding region was maintained. In RMSD analysis, the fluctuations were within the acceptable range, making it a probable candidate. Further, W30 showed the least binding energy on MM/GBSA analysis than W32 and W34. Hence, it becomes probable compound obtained from Withania to act as a likely inhibitor for the Mpro of SARS-CoV-2.

4 CONCLUSION

In conclusion, 73 withanolides and selected reference compounds were screened based on Lipinski's rule of five. The binding affinity of these screened compounds in the active sites of Mpro of SARS-CoV-2 was investigated further. Of these, nine molecules showed a significant docking score, even better than the reference compounds. ADME studies were also conducted for screened molecules to determine their drug-likeness and bioavailability. Since the molecules utilized here ADME was also determined, they can be the active lead for drug development of respective illness. The bioactive molecule from Withania showed good conformations with the Mpro of SARS-CoV-2, stable complex was formed; these complexes showed a more significant number of hydrogen bonds and van der Waals interactions than the complexes with reference compounds. Further, MD simulation studies confirm the stability of the compound W30 in the trajectory of the protein Mpro of SARS-CoV-2. MM/GBSA showed the least binding energy (−63.463 KJ/mol) for W30. Some more confirmatory analysis (both in-vitro and in-vivo) should be conducted for the screened molecules to ascertain their efficacy in the illness treatment.

CONFLICT OF INTEREST

The authors declare that they have no conflict of interest that could influence the work reported in this article.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.