|Year : 2016 | Volume
| Issue : 6 | Page : 445-453
A quantitative structure-activity relationship (QSAR) study of some diaryl urea derivatives of B-RAF inhibitors
Sedighe Sadeghian-Rizi1, Amirhossein Sakhteman2, Farshid Hassanzadeh3
1 Department of Medicinal Chemistry, School of Pharmacy and Pharmaceutical Sciences, Isfahan University of Medical Sciences, Isfahan, I.R. Iran
2 Department of Medicinal Chemistry, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, I.R. Iran
3 Department of Medicinal Chemistry and Novel Drug Delivery Systems Research Center, School of Pharmacy and Pharmaceutical Sciences, Isfahan University of Medical Sciences, Isfahan, I.R. Iran
|Date of Web Publication||8-Dec-2016|
Department of Medicinal Chemistry and Novel Drug Delivery Systems Research Center, School of Pharmacy and Pharmaceutical Sciences, Isfahan University of Medical Sciences, Isfahan
Source of Support: None, Conflict of Interest: None
In the current study, both ligand-based molecular docking and receptor-based quantitative structure activity relationships (QSAR) modeling were performed on 35 diaryl urea derivative inhibitors of V600E B-RAF. In this QSAR study, a linear (multiple linear regressions) and a nonlinear (partial least squares least squares support vector machine (PLS-LS-SVM)) were used and compared. The predictive quality of the QSAR models was tested for an external set of 31 compounds, randomly chosen out of 35 compounds. The results revealed the more predictive ability of PLS-LS-SVM in analysis of compounds with urea structure. The selected descriptors indicated that size, degree of branching, aromaticity, and polarizability affected the inhibition activity of these inhibitors. Furthermore, molecular docking was carried out to study the binding mode of the compounds. Docking analysis indicated some essential H-bonding and orientations of the molecules in the active site.
Keywords: QSAR; B-RAF inhibitors; Diaryl Urea; Docking; Multiple linear regressions; PLS-LS-SVM
|How to cite this article:|
Sadeghian-Rizi S, Sakhteman A, Hassanzadeh F. A quantitative structure-activity relationship (QSAR) study of some diaryl urea derivatives of B-RAF inhibitors. Res Pharma Sci 2016;11:445-53
|How to cite this URL:|
Sadeghian-Rizi S, Sakhteman A, Hassanzadeh F. A quantitative structure-activity relationship (QSAR) study of some diaryl urea derivatives of B-RAF inhibitors. Res Pharma Sci [serial online] 2016 [cited 2018 Aug 16];11:445-53. Available from: http://www.rpsjournal.net/text.asp?2016/11/6/445/194869
| Introduction|| |
RAF is among tyrosine kinase type receptors with serine/threonine kinase activity  . Its contribution is in mitogen activated protein kinase (MAPK) signaling pathway, which conducts signals from membrane-based receptors to the nucleus to mediate cell proliferation, differentiation, and survival  . Numerous cancers are related to the constitutive activation of the above signaling pathway  . B-RAF is one of the isoforms of the RAF kinase family that can regulate multiple downstream molecules and is also regulated by a variety of signaling molecules , . In about 7% of human cancers, the mutation of B-RAF has been detected ,,,, .
Some small molecule RAF kinase inhibitors by diverse scaffolds such as ureas, urea bioisosteres, imidazoles, benzamides, oxindoles, and aza-stilbenes have emerged in the recent past decades (11,12). But diaryl urea have been most extensively investigated because of sorafenib success in clinical for renal and hepatocellular carcinoma (13-15).
It is of great importance to introduce computer-aided drug design (CADD) approach to accelerate the time-consuming process of conventional drug discovery  . Quantitative structure activity relationships (QSAR) and molecular docking are two of the helpful methods of CADD for drug design and prediction of drug activity , . In QSAR large number of compounds are usually evaluated resulting in models that can predict the potency or activity of new or even non-synthesized compounds  .
When the three-dimensional structure of the target protein is available or can be modeled, molecular docking is often used for screening of compound libraries. Molecular docking predicts the conformation of a protein-ligand complex and calculates the binding affinity and investigates protein-ligand interactions (20,21).
In this study aimed to develop a robust and accurate model for the inhibitory activity of inhibitors in order to design potential B-RAF kinase inhibitor. We used different method to connect between structural parameters and B-RAF kinase inhibitory. These methods included multiple linear regressions (MLR) as linear method and partial least squares least squares support vector machine (PLS-LS-SVM) as a nonlinear approach. The latter method was used to carry out non-linear mappings on the physicochemical and biological descriptors of the molecules. In Support vector machines, nonlinear kernel based functions were used to solve both regression and classification problems. An advantage of this method is its reproducibility in data mapping  . Our aim in this study was to develop more examples of modeling based on this approach. Finally docking study was performed to suggest a binding mode for the inhibitors on B-RAF target.
| Materials and methods|| |
All calculations were made using an Intel Core i5 (CPU 2.6 GHZ) laptop running on windows 7 operating system. The m-files for MATLAB calculations were developed in our group.
Dataset and descriptor generation
The dataset used in this study was taken from the work of Menard, et al.  .Chemical structure of 35 studied compounds is provided in [Table 1]. This set contains diarylurea derivatives with inhibition potency against B-RAF kinase. The chemical structures of molecules were drawn and optimized by HyperChem 7.0 software (HyperCube Inc. USA). Energy minimizations for all compounds were performed by AM1 semi-empirical method with Polark-Ribiere algorithm until the root-mean-square gradient of 0.01 Kcal/mol was reached. The resulted geometries were transferred into Dragon program (developed by Milano Chemometrics and QSAR Group) to calculate descriptors. The physicochemical parameters were calculated utilizing HyperChem and Dragon softwares. Molecular descriptors computed using the Dragon software were constitutional, functional, topological, and geometrical groups. Hyperchem was used to obtain chemical descriptors such as Log P, hydration energy, polarizability, molar refractivity, molecular volume, and molecular surface area. Gaussian 98 W package was employed to use HF method at 6-31G* basis set for optimization and calculation of different quantum chemical descriptors including dipole moment, local charge on atoms, high-occupied molecular orbital (HOMO), and low-unoccupied molecular orbital (LUMO) energies. Indices of electronegativity, electrophylicity, hardness, and softness were calculated from the energies of HOMO and LUMO.
|Table 1: Chemical structure of B-RAF kinase inhibitor used in this study.|
Click here to view
The correlation of the calculated descriptors with each other was calculated and collinear descriptors (0.85) were specified. Those with higher correlation towards activity vector were retained and the others were eliminated.
Splitting of the matrix into calibration (train) and external (test) set was done using kenard-stone algorithm. Subsequently, stepwise multiple linear regressions and partial least square analysis were performed on the training set for MLR and support vector machine (SVM) methods. In case of PLS-LS-SVM, Gaussian RBF Kernel with two tuning parameters, γ (gama) and σ2 (sigma 2 ) were used. Latent variables of partial least squares (PLS) were entered to the models using stepwise method. The predicted residual error sum of squares (PRESS) was used to select the best models for validity evaluations.
Validation of QSAR models
To evaluate robustness of the models in the calibration subset, leave-one-outcross validation method was used. PRESS and Q 2 LOO values were thereafter calculated according to the equations 1 and 2.
where, SSY is the sum of squares of the response values. The models obtained from calibration set were then used to predict the response values of the test set. PRESS and R 2 of test set were calculated as described above.
Finally, chance correlation test was used by scrambling the response vectors for several times and recalculating error metrics (PRESS and R 2 ). Applicability domain for the best model was also done using Williams plot. For this purpose standard residuals were plotted against leverage for the data set  . Other parameters of validation including K and K' were also provided for both modeling approaches.
Docking was conducted using AutoDock 4.2 software package. For this purpose, the crystal structure of B-RAF was retrieved from Protein Data Bank (PDB code: 1UWH). Grid and docking parameter files was generated by AutoDockTools version 6.5. For the preparation of protein, the original ligand and water molecules were removed from the coordinates and Kollman charges together with polar hydrogens were added. For ligands, Gasteiger charges were assigned and non-polar hydrogens were merged. The grid map of 60 × 60 × 60 points in x, y, and z directions was centered on the binding site (Cys 531) with a spacing of 0.375 Å. Lamarckian genetic algorithm (LGA) was used for docking with the following settings: a maximum number of 25,000,000 energy evaluations, an initial population of 300, a maximum number of 27,000 generations, and 100 independent docking runs. Other parameters of .dpf file were remained as default. All parameters of docking were validated based on re-docking of the cognate ligand (BAY 43-9006) and measuring the root-mean-square deviation (RMSD) value between the best pose of the docked structure with its conformation at crystal state. The RMSD value of less than 2 Ε can validate the protocol of docking. Results were ranked according to the binding free energy and mode of interactions. Ligplot program was used to visualize hydrophobic and hydrogen-bonding interactions between the ligand and receptor  .
| Results|| |
Sixty two remained descriptors constructed an input file for stepwise selection based on MLR analysis. Comparison of statistical parameters was employed to result the best model as seen in the models bellow [Table 2]. The numbers of training set compounds in the first and second model were 31 and 28, respectively.
pIC 50 = -34.171 (± 5.597) - 0.006 (± 0.002). αzz - 0.091 (± 0.009). G (N...N) + 3.260 (± 0.379). TI2 - 0.110 (± 0.016). DISPm + 22.523 (± 10.470). PW3 + 7.682 (± 1.542). BLI + 50.35 (± 17.976). PW4 + 2.055 (± 0.750). (PJI3). (3)
where, pIC50 is -log (IC50), N = 31, F = 26.102, R  = 0.905, Se = 0.3131.
pIC 50 = -9.576 (± 8.537) + 0.062 (± 0.12). G (N...N) + 1.981 (± 0.891). DISPm + 66.775 (± 12.261). BLI - 34.512 (± 10.247). (FDI). (4)
where, N = 28, F = 13.441, R  = 0.837, Se = 0.5144.
In this equation, the values in the parentheses show the standard deviation of the coefficients. N, R, Se, and F are numbers of components, coefficient, standard error of regression, and Fisher's F-ratio, respectively.
As a result for PLS-LS-SVM, the first three latent variables resulted in a model with reasonable R  values (0.93 and 0.84) .
Internal validation of the models was revealed by means of leave-one-out cross-validation. The results for cross-validation studies of both modeling approaches are listed in [Table 2].
The more robustness of LS-SVM has been therefore shown based on both PRESS and R  values. Furthermore, LS-SVM was more resistant to the number of compounds in training and test set with respect to MLR. Chance correlation test by means of Y-permutation ensured that the presented models were not achieved randomly.
As depicted in [Figure 1]a, the molecules were not more than the cut-off h value of leverage. Applicability domain studies for the model based on PLS-LS-SVM have revealed that the model can be applied for structurally relevant compounds.
|Figure 1. (a) applicability domain based on williams plot, (b) interaction model of docked inhibitor 32 on B-RAF kinase|
Click here to view
Docking analysis revealed important H-bonding interactions between urea moiety with Glu500 and Asp593 residues in allosteric pocket. H-bonding between tow nitrogen atoms of imidazopyridine moiety with Cys531 residue of hinge binding region was also significant.
| Discussion|| |
R  , Q  , and F ratio of two models were compared, model 1 revealed the best statistical parameters R  = 0.905, Q  = 0.8145, F = 26.102 [Table 2]. The more power of LS-SVM compared to MLR has been verified based on external validation including K and K' as displayed in [Table 3]. SVM model has passed the validation criteria and showed to be a more powerful modeling approach.
The best MLR model contained 8 descriptors including 4 topological parameters (TI2, BLT, PW3, PW4), 3 geometrical parameters (G(N..N), PJI3, DISPm), and one chemical parameter (αzz). Based on descriptor coefficients, increasing αzz , G (N...N), and DISPm can decrease pIC 50 and increasing TI2, PJI3, BLI, PW3, and PW4 can increase the extent of pIC 50 of the structures.
TI2 is a topological descriptor belonging to the Mohar indices which can affect solubility. TIs can be sensitive to size, shape, symmetry, branching, and cyclicity.
G (N...N) is a 3D geometrical descriptor relating to the geometrical distance between nitrogen and nitrogen in molecules with more than one N-atom.
DISPm is related to the molecular geometry as well as molecular size and describes the conformational features of the molecules. Negative contribution of DISPm revealed that rigidity of molecules can increase inhibitory activity due to the absence of long and flexible substituents and the presence of unsaturated bonds. Kier benzene-likeliness index (BLI) is an aromaticity index calculated from molecular topology. Aromaticity might influence the transport or distribution of the chemical across cell membrane.
PW3 and PW4 as the path/walk count ratio are independent of molecular size. These descriptors can be considered as shape descriptors whose value increases with increased branching in the vertices.
PJI3 (Petitjean shape indices), a first Petitjean shape index, is a topological anisometry descriptor. αzz is the polarizability along zz axis of the molecule which represents how readily the molecular charge distribution is distorted by external electric fields. Almost 7 descriptors appeared in the SMLR equation belongs to the geometrical and topological groups of descriptors.
In Support vector machines, nonlinear kernel based functions are used to solve both regression and classification problems. To reduce the complexity of optimization, least squares support vector machines has emerged in the recent years. In the following regression equation, ω and b are the slope and the intercept of the equation, respectively.
The function φ(xi) is used to map the input space into a higher dimensional space.
investigate the role of descriptors on the first three variables, correlation of all descriptors with the loading values of the three variables were calculated. It was observed that the factors 1, 2, and 3 have higher loadings for TE, Ms, and FDI respectively. This result was found in both modes with 28 and 31 compounds. Total energy (TE) of a molecular system is among quantum chemical descriptors and is the sum of the total electronic energy (Eee) and the energy of intern clear repulsion (Enr).
The mean electro topological state (Ms) is among constitutional descriptor which is calculated by dividing Ss by the number of nonhydrogen atoms (nSK):
Folding degree index (FDI) is a geometrical descriptor. Negative contribution of TE revealed that decrease in total energy for molecular systems can increase their inhibitory activity. Whereas positive values of Ms and FDI descriptors show that the indicated descriptor contribute positive effect on the values of pIC50 . According to experimental QSAR results and docking binding energies performed in this study, the modification of the phenyl middle ring using hydrophobic group such as halogen, thiomethyl substitution (2,6 compounds) or replacement by naphthyl group (24-35, 22, 19, and 5 compounds) can enhance the inhibitory activity due to an increase in the potency of binding and this result was in accord with the previously reported SAR result  .
The binding energy of all compounds is reported in [Table 4] and the compound with naphthyl moiety have the best binding energy. In [Figure 1]b, orientation of the interactions for compound 32 with the lowest binding energy is depicted.
| Conclusion|| |
In this work, molecular docking and structure-based QSAR method were performed to explore structural features, orientations, and conformations of some inhibitors interacting with B-RAF kinase. A structure-based QSAR model using MLR and LS-SVM were developed. The result indicated that the performance of LS-SVM model was better than the MLR model.
The selected descriptors indicated that size, degree of branching, aromaticity, and polarizability affected the inhibition activity of these inhibitors. Docking analysis indicated some essential H-bonding and orientations in active site. This information will guide us through design of novel B-RAF kinase inhibitors.
| Acknowledgements|| |
The content of this paper is extracted from the PhD thesis (No. 394159) which was financially supported by the Isfahan University of Medical Sciences, Isfahan, Iran.
| References|| |
Leicht DT, Balan V, Kaplun A, Singh-Gupta V, Kaplun L, Dobson M, et al.
RAF kinases: function, regulation and role in human cancer. Biochim Biophys Acta. 2007;1773(8):1196-1212.
Peyssonnaux C, Eychene A. The RAF/MEK/ERK pathway: new concepts of activation. Biol Cell. 2001;93(1-2):53-62.
Malumbres M, Barbacid M. RAS oncogenes: the first 30 years. Nat Rev Cancer. 2003;3(6):459-465.
Roskoski R Jr. RAF protein-serine/threonine kinases: Structure and regulation. Biochem Biophys Res Commun. 2010;399(3):313-317.
Hagemann C, Rapp UR. Isotype-specific functions of RAF kinases. Exp Cell Res. 1999;253(1):34-46.
Huang T, Zhuge J, Zhang WW. Sensitive detection of BRAFV600E mutation byAmplification Refractory Mutation System (ARMS)-PCR. Biomark Res. 2013;1:3.
Lee B, Mukhi N, Liu D. Current management and novel agents for malignant melanoma. J Hematol Oncol. 2012;5:3.
Davies H, Bignell GR, Cox C, Stephens P, Edkins S, Clegg S, et al
. Mutations of the BRAF gene in human cancer. Nature. 2002;417(6892):949-954.
Domingo E, Laiho P, Ollikainen M, Pinto M, Wang L, French AJ, et al
. BRAF screening as a low-cost effective strategy for simplifying HNPCC genetic testing. J Med Genet. 2004;41(9):664-668.
Rahman MA, Salajegheh A, Smith RA, Lam AK. B-RAF mutation: A key player in molecular biology of cancer. Exp Mol Pathol. 2013;95(3):336-342.
Li HF, Lu T, Zhu T, Jiang YJ, Rao SS, Hu LY, et al.
Virtual screening for RAF-1 kinase inhibitors based on pharmacophore model of substituted ureas. Eur J Med Chem. 2009;44(3):1240-1249.
Wong KK. Recent developments in anti-cancer agents targeting the Ras/RAF/ MEK/ERK Pathway. Recent Pat Anticancer Drug Discov. 2009;4(1):28-35.
Pratilas CA, Solit DB. Targeting the mitogen-activated protein kinase pathway: physiological feedback and drug response. Clin Cancer Res. 2010;16(13):3329-3334.
Stenner F, Chastonay R, Liewen H, Haile SR, Cathomas R, Rothermundt C, et al.
A pooled analysis of sequential therapies with soRAFenib and sunitinib in metastatic renal cell carcinoma. Oncology. 2012;82(6):333-340.
Wu J, Zhu AX. Targeting insulin-like growth factor axis in hepatocellular carcinoma. J Hematol Oncol. 2011;4:30.
Zhang S. Computer-Aided Drug Discovery and Development. Methods Mol Biol. 2011;716:23-38.
Hansch C, Hoekman D, Gao H. Comparative QSAR: toward a deeper understanding of chemicobiological interactions. Chem Rev. 1996;96(3):1045-1076.
Foroumadi A, Sakhteman A, Sharifzadeh Z, Mohammadhosseini N, Hemmateenejad B, Moshafi MH, et al
. Synthesis, antituberculosis activity and QSAR study of some novel 2-(nitroaryl)-5-(nitrobenzylsulfinyl and sulfonyl)-1,3,4-thiadiazolederivatives. Daru. 2007;15(4):218-226.
Pirard B. Computational methods for the identification and optimisation of high quality leads. Comb Chem High Throughput Screen.2004;7(4):271-280.
Cavasotto CN, Orry AJ. Ligand docking and structure-based virtual screening in drug discovery. Curr Top Med Chem. 2007;7(10):1006-1014.
Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov. 2004;3(11):935-949.
Khoshneviszadeh M, Sakhteman A. Exploring quantitative structure-activity relationship (QSAR) models for some biologically active catechol structures using PC-LS-SVM and PC-ANFIS. J Appl Biol Chem. 2016;59(3):433-441.
Menard D, Niculescu-Duvaz I, Dijkstra HP, Niculescu-Duvaz D, Suijkerbuijk BM, Zambon A, et al
. Novel potent BRAF inhibitors: toward 1 nM compounds through optimization of the central phenyl ring. J Med Chem. 2009;52(13):3881-3891.
Tetko IV, Sushko I, Pandey AK, Zhu H, Tropsha A, Papa E, et al
. Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection. J Chem Inf Model. 2008;48(9):1733-1746.
Wallace AC, Laskowski RA, Thornton JM. LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions. Protein Eng. 1995;8:127-134.
[Table 1], [Table 2], [Table 3], [Table 4]