qsar analysis of n myristoyltransferase inhibitors antifungal

ORI GINAL RESEARCH

QSAR analysis of N-myristoyltransferase inhibitors:antifungal activity of benzofurans

Hemantkumar S. Deokar ÆPurushottamachar Puranik Æ Vithal M. Kulkarni

Received: 8 April 2008 / Accepted: 18 June 2008

� Birkhauser Boston 2008

Abstract Benzofurans are a class of antifungal agents reported to act by selective

inhibition of the N-myristoyltransferase (Nmt) enzyme in fungal cells. A three-

dimensional quantitative structure–activity relationship (3D-QSAR) was performed

on a series of 29 molecules to find correlation between various physicochemical

descriptors and Nmt inhibition. QSAR equations were evaluated using a training set

24 molecules and an external test set of 5 molecules. The statistical quality of the

QSAR models was evaluated using the parameters r2, rcv2 , and the rpred

2 measure. The

results obtained indicated that the enzyme inhibitory activity of benzofuran ana-

logues strongly depends on structural factors as expressed by hydrogen-bond

acceptors and spatial factors as expressed by the principle moment of inertia in the X(PMI_X) and Y directions (PMI_Y).

Keywords 3D-QSAR � GFA � Antifungal � N-myristoyltransferase �Benzofuran analogues

Introduction

Invasive fungal infections have increased dramatically in recent years to become an

important cause of morbidity and mortality in patients. Currently available

antifungal drugs for such infections essentially have three molecular targets:

H. S. Deokar � V. M. Kulkarni (&)

Department of Pharmaceutical Chemistry, Poona College of Pharmacy, Bharati Vidyapeeth

University, Pune 411038, India

e-mail: [email protected]

P. Puranik

Department of Pharmacology and Experimental Therapeutics,

University of Maryland School of Medicine, 685 West Baltimore Street,

Baltimore, MD 21201-1559, USA

Med Chem Res

DOI 10.1007/s00044-008-9120-5

MEDICINALCHEMISTRYRESEARCH

cytochrome P-450-dependent lanosterol 14a-demethylase (CYP-P45014aDM), ergos-

terol, and 1,3-beta-glucan synthase. The first is a fungistatic target vulnerable to

resistance development; the second, while a fungicidal target, is not sufficiently

different from the host to ensure high selectivity; and the third, a fungistatic

(Aspergillus) or fungicidal (Candida) target, has limited activity spectrum and

potential host toxicity that might preclude dose escalation. Myristoyl CoA:protein

N-myristoyltransferase (Nmt; EC 2.1.3.97) is a cytosolic monomeric enzyme that

catalyzes the transfer of the cellular fatty acid myristate (C14.0) from myristoyl-

CoA to the N-terminal glycine amine of a variety of eukaryotic proteins (Johnson

et al., 1994) Genetic studies have established that Nmt is essential for vegetative

growth and survival of pathogenic fungi such as Sacchromyces cerevisiea. Nmt has

also been proven to be essential for the viability of fungi, including the medically

important Candida albicans (C. albicans) and Cryptococcus neoformans (Duronio

et al., 1989; Weinberg et al., 1995; Lodge et al., 1994). Human Nmt has also been

cloned and characterized. C. albicans Nmt has 451 amino acid residues, with a

sequence identity of 45% to the human Nmt enzyme. Clear difference in the

peptide–substrate specificity between fungal and human Nmts has been exploited,

and Nmt has been identified as a potential chemotherapeutic target for antifungal

agents (Wilcox et al., 1987; Johnson et al., 1994). Hence Nmt is an attractive target

for the design and development of novel antifungal agents with a new mode of

action without disrupting host Nmt. Because of its novel mechanism of action, Nmt

inhibitors are expected to have advantages over azole antifungals in terms of

activity against azole-resistant fungal strain and lack of drug–drug interactions,

which are drawbacks of azole antifungal agents (Benedetti and Bani, 1999).

Although several peptidomimetic inhibitors of C. albicans Nmt (Ca Nmt) have been

reported, their antifungal effect is only marginal (Devadas et al., 1997; Nagarajan

et al., 1997). Hence, there is a need for novel nonpeptidic enzyme inhibitors.

In our laboratory, molecular modeling studies have been performed on various

antifungal targets like such as P45014aDM (Talele et al., 1999) and dihydrofolate

reductase (Gokhale and Kulkarni, 2000b) enzyme for the rational design of new

antifungal agents. We have generated a 3D pharmacophore model using peptidic

Nmt inhibitors and successfully utilized it for the design of nonpeptidic inhibitors

(Karki and Kulkarni, 2001b, 2001c). Recently, Tatsuo et al. reported a novel series

of benzofuran analogs as nonpeptidic Nmt inhibitors (Masubuchi et al., 2001;

Ebiike et al., 2002; Sogabe et al., 2000). Previously we have reported 3D-QSAR on

series of benzofuran analogs as antifungals using comparative molecular field

analysis (CoMFA) (Purushottamachar and Kulkarni, 2003). In order to understand

better the relationship between structure and activity, we report herein a QSAR

using genetic function approximation (GFA), as developed by Rogers (Rogers and

Hopfinger, 1994) for the analysis of fungal N-myristoyltransferase active against C.albicans. GFA is a genetic algorithm which generates a population of equations

rather than a single equation in correlating biological activity with physiochemical

descriptors. GFA considers a combination of Friedman’s multivariate adaptive

regression splines (MARS) algorithm with Holland’s genetic algorithm to evolve a

population that best fits the training data set (Kawakami et al., 1996). This is done as

follows:

Med Chem Res

(1) The initial population of equations is generated by a random choice of

descriptors, and the fitness of each equation is scored by a lack-of-fit (LOF)

measure, LOF ¼ LSE

1� cþd�pÞ=m½ �f g2 where LSE is the least square error, c is the

function in the models, d is the smoothing parameter which controls the

number of features contained in all terms of the models, p is the number of

features contained in all basis functions, and m is the number of samples in the

training set.

(2) A pair from the population of equations is chosen at random and crossovers are

performed and progeny equations are generated.

(3) The fitness of each progeny equation is assessed by the LOF measure.

(4) If the fitness of the new progeny equation is better, then it is preserved.

GFA models provide useful information such as the relevance of particular

descriptors to the model and activity prediction; we have been using GFA methods

effectively and have recently applied them to generate QSAR models for antifungal,

antibacterial, antitubercular, anticancer, and antidiabetic agents (Gokhale and

Kulkarni, 2000a; Karki and Kulkarni, 2001a; Kharkar et al., 2002; Wagh et al.,2006; Sachan et al., 2007). A combination of this robust statistical technique

coupled with the use of different descriptors has resulted in the successful prediction

of novel antifungals (Chandavarkar et al., 2003; Chandavarkar et al., 2005).

Results

QSAR models were generated using a training of set 24 molecules (molecules 1–24,

Table 1). Test set molecules (molecules 25–29, Table 1) with regularly distributed

biological activities were used to assess the predictive ability of the generated

QSAR models. Biological activity was used in terms of log (1/IC50) against Ca Nmt.All the molecules used in the study were subjected to conformational search using a

random sampling method, and the lowest-energy conformers were aligned using a

minimum common substructure. Molecules were superimposed onto the lowest-

energy conformer of molecule 23, having the highest biological activity (Fig. 4).

The alignment resulted in the orientation of the molecules in such a way that the

substituent at C2 of the benzofuran ring system oriented towards the X-axis and the

C4-substituted alkylamine group oriented to the Y-axis.During the generation of QSAR models, it was observed that 20,000 crossovers

with a smoothing factor of d = 1.0 resulted in optimum internal and external

predictivity as assessed from the LOF value, rcv2 , and variable usage graphs. Thus

the number of crossovers was set to 20,000.

Since equations generated without restricting the number of the descriptors

(infinite chain length) are usually not relevant, interpretation of QSAR model with

more terms becomes difficult in designing newer molecules. Hence equations with

not more than five terms including a constant were generated.

A total of 32 descriptors (Table 2) were used for the QSAR model generation.

All the statistically significant QSAR models are shown in Table 3. A set of

generated QSAR equations were evaluated for their predictive ability. Observation

Med Chem Res

Table 1 Structure and activities of the molecules in the training (1–24) and test (25–29) sets

O O

O

R

Compd R Observed activity Predicted

activity

Residuals

IC50 Log I/IC50

1 O(CH2)2NHC(CH3)3 50 -1.6989 -0.870 -0.8289

2 O(CH2)2OHCH2NHCH(CH3)2 0.98 0.0088 -0.163 0.1718

3 O(CH2)3NHC(CH3)3 1.7 -0.2304 -0.821 0.5906

4 O(CH2)4NHC(CH3)3 4.4 -0.6434 -1.378 0.7346

5

OCH2CH2CH2 N

1.6 -0.2041 -0.713 0.5089

6 O(CH2)5NHC(CH3)3 1.5 -1.1760 -1.664 0.4880

25 O(CH2)2OHCH2NHCH(CH3)2 4.4 -0.6434 -1.369 0.7256

26 O(CH2)2OHCH2NHC(CH3)3 1.2 -0.0792 0.209 -0.2882

NH

X

O

R1O

R

Compd R R1 X Observed activity Predicted

activity

Residuals

IC50 Log I/IC50

7 CH2CH2Ph CH3 N 1.2 -0.0791 -0.142 0.0629

8 CONHPh CH3 N 2.2 -0.3424 0.494 -0.8364

9 CH2SPh CH3 N 0.62 0.2076 0.846 -0.6384

10 COOC2H5 c-C3H5 N 4.4 -0.6434 -0.874 0.2300

11 COOC2H5 CH3 CH 3.3 -0.5185 0.178 -0.6965

12 COOC2H5 CH3 N 1.0 1.0000 0.594 0.4060

21 COOC2H5 Et N 10 -1.0000 0.605 -1.6050

22 COOC2H5 i-propyl N 83 -1.9190 -0.929 -0.9900

29 COOC2H5 H N 79 1.8976 -0.775 -1.1226

Med Chem Res

from variable usage indicated that PMI_X, HbondAcc, Density, AlogP, PMI_Y,

MolRef, RotlBonds, MW, Apol, and LUMO contributed more significantly than all

other descriptors. The frequency distribution of the variables used is shown in

Table 4. The variable terms which appeared in the equation clearly indicate that

Table 1 continued

Compd R R1 X Observed activity Predicted

activity

Residuals

IC50 Log I/IC50

23

O

N

N

CH3 N 0.001 3.0000 1.797 1.2030

24

OO

CH3 N 0.003 1.5228 1.596 -0.0732

NH

N

O

O

O R

Compd R Observed activity Predicted

activity

Residuals

IC50 Log I/IC50

13 2-Cynophenyl 0.017 1.7695 1.328 0.4415

14 Phenyl 0.072 1.1426 0.510 0.6326

15 3-Flurophenyl 0.110 0.9586 1.845 -0.8864

16 2,3-Diflurophenyl 0.0037 2.4317 1.870 0.5670

17 2-Flurophenyl 0.0083 2.0809 1.334 0.7469

18 2,3,4-Triflurophenyl 0.0057 2.2441 2.295 -0.0509

19 2,4-Diflurophenyl 0.0075 2.1249 2.133 -0.0081

20 4-Flurophenyl 0.0052 2.2839 1.155 1.1289

27 4-Chlorophenyl 0.073 1.366 1.343 -0.2069

28 4-Cynophenyl 0.0094 2.0268 1.397 0.6298

Med Chem Res

structural and spatial descriptors played an important role in the biological activity

of these molecules. The selection of the best model was based on the values of rcv2 ,

LOF, and rpred2 .

Equation 3 shows better internal predictivity (rcv2 = 0.635), external predictivity

(rpred2 = 0.733), and bootstrap r2 (Bsr2 = 0.731) than all the other equation. The

variable usage graph (Table 4) also indicates the use of the descriptors contained in

Eq. 3. Hence, this equation was selected as the best QSAR model to explain the

Table 2 Descriptors used in the present study

No Descriptor Type Description

1 Vm Spatial Molecular volumea

2 Area Spatial Molecular surface areaa

3 Density Spatial Molecular densitya

4 RadOfGyration Spatial Radius of gyrationa

5 PMI–mag Spatial Principle moment of inertiaa

6 PMI–X Spatial Principle moment of inertia X-component

7 PMI–Y Spatial Principle moment of inertia Y-component

8 PMI–Z Spatial Principle moment of inertia Z-component

9 MW Structural Molecular weighta

10 Rotlbonds Structural Number of rotatable bondsa

11 Hbond acceptor Structural Number of hydrogen bond acceptorsa

12 Hbond donor Structural Number of hydrogen bond donorsa

13 AlogP Thermodynamic Logarithm of partition coefficienta

14 MolRef Thermodynamic Molar refractivitya

15 Dipole–mag Electronic Dipole momenta

16 Dipole–X Electronic Dipole moment–X-component

17 Dipole–Y Electronic Dipole moment–Y-component

18 Dipole–Z Electronic Dipole moment–Z-component

19 Charge Electronic Sum of partial chargesa

20 Apol Electronic Sum of atomic polarizabilitiesa

21 HOMO Electronic Highest occupied molecular orbital energy

22 LUMO Electronic Lowest unoccupied molecular orbital

energy

23 Sr Electronic Superdelocalizability

24 Foct Thermodynamic Desolvation free energy for octanol

25 Fh2o Thermodynamic Desolvation free energy for water

26 Hf Thermodynamic Heat of formation

27 DIFFV MSA Difference volume

28 COSV MSA Common overlap steric volume

29 Fo MSA Common overlap volume ratio

30 NCOSV MSA Non-common overlap steric volume

31 Shape RMS MSA RMS to shape reference

32 SR Vol MSA Volume of shape reference molecule

a Default descriptor

Med Chem Res

observed variation in the biological activity of the molecules under study. Terms

involving structural and spatial descriptors contribute to this equation and explain

about 73% variance in the biological activity. The observed and calculated values

for the training and test set molecules are given in Table 1. The plot of calculated

activity values for the training set molecules and the predicted versus the observed

activity values for the test set molecules are shown in Figs. 1 and 2, respectively,

Equations 3, 4, 7, and 8 (Table 3) contain reasonable statistical terms, but Eq. 3 in

Table 3 QSAR equations generated using genetic function approximation for the training set

No. Equation LOF r2 arcv2 bBS r2 F-value rpred

2

Model 1 BA = - 2.7589 + 0.0092 PMI_X

– 0.3074 Rotlbonds + 0.4388 Hbond Acc

0.914 0.736 0.630 0.737 18.569 0.682

Model 2 BA = – 1.8205 – 0.5509 Rotlbonds

+ 0.2954 Fh2O + 0.4177Dipole–X

0.914 0.735 0.558 0.738 18.526 0.239

Model 3 BA = - 4.2699 + 0.0070 PMI–X

– 0.0013 PMI–Y + 0.4998 Hbond Acc

0.934 0.730 0.635 0.731 18.013 0.773

Model 4 BA = - 1.4397 + 0.0093PMI_X

– 0.4557 Rotlbonds – 0.1098 Fh2O

0.994 0.727 0.610 0.729 17.763 0.719

Model 5 BA = - 5.2823 + 1.2823 LUMO

– .0019 PMI_Y + 0.8235 Hbond Acc

0.953 0.725 0.599 0.702 17.537 0.742

Model 6 BA = - 21.6399 + 0.0061 PMI_X

+ 0.4200 Sr + 17.9082 Density

0.971 0.719 0.599 0.721 17.084 0.019

Model 7 BA = - 0.6065 + 0.0112 PMI_X

0.0635 MolRef – 0.1123 Foct

0.979 0.717 0.617 0.718 16.879 0.726

Model 8 BA = - 3.3993 + 0.0118 PMI_X

– 0.0002 Apol + 0.3438 HbondAcc

0.986 0.715 0.608 0.716 16.719 0.753

Model 9 BA = - 6.1863 + 0.0088 PMI_X

+ 0.4585 HbondAcc + 0.320 Sr

0.990 0.714 0.593 0.715 16.617 0.482

Model 10 BA = - 0.9557 + 0.0077 PMI_X

– 0.4638 Rotlbonds – 0.1026 Foct

0.992 0.713 0.594 0.715 16.582 0.755

BA in all the equations represents log (1/IC50) in terms of lM, a rcv2 is cross-validated r2, b BS r2 is the

bootstrapping r2, and rpred2 is the predictive r2

Table 4 Frequency distribution

of the variablesDescriptor Frequency

PMI_X 85

HbondAcc 40

Density 22

AlogP 19

PMI_Y 17

MolRef 16

RotlBond 14

MW 11

Apol 10

LUMO 10

Med Chem Res

comparison with the other three equations can be characterized by slightly higher rcv2

and rpred2 values. Therefore in the present study Eq. 3 was chosen as a model that

best describes the inhibition of Nmt enzyme of C. albicans by benzofuran analogs.

The statistical significance of the relationship between enzyme inhibition and

physicochemical descriptors was further demonstrated by the randomization

procedure. The r2 values for the 50 trials using permuted data are shown in

Fig. 3. The mean of r2 for 49 randomization tests was used to assess the reliability

of the model; its value was 0.269. It is evident that the r2 value of the original model

is much higher than that of the trials with the permuted data, suggesting robustness

of the model.

Discussion

The C. albicans Nmt inhibition by the series of benzofuran antifungals is thus a

function of PMI_X, PMI_Y, and HbondAcc. PMI_X signifies the principle moment

-2

-1

0

0.5

1

1.5

2

-2 -1 0 1 2

Cal

cula

ted

act

ivit

y

-0.5

-1.5

-2.5 -1.5 -0.5 0.5 1.5 2.5

Observed activity

Fig. 2 Plot of predicted versus observed activity of the molecules of the test set

-2

-1

0

1

2

-2 -1 0 1 2 3 4

Observed activity

2.5

1.5

0.5

-0.5

-1.5

Cal

cula

ted

act

ivit

y

Fig. 1 Plot of calculated versus observed activity of the molecules of the training set

Med Chem Res

of inertia of the molecule along X-axis. Positive correlation indicates that the

component of the molecule oriented along the X-axis is significant for better

activity. Similarly, the negative correlation of PMI_Y indicates that the component

of the molecule oriented along the Y-axis should not be bulky. HbondAcc is a

structural parameter and the positive correlation of this term with activity indicates

that, the higher the number of H-bond acceptor groups in the molecule, the more

active it will be. From Table 4 it is evident that the descriptors PMI_X, PMI_Y, and

HbondAcc have been used maximum times. This indicates that these three

descriptors have higher relevance to the prediction of activity.

PMI_X is a spatial descriptor and indicates the orientation and conformational

rigidity of the molecule; its value depends on the total mass distribution within the

Fig. 4 Alignment of the training set molecules

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49

r2 o

f G

FA

mo

del

s

Trials

Fig. 3 GFA randomized tests. The first bar shows the r2 value for the model based on the actual data.The other 49 bars show the r2 value for the model based on permuted data

Med Chem Res

molecule. Thus the orientation of the substructure of the molecule at C2 position

with its hydrogen bonding acceptor functional group is important for enzyme

inhibition. In the most active molecule (23) the imidazole ring nitrogen is accessible

for hydrogen-bond interaction with active site residues. In the case of molecules 13–20, 22, 27, and 28, the C2 substituent with ether linkage are slightly away from the

direction of the X-axis. This explains the lower activity of these molecules as

compared with 23. Molecule 8, with amide linkage, and molecules 1–6, 25, and 26,

containing ethyl ester, are highly inactive because these groups are oriented away

from the X-axis and also because of the lower hydrogen-bonding capacity of the

amide and ester linkage than the ether functional group.

PMI_Y indicates the conformational and rigidity of the substituent at the C4

position of benzofuran ring, which is important for biological activity. As PMI also

represents the mass distribution of the functional group oriented in the direction of

the Y-axis within the molecule, its negative correlation hints that the substitution at

C4 oxygen should be rigid, not bulky, and optimal, as in molecule 23. Molecule 23

shows that an increase in the space between the C4 oxygen and the amine of the

alkyl group will be detrimental to activity. For example, molecules 4 and 5, which

contain longer alkyl side chains, are inactive. Therefore this indicates the optimum

spacer length to be three. It is observed that molecules containing rigid functional

groups such as 3-pyridylmethyl on the nitrogen of the alkyl amine at the C4 position

of the benzofuran ring are more active than that of nonrigid functional groups such

as isopropyl and tertiary butyl. This may be due to the nonrigid nature of the

isopropyl and tertiary butyl groups, which makes the alkylamino side chain orient

away from the Y-axis, thereby making the molecules lose vital interaction at the

active site of enzyme. The observed variance in the inhibitory activity among 3-

pyridylmethyl substituted molecules must be due to the influence of other

descriptors.

Positive correlation of HbondAcc with the enzyme inhibitory activity indicates

that molecules should have more hydrogen acceptor groups to show potent activity.

Functional groups such as ether, ring nitrogen (tertiary ‘N’), ring oxygen, etc. are

known to act by forming hydrogen bonds with the donor residues in the enzyme

active site. Molecules with the above functional groups are highly active (13–20, 22,

23, 24, 27, and 28), whereas molecules 1–12, 21, 22, 25, 26, and 29 with few

hydrogen bond acceptors groups at the C4 and C2 positions than the most active

molecule are less active. In most of the active molecules ether linkage oxygen has

more hydrogen bond acceptors capacity than thioether, ester oxygen, and amide

carbonyl functional group of inactive molecules (hydrogen bond acceptors capacity:

–N= [ –O– [ COO [ –S–[ CONH [ –C–). Additionally in active molecules, the

pyridyl nitrogen at the C4 side chain also contributes towards activity by its

hydrogen-bond interaction with the enzyme active site (enzyme–ligand interaction,

explained at the end of discussion). Among the inactive molecules, molecules

containing ester oxygen exhibit better activity than those of the thioether-, amide-,

and alkyl linkage-containing molecules 7, 8, and 9, respectively. The importance of

3-pyridylmethylgroups as an additional hydrogen-bond acceptor functional group is

explained by examining molecules 1–6, 12, 25, and 26. In all of these molecules the

ester functional group at the C2 position is common. Whereas the observed potent

Med Chem Res

activity of 12 is due to the presence of the 3-pyridylmethyl group. The moderate

activity of the molecule 24, even though it contains ether and 3-pyridylmethyl

groups, may be due to the inaccessibility of the ether functional group to the enzyme

active site.

Thus the overall activity will depend on the combination of the three descriptors.

The results are complimentary to the preliminary structure–activity relationship

(SAR) reported by Tatsuo et al. (Masubuchi et al., 2001; Ebiike et al., 2002; Sogabe

et al., 2000).

The X-ray crystal structure of Ca Nmt (PDB: 1IYL) with the nonpeptidic

inhibitor (23) shows that the pocket of the inhibitor-binding site is composed of

aromatic residues Tyr107, Phe115, Phe117, Tyr119, Phe196, Tyr225, Phe240,

Tyr335, Phe339, and Tyr354. The benzofuran ring is stacked parallel to Tyr225 and

perpendicular to Tyr354, in the proximity Phe117 and Phe339. These residues are

presumed to be important not only for the architecture of the binding site but also

for inhibitor binding. His227 is located in the proximity of the oxygen atom of the

benzofuran ring. The hydrogen-bond interaction of the benzofuran ring with the

imidazole ring of His227 and the geometry of the benzofuran ring may contribute to

the inhibitor binding.

The secondary amine group of the substituent at position C4 of the benzofuran

ring makes a hydrogen bond with the C-terminal carboxylate of Leu451. Thus its

orientation accounts for structure activity. Tyr119 is in the proximity of the pyridine

ring. The change of the substituent from a t-butyl group to a 3-pyridylmethyl group

improves the inhibitory activity due to the additional hydrogen-bond interaction

with the side chain of Asn392 positioned in close proximity to Phe240.

The descriptors of the selected Eq. 3 are consistent with the interaction of the

molecule with the enzyme active site. The spatial factor PMI_Y (secondary amine

in the molecules) is crucial for enzyme inhibitory activity. Any substituent which

ensures proper orientation of hydrogen-bond acceptor groups in the molecule for

potent enzyme inhibitory activity. Any substituent which has hydrogen-bond

acceptor capacity at positions C4 and C2, which can form hydrogen bond with

Tyr119 and Asn392, respectively, with proper steric bulk and orientation, will

increase the potency of molecule. The benzofuran scaffold can be successfully

replaced with bicyclic or tricyclic rings with proper hydrogen-bond acceptor atoms

to interact with His227 to increase the potency. At the ether linkage containing

molecules at C2 position, the hydrogen bond forming ether oxygen should be near

to the Asn392 and its bulky substituent should be properly oriented towards

hydrophobic aromatic residues Phe115, Phe240 and Phe339 of enzyme active site.

Conclusions

A QSAR model was derived using GFA for a series of benzofuran antifungals. The

best model generated correlates with the antifungal activity with PMI_X, PMI_Y,

and HbondAcc, explaining the significance of the orientation of the molecule along

the X and Y-axes and of hydrogen-bond acceptors. The model has moderate internal

and external predictivity, as shown by the values of rcv2 of 0.635 and of rpred

2 of

Med Chem Res

0.773. The statistical significance and robustness of the model has been confirmed

by a randomization study. Thus in the series of benzofuran antifungals, molecules

with substituents with proper orientation towards the X- and Y-axes and hydrogen-

bond accepting capacity may enhance enzyme inhibition activity and, in turn,

antifungal activity. On the basis of the developed QSAR models, novel molecules

can be designed as potential Nmt inhibitors.

Materials and methods

Data set

Twenty-nine molecules selected for the present study were taken from the published

work (Masubuchi et al., 2001; Ebiike et al 2002; Sogabe et al., 2000). These

molecules have been reported as C. albicans Nmt enzyme inhibitors. The negative

logarithm of the 50% inhibitory concentration (IC50) expressed in lM was used as

the biological activity in the 3D-QSAR study, thus correlating the data linearly to

the free energy change. A training set of 24 four molecules (molecules 1–24,

Table 1) was used to generate the QSAR models. The training set molecules were

selected in such a way that they contained information in terms of both their

structural features and biological activity ranges. The most active molecules,

moderately active, and less active molecules were included, to spread out the range

of activities (Golbraikh et al., 2003). To assess the predictive power of the model, a

set of five molecules (molecules 25–29, Table 1) was arbitrarily set aside as the test

set. The test molecules were selected in such way that they truly represent the

training set.

Model building

All molecular modeling studies were carried out using Cerius2 (Version 3.5 and

4.10L) running on Silicon Graphics O2 R5000 and Intel Pentium IV 3.0 GHz

workstations. Structures were constructed and partial charges assigned using the

charge equilibrium method within Cerius2; throughout the study universal force

field 1.02 was used (Rappe et al., 1992). The molecules were subsequently

minimized using the smart minimizer until a root-mean-square deviation of

0.001 kcal/mol A was achieved before use in the study (Rappe and Goddard, 1991).

Conformational sampling

The local minimized geometry was used as the initial structure for conformational

analysis. Conformational ensembles were generated by random sampling using a

rotation increment of 10 kcal/mol for all the torsional angles. In order to restrict the

number of conformers generated to a maximum of 500, conformers with an energy

threshold value of more than 10 kcal/mol above that of the local minimized

structures were rejected, thus selecting only energetically stable conformers.

Med Chem Res

Calculation of descriptors

Different physicochemical descriptors were calculated for each molecule in the

study table using the default settings within Cerius2. The descriptors included are

electronic, spatial, structural, thermodynamic, and molecular shape analysis (MSA)

descriptors. All molecules were aligned on the lowest-energy conformer (molecule

23) with energy 210.346 kcal/mol, which was taken as the reference for calculation

of MSA. A complete list of the descriptors used in the study is given in Table 2. For

quantum-mechanical descriptors, the highest occupied orbital energy (HOMO),

lowest unoccupied molecular orbital (LUMO), super delocalizabilty (Sr), etc., were

calculated using the PM3 Hamiltonian.

Generation of QSAR models

QSAR has been a method to derive analysis establishes relationship between

physiochemical properties and biological activity of molecules. In the present study,

QSAR model generation was carried out using the GFA technique using 20,000

crossovers, a smoothness value of 1.00, and other settings at their default values.

GFA was asked to consider not more than four terms in the equation. The set of

equations generated was evaluated on the following basis:

a. LOF

b. Variable term in the equation.

c. Predictivity of the equation (predictive r2 value)

Validation of QSAR models

The overall objective of a QSAR procedure is to derive a model that is optimally

predictive. A model that does a good job of predicting the activities of molecules on

which it is based must be tested to see if any of the data in the test set are data that

affect the model excessively. This is done using the QSAR validation procedure. It

is useful to assess the reliability and significance of QSAR models. Validation can

be done by using cross-validation test, the bootstrap test, and by randomization test.

The cross-validation process repeats regression many times on subsets of data.

Usually each molecule is left out in turn, and the r2 value is computed using the

predicted values of the missing molecules (the cross-validated r2). If molecules are

removed N at a time from a total set of M, then N 9 M regressions are performed.

Bootstrapping involves the generation of many new data sets from the original data

set by randomly choosing samples from the original data set (Cramer III et al.,1988). The bootstrap r2 (Bsr2) value is the average squared correlation coefficient. A

Bsr2 value is computed from the subset of variables used one at a time for the

validation procedure. A bootstrap r2 value can be used more than once in computing

the r2 statistic. The randomization test is done as follows: (1) repeatedly permuting

the activity values of the data set, (2) using permuted values to generate QSAR

model, and (3) comparing the resulting mean r2 with the r2 value of the original

QSAR model generated from nonrandomized activity values (Verma et al., 2008). If

Med Chem Res

the original QSAR model is statistically significant, its r2 should be significantly

better than the r2 from permuted data.

Predictivity of the equation

Each QSAR equation was assessed by predicting the activities of the test set

molecules and these predictions were expressed as predictive r2 (rpred2 ) values,

r2predicative ¼

SD� PRESS

SD

where SD is the sum of the squared deviation between biological activities of the

test set molecules and mean activity of the training set, and PRESS is the sum of the

squared deviation between the predicted and actual activities of the test set.

Observed and predicted activities of the molecules along with residuals are

presented in Table 1.

Acknowledgements One of the authors (H.S.D.) is thankful to CSIR, New Delhi for awarding a Senior

Research Fellowship (SRF). The authors would like to thank Dr. Shivajirao S. Kadam Vice-Chancellor,

Bharati Vidyapeeth University, Pune for encouragement.

References

Benedetti MS, Bani M (1999) Metabolism-based drug interactions involving oral azole antifungals in

humans. Drug Metab Rev 31:665–717. doi:10.1081/DMR-100101941

Chandavarkar MA, Kulkarni VM, Shivkumar P, Shetty RS, Bapat UR (2003) 1,2,4-Triazol-1-yl-propan-

2-ol derivatives as anti-fungal agents. World Patent WO 068142.

Chandavarkar MA, Kulkarni VM, Shivkumar P, Shetty RS, Bapat UR (2005) Novel antifungal agents.

U.S. Pat Appl 0020653:A1

Cramer RD III, Bunce JD, Petterson DE, Frank IE (1988) Crossvalidation, bootstrapping, and partial least

squares compared with multiple regression in conventional QSAR studies. Quant Struct Act Relat

7:18–25. doi:10.1002/qsar.19880070105

Devadas B, Freeman SK, Zupec ME, Lu HF, Nagarajan SR, Kishore NS et al (1997) Design and synthesis

of novel imidazole-substituted dipeptide amides as potent and selective inhibitors of Candida

albicans myristoylCoA:protein N-myristoyltransferase and identification of related tripeptide

inhibitors with mechanism-based antifungal activity. J Med Chem 40:2609–2625. doi:10.1021/

jm970094w

Duronio RJ, Towler DA, Heuckeroth RO, Gordon JI (1989) Disruption of the yeast N-myristoyl

transferase gene causes recessive lethality. Science 243:796–800. doi:10.1126/science.2644694

Ebiike H, Masubuchi M, Liu P, Kawasaki K, Morikami K, Sogabe S et al (2002) Design and synthesis of

novel benzofurans as a new class of antifungal agents targeting fungal N-myristoyltransferase. Part

2. Bioorg Med Chem Lett 12:607–610. doi:10.1016/S0960-894X(01)00808-3

Gokhale VM, Kulkarni VM (2000a) Understanding the antifungal activity of terbinafine analogues using

quantitative structure–activity relationship (QSAR) models. Bioorg Med Chem 8:2487–2499. doi:

10.1016/S0968-0896(00)00178-4

Gokhale VM, Kulkarni VM (2000b) Selectivity analysis of 5-(arylthio)-24-diaminoquinazolines as

inhibitors of Candida albicans dihydrofolate reductase by molecular dynamics simulations. J Comput

Aided Mol Des 14:1573–4951. doi:10.1023/A:1008189724803

Golbraikh A, Shen M, Xiao Z, Xiao Y, Lee K, Tropsha A (2003) Rational selection of training and test

sets for the development of validated QSAR models. J Comput Aided Mol Des 17:241–253. doi:

10.1023/A:1025386326946

Johnson DR, Bhatnagar RS, Knoll LJ, Gordon JI (1994) Genetic and biochemical studies of protein

N-myristoylation. Annu Rev Biochem 63:869–914. doi:10.1146/annurev.bi.63.070194.004253

Med Chem Res

http://dx.doi.org/10.1081/DMR-100101941

http://dx.doi.org/10.1002/qsar.19880070105

http://dx.doi.org/10.1021/jm970094w

http://dx.doi.org/10.1021/jm970094w

http://dx.doi.org/10.1126/science.2644694

http://dx.doi.org/10.1016/S0960-894X(01)00808-3

http://dx.doi.org/10.1016/S0968-0896(00)00178-4

http://dx.doi.org/10.1023/A:1008189724803

http://dx.doi.org/10.1023/A:1025386326946

http://dx.doi.org/10.1146/annurev.bi.63.070194.004253

Karki RG, Kulkarni VM (2001a) Three-dimensional quantitative structure—activity relationship

(3D-QSAR) of 3-aryloxazolidin-2-one antibacterials. Bioorg Med Chem 9:3153–3160. doi:10.1016/

S0968-0896(01)00186-9

Karki RG, Kulkarni VM (2001b) A feature based Pharmacophore for Candida albicans MyristoylCoA:

protein N-myristoyltransferase inhibitors. Eur J Med Chem 36:147–163. doi:10.1016/S0223-

5234(00)01202-2

Karki RG, Kulkarni VM (2001c) Computer-aided design and synthesis of Candida albicansN-myristoyltransferase inhibitors as antifungal agents. Indian Drugs 38:406–413

Kawakami Y, Inoue A, Kawai T, Wakita M, Sugimoto H, Hopfinger AJ (1996) The rationale for E2020

as a potent acetylcholinesterase inhibitor. Bioorg Med Chem Lett 4:1429–1446. doi:10.1016/0968-

0896(96)00137-X

Kharkar PS, Desai B, Gaveria H, Varu B, Loriya R, Naliapara Y et al (2002) Three-dimensional

quantitative structure-activity relationship of 14-dihydropyridines as antitubercular agents. J Med

Chem 45:4858–4867. doi:10.1021/jm020217z

Lodge JK, Machelski E, Toffaletti DL, Perfect JR, Gordon JI (1994) Targeted gene replacement demonstrates

that myristoyl-CoA: protein N-myristoyltransferase is essential for viability of Cryptococcus

neoformans. Proc Natl Acad Sci USA 91:12008–12012. doi:10.1073/pnas.91.25.12008

Masubuchi M, Ken-ichi K, Ebiike H, Ikeda Y, Tsujii S, Sogabe S et al (2001) Design and synthesis of

novel benzofurans as a new class of antifungal agents targeting fungal N-myristoyltransferase Part 1.

Bioorg Med Chem Lett 11:1833–1837. doi:10.1016/S0960-894X(01)00319-5

Nagarajan SR, Devadas B, Zupec ME, Freeman SK, Brown DL, Lu HF et al (1997) Conformationally

constrained [p-(omega-aminoalkyl)phenacetyl]-L-seryl-L-lysyl dipeptide amides as potent pepti-

domimetic inhibitors of Candida albicans and human myristoyl-CoA:protein N-myristoyl

transferase. J Med Chem 40:1422–1438. doi:10.1021/jm9608671

Purushottamachar P, Kulkarni VM (2003) 3D-QSAR of N-myristoyltransferase inhibiting antifungal

agents by CoMFA and CoMSIA methods. Bioorg Med Chem 5:3487–3497. doi:10.1016/S0968-

0896(03)00305-5

Rappe AK, Casewit CJ, Colwell KS, Goddard WA, Skiff WM (1992) UFF a full periodic table force field

for molecular mechanics and molecular dynamics simulations. J Am Chem Soc 114:10024–10039.

doi:10.1021/ja00051a040

Rappe AK, Goddard WA (1991) Charge equilibration for molecular dynamics simulations. J Phys Chem

95:3358–3363. doi:10.1021/j100161a070

Rogers D, Hopfinger AJ (1994) Application of genetic function approximation to quantitative structure-

activity relationships and quantitative structure-property relationships. J Chem Inf Comput Sci

34:854–866. doi:10.1021/ci00020a020

Sachan N, Kadam SS, Kulkarni VM (2007) Human protein tyrosine phosphatase 1B inhibitors: QSAR by

genetic function approximation. J Enzyme Inhib Med Chem 22:267–276. doi:10.1080/1475636060

1051274

Sogabe S, Masubuchi M, Sakata K, Fukami TA, Morikami K, Shiratori Y et al (2000) Crystal structures

of Candida albicans N-myristoyltransferase with two distinct inhibitors. Chem Biol 9:1119–1128.

doi:10.1016/S1074-5521(02)00240-5

Talele TT, Kulkarni SS, Kulkarni VM (1999) Development of pharmacophore alignment models as input

for comparative molecular field analysis of a diverse set of azole antifungal agents. J Chem Inf

Comput Sci 39:958–966. doi:10.1021/ci990020o

Verma J, Khedkar VM, Prabhu AS, Khedkar SA, Malde AK, Coutinho EC (2008) A comprehensive

analysis of the thermodynamic events involved in ligand—receptor binding using CoRIA and its

variants. J Comput Aided Mol Des 22:91–104. doi:10.1007/s10822-008-9172-0

Wagh NK, Deokar HS, Juvale DC, Kadam SS, Kulkarni VM (2006) 3D-QSAR of histone deacetylase

inhibitors as anticancer agents by genetic function approximation. Indian J Biochem Biophys

43:360–371

Weinberg RA, McWherter CA, Freeman SK, Wood DC, Gordon JI, Lee SC (1995) Genetic studies reveal

that myristoylCoA: protein N-myristoyltransferase is an essential enzyme in Candida albicans. Mol

Microbiol 16:241–250. doi:10.1111/j.1365-2958.1995.tb02296.x

Wilcox C, Hu JS, Olson EN (1987) Acylation of proteins with myristic acid occurs cotranslationally.

Science 238:275–278. doi:10.1126/science.3685978

Med Chem Res

http://dx.doi.org/10.1016/S0968-0896(01)00186-9

http://dx.doi.org/10.1016/S0968-0896(01)00186-9

http://dx.doi.org/10.1016/S0223-5234(00)01202-2

http://dx.doi.org/10.1016/S0223-5234(00)01202-2

http://dx.doi.org/10.1016/0968-0896(96)00137-X

http://dx.doi.org/10.1016/0968-0896(96)00137-X

http://dx.doi.org/10.1021/jm020217z

http://dx.doi.org/10.1073/pnas.91.25.12008

http://dx.doi.org/10.1016/S0960-894X(01)00319-5

http://dx.doi.org/10.1021/jm9608671

http://dx.doi.org/10.1016/S0968-0896(03)00305-5

http://dx.doi.org/10.1016/S0968-0896(03)00305-5

http://dx.doi.org/10.1021/ja00051a040

http://dx.doi.org/10.1021/j100161a070

http://dx.doi.org/10.1021/ci00020a020

http://dx.doi.org/10.1080/14756360601051274

http://dx.doi.org/10.1080/14756360601051274

http://dx.doi.org/10.1016/S1074-5521(02)00240-5

http://dx.doi.org/10.1021/ci990020o

http://dx.doi.org/10.1007/s10822-008-9172-0

http://dx.doi.org/10.1111/j.1365-2958.1995.tb02296.x

http://dx.doi.org/10.1126/science.3685978

qsar analysis of n myristoyltransferase inhibitors antifungal

Documents