mark mackey, cresset, 'meet molecular architect, a new product for understanding sar and...

39
Meet Molecular Architect Dr Mark Mackey Chief Scientifc Officer

Upload: cresset

Post on 26-Jan-2015

104 views

Category:

Technology


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Meet Molecular Architect

Dr Mark MackeyChief Scientifc Officer

Page 2: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Outline

> Fields, Field points and the good things you can do with them

> The alignment problem

> 3D-QSAR using Fields

> Examples> SARS PLpro – small data set, known xtal structure

> NK3 – large data set, unknown xtal structure

Page 3: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NN

Br

F FF

SH2NO

O

Field Points

Condensed representation of electrostatic, hydrophobic and shape properties (“protein’s view”)

> Molecular Field Extrema (“Field Points”)

3D Molecular Electrostatic

Potential (MEP)

Field Points= Positive = Negative= Shape= Hydrophobic

2D

Page 4: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Field Points have lots of applications

> Virtual screening

> Alignment

> Pharmacophore elucidation

> Bioisosteres

> etc

Page 5: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Field Points have lots of applications

> Virtual screening

> Alignment

> Pharmacophore elucidation

> Bioisosteres

> etc

> What about 3D QSAR?

Page 6: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

The Alignment Problem

> Historically very difficult

> Early approaches template-based> Issues with side chain orientations

> Some success with docked data sets

> Easy to fool yourself> Correlation/causation

Page 7: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Alignment issues

> Ligand-centric view vs protein-centric

Cramer, JCAMD, 2010, DOI 10.1007/s10822-010-9403-z

NN

Br

F FF

SH2NO

O

NN

Br

F FF

SH2NO

O

Page 8: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Which is better?

> “The superior statistical qualities of 3D-QSAR models based on poses that superimpose presumably critical ligand features, rather than docked conformations.” Clark R., JCAMD 2007, p587

Doweyko, J. Comp-Aided Mol. Des., 2004, p 587

Free alignment adds signal, but also noise. Worse statistics, better predictability?

Page 9: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

N-methyl acetamideImidazole

Field Alignment

H3N

N

OO

HN

O

NH

NH2H2N

O

NH

NH

HN

SOO

Page 10: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

N-methyl acetamideImidazole

Field Scoring

Cheeseright et al, J. Chem Inf. Mod., 2006, 665

To score a particular alignment, we use the field points of molecule 1 to sample the actual field of molecule 2

Page 11: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Field Scoring

N-methyl acetamideImidazole

Cheeseright et al, J. Chem Inf. Mod., 2006, 665

To score a particular alignment, we use the field points of molecule 1 to sample the actual field of molecule 2 and vice-versa

Page 12: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Field Sampling

Field-point based QSAR descriptors

Page 13: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Field Sampling

Field-point based QSAR descriptors

Page 14: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Advantages

> Many fewer sample points than grid-based methods

> E.g. Vegfr2 data set

Field Sampling Grid Sampling Filtered Grid Sampling

q2 0.62 0.42 (0.48) 0.40

Number of descriptors

466 3940 1243

Du et al., J Mol Graph Model. 27 (2009) 642-652

Page 15: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Advantages

> Many fewer sample points than grid-based methods

> Sample points physically rather than statistically chosen

> Gauge invariant

> Consistent framework for alignment and QSAR

Page 16: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Initial validation

> Tested against literature CoMFA datasets> 15 datasets with alignments available

> CoMFA average cross-validated RMSE is 0.72

> Field QSAR using CoMFA alignments is 0.74

> Simple model (volume indicator variable) is 0.83

> Data sets re-aligned using field alignment> RMSE 1.00

Page 17: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Interpretability

Electrostatic Steric Variance

Page 18: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

SARS PLpro

Page 19: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

The target

> PLpro (Papain-like protease) is a DUB target which is critical for the replication of the coronavirus responsible for SARS

> Crystal structures available with bound ligands from 2 series of compounds: structurally related (PDB entries 3E9S and 3MJ5)

> Small number of analogues – challenge to see if we can use 3D-QSAR for small data sets

Page 20: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Alignment

Page 21: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Sampling points

Page 22: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Model

3.50 4.00 4.50 5.00 5.50 6.00 6.50 7.003.50

4.00

4.50

5.00

5.50

6.00

6.50

7.00

R² = 0.838410392100944

R² = 0.987121600034644Training SetLinear (Training Set)Test SetLinear (Test Set)

PLS Components = 5 RMSE = 0.09 RMSEP = 0.38

Page 23: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Summary

> Able to build a predictive 3D-QSAR model based on small number of analogues

> Guided (by volume of Xtal structure) alignment worked best. Free alignment was OK, but noisier.

Page 24: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 antagonists

Page 25: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 example

> GPCR target (Tachykinin receptor 3) – selectively binds Neurokinin B – target for treatment of neurological disorders such as schizophrenia

> Three series of inhibitors from Euroscreen> Scaffold-1 – 81 compounds with pIC50 (radioligand

binding) in range 4.6-8.7

> Scaffold-2 – 80 compounds with pIC50 in range 4.8-7.7

> Errors in radioligand binding data c. ± 0.4

Page 26: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 binding mode

> For a 3D method you need a 3D alignment

> FieldAlign can align to a reference

> FieldTemplater generates the reference

O

N+

NH

N

HO

O

O

OHO

H

N

N+

N

O

N

N

O

H

F

F F

F

F

O

HN N+

H

H

N N

NH

FieldTemplater

Page 27: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 binding mode prediction

> FieldTemplater> Selection of 3 highly active scaffold-1 compounds

plus 2 structurally dissimilar literature NK3 actives (Talnetant and SB-218795).

> Generated Templates filtered and candidate selected

> Conformation of most active scaff-1 structure then used as alignment target for other structures

Page 28: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

3D-QSAR details

> Alignment> Free alignment to template conformation

> Field selection> Generated Field points for both steric and electrostatic

fields, with both sets at independent locations.

> 80/20 training/test split > Most active and least active training set

> 2nd most active, 2nd least active test set

> Random distribution of remaining compounds

Page 29: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Initial models problematic

> When all else fails, talk to the chemists

> “Are you using the right tautomer?”

N N

N

N

H

N N

N

N

Page 30: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 Series 1

RMSE 0.19, RMSEpred 0.64

4.5 5 5.5 6 6.5 7 7.5 8 8.5 94.5

5

5.5

6

6.5

7

7.5

8

8.5

9

Training Set

Test Set

Page 31: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK3 Series 1

Electrostatics Sterics

Page 32: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Extend to scaff-2?

4 5 6 7 8 9 10 114

5

6

7

8

9

10

11

Actual vs Predicted - scaff2 compounds on scaff1 model

> Complete lack of predictivity

> Visual analysis suggests a shift in binding mode for scaff-2

> Cross-series QSAR difficult

> Requires consistent binding modes!

Page 33: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

NK1 Scaffold 2

4 4.5 5 5.5 6 6.5 7 7.5 8 8.54

4.5

5

5.5

6

6.5

7

7.5

8

8.5

TrainingTest

Activity

Pred

icted

Acti

vity

RMSEpred 0.60

Page 34: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Summary

> Able to generate models based on alignment to predicted active conformation by templating

> Independent models within each of two series show reasonable predictivity and can be used to guide further work

> Cross-series analysis suggests different binding modes for the two series

Page 35: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Molecular Architect

Page 36: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Molecular Architect

> Initially FieldAlign + QSAR

> Align your molecules

> Build models

> Test models

> Fit new compounds to models

> Interactive feedback

> Add additional alignment options

Page 37: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Molecular Architect

> One tool for molecule designers> Align

> QSAR

> Pharmacophore elucidation

> Bioisosteres

> What do I make next?

> Beta Q4 2011

Page 38: Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understanding SAR and gaining better activity predictions

Acknowledgements

> Cresset> Andy Vinter

> Tim Cheeseright

> James Melville

> Chris Earnshaw

> Euroscreen> Hamid Hoveyda

> Julien Parcq