library compound design methods for custom ... - chemaxon

49
Solutions for Cheminformatics 21-25 Nov, 2010, Hyderabad Library Compound Design Methods for Custom Library Synthesis

Upload: others

Post on 29-Oct-2021

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Library Compound Design Methods for Custom ... - ChemAxon

Solutions for Cheminformatics

21-25 Nov, 2010, Hyderabad

Library Compound Design Methods for Custom

Library Synthesis

Page 2: Library Compound Design Methods for Custom ... - ChemAxon

Offers

Page 3: Library Compound Design Methods for Custom ... - ChemAxon

Usage

Page 4: Library Compound Design Methods for Custom ... - ChemAxon

Library design by ChemAxon

DB

DB

Databases

Reactions

Molecules

Markush structures

Queries

Compound selection

Similarity searches

Substructure searches

Enumeration

Fuse fragments

R-group composition

Reaction enumeration

Markush enumeration

Library analysis

Clustering

2D similarity screen

3D Shape similarity

screen

Fragmentation

R-group decomposition

Fragmentation

Reagent clipping

Page 5: Library Compound Design Methods for Custom ... - ChemAxon

Library design by ChemAxon

ChemAxon Technology

Chemical data storage JChemBase

JChem Cartridge for Oracle

Chemical data search JChem search technology

Chemical data visualization JChem for Excel – Marvin

Instant Jchem - Marvin

Chemical data characterization Calculator plugins – logP, pKa ...

Enumeration Reactor – reaction enumeration

Markush enumeration

R-group composition

Fragment fusion

Fragmentation Fragmenter

R-group decomposition

Analysis JKlustor

Screen 3D - Screen 2D

Page 6: Library Compound Design Methods for Custom ... - ChemAxon

Databases: displaying content on your desktop

JChem for Excel

Instant JChem

Page 7: Library Compound Design Methods for Custom ... - ChemAxon

Databases: displaying content: JSP application

• Search technology

• Descriptors

• Alignments

• Chemical Terms filter

• Import / Export /Edit

• AJAX in JChem Webservices

ONLINE TRYOUT

https://www.chemaxon.com

Page 8: Library Compound Design Methods for Custom ... - ChemAxon

Building blocks for library enumeration

Instant JChem

- Fragmentation

JChem for Excel

- R-group decomposition

Command line

- Fragmentation

- R-group decomposition

Page 9: Library Compound Design Methods for Custom ... - ChemAxon

0.47 0.55

0.57

0.28

0.20

0.06

Which fragments?

regular Tanimoto

optimized Tanimoto

Optimization of Similarity search metrics:

ECFP/FPCP/Chemical FP/ Pharmacophore FP

Page 10: Library Compound Design Methods for Custom ... - ChemAxon

Similarity searching statistics

1

10

100

1000

10000

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

Number of Active Hits

Num

ber

of

Hits

Tanimto Euclidean Optimized Ideal

Page 11: Library Compound Design Methods for Custom ... - ChemAxon

Enumeration

Output files ChemAxon technology

Fragments Fragment fusion

Markush structures Markush enumeration

(search without enumeration)

Reactants – generic

reactions

Reaction enumeartion

R-tables R-group composition

Page 12: Library Compound Design Methods for Custom ... - ChemAxon

Enumeration

R-table Markush structure

Page 13: Library Compound Design Methods for Custom ... - ChemAxon

ChemAxon in Knime

Page 14: Library Compound Design Methods for Custom ... - ChemAxon

Reaction Enumeration

EXCLUDE: match(reactant(1), "[Cl,Br,I]C(=[O,S])C=C") or

match(reactant(0), "[H][O,S]C=[O,S]") or

match(reactant(0), "[P][H]") or

(max(pka(reactant(0), filter(reactant(0),

"match('[O,S;H1]')"), "acidic")) > 14.5) or

(max(pka(reactant(0), filter(reactant(0),

"match('[#7:1][H]', 1)"), "basic")) > 0)

Page 15: Library Compound Design Methods for Custom ... - ChemAxon

ChemAxon in Knime

Page 16: Library Compound Design Methods for Custom ... - ChemAxon

Library analysis

• Characterisation of library:

– Fragments - Fragmenter

– Molecular descriptors – Calculator plugins

Page 17: Library Compound Design Methods for Custom ... - ChemAxon

Library analysis – 3D shape similarity search

Test on DUD

1% Enrichment

0

5

10

15

20

25

30

35

40

ADA CDK2 DHFR ER FXA HIVRT NA P38 thrombin TK trypsin

Perc

en

t o

f th

e a

cti

ves f

ou

nd

Surflex-sim

ROCS

FlexS

ICMsim

CXN-H

Giganti et al. J. Chem. Inf. Model. 2010, 50, 992

Page 18: Library Compound Design Methods for Custom ... - ChemAxon

Library analysis

Wide range of methods• Unsupervised, agglomerative

clustering

• Hierarchical and non-hierarchical

methods

• Similarity based and structure

based techniques

Flexible search options• Tanimoto and Euclidean metrics,

weighting

• Maximum common substructure

identification

• chemical property matching

including atom type, bond type,

hybridization, charge

Page 19: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 20: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 21: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 22: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 23: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 24: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 25: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 26: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 27: Library Compound Design Methods for Custom ... - ChemAxon

Use cases

Page 28: Library Compound Design Methods for Custom ... - ChemAxon

Target-focused libraries:

rapid selection of potential PDE inhibitors from

multi-million compounds’ repositories

Why do we need rapid selection of target- focused libraries?

Design inputs

2D similarity searching strategy

Property-based filtering

Seed/ chemotype representation (diversity)

Conclusion/ Proposals

TargetEx Ltd.,

György Dormán

Page 29: Library Compound Design Methods for Custom ... - ChemAxon

Target -focused libraries via Virtual Screening

H-bond Acceptor

AromaticH-bond donor

Cation

Docking Target structure

Source CompoundsCommercial Samples

Combinatorial Libraries/Historical collectionsDe Novo Compounds

Known Active Compounds

2DSubstructure-

Similarity SearchingPartitioningData FusionClustering

KernelsSVM

3D Pharmacophore

Shape Similarity3D/4D-QSAR

Final Visual InspectionAcquisition

Plating

FilteringADMET

Lead-likeness

Biological testing2D fingerprint

Focused library

TargetEx Ltd.,

György Dormán

Page 30: Library Compound Design Methods for Custom ... - ChemAxon

2D similarity selection

January 5, 2011

Page 31: Library Compound Design Methods for Custom ... - ChemAxon

Similarity searching strategy: execution

• Setting the starting similarity level (dependent on the

fingerprint S/W, T= 60-75 % for ChemAxon)

• Iteration based on the results (scenarios):

• the number of virtual hits are between 50 and 500,

OK

• the number of virtual hits are <50 or >500

– if <50 lower the similarity treshold with 5 %

– if >500 increase the similarity treshold with 5 %

– This can be continued until the optimal range achieved

– If 5 % decrease results in >500 compounds the search can

be refined by 2% (alternatively a diversity selection would

be needed, but that is not available)

– Duplications can be removed when merged the resulting

DBs

Page 32: Library Compound Design Methods for Custom ... - ChemAxon

How to reduce the number of the hits?

Normally screening companies would like to buy 100-1000

compounds

• Since from the various vendor DBs we can obtain 2000-

10.000 virtual hits their number can be reduced

• 1. Applying the reference property space (Lipinski and Veber

rules) (IJchem OK)

• 2. There are overrepresented seeds thus virtual hits coming

from those seeds can be reduced (IJchem OK)

• 3. Applying an optimal distribution of the resulting chemotypes

(removing the overrepresented compounds) (Limited with

Jklustor)

• 4. Simple diversity analysis (JKlustor)

Page 33: Library Compound Design Methods for Custom ... - ChemAxon

TargetEx Ltd., György Dormán

1. Applying the reference property space: Structural determinants: H-bond donor/ acceptor, hydrophobic interactions

(property space determination)

Pharmacophore fingerprints requires more computation and time consuming

In simple similarity search pharmacophore features can only be

considered as statistical features (not connected to structures)

The similarity search results can be filtered based on the physico-chemical

parameter space of the seed compounds (+10/-10 % range applied)

Page 34: Library Compound Design Methods for Custom ... - ChemAxon

Results and further reduction

• Similarity search results: 8655

• After property filtering: 2009

• 2. There are overrepresented seeds thus virtual

hits coming from those seeds can be reduced

• When combining the similarity search the

contribution of the seeds can be controlled (or set

the number of analogues derived from certain

seeds)

Page 35: Library Compound Design Methods for Custom ... - ChemAxon

2. Overrepresented seeds

Seeds leading to highest number of similar hits

HN

NN

N

O

CH3

CH3

CH3

S

N

N

H3C

O

O

O

#4 (Sildenafil)

238 analogues

(60 % similarity or above)

H3C

O

S

O

O

N

O

N

O

HN

CH3

#13

328 analogues

(60 % similarity or above)

H3C

O

S

O

O

N

O

N

O

HN

CH3

#18 (desantafil)

4494 analogues

(60- 80 % similarity)

Page 36: Library Compound Design Methods for Custom ... - ChemAxon

2. Overrepresented seeds

Seeds leading to highest number of similar hits

O

CH3

SO O

N

N

H3C

HN

N

N

O

N

CH3

CH3

#27

237 analogues

(60 % similarity or above)

N

N

HN

O

CH3

O

CH3

NN

#28

272 analogues

(60 % similarity or above)

N

N

Cl

HN

N

OH

O

O

O

#30

466 analogues

(60 % similarity or

above)NH

N

O

O

N

O

OCH

3

#44

2726 analogues

(60 % similarity or above)

Page 37: Library Compound Design Methods for Custom ... - ChemAxon

Recurring structural motifs in the seed structures

Page 38: Library Compound Design Methods for Custom ... - ChemAxon

Recurring structural motifs in the similarity search results

Page 39: Library Compound Design Methods for Custom ... - ChemAxon

3. Applying an optimal distribution of the resulting chemotypes

Proposed application of JKlustor/LibMCS

• Taking into consideration of the substructure

where the maximum number of connection

(bond) is found

– it can be an option

– Maybe difficult to define

• Using such option the „real” core structure can be

found easier

Page 40: Library Compound Design Methods for Custom ... - ChemAxon

Use caseIan Berry

Evotec

Page 41: Library Compound Design Methods for Custom ... - ChemAxon

Evotec Library Profiler

• Aim is to be able to select from a large virtual library either:

– A combinatorial subset

• Typically small focussed libraries

– A non-combinatorial subset

• Medicinal chemistry projects

• Desirable to allow access to all scientists

– Creativity

– Share ideas

– Security aspect

• Interactive

• Subsets need to satisfy multiple criteria

Page 42: Library Compound Design Methods for Custom ... - ChemAxon

Workflow

Enumerate

Virtual

Library

Export to fileImport into

Esma

Property

calculation

Filtering /

analysisExport to file

Import into

Jchem for

Excel

Filtering /

analysis

Import into

Spotfire

Page 43: Library Compound Design Methods for Custom ... - ChemAxon

Workflow using the Library Profiler

Enumerate

Virtual

Library

Select

properties

to calculate

Filtering /

analysis

Export to fileFurther

analysis

Page 44: Library Compound Design Methods for Custom ... - ChemAxon

5-Jan-1144

Page 45: Library Compound Design Methods for Custom ... - ChemAxon

Charting – Scatter plot

Page 46: Library Compound Design Methods for Custom ... - ChemAxon

Pivot View - Properties

Page 47: Library Compound Design Methods for Custom ... - ChemAxon

Using ChemAxon tools

• High usage of Marvin View and Sketch

– Easy to integrate

• JChem cartridge for filtering

– Experience in using cartridge

• JChem tools for many of the property calculations

– HBD, HBA, ROT, AMW, TPSA, Veber Bioavailability, BBB

distribution, undesirable functional groups, Andrews

AVERAGE energy, Bioavailability score, Ligand binding

efficiency, PGP Substrate prediction, pKa, protonated

atom count, non-H atom count

Page 48: Library Compound Design Methods for Custom ... - ChemAxon

WORKSHOP AT 14:00HANDS-ON SESSION

Focused and diverse library

generation by ChemAxon

technology

Page 49: Library Compound Design Methods for Custom ... - ChemAxon

Visit other technical presentations

www.chemaxon.com