combinatorial markush structure handling at chemaxon: us ugm 2008

13
Combinatorial Markush structures at ChemAxon: from drawing to analysis Szabolcs Csepregi Solutions for Cheminformatics

Upload: chemaxon

Post on 18-Nov-2014

2.295 views

Category:

Technology


0 download

DESCRIPTION

Markush structure handling at ChemAxon range of products will be presented, including drawing, visualization and generation of Markush structures, Markush enumeration techniques and searching. It will be shown how libraries more complex than 10^30 library size are handled, with the generic description including R-groups, atom and bond lists, link nodes and position variation. Further developments towards patent Markush structure handling will be also discussed. For latest developments/presentations see: http://www.chemaxon.com/product/markush_search.html

TRANSCRIPT

Page 1: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Combinatorial Markush

structures at ChemAxon:

from drawing to analysis

Szabolcs Csepregi

Solutions for Cheminformatics

Page 2: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Outline

• Combinatorial and patent Markush structures

• Drawing Combinatorial Markush structures with

Marvin

• Markush Enumeration

• Markush registration & searching in a database

• What is coming in Marvin / JChem 5.1

• Future plans – towards patents

2

Page 3: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Markush structuresGeneric notation for describing many molecules

(= Markush library) in a compact form.

Main usage:

– Combinatorial chemistry: similar steps of synthesis

– Chemistry-related patents: to claim part of the chemical

space for a particular purpose.

3

Page 4: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Markush structures

Combinatorial Markush

• Smaller libraries

• Usually simpler

constructs:

– R-groups

– Link nodes

– Atom lists

• Suitable to describe

simpler patents

Patent Markush

• The goal is as wide

coverage as possible

• Uses more

sophisticated methods

– Homology variation

(Alkyl, Aryl, etc.)

– Position variation

– Etc.

• Extra conditions to

avoid overlap with

existing patents4

Page 5: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Drawing with Marvin• Easy R-group drawing & zoom functions

• Atom list, link node, bond list

• Position variation

5

New in 5.1

Page 6: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Enumeration

• Full enumeration

• Selected parts only

• Random enumeration

• Calculate library sizeexact size of huge

Markush libraries

– arbitrary precision or

– magnitude

6

Page 7: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Enumeration• Coloring: scaffold and each R-group parts get different colors

• Alignment: as original scaffold

7

New in 5.1

Page 8: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Markush database tables

• Available in JChem Base and Instant JChem

• Search in the Markush library space of

combinatorial Markush structures

– No enumeration involved – can handle very complex

Markush structures (tested up to 1040, but no explicit limits

were built in.)

– Search types:

• Substructure

• Exact structure (contained in the Markush library)

• Exact fragment

• Perfect (same Markush structure)

– Stereochemistry, query atoms, bonds, query properties:

• Aromatic/aliphatic atom, ring atom and bond, chain bond,

number of bonds handled

8

Page 9: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Markush database tables

Markush structure reduction to display substructure hits:

All matchings of the query to the Markush structure are reduced

into less generic structures. (Generic parts overlapping the hit

are expanded.)

Example

Markush structure Reduced Markush structures

with hit coloring

+ Query

9

Page 10: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Integration in Instant JChem 2.3• Markush tables available: create, import, insert, search

• Show / hide R-groups for Markush table views

• Markush enumeration / hit reduction dialog

10

New in 2.3

Page 11: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

What is coming in 5.1

• Drawing

– Easier sketching of position

variation (from 5.0.2)

• Markush features

– Position variation

• Enumeration:

– R-group coloring

– Scaffold alignment

• Markush search:

– Abbreviations (superatom s-groups) can be included in

Markush structures (from 5.0.2)

– Position variation in both query and database

11

New in 5.1

Page 12: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Longer term plans

Further developments towards patents

• Homology variation (Alkyl, Aryl, Protecting group, etc.)

– properties (# of atoms, branching points, # of heteroatoms, etc.)

• Multiple graphical attachment points for R-groups

• Larger repeating groups

• Bridged definition of multiple R-atoms R1, R2= H, CH3, NO2 or together form a ring

12

Page 13: Combinatorial Markush structure handling at ChemAxon: US UGM 2008

Thank you for your attention!

For more information please visit

www.chemaxon.com

13