markush claims: representation, search, analysis and construction - where are we and how to go...

27
MARKUSH CLAIMS: REPRESENTATION, SEARCH, ANALYSIS & CONSTRUCTION Árpád Figyelmesi 27th ICIC International Conference for the Information Community Nice 2015 Where Are We and How to Go Forward?

Upload: dr-haxel-congress-and-event-management-gmbh

Post on 28-Jan-2018

1.063 views

Category:

Internet


0 download

TRANSCRIPT

Page 1: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

MARKUSH CLAIMS: REPRESENTATION,

SEARCH, ANALYSIS & CONSTRUCTION

Árpád Figyelmesi

27th ICIC International Conference for the Information Community

Nice 2015

Where Are We and How to Go Forward?

Page 2: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

OVERVIEW

History and current state of Markush claims

Page 3: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Origins

Dr. Eugene A. Markush

1888 Budapest, Hungary

1968 New York, USA

US1506316A

”The process for manufacture of

dyes…”colorantshistory.org

Page 4: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Importance

• Between 1994 and 2013

• 3,704,996 US patent

• 468,262 with Markush claims

• Every eighth patent contains

Markush claims

Joseph J. Mallon, 2014

Page 5: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

The real value in Patents not in Drugs…

You Need Good Markush Technology

or

Lot of manual work (with unavoidable mistakes)

Drug Discovery workflow

Find relevant

documents

Analyze prior art

and invent

something new

Create your

own Patents

Page 6: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Variation types

• Substituent variation

• Position variation

• Frequency variation

• Homology variation

• Variation inside variation (nested)

• Additional logical constraints

Page 7: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Patent Markush

• Nested R-groups

• Homologies

• Additional logical constraints

Combinatorial Library

• No nested R-groups

• No Homologies, Repeating

units and Position variation

Markush types

Page 8: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush chemical space size

Zhengwei Peng, 2014

Page 9: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Existing databases

Thomson Markush database (MMS)

2.4 million patents

1.6 million Markush structures

2 million specific compounds

CAS Markush database (MARPAT)

0.5 million patent

1 million Markush structures

Page 10: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

REPRESENTATION

Markush representation techniques and challenges

Page 11: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

● R-groups

● Atom lists

● Bond list

● Position variations

● Repeating units

● Homology groups

Markush Representation

Page 12: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

US5948793A

Claimed structure represented

with multiple structures

Workarounds

Page 13: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

CONSTRUCTION

Sketching and automatic generation of Markush structures

Page 14: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

General structure editors

Page 15: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush Editor

R-group definitions

Tree view Scaffold

Structure checker

Nesting view & Preview

Page 16: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush Composer

Automatic Markush generation from compound list

Page 17: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

DOCUMENT CURATION

Extracting Markush structures from Patents

Page 18: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Representing Covered Chemical Space

● Document processing (XML,

PDF, HTML)

● Name to structure

● NLP technologies

● OSR (CLiDE & OSRA)

Page 19: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Automatic Markush extraction

ChemProspektor

InfoChem

Theseus research project founded

by the Federal Ministry of

Economics and Technology

Dr. Josef Eiblmaier, ACS National Meeting, Philadelphia, August 19 - 22, 2012

Page 20: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

ChemCurator Markush extraction view

Markush editor

Example structures

Annotated document

Selected structures

Structure checker

Page 21: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

MARKUSH SEARCH & ANALYSIS

Understanding covered chemical space and comparison

Page 22: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush Search & Hit Visualization

● Substructure

● Full structure

● (Similarity)

● Hit visualization

● Non-Hit visualization

Page 23: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush Enumeration

• Full enumeration

• Random enumeration

• Partial enumeration

• Library size calculation

• Biased enumeration

• (Property distribution

characterization)

Page 24: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Markush Overlap

Overlapping chemical space

calculation

Results:

● Percentage of overlap

● Overlapping Markush

Benefits:

● No enumeration

● No size limitations

Page 25: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

SUMMARY

Page 26: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

Summary

• Important breakthroughs in Markush technology– Search & hit visualization

– Comparison

– Construction

– Curation

• Active development in challenging areas– Similarity search

– Characterization

Page 27: Markush Claims: Representation, Search, Analysis and Construction - Where Are We and How to Go Forward

THANK YOUÁrpád Figyelmesi

[email protected]

27th ICIC International Conference for the Information Community

Nice 2015