automation of cheminformatics tasks using pipeline pilot

30
ask more of your data Automation of Cheminformatics Tasks using Pipeline Pilot

Upload: others

Post on 12-Sep-2021

6 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Automation of Cheminformatics Tasks using Pipeline Pilot

ask more of your data

Automation of Cheminformatics Tasks using Pipeline Pilot

Page 2: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Outline

• Challenges in Cheminformatics• An Introduction to Data Pipelining• Examples

1. Data integration, mining and reporting2. Deployment through end-user clients

• Summary

Page 3: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Challenges in Cheminformatics

• Integrating diverse data & applications

• Automating data processing & analysis

• Capturing & deploying best-practice processes

Page 4: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Pipelining

• A powerful paradigm for data processing• Pipelines guide the flow of data through a series

of modular computational components

Page 5: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Execution…

Page 6: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Results…

Page 7: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Pipelining Enables

• Integration of data from multiple disparate data sources

• Integration of disparate applications

• Automated execution of routine processes

• Capture and deployment of best practice

Page 8: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Pipelining Enables

• Integration of data from multiple disparate data sources

• Integration of disparate applications

• Automated execution of routine processes

• Capture and deployment of best practice

Page 9: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Pipelining Enables

• Integration of data from multiple disparate data sources

• Integration of disparate applications

• Automated execution of routine processes

• Capture and deployment of best practice

Page 10: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Pipelining Enables

• Integration of data from multiple disparate data sources

• Integration of disparate applications

• Automated execution of routine processes

• Capture and deployment of best practice

Page 11: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Integration Example

Primary Data

PKTox

Discovery Biology

Repository

Functionalassay data

DiscoveryChemistry Repository

CompoundsProperties

Inventory

Other Input

Decisions

Page 12: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data Integration Example

Chemistry in ISIS

Biology in Assay Explorer

Page 13: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Step 1: Retrieve and MergeRetrieve chemistry data

- One row per molecule- Convert text string CTAB to Pipeline Pilot

molecule

Merge on commonkey (sample id)

Retrieve biology data- One row per assay- Pivot to get one row per

molecule

Page 14: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Data pivoting• Data is typically organized for storage

– Long-skinny format

• Pivot converts to a short-wide format– View single compound multiple assay

results– Make the data more compact– Suitable for joining to chemistry

Page 15: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Results so far…

Page 16: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Workflow Example

Page 17: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Results so far…

Page 18: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Interactive Visualization

Run Demo

Page 19: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Interactive Scatter Plot

Page 20: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Using Spotfire as a Filter

Page 21: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Hit elaboration

Select similar molecules from Maybridge using Accord Cartridge similarity search

Page 22: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Reporting…

Page 23: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Report OutputHTMLPDF

Page 24: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Deployment to End Users

• Deployment of best-practice processes through an application that end-users are already familiar with speeds uptake and cuts training, for example– Accord for Excel– SpotFire– WebBrowser

Page 25: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Accord for Excel – Reagent Searching

Page 26: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Results

Page 27: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Under the hood…

OR?

OR?

OR?

Page 28: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Out-of-the-box integration of 3rd parties

Other

Cheminformatics

Bioinformatics

Modeling

Page 29: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

Conclusion• A flexible platform for data and application

integration• Best practice workflows can be deployed to end-

users through familiar clients• Many data sources and applications already

integrated– others easy to add

Page 30: Automation of Cheminformatics Tasks using Pipeline Pilot

…ask more of your data

With thanks to

• MDL – for software, data and protocols– David Evans– Gerd Blanke

• Accelrys– Jim Clark and the Accord for Excel team

• Spotfire• SciTegic

– Andrei Caracoti – Spotfire integration– Phil Cochrane – Scatter plot viewer– Rahim Lila