collaboratory for multi-scale chemical science

1
Collaboratory for Multi-scale Collaboratory for Multi-scale Chemical Science Chemical Science Abstract The goal of the CMCS project is to enable chemical scientists to conquer barriers to rapid sharing of validated information and open new paradigms for multi-scale science. The emerging vision for meeting these requirements is a chemical science ‘knowledge grid,’ which incorporates advances being made in semantic web, informatics, collaboratory, and grid communities. This is being accomplished by developing and publicly deploying an adaptive informatics infrastructure that integrates a set of key collaboration tools, chemistry-specific applications, data resources, and services, such as a Chemical Science Portal enabling data-centric project- and community-level collaboration, XML data/metadata management services enabling annotation and data discovery, and tools for security, notification, and collaboration. The CMCS environment is currently used by several pilot groups in the combustion research community and other areas of chemistry. The capabilities of CMCS, together with several new scientific results enabled by CMCS were showcased at SC02 and SC03. SciDAC PI Meeting Charleston, SC March 22-24, 2004 CMCS Team Thomas C. Allison, 6 Sandra Bittner, 3 Brett Didier, 2 Michael Frenklach, 8 William H. Green, Jr., 7 Darrian Hale, 1 Mihael F. Hategan- Marandiuc, 3 Carina Lansing, 3 Gregor von Laszewski, 3 David Leahy, 1 James D. Myers, 2 Michael Minkoff, 3 David Montoya, 5 Luwi Oluwole, 7 Carmen Pancerella, 1 Reinhardt Pinzon, 3 William Pitz, 4 Larry Rahn, 1 Jane Riese, 5 Branko Ruscic, 3 Karen Schuchardt, 2 Albert F. Wagner, 3 Theresa Windus, 2 Christine Yang, 1 and Ginger Young 5 X (cm) R (cm) 0 2 4 6 8 10 0 1 2 3 4 1 2 3 5 6 8 9 10 0 4 2 7 11 Chemical Science Application Areas SciDAC Reacting Flow Simulations Detection and tracking of features in large-scale simulation data sets for BES SciDAC reacting flow simulations Contact: David Leahy, SNL PrIMe: Process Informatics Model International collaboration for generation of predictive kinetic models based on the best current experimental and theoretical data Contact: Michael Frenklach, UCB Chemical Model Reduction Web-service hosted for computational reduction of chemical models against specific ranges of validity Contact: Bill Green, MIT HCCI Consortium A Multi-University Consortium formed to address Homogenous Charge Compression Ignition (HCCI) Engines Contact: Bill Pitz, LLNL IUPAC International Union of Pure & Applied Chemistry Task Group on Thermochemistry of Radicals Contact: Branko Ruscic, ANL Quantum Chemistry BES SciDAC Chemistry, EMSL Chemistry Program, NW Chem, ECCE Contact: Theresa Windus, PNNL Real Fuels Chemistry NIST project on chemistry of realistic fuels Contact: Tom Allison, NIST http:// cmcs.org CMCS Objectives Architect and build an adaptive informatics infrastructure enabling multi-scale science XML data/metadata management services Chemical Science Portal enabling data-centric project- and community-level collaboration Middleware and tools for security, notification, collaboration Pilot project within combustion research community Enable rapid exchange of multi-scale data/pedigree Integrate chemical science tools that generate, use and archive metadata Demonstrate the power of adaptive infrastructure to existing and new areas as CMCS evolves Development environment for an evolving set of collaborative cross-scale science tools Develop collaborative data pedigree/annotation tools Explore and develop a prototype ‘knowledge grid’ capability Gain adoption and continued support by science community participation Document success and continuation path The Multi-scale Challenge Impact of Chemical Science relies upon flow of information across many physical scales Data from smaller scales supports models at larger scales New knowledge is assimilated from different data and tools at each scale Critical science lies at scale interfaces Molecular properties, transport Validated chemical mechanisms, reduced mechanisms Chemistry - turbulence interactions Impact through industrial application is mostly at larger scales, but industrial R&D draws on all scales Multi-scale information is complex and its pedigree matters The propagation of data pedigree across scales is difficult Data is validated and annotated in post-publication processes Multi-scale science faces barriers Normal publication route is slow Data and meta-data are not easily available Incompatible data formats and undocumented metadata thwart information exchange The researchers and facilities involved are distributed geographically and among multiple disciplines Complexity of multi-scale science can lead to unnecessary duplication and impede investment of research dollars REACTIONLAB 1 Sandia National Laboratories, Livermore, CA 2 Pacific Northwest National Laboratory, Richland, WA 3 Argonne National Laboratory, Argonne, IL 4 Lawrence Livermore National Laboratory, Livermore, CA 5 Los Alamos National Laboratory, Los Alamos, NM 6 NIST, Gaithersburg, MD 7 Massachusetts Institute of Technology, Cambridge, MA 8 University of California, Berkeley, CA CMCS Development Partnerships CIENTIFIC NNOTATION IDDLEWARE S A M CMCS is Funded by National Collaboratory Program Java Storage Interface JDBC Filesystem XML conf. JDBC GridFTP DAV/DASL Notebook Services JMS External Security Security BFD SOAP XSLT Processing Client API ELN API Subscriber Web Service Metadata generation/ translation Configurable security framework SAM APIs/Interfaces/Components Extends Jakarta Slide, uses OpenJMS, Jaxen, JDOM, COG Supports WebDAV protocol standards for data/metadata access, search, … Supports external authentication and authorization mechanisms Data Store interface allows data/metadata to be stored in files, relational DBs, DataGrids, … Client-side protocol, API, and component-level interfaces Browser accessible graphical user interface for access to resources and visualization Community tools – group formation, announcements, chat, threaded discussion, calendar, task list management Portlet structure accommodates knowledge management too, research applications, and other specialized portal functions Resource Name: CH3OOqueryResult.xml Title: ATcT Thermochemistry Data Table for Methylperoxy radical Creation Date: 2003-11-10 Creator: Branko Ruscic Contributors: Reinhardt Pinzon, Albert F. Wagner, Melita L.Morton, Gregor von Laszewski, Sandra Bittner, Sandeep Nijsure Keywords: Thermodynamics, molecule, species MIME Type: text/xml-activetables-thermochemistry Text Whiteboard Sound Equations Plot View (text/html) JANAF format (text/plain) O Atom Reference – ATcT AtomicLexicon in MainLibrary (1.027) ATcT Bibliography in Main Library (1.027) ATcT NetworkEncyclopedia ATcT SpeciesDictionary hastranslations hasinputs IUPAC issanctionedby references ATcT PolyatomicRRHOLexicon pitzNotesBibliography references ATcT PreferredEnthalpiesCompendium This diagram represents the major conceptual elements of the CMCS Informatics Infrastructure. A portal serves as the web interface for scientists and is powered by the CHEF Architecture shown on the left. Other web interfaces enable remote web services as well as direct access to data from desktop clients. The shared data services infrastructure takes advantage of a variety of standards and open-source technologies. This infrastructure is powered by the Scientific Annotation Middleware also shown to the left. Web-based standards make tools and resources available to collaborators Annotations CMCS Metadata Structure Provenance Relationships Content Metadat a Chemical Science Informatics Architecture

Upload: robyn

Post on 17-Jan-2016

38 views

Category:

Documents


2 download

DESCRIPTION

CIENTIFIC NNOTATION IDDLEWARE. S A M. Client API. ELN API. Subscriber. DAV/DASL. Notebook Services. JMS. Metadata generation/ translation. BFD. Configurable security framework. Processing. SOAP. Web Service. External Security. Security. XSLT. Java Storage Interface. JDBC. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Collaboratory for Multi-scale Chemical Science

Collaboratory for Multi-scale Chemical ScienceCollaboratory for Multi-scale Chemical Science

Abstract

The goal of the CMCS project is to enable chemical scientists to conquer barriers to rapid sharing of validated information and open new paradigms for multi-scale science. The emerging vision for meeting these requirements is a chemical science ‘knowledge grid,’ which incorporates advances being made in semantic web, informatics, collaboratory, and grid communities. This is being accomplished by developing and publicly deploying an adaptive informatics infrastructure that integrates a set of key collaboration tools, chemistry-specific applications, data resources, and services, such as a Chemical Science Portal enabling data-centric project- and community-level collaboration, XML data/metadata management services enabling annotation and data discovery, and tools for security, notification, and collaboration. The CMCS environment is currently used by several pilot groups in the combustion research community and other areas of chemistry. The capabilities of CMCS, together with several new scientific results enabled by CMCS were showcased at SC02 and SC03.

SciDAC PI MeetingCharleston, SC March 22-24, 2004

CMCS TeamThomas C. Allison,6 Sandra Bittner,3 Brett Didier,2

Michael Frenklach,8 William H. Green, Jr.,7 Darrian Hale,1

Mihael F. Hategan-Marandiuc,3 Carina Lansing,3 Gregor von Laszewski,3 David Leahy,1 James D. Myers,2

Michael Minkoff, 3 David Montoya,5 Luwi Oluwole,7 Carmen Pancerella,1 Reinhardt Pinzon,3 William Pitz,4

Larry Rahn,1 Jane Riese,5 Branko Ruscic,3 Karen Schuchardt,2 Albert F. Wagner,3 Theresa Windus,2

Christine Yang,1 and Ginger Young5

X (cm)

R(c

m)

0 2 4 6 8 100

1

2

3

4 1

2

3

5

68

9

1004

27

11

Chemical Science Application AreasSciDAC Reacting Flow Simulations

Detection and tracking of features in large-scale simulation data sets for BES SciDAC reacting flow simulations

Contact: David Leahy, SNL

PrIMe: Process Informatics ModelInternational collaboration for generation

of predictive kinetic models based on the best current experimental and theoretical data

Contact: Michael Frenklach, UCB

Chemical Model ReductionWeb-service hosted for computational

reduction of chemical models against specific ranges of validity

Contact: Bill Green, MIT

HCCI Consortium A Multi-University Consortium formed to

address Homogenous Charge Compression Ignition (HCCI) Engines

Contact: Bill Pitz, LLNL

IUPAC International Union of Pure & Applied

Chemistry Task Group on Thermochemistry of Radicals

Contact: Branko Ruscic, ANL

Quantum Chemistry BES SciDAC Chemistry, EMSL Chemistry

Program, NW Chem, ECCEContact: Theresa Windus, PNNL

Real Fuels Chemistry NIST project on chemistry of realistic fuelsContact: Tom Allison, NIST

http://cmcs.org

CMCS ObjectivesArchitect and build an adaptive informatics

infrastructure enabling multi-scale scienceXML data/metadata management servicesChemical Science Portal enabling data-centric project- and

community-level collaborationMiddleware and tools for security, notification, collaboration

Pilot project within combustion research communityEnable rapid exchange of multi-scale data/pedigree Integrate chemical science tools that generate, use and

archive metadata

Demonstrate the power of adaptive infrastructure to existing and new areas as CMCS evolves Development environment for an evolving set of

collaborative cross-scale science toolsDevelop collaborative data pedigree/annotation toolsExplore and develop a prototype ‘knowledge grid’ capability

Gain adoption and continued support by science community participation

Document success and continuation path

The Multi-scale ChallengeImpact of Chemical Science relies upon flow of

information across many physical scalesData from smaller scales supports models at larger scalesNew knowledge is assimilated from different data and tools

at each scaleCritical science lies at scale interfaces

Molecular properties, transportValidated chemical mechanisms, reduced mechanismsChemistry - turbulence interactions

Impact through industrial application is mostlyat larger scales, but industrial R&D draws on all scales

Multi-scale information is complex and its pedigree mattersThe propagation of data pedigree across scales is difficultData is validated and annotated in post-publication

processes

Multi-scale science faces barriersNormal publication route is slowData and meta-data are not easily available Incompatible data formats and undocumented metadata

thwart information exchangeThe researchers and facilities involved are distributed

geographically and among multiple disciplinesComplexity of multi-scale science can lead to unnecessary

duplication and impede investment of research dollars

REACTIONLAB

1Sandia National Laboratories, Livermore, CA 2Pacific Northwest National Laboratory, Richland, WA 3Argonne National Laboratory, Argonne, IL4Lawrence Livermore National Laboratory, Livermore, CA5Los Alamos National Laboratory, Los Alamos, NM6NIST, Gaithersburg, MD7Massachusetts Institute of Technology, Cambridge, MA8University of California, Berkeley, CA

CMCS Development PartnershipsCIENTIFIC

NNOTATION

IDDLEWARE

SAM

CMCS is Funded by

National Collaboratory Program

Java Storage Interface

JDBC FilesystemXML conf.

JDBCGridFTP

DAV/DASLNotebookServices

JMS

ExternalSecuritySecurity

BFD

SOAP

XSLT

Processing

Client API ELN API Subscriber

WebService

Metadatageneration/translation

Configurablesecurity

framework

SAM APIs/Interfaces/Components

Extends Jakarta Slide, uses OpenJMS, Jaxen, JDOM, COG Supports WebDAV protocol standards for data/metadata

access, search, … Supports external authentication and authorization

mechanisms Data Store interface allows data/metadata to be stored in files,

relational DBs, DataGrids, … Client-side protocol, API, and component-level interfaces

Browser accessible graphical user interface for access to resources and visualization

Community tools – group formation, announcements, chat, threaded discussion, calendar, task list management

Portlet structure accommodates knowledge management too, research applications, and other specialized portal functions

Resource Name: CH3OOqueryResult.xmlTitle: ATcT Thermochemistry Data Table for Methylperoxy radicalCreation Date: 2003-11-10 Creator: Branko Ruscic Contributors: Reinhardt Pinzon, Albert F. Wagner, Melita L.Morton, Gregor von Laszewski, Sandra Bittner, Sandeep NijsureKeywords: Thermodynamics, molecule, species MIME Type: text/xml-activetables-thermochemistry

Text Whiteboard Sound Equations

Plot View (text/html)

JANAF format (text/plain)

O Atom Reference – ATcT AtomicLexicon in MainLibrary (1.027)

ATcT Bibliography in Main Library (1.027)

ATcT NetworkEncyclopedia

ATcT SpeciesDictionary

hastranslations

hasinputs

IUPAC

issanctionedby

references

ATcT PolyatomicRRHOLexicon

pitzNotesBibliography

references

ATcT PreferredEnthalpiesCompendium

This diagram represents the major conceptual elements of the CMCS Informatics Infrastructure. A portal serves as the web interface for scientists and is powered by the CHEF Architecture shown on the left. Other web interfaces enable remote web services as well as direct access to data from desktop clients. The shared data services infrastructure takes advantage of a variety of standards and open-source technologies. This infrastructure is powered by the Scientific Annotation Middleware also shown to the left.

Web-based standards make tools and resources available to collaborators

Annotations

CMCS Metadata Structure

ProvenanceRelationships

Content

Metadata

Chemical Science Informatics Architecture