teragrid information services jp navarro, lee liming university of chicago teragrid architecture...

20
TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

Upload: frederick-scott

Post on 28-Dec-2015

218 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

TeraGrid Information Services

JP Navarro, Lee Liming

University of Chicago

TeraGrid Architecture MeetingSeptember 20, 2007

Page 2: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 2

People InteractionsIdentify available resources and request allocation

User Documentation, User Portal, POPS

Learn how to use resources and their statusUser Documentation, Knowledge BaseUser PortalConferences and other events

Ask for helpE-mailTelephone

(User) Software InteractionsStandard service interfaces (login, move data, run jobs,

WAN file-systems, manage data, etc.)Coordinated software (Grid clients, development tools,

science workflow, etc.)Coordinated Unix interaction (standard variables, SoftEnv)

Grid in what sense?

Page 3: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 3

Provide an Information Services Infrastructure:– Applying grid concepts to information publishing– Have RP/GIG/partner operated information

services– Centrally indexing and/or aggregated for

discovery– Primarily focused on public information– Primarily accessible thru software interfaces– Using standards based interfaces– Reliable, scalable, and fast– Publishing TeraGrid information– And partner/community information

TeraGrid Information Services Vision

Page 4: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 4

TG Information Services IS [NOT]

IS NOT ISA central database (Data Warehouse)

A central index/aggregation (Google)

A new user interface A way user interfaces access information

A single implementation/tool Includes several tools

A single software interface Accessed using several useful interfaces

A specific set of data Phased growing set of data

Changed data ownership Ownership maintained as appropriate

Way to manage scientific information

Way to manage Grid meta-data

A data management system (database)

An information publishing system

A coordinated way to index and publish public [Tera]Grid information using software interfaces.

Page 5: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 5

Clients

Clients

Clients

High-Level Components

CacheCache

WS/RESTHTTP GET

WS/SOAP

WS MDS4

TomcatWebMDS

Apache 2.0

TeraGrid Wide Services

TeraGrid Repositories

Partners

WS/SOAPWS MDS4

Resource Provider Services

Page 6: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 6

RP Information ServicesContent:

RP owned and maintained informationData can originate in local systems

Infrastructure:2 scheduling MDS services: authenticated and public (merging)1 general purpose MDS service

TeraGrid Wide Information ServicesContent:

Aggregate/index RP Information Services contentAdditional central information (TGCDB, GIG operated services,

…)

Infrastructure:Several redundant servers Information caching (persistence)Several MDS4 services (WS/SOAP)WebMDS/Tomcat, Apache 2.0, … (WS/REST)Content published in: HTML, XHTML/XML, XML, Atom, RSS, …

TeraGrid Wide vs RP Services

Page 7: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 7

WS/* (Tomcat 5.0, Apache 2.0)Benefits

Very common web services platformSupports several web service interfaces (including simple)Supports multiple styles like REST, Web 2.0Can be highly scalable

ContentMany formats: HTML, XHTML/XML, XML, Atom, RSS, …

WS/SOAP (Globus 4.0.5/VDT 1.7.1 MDS4)Benefits

Indexing, TriggerRegistration, Publish, SubscribeSecurity/AuthorizationRobust WSRF interface

ContentXML

WebMDSBenefits

XPath supportXSLT transforms

Tools

Page 8: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 8

High-Availability Design

info.dyn.teragrid.org

info.teragrid.org

TeraGrid Dynamic DNS

Dynamically direct clients to one or more servers

Set by Information Services administrators

Changes propagate globally fast (TTL = 15 minutes)

Clients

Dynamically Changes

Doesn’t Change

RP/partner services

TG wide servers

(Patrick Dorn & NCSA NetEng)

Page 9: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 9

Information Services Users

User Documentation(Michael, Diana)

User Portal(Maytal & team)

Database?Database?

Gateways

Peer Grids

User Applications

info.teragrid.org

Inca(Kate & team)

Page 10: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 10

Completed Milestones

Infrastructure information services are in production– RP Information Services– TeraGrid Wide Information Services

Scheduling information published to User Portal (since Spring)– Scheduler load– Queue contents

CTSS 4 capabilities published in publicly (Since August)– Accessed by

• User Documentation• Inca system

– Content • Which capability kits are available on each resource• What software is available in each kit on each resource• What services are available from each kit on each resource

Page 11: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

Queue Contents in User Portal

Page 12: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

Resources and Available Kits

Page 13: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

Where are the GridFTP services?

Page 14: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 14

CTSS 4 Kit Capabilities

For each kit on each resource– Current support level, and target support level

• Development/prototype, Testing/pre-production, Production– Support organization– Inca status URL– Multiple version of a kit with different support levels

For each kit software component on each resource– Name, version, how to access it– Multiple versions of a single component

For each kit service on each resource– Name, type, version, and Endpoint (contact location)– GSI OpenSSH, GridFTP, SRB servers, GRAM, MDS4– Multiple services of the same type

The coordinated way TeraGrid publishes available CTSS capabilities.

Page 15: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 15

Open Forward Process - Requirements

Requirements Analysis Design & Develop Deploy & Support

WhatUse cases (who needs it)?What type of information (attributes)?Who owns and provides the data (ownership, source)?Is any of it already available thru information services (gap analysis)?WhoEveryone: users, staff, partnersWhereDiscussion: [email protected] results: wiki documents currently linked here:

http://www.teragridforum.org/mediawiki/index.php?title=Information_Services#Potential_Future_Content

Page 16: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 16

Open Forward Process - Develop

Requirements Analysis Design & Develop Deploy & Support

WhatRegister a provider and/or publish data?Design/adopt a schema?Develop publishing adaptors, services.Choose and/or develop query interfaces.WhoSeveral groups: information users, owner, consumers, SI facilitatedWhereDiscussion: [email protected]

Page 17: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 17

Open Forward Process - Deploy

Requirements Analysis Design & Develop Deploy & Support

WhatDeploy adapters or publishing services.Deploy query interfaces.Update information services documentation.WhoSeveral groups: information providers, RPs, SIWhereDiscussion: [email protected], gateways, docgroup, …

Page 18: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 18

Deploy & SupportUpgrade/expand/merge scheduling information services

Part of WS GRAM 4.0.5 upgradeDoesn’t address Scheduling WG requirements yet

Implement 99.5% availability TeraGrid wide information services

Improve information services documentation

Design & DevelopPublish TGCDB resource and organization informationPrototyping REST/Web 2.0 interfaces:

Publish HTML (staff/developer views), XHTML/XML, Atom, RSS, etc.

Prototype universal command line client

Requirements AnalysisHPC (non-CTSS) softwareResource hardware specificationsScheduling information (BQP, Scheduling WG)Data movement information (Data WG)

Current Activity Areas

Page 19: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 19

GatewaysUser Services gateway database informationGateway published, capabilities, software, and services

Data collectionsUser Services data collections database informationData collections access method, service Endpoint, paths

Community software areasWhich resources have each CSAWhat software is available in each CSA, how to access it

Community accountsWhich resources is each account active on

OutagesPlanned and unplanned outage information (gateways could use)

Resource ProviderPolicies

Peer grids/interoperabilityResources, services available on peer grids (OSG, EGEE, …)

…….

[Not so] Farfetched Possibilities

Page 20: TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007 TeraGrid Architecture Meeting 20

Discuss Information Services content, requirements, and design:E-mail list [email protected]

View current Information Services contentUser Portal (scheduler load & queue contents):

https://portal.teragrid.org:443/gridsphere/gridsphere?cid=resources

User Documentation (CTSS 4 kits, software, services):http://www.teragrid.org/userinfo/software/ctss.php

Staff and developer WebMDS viewshttp://www.teragridforum.org/mediawiki/index.php?title=WebMDS_Views

Useful URLs:Will link to other areas:

http://info.teragrid.org/

Information Services activities, plan, etc,http://www.teragridforum.org/mediawiki/index.php?

title=Information_Services

More Information