grid middleware & tools session summary

34
Ian Bird, CERN Rob Gardner, University of Chicago

Upload: oscar-gordon

Post on 30-Dec-2015

32 views

Category:

Documents


2 download

DESCRIPTION

Ian Bird, CERN Rob Gardner, University of Chicago. Grid Middleware & TOOLS session summary. Introduction. 82 abstracts submitted, 36 oral presentations (7 sessions), 44 posters, [2 withdrawn] Categories: cover a broad range Experiment experiences Data Management Workload Management - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Grid Middleware & TOOLS   session summary

Ian Bird, CERN

Rob Gardner, University of Chicago

Page 2: Grid Middleware & TOOLS   session summary

Introduction 82 abstracts submitted,

36 oral presentations (7 sessions), 44 posters, [2 withdrawn]

Categories: cover a broad rangeExperiment experiencesData ManagementWorkload ManagementMonitoring, Information, AccountingSecurity & AuthorizationFabric & Deployment

Page 3: Grid Middleware & TOOLS   session summary
Page 4: Grid Middleware & TOOLS   session summary

Grid reliability – Pablo Saiz

Page 5: Grid Middleware & TOOLS   session summary

Grid efficiency during CMS data challenges – Oliver Gutsche

Page 6: Grid Middleware & TOOLS   session summary

D0 – reprocessing on OSGAmber Boehnlein

Common theme: making sites reliable requires debugging sites/systems one by one

Page 7: Grid Middleware & TOOLS   session summary

Alien grid environment- Pablo Saiz

Job agents – pilot jobsMonitoring

Page 8: Grid Middleware & TOOLS   session summary
Page 9: Grid Middleware & TOOLS   session summary

SRM v2.2 – Flavia Donno

18 month effort to agree, build, test, deploy new version

Page 10: Grid Middleware & TOOLS   session summary

dCache – one of several MSS systems

-Patrick Fuhrmann – overview of dCache developments-- Gerd Behrmann – distributed instance for NDGF

Page 11: Grid Middleware & TOOLS   session summary

LCG Data management tools

LFC, DPM, FTS – Markus Schulz

Page 12: Grid Middleware & TOOLS   session summary

Examples of services that consider deployment & management issues

Page 13: Grid Middleware & TOOLS   session summary

CORAL – distributed database access

Dirk Duellmann

Page 14: Grid Middleware & TOOLS   session summary
Page 15: Grid Middleware & TOOLS   session summary
Page 16: Grid Middleware & TOOLS   session summary

Pilot jobs?

Page 17: Grid Middleware & TOOLS   session summary

Pilot jobs – and variants:

Such a good idea – everyone wants one …

Page 18: Grid Middleware & TOOLS   session summary

Stuart Paterson – optimizations in DIRAC

Marianne Bargiotti Integrity checking in DIRAC

Page 19: Grid Middleware & TOOLS   session summary

Pilots can move intelligence into the jobPaul Nilsson – Panda experience

Page 20: Grid Middleware & TOOLS   session summary

gLite WMS developments

Marco Cecchi

Page 21: Grid Middleware & TOOLS   session summary

CHEP'07, Victoria 21

Igor Sfiligoi – comparison of WMS

Page 22: Grid Middleware & TOOLS   session summary
Page 23: Grid Middleware & TOOLS   session summary

Experiment dashboardsJulia Andreeva

Monitoring from VO/user perspective

Page 24: Grid Middleware & TOOLS   session summary

GridICE – monitoringGuido Cuscela

Permits different views of running jobs

Page 25: Grid Middleware & TOOLS   session summary

James Casey

Advances in monitoring of grid services

Page 26: Grid Middleware & TOOLS   session summary

Stephen Burke – 6 years experience with GLUE schema

Martin Flechl – details on integration of information systems

Page 27: Grid Middleware & TOOLS   session summary
Page 28: Grid Middleware & TOOLS   session summary

David Groep - glExec

Supporting pilot jobs

Page 29: Grid Middleware & TOOLS   session summary
Page 30: Grid Middleware & TOOLS   session summary

Greig CowanUsing DPM over the WAN

Page 31: Grid Middleware & TOOLS   session summary

Addressing failover for core operations services – Alfredo Pagano

Various strategies

Page 32: Grid Middleware & TOOLS   session summary

Platform LSF – Robert StoberIntegrating heterogeneous clusters

Page 33: Grid Middleware & TOOLS   session summary

Observations Solutions exist for most needs now –

Certainly not all perfect yetExperiment layer relatively deep Plethora of workload management systemsNot so many for data management …

Service management issues starting to be addressed by some services (DPM, LFC, FTS, Gridsite, Coral)But in general little thought on how site managers

should manage services

Interoperability / interoperation

Page 34: Grid Middleware & TOOLS   session summary

Observations Workload management

Everyone wants pilot (aka glidein) jobs (and everyone has written a system to submit them)

Commonality – to reach a reliable service experiments need to systematically debug sites being used: D0, CMS, dashboards, …

Sophisticated systems to monitor, debug, recover Dirac, dashboards, grid service monitoring, etc., To improve reliability and help debug the system