swegrid in practice, grid(s) in sweden trefpunkt karlshamn, 20-21 april,2005 balázs kónya lund...

32
SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by Ursula The Grid, as seen by Ursula Wilby, Sydsvenskan Wilby, Sydsvenskan 10.2.2002 10.2.2002

Post on 30-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

SweGrid in practice,Grid(s) in Sweden

TREFpunkt Karlshamn, 20-21 April,2005

Balázs KónyaLund University

NorduGrid Collaboration

The Grid, as seen by Ursula The Grid, as seen by Ursula Wilby, Sydsvenskan 10.2.2002Wilby, Sydsvenskan 10.2.2002

Page 2: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

2

outline

• Grid Computing as of Today The Grid Vision How Grids work Production Grids

• SweGrid Hardware Operation & Services Middleware Users & Applications

• (Grids in Scandinavia)• Summary

Page 3: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

3

Some history:TREFpunkt 2002

• Expression of interest: Development of GRID testbed in Sweden (Swedish informal GRID consortium, T Ekelöf). SWEGRID has been put into

production

• Design and implementation of the NorduGrid Middleware Architecture (started February 2002) Advanced Resource Connector

(ARC) from the NorduGrid Collaboration has become one of the major grid middlewares

Page 4: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

4

Grid Computing as of Today:An old vision ...

The “Grid book” 1998: Computational Power Grid A future infrastructure of

computing and data management

a new utility, next to the existing water, heating, electricity, ...,

Computing from the tap

source: IBM

Page 5: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

5

Grid computing as of Today:Turning into reality...

• Europe is building its E-Infrastructure FP5: ~50 million Euro

FP6: ~140 million Euro

• Grids become part of the European computing infrastructure

• Production quality Grids are being used on daily basis to solve scientific problems

• Vision: the grid layer should be seamlessly “integrated” with the network

5

“The Grid, for Europe, is far more than resource sharing. It is a big step forward to build the Cyberinfrastructure for a united research community tackling the grand challenges of our universe. It is a coordinated, single economic engine preparing to compete with Asia and the United States.” Wolfgang Gentzsch

Page 6: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

6

Grid Computing as of Today

• The Grid Phenomenon (or hype) continues infecting the academics, the IT industry and the governments (bureaucrats).

• Grid computing has become the Holy Grail of distributed computing and a major marketing tool for the IT sector.

• The next BIG thing promised after the internet: World Wide Web access to information World Wide Grid access to computing capacity and

beyond ...

• Meanwhile a lot of research/development has been carried out: Matured middlewares & production grids

Page 7: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

7

The rational behind the Grids remains the same

Network vs. computer performance

Computer speed doubles every 18 months

Network speed doubles every 9 months

Even data storage outperforms CPU

Science is pursued in collaborations

Teamwork: large experiments, instruments, data sets

Need for dynamic, flexible, secure, coordinated resource sharing

Need for sharing of geographically distributed resources

The world gets super-connected

Moore’s Law vs. storage improvements vs. optical improvements. Graph from

Scientific American (Jan-2001) by Cleo Vilett, source Vined Khoslan, Kleiner, Caufield and

Perkins.

Page 8: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

8

Standa

rds &

Inter

opera

bility

???

Grid Computing as of Today:middlewares

The middleware is the Operating system of the Grid A collection of software which implements grid functionalities No complete solution

Major Middlewares (in alphabetic order):1)EDG-line: EDG/LCG/Glite

2)Globus Toolkit (incompatible versions)• GT v2 (pre-WS Globus)• GT v3 (deprecated)• Globus Toolkit v4

• WS-RF Framework• alpha/beta quality, first release in May

3)Grid3/OSG (GT2 + Condor)

4)NorduGrid/ARC

5)Unicore

Page 9: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

9

How Grid works:overview of a Grid session

The GRID middleware:• Finds convenient places for the scientists "job" (computing task) to be run

• Optimizes use of the widely dispersed resources

• Organizes efficient access to scientific data

• Deals with authentication to the different sites that the scientists will be using

• Interfaces to local site authorization and resource allocation policies

• Runs the jobs

• Monitors progress

• Recovers from problems

•... and ...

•Tells you when the work is complete and transfers the result to the requested location

Page 10: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

10

Grid session (animated)

RSL

input

output

output

outputCluster

ClusterClusterinfosys

program input

?? ?

?

Page 11: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

11

Production Grids: EGEE

• The largest Grid project, funded by the EU FP6 http://eu-egee.org Time span: April 2004 to April 2006, with planned 2-years extension Partners: 70+ institutes worldwide Major activities:

• Raise Grid awareness• Provide resources and operational support via Grid technologies for

scientists• Maintain and improve Grid software

• Is a follow-up to the European DataGrid project (EDG), inheriting large parts of its Grid solutions Middleware: gLite, based on Globus and Condor

• Is based on the resources contributed to the LHC Computing Grid (LCG)

• The first release of the gLite middleware is to come out this month• Is widely expected to become the largest production Grid• Sweden is participating with SweGrid

Page 12: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

12

Production Grids: Grid3

• Grid3: originally, provided infrastructure and simple Grid-like solution for High Energy Physics computing in USA http://www.ivdgl.org/grid2003/ Collaboration of several research centers, active: 2003-2004 Uses Globus and Condor, plus few own developments

• Was proven to be able to provide reliable services to other applications

cms dc04

atlasdc2

Page 13: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

13

Production Grids:Open Science Grid

• Continuation and extension of Grid3 achievements http://www.opensciencegrid.org/ Consortium, aims at creating a national US Grid infrastructure Focus on general services, operations, end-to-end performance Takes over Grid3 in Spring 2005 (NOW)

Page 14: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

14

Production Grids: NorduGrid

• Collaboration of Nordic researchers, developing an own Grid middleware solution (ARC) since 2001 http://www.nordugrid.org

• A Grid based on ARC-enabled sites Driven (so far) mostly by the

needs and resources of the LHC experiments

Dozens of other applications• Assistance in Grid

deployment outside the Nordic area

• SweGrid is part of this Grid

Page 15: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

15

Production Grids: GRID.IT

• National Grid infrastructure in Italy http://www.grid.it/ Funded for 2003-2005 Also triggered by HEP community

needs, but expands to many other applications

• Like EGEE, heavily based on the EDG Grid middleware Does specific developments, most

notably, portals and monitoring tools

• Contributes to EGEE

Page 16: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

16

Production Grids: SweGridbasic facts

• SweGrid is a computational Grid consisting of six dedicated clusters

• A Swedish National Computational resource The hardware is funded by a grant from the

Wallenberg foundation Operational costs and personnel for support and

maintenance are funded by the Swedish Research Council

• Official inauguration: March 2004 It was the first dedicated Grid resource in

Scandinavia

• SweGrid operates in a production mode since then Runs on the reliable ARC middleware Offers support and maintenance services

ww

w.s

weg

rid.

se

Page 17: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

17

SweGrid:Architecture

• Six dedicated clusters located at the Swedish academic computing centres.

• Each of the 6 clusters consists of 100 computing nodes and 2 TB disk storage

• Homogeneous hardware to simplify initial development and deployment

• The sites are connected through the 10 Gb/s GigaSunet network

• The OS installed differs between the clusters: RedHat Linux 7.3, Fedora Core 1, Debian 3.0

• The primary Grid middleware is ARC, LCG/Glite is also being deployed

Page 18: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

18

SweGrid: Grid node

•100 compute nodes •IA-32, 1 processor/node

•2.8 GHz Intel P4

•2 Gbyte memory

•875P chipset

•800 MHz FSB

•dual memory channel

•2 TByte temporary storage •FibreChannel for bandwidth

•14 x 146 GByte 10000 rpm

•1 Gigabit internal interconnect •Not full bisectional bandwidth

•Access server •Limited login

•1 Gigabit to SUNET, directly attached

Page 19: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

19

SweGrid middleware: ARC

• One of the few matured Grid solutions Continous development Good support

• Attractive for resource owners Non-intrusive Portable (variety of OS, LRMS) Simple installation procedure Scalable, reliable Performs well in ATLAS Data

Challenges Scalabe: serves a grid of ~50

sites and 5000 CPUs, 50000 jobs/month

• Attractive for users Robust, portable Relatively feature rich Client can be installed everywhere

by anyone Plenty of documentation

IANA registered Grid ports:2135, 2811

Page 20: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

20

Swegrid Users: Resource allocation

• SweGrid is part of the Swedish National Infrastructure for Computing (SNIC)

• Potential users must apply for resource allocations (CPU quotas)

• Resource allocations for users are done on a peer-review basis via the Swedish National Allocation Committee (SNAC), upon requests 1/3 is (pre-)allocated for LHC computing The rest is distributed between chemistry,

genomics, meteorology etc – whoever applies Allocation is done twice a year 20 applications in the last round requesting

153740 "CPU-hours". 18 was granted to the sum of 82300 "CPU-hours"

Page 21: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

21

SweGrid services

• Operation, Support, Maintenance Six assigned Swegrid system administrators, one per site Detect and resolve hardware failures Installation and upgrades of software Monitoring and performance enhancements Tutorials Helpdesk Application support

• SweGrid Accounting System (SGAS) Developed within the SweGrid project Resource allocation and enforcement system Grid bank Usage tracking Test deployment on SweGrid

Page 22: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

22

SweGrid Users:Application areas & Research

• SweGrid is designed mainly for through-put computation, to quickly process large numbers of loosely coupled non-parallel computations

• Users come from various fields of science: climate research, material science, physics, chemistry and biology www.pdc.kth.se/grid/swegrid-vo/volist.txt

• The SweGrid also hosts research activities in various fields in IT such as Distributed data bases, Scheduling, brokering Data management, replication

• SweGrid has also been developing the SweGrid Accounting System (SGAS)

Page 23: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

23

Swegrid Users:Main User Group

• High Energy Physics (HEP) Community has been the driving force behind Grid

• Initially the main customers of SweGrid 1/3 of SweGrid is reserved

for HEP

• Data Challenges ~110.000 jobs in the second

half of 2004

Page 24: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

24

SweGrid:a multidisciplinary grid

Date: 04/15/2005 02:38 PMTo: [email protected]: [NG-disc] Multidisciplinary grid

Right now bluesmoke has jobs running belonging to:

- climate simulation- astrophysics- bioinformatics- materials science

Actually no HEP at all, at the moment.

I think we might be underestimating the importance and coolness of this.

-- Leif Nixon - Systems expert ------------------------------------------------------------ National Supercomputer Centre - Linkoping University ------------------------------------------------------------

Page 25: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

25

Grid of Grids

• Eesti Grid

• Finn activities

• SweGrid

• NorGrid

• DCGC

• NDGF

• Germany

• Switzerland

• Slovenia

• Slovakia

Page 26: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

26

Estonian Grid

• Technical details:- 122 CPUs in 9 clusters- 3.8 TB storage- NorduGrid ARC middleware- GÉANT connection 622Mbit/s, between the clusters 1Gbit/s

• Operations: EG CA EG CA is member of EUGridPMA

Technical support and coordination group:

Steering committee established at the Ministry of

Education and Research of Estonia

• Challenges: Estonian electronic ID-card infrastructure on the Grid (700 000

valid electronic ID-cards issued in Estonia!)

Local experiences with E-money and rental software

Interoperability of ARC and UNICORE

Page 27: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

27

DCGC

Danish Center for Grid Computing

• Danish Center for Grid Computing, established in August 2003 http://www.dcgc.dk

• 3 years project, aiming at Provide Grid-access to test facilities (including HPC) Host development activities Provide user support

• Aims at getting industrial partners and users involved

• Uses ARC for middleware

Page 28: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

28

Finland: Material Sciences Grid (M-grid)

• First large initiative to put Grid middleware into production use in Finland:http://www.csc.fi/proj/mgrid/

• Based on ARC and Linux clusters, currently 443 CPUs, targeted for serial and ”pleasantly parallel” applications, clusters can be accessed both locally and via ARC

• Joint project between seven Finnish universities, Helsinki Institute of Physics and CSC founded by the universities and the Academy of Sciences

• Users mainly from the physics and chemistry departments in the partner universities Material physicists, particle physicists, chemists,

some bioscientists Typical applications: Gromacs, Gaussian, Dalton

Page 29: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

29

NORGRID

• Norwegian project for Grid competence building, January – December 2004 http://norgrid.uio.no ca. 3 FTE Partners : NTNU, UiB, UiO, UiT, UNINETT Funding 50% NFR and 50% partners

• Objectives: competence building on Grid middleware and related

technologies in Norway Prepare a middleware infrastructure for the next HPC

project (starting 2005) Emphasis on distributed data management,

metascheduling, portals• Presently uses ARC

Aims to evaluate it against UNICORE, GT4.x and LCG2/gLite

Page 30: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

30

NDGF

• Nordic Data Grid Facility pilot project was launched in spring 2003 Initial success of NorduGrid provided grounds for a Nordic Grid

facility http://www.ndgf.org Funds for 1 director + 4 postdocs in each country Strong emphasis towards portal development and storage facilities Aimed to evaluate various Grid solutions, uses ARC

• Will produce recommendations for the Nordic Grid facility Aims to harness all the resources in the Nordic countries Grid of Grids with a large centralized storage facility This facility is expected to become the Nordic Tier1 candidate

Page 31: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

31

NGN

• Nordic Grid Neighborhood is a networking project funded by the Nordplus program, started in September 2004 http://www.nicpb.ee/NordicGrid/

• Expands to the Baltic states and North-West Russia (St.Petersburg) 20 partners supports and strengthens contacts in the field of Grid

technologies activities cover education, reciprocal knowledge transfer

and Grid research and development• Plans to set up a testbed to deploy and

demonstrate ARC and AliEn The scope is to attract and educate users that can

benefit from Grid, e.g., medical applications

Page 32: SweGrid in practice, Grid(s) in Sweden TREFpunkt Karlshamn, 20-21 April,2005 Balázs Kónya Lund University NorduGrid Collaboration The Grid, as seen by

32

Summary

• With the creation of SweGrid Sweden made the first dedicated Scandinavian investment in a Grid computing Infrastructure. The SweGrid model has been followed by other Scandinavian countries

• SweGrid offers a production quality grid infrastructure since the beginning of 2004

• SweGrid is part of the Swedish National Infrastructure for Computing and being heavily used by a multidisciplinary user base

• Through the SweGrid resources Sweden joins to major world-class Grids: NorduGrid, EGEE

• STAC (Swedish Technical Advisory Committee) is determined that the SweGrid project will have a continuation