egee a large-scale production grid infrastructure

56
EGEE-II INFSO-RI- 031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE A Large-scale Production Grid Infrastructure Erwin Laure EGEE Technical Director ISSGC06 July 16-28, 2006 Ischia, Italy

Upload: zola

Post on 14-Jan-2016

49 views

Category:

Documents


0 download

DESCRIPTION

EGEE A Large-scale Production Grid Infrastructure. Erwin Laure EGEE Technical Director. ISSGC06 July 16-28, 2006 Ischia, Italy. Lost in Definitions?. Defining the “Grid”: Access to (high performance) computing power Distributed parallel computing - PowerPoint PPT Presentation

TRANSCRIPT

EGEE – A large-scale production Grid infrastructureErwin Laure
EGEE-II INFSO-RI-031688
Distributed parallel computing
Increased storage provision
Interconnection of arbitrary resources
Corresponding security
CoreGrid Definition
GGF Definition
EGEE-II INFSO-RI-031688
Defining the Grid
A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user.
This interconnection of users, resources, and services for jointly addressing dedicated tasks is called a virtual organization.
Comparison between Grids and Networks:
Networks realize message exchange between endpoints
Grids realize services for the users higher level of abstraction
EGEE - A Large-scale Production Grid Infrastructure
Unter einem Grid versteht man die Gesamtheit von vernetzten Ressourcen (z.B. Rechner, Instrumente, Sensoren, ...) sowie die darauf bereitgestellte Vermittlungsschicht (Grid Middleware), die für Anwender als Grid Dienste zur Verfügung stehen. Dieser Zusammenschluss von Benutzern, Ressourcen und Diensten zur gemeinsamen Bewältigung einer Aufgabe wird als virtuelle Organisation bezeichnet. Im Unterschied zum Internet, wo prinzipiell Nachrichten zwischen zwei Punkten ausgetauscht werden, realisiert ein Grid somit eine höhere Abstraktionsstufe, durch die der Benutzer Dienstleistungen (in Form von Grid Services) in Anspruch nehmen kann. Die Dienstleistungen werden von der Vermittlungsschicht weitgehend automatisch durchgeführt, wobei zwischen der Basis-Grid-Middleware - mit möglichst allgemein gültigen Diensten - und höheren Grid Diensten - mit entsprechender Spezialisierung auf bestimmte Dienstleistungen - unterschieden wird.
EGEE-II INFSO-RI-031688
of networked resources and the corresponding middleware, which provides services for the user.
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
The EGEE Project
Aim of EGEE:
“to establish a seamless European Grid infrastructure for the support of the European Research Area (ERA)”
EGEE
71 partners in 27 countries, federated in regional Grids
EGEE-II
Expanded consortium
91 partners
Challenge: large consortium needs clearly defined management structure and reporting lines
tested in EGEE, expanded in EGEE-II
maintain expertise from EGEE partners
EGEE-II
focus on providing a service to users and less on building up the infrastructure
increased work with industry: now also direct input in technical management on same level as scientific applications
Involvement of industry also in the technical work – as EGEE business associates or through collaboration with the CERN openlab project (www.cern.ch/openlab)
SA = Service Activities: operation and management of the Grid as well as provision of network resources working, sustainable infrastructure: EGEE-II as core service provider
new sites/countries joining the infrastructure
new activity (SA3) responsible for software integration and testing
NA = Networking Activities: management and coordination of all the communication aspects of the project, application support Growth of infrastructure user base + international collaboration
increased support for applications
reinforced dissemination and training: closer links to industry; extend coverage to all regions
JRA = Joint Research Activities: Re-engineering and integration of Grid middleware components Middleware becoming more mature, needs less development
also includes quality assurance for the whole project and security
New coordination bodies:
Technical Coordination Group
User Information Group
EGEE-II website: www.eu-egee.org
of networked resources and the corresponding middleware, which provides services for the user.
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
EGEE Infrastructure
~ 25 000 CPUs
> 10 PB storage
> 60 Virtual Organizations
At Feb review:
6 domains and
Scaling up the infrastructure with resource centres around the globe
Stable, well-supported infrastructure, running only well-tested and reliable middleware
Pre-production service
Run in parallel with the production service (restricted nr of sites)
First deployment of new versions of the gLite middleware
Test-bed for applications and other external functionality
T-Infrastructure (Training&Education)
and application (Testbed, CA,
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Regional Operation Centers
Tools are developed/hosted at different sites:
GOC DB (RAL), SFT (CERN), GStat (Taipei), CIC Portal (Lyon)
Grid operator on duty
CERN, IN2P3, INFN, UK/I, Ru,Taipei
Crucial in improving site stability and management
Expanding to all ROCs in EGEE-II
Operations coordination
Nov 04, May 05, Sep 05, June 06
Procedures described in Operations Manual
Introducing new sites
Site downtime scheduling
Suspending a site
Escalation procedures; etc.
Evolving and maturing procedures
Procedures being in introduced into and shared with the related infrastructure projects
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
of networked resources and the corresponding middleware, which provides services for the user.
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Strict software process
Software configuration management, version control, defect tracking, automatic build system, …
Conservative approach in what software to use
Avoid “cutting-edge” software
Deployment on over 100 sites cannot assume a homogenous environment – middleware needs to work with many underlying software flavors
Avoid evolving standards
Evolving standards change quickly (and sometime significantly cf. OGSI vs. WSRF) – impossible to keep pace on > 100 sites
Long (and tedious) path
from prototypes to production
EGEE-II INFSO-RI-031688
EGEE generic middleware
March 4, 2006:
EGEE-II INFSO-RI-031688
After gLite 3.0:
As needed by users and as made available by developers
Major releases provide a “check-point”
In general in coincidence with major application challenges
Continuing development to
Improve functionality
Increase robustness
Increase usability
EGEE-II INFSO-RI-031688
Grid Interoperability
Incubator for new Grid
Strengthening contacts with industry
EGEE-II INFSO-RI-031688
Platform
Infrastructure
Unix
Windows
JVM
TCP/IP
MPI
EGEE-II INFSO-RI-031688
Platform
Infrastructure
Unix
Windows
JVM
TCP/IP
MPI
EGEE-II INFSO-RI-031688
Middleware structure
Higher-Level Grid Services may or may not be used by the applications
should help them but not be mandatory
Foundation Grid Middleware is deployed on the infrastructure
should not assume the use of Higher-Level Grid Services
must be complete and robust
should allow interoperation with other major grid infrastructures
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
Job submission
Information System
EGEE-II INFSO-RI-031688
SA1 Pre-Production
Scalability Tests
Pre-Production Deployment
EGEE-II INFSO-RI-031688
of networked resources and the corresponding middleware, which provides services for the user.
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
EGEE Applications
>20 applications
Applications now moving from testing to routine and daily usage
EGEE - A Large-scale Production Grid Infrastructure
At Feb review:
6 domains and
2185
2796
7617
10312
11151
9096.5
8629.7419354839
10924.6774193548
10717.4
18084.8709677419
23259.5333333333
22880.8709677419
29417.4516129032
32331.6428571429
30691.8064516129
26448.3
Sheet1
Normalised CPU time [units 1K.SI2K.Hours] by VO and DATE (Excluded dteam VO)
VO
Jun-05
Jul-05
Aug-05
Sep-05
Oct-05
Nov-05
Dec-05
Jan-06
Feb-06
Mar-06
Apr-06
May-06
Total
Jobs / day
CSV Dump from Grid Operations Centre accounting portal for the period 06/2005 to 05/2006
Values shown are Total number of jobs run
(Units are Number of physical jobs per site per VO per month)
Jun-05
Jul-05
Aug-05
Sep-05
Oct-05
Nov-05
Dec-05
Jan-06
Feb-06
Mar-06
Apr-06
May-06
20053
2
0
0
0
0
0
0
0
0
0
0
0
aegis
0
0
0
0
0
0
0
0
517
1527
1774
517
alice
9999
2577
169
2226
18339
19716
9227
24557
12113
2086
40856
3982
alicegrd
0
0
0
0
0
0
0
0
0
1
0
0
alicesgm
0
0
1
211
8537
26635
7147
1974
3
0
4301
3953
ams
0
0
0
0
0
9
1
1
0
0
0
0
argo
0
0
0
1
0
0
0
0
0
0
0
0
asci
0
0
0
0
1
0
0
0
0
0
0
0
astron
0
0
0
0
0
0
0
0
0
0
0
0
atlas
92195
136118
119347
91664
200932
237862
209226
378028
436221
518646
357668
48836
atlasgrid
806
1852
679
2584
2269
5204
1654
4956
696
155
202
3
atlaslcl
0
0
0
1
1
13
0
0
5
0
0
0
atlassgm
0
0
0
19
0
59
22
12
5
536
1991
176
auvergrid
0
0
0
0
0
0
0
23
44
11
19
19
babar
5209
2938
2554
7256
13191
5125
12078
7879
5817
9079
15992
3467
baltic
0
0
0
0
0
0
0
0
0
186
669
45
balticgrid
0
0
0
0
0
0
19
135
1753
2662
1622
305
becms
0
0
0
0
0
0
0
0
0
2
6409
0
belle
0
0
0
0
0
0
0
0
38
71
428
13
betest
0
0
0
0
0
0
0
0
0
182
219
31
bfactory
7053
12191
14472
15677
14504
14853
31609
37450
16907
27995
20465
153
bfg
0
0
0
0
0
0
44
4
22
54
17
0
bg
0
0
0
0
0
0
0
0
40
208
16
0
bio
1382
5681
1809
578
149
1293
11020
6876
929
9540
856
92
biomath
0
0
0
0
0
0
0
0
0
253
1809
0
biomed
10967
14849
24880
12279
11419
18976
22509
42154
15169
32592
24331
6556
biomedsgm
0
0
0
0
0
0
0
0
0
9
11
0
bmed
2308
2932
1711
571
530
330
733
889
576
1349
1379
170
calice
0
0
0
0
0
0
0
0
11
2
0
0
cdf
2
0
0
714
1028
698
1633
2637
1693
5566
1840
232
cesga
0
18
4
27
10
37
11
241
106
437
473
0
cms
68614
42878
31827
60907
152142
158833
121852
144785
187380
148459
135239
11056
cmsgrid
1
44
164
76
82
91
298
1071
93
236
6677
19
cmssgm
0
0
0
9
1
1
108
171
112
534
960
1830
cns
0
0
0
0
0
0
1
1
3
0
31
68
compass
0
0
0
0
0
0
0
0
16
2
5
0
compchem
1
0
1
31
2
4
2
2
33
342
190
3
cosmo
1
1
8
0
2
20
0
5
341
925
11034
2048
d0
7
34
4589
5
0
3818
9932
11341
3204
3584
2250
775
dapatlas
0
0
0
0
9
0
0
0
0
0
0
0
dapdteam
0
0
0
0
2
7
0
0
0
0
0
0
dcms
78
124
21
86
118
226
1295
1278
4736
3735
462
0
dech
0
0
30
395
1
0
2
10
0
0
0
0
delphi
0
0
0
0
0
0
0
0
0
0
0
2
demo
0
0
0
0
0
3
1
0
0
0
0
0
desy
0
0
9
0
0
0
0
0
0
0
0
0
dteamsgm
0
0
0
54
19
341
483
449
208
385
332
103
dzero
19
105
152
130
418
2296
3240
7474
13373
6771
7463
1595
edinburgh
0
0
0
0
0
5
0
0
1
0
0
0
eearth
5
9
0
0
0
0
0
0
0
0
0
0
eela
0
0
0
0
0
0
0
0
0
1
25
0
egeode
10
0
122
197
114
309
541
172
138
124
107
0
egrid
0
0
0
0
0
0
0
0
0
6
0
0
eimamagi
0
0
0
0
0
0
0
0
0
0
3
0
elis
0
0
0
0
0
0
0
0
0
972
909
0
emutd
0
0
0
0
3
0
0
0
0
0
0
0
enea
0
0
0
4
13
4
5
1
1
1
0
1
esr
918
305
355
2541
1451
1810
567
1001
563
532
283
128
eumed
0
0
0
0
0
0
0
0
0
22
1
3
fusion
0
0
0
0
122
10324
0
44
1947
1124
22
3
fusn
0
0
0
0
0
0
0
0
0
0
0
6
gamess
0
0
0
0
0
0
0
0
0
0
2
0
geant
0
0
0
0
0
3
2852
229
1041
4
362
0
geant4
0
0
0
0
0
401
2570
1169
1041
5
6
0
geantsgm
0
0
0
0
0
776
3041
358
0
0
0
0
gear
0
0
0
0
0
0
0
0
0
4
33
338
gearsgm
0
0
0
0
0
0
0
0
0
13
6
0
gene
0
0
0
0
0
0
0
0
0
0
0
0
ghep
0
5
2
0
0
31
12
2
22
1
0
0
gilda
0
0
0
0
0
0
0
0
0
0
0
0
gks
0
0
0
0
0
0
0
0
0
0
0
0
grid3
0
0
0
0
0
0
0
0
0
0
0
0
gridit
0
0
95
3495
1973
755
3428
6379
5420
1712
131
75
gridtest
0
0
0
0
0
0
0
0
0
4
0
0
grif
0
0
0
0
0
0
0
0
0
3
0
0
grycap
0
94
0
3
2
8
1
20
3
7
0
3
gtlastudents
0
0
0
0
0
0
0
0
0
439
2252
5
gvmuam
0
0
0
0
0
0
0
0
0
0
0
0
h1
388
1236
1322
702
307
411
295
296
11
154
5
18
hera1
0
0
0
0
0
0
0
6
0
0
0
0
herab
0
0
0
0
0
1
0
0
0
0
0
0
hgdemo
0
0
0
0
0
0
0
59
4
2
0
0
hone
473
1016
816
1046
336
2514
811
1973
1297
751
39
28
hungrid
0
0
0
0
0
0
0
0
32
2
1
0
icecube
0
0
0
2
0
0
0
0
0
0
0
0
ific
1554
559
0
443
0
0
7
59
174
368
410
191
ilc
20
140
5637
2116
1089
676
1097
348
1833
2571
1884
165
ilcgrid
10
24
233
0
0
0
62
0
2
35
41
2
ildg
2
0
0
4
0
0
0
0
25
103
55
0
inaf
0
0
2
80
2
705
37
1
1
146
1
1
infngrid
18
209
574
2178
2156
1446
677
521
3301
5121
1741
3
inforet
0
0
0
0
2
3
5
0
0
0
0
0
ingv
0
0
1212
1532
486
788
1966
5098
5585
0
6
0
intec
0
0
0
0
0
0
0
0
0
53
103
0
iteam
0
0
0
0
0
0
0
0
0
0
0
0
ivdgl
0
0
0
0
0
0
0
0
0
0
0
0
jku
0
0
0
0
0
0
0
0
15
218
14
4
lal
0
0
0
0
0
26
288
0
3
1
0
0
lalice
0
0
0
0
0
0
0
0
0
0
0
26
latlas
0
0
0
0
0
0
0
0
0
0
70
27
lbiomed
0
0
0
0
0
0
0
0
0
0
22
4
lcgatlas
0
0
0
0
0
0
0
0
219
788
862
1147
lcgcdf
0
0
0
0
0
0
0
0
0
0
18512
0
lcgdteam
0
0
0
0
0
0
0
0
477
470
388
120
lcms
0
0
0
0
0
0
0
0
0
0
6
0
ldteam
0
0
0
0
0
0
0
0
0
0
57
10
lhcb
42912
24317
120043
100975
107522
171507
219373
197761
130424
109627
73622
15354
lhcblcl
0
0
0
0
1
2
1
0
0
0
0
0
lhcbsgm
0
0
0
0
0
9
219
266
4
980
6801
1322
libi
0
0
0
0
0
0
0
0
0
1
0
0
llhcb
0
0
0
0
0
0
0
0
0
0
89
20
lt2-alice
0
0
0
0
0
0
0
0
0
1
0
1
lt2-atlas
0
0
0
0
0
0
0
0
0
1113
1918
151
lt2-biomed
0
0
0
0
0
0
0
0
0
1485
2071
573
lt2-cms
0
0
0
0
0
0
0
0
0
854
1051
194
lt2-dteam
0
0
0
0
0
0
0
0
0
441
375
105
lt2-dzero
0
0
0
0
0
0
0
0
0
35
191
0
lt2-geant4
0
0
0
0
0
0
0
0
0
2
1
0
lt2-ilc
0
0
0
0
0
0
0
0
0
191
38
32
lt2-lhcb
0
0
0
0
0
0
0
0
0
1092
1929
14
lt2-ltwo
0
0
0
0
0
0
0
0
0
42
1
6
lt2-pheno
0
0
0
0
0
0
0
0
0
0
19
0
lt2-zeus
0
0
0
0
0
0
0
0
0
0
97
0
ltwo
0
0
0
0
0
6
9
0
4
1
0
0
magic
64
2138
125
1185
66
0
1172
2690
4241
1984
236
0
marine
0
0
0
1
0
0
0
0
0
0
0
0
minos
0
0
0
0
0
0
0
0
2
6
35
0
mis
0
0
0
0
0
0
0
0
0
0
0
0
na48
0
0
0
21
16
13
278
7
60
31
1
0
ncf
0
0
0
0
0
0
0
0
0
0
0
0
nw_ru
0
0
0
0
5
0
66
0
82
76
2
5
ops
0
0
0
0
0
0
0
0
0
2
234
126
oxg
0
0
0
0
0
0
0
0
0
14
0
2
pamela
0
0
0
0
0
0
0
0
3
0
0
1
pdc
0
0
0
0
0
0
0
0
0
11
2
0
pheno
0
27
35
686
2428
5
8
0
18
0
86
13
phicos
0
0
0
0
43
0
0
0
0
0
0
0
photon
0
1
0
0
0
36
62
0
0
0
3
1
picard
0
0
0
0
0
0
0
0
0
30
3
0
planck
0
0
0
5
0
0
0
1
0
7
0
0
pvier
0
0
0
0
22
0
0
0
0
0
0
0
rdteam
76
0
0
0
0
0
0
0
0
0
0
0
rgstest
48
54
942
79
61
0
0
0
0
0
3
0
scailcg
0
0
15
18
0
1
4
123
18
7
0
0
see
3961
2382
157
96
153
370
895
1944
777
972
2180
215
seegrid
114
118
59
197
742
1051
7261
1751
8843
11904
9327
2915
sixt
0
0
0
0
0
0
0
0
0
0
0
0
skgrid
0
0
0
0
0
0
0
1555
63
14390
2166
0
solovo
0
67
237
0
0
170
0
0
0
36
0
0
spbprod
0
0
0
0
0
0
0
46
0
0
0
0
ssf
0
0
0
0
0
0
0
0
0
1
5
0
swetest
645
675
639
30
44
405
931
3737
2959
7480
6861
52
t2k
0
0
0
0
0
0
0
0
7
0
0
0
theophys
0
0
1
0
31
99
117
935
284
1103
208
97
trgrida
0
0
0
0
0
0
0
0
0
149
145
84
trgridb
0
0
0
0
0
0
0
0
0
6
40
26
trgridd
0
0
0
0
0
0
0
0
0
4
50
0
trgride
0
0
0
0
0
0
0
0
0
32
129
13
twgrid
0
0
0
0
2
0
0
378
770
26
114
38
UNKNOWN
0
0
0
0
0
0
0
0
0
0
26
0
usatlas
0
0
0
0
0
0
0
0
0
0
0
0
uscms
0
0
0
0
0
0
0
0
0
0
0
0
virgo
0
0
102
133
589
83
0
152
701
216
302
97
vldbi
0
0
0
0
68
0
0
0
0
0
0
0
vlefi
0
0
0
0
4
0
0
0
0
0
0
0
vlemed
0
0
0
0
0
0
0
0
0
0
0
0
vlibu
0
0
0
0
3
0
0
0
0
0
0
0
voce
302
97
245
215
1093
184
717
629
2096
2432
1455
912
webcom
5
4
3
116
587
1404
0
10
129
68
30
4
zeus
22726
11703
3265
7921
15389
4995
15785
7817
28481
715
5254
1875
zh
0
0
0
0
0
0
0
0
0
0
0
0
272895
267522
338665
321522
560631
697786
709307
911941
905286
951446
793449
112605
9096.5
8629.7419354839
10924.6774193548
10717.4
18084.8709677419
23259.5333333333
22880.8709677419
29417.4516129032
32331.6428571429
30691.8064516129
26448.3
22521
lcg
214527
207786
272230
258671
489825
619919
569127
753581
767051
781259
628317
86531
7150.9
6702.7741935484
8781.6129032258
8622.3666666667
15800.8064516129
20663.9666666667
18358.935483871
24309.064516129
27394.6785714286
25201.9032258065
20943.9
17306.2
others
1945.6
1926.9677419355
2143.064516129
2095.0333333333
2284.064516129
2595.5666666667
4521.935483871
5108.3870967742
4936.9642857143
5489.9032258064
5504.4
5214.8
Jun-05
9097
7151
Jul-05
8630
6703
Aug-05
10925
8782
Sep-05
10717
8622
Oct-05
18085
15801
Nov-05
23260
20664
Dec-05
22881
18359
Jan-06
29417
24309
Feb-06
32332
27395
Mar-06
30692
25202
Apr-06
26448
20944
May-06
22521
17306
instruments ever built to
Mont Blanc
(4810 m)
Downtown Geneva
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
The accelerator generates 40 million particle collisions (events) every second at the centre of each of the four experiments’ detectors
The LHC Accelerator
EGEE-II INFSO-RI-031688
LHC DATA
This is reduced by online computers that filter out a few hundred “good” events per sec.
Which are recorded on disk and magnetic tape
at 100-1,000 MegaBytes/sec ~15 PetaBytes per year
for all four experiments
EGEE-II INFSO-RI-031688
event filter
EGEE-II INFSO-RI-031688
grid infrastructures ….
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Example: HEP
Preparing for LHC start-up
Emphasis on providing a service
Computing needs of experiments
E.g. LHCb: ~700 CPU years in 2005 on the EGEE infrastructure
E.g. ATLAS: over 10,000 jobs per day
ATLAS
LHCb
ATLAS
EGEE - A Large-scale Production Grid Infrastructure
LHC = Large Hadron Collider http://lhc.web.cern.ch/lhc/
The Large Hadron Collider (LHC)  is being built in a circular tunnel 27 km in circumference, 50 to 175 m underground, at CERN near Geneva, Switzerland.
It is designed to collide two counter rotating beams of protons or heavy ions. Proton-proton collisions are foreseen at an energy of 7 TeV per beam with a planned start-up in 2007.
The beams will be stored at high energy for hours. During this time collisions take place inside the four main LHC experiments.
LHC Experiments:
ALICE http://aliceinfo.cern.ch/
Dedicated heavy-ion detector to exploit the unique physics potential of nucleus-nucleus interactions at LHC energies. ALICE will study the physics of strongly interacting matter at extreme energy densities, where the formation of a new phase of matter, the quark-gluon plasma, is expected.
ATLAS http://atlas.web.cern.ch/Atlas/index.html
ATLAS will explore the fundamental nature of matter and the basic forces that shape the universe in the particle collisions of the LHC. The debris of the collisions reveal fundamental particle processes. The energy density in these high energy collisions is similar to the particle collision energy in the early universe less than a billionth of a second after the Big Bang.
CMS http://cmsinfo.cern.ch/outreach/
The CMS magnet will be the largest solenoid ever built, providing a magnetic field of 4 Tesla, to detect muons (elementary particles). CMS will explore physics at the TeV scale, try do discover the Higgs boson, look for evidence of supersymmetry, and be able to study aspects of heavy ion collisions.
LHCb http://lhcb-public.web.cern.ch/lhcb-public/
LHCb will obtain precise measurements of CP violation – to understand why there is more matter than antimatter in the universe.
LHC Service Challenges: http://lcg.web.cern.ch/LCG/PEB/Planning/deployment/Grid%20Deployment%20Schedule.htm
0.00214091
0.0069607899
0.0057056899
0.000009
0.00000568
0.0017521
0.0077871999
0.0255217797
0.00000271
0.0014767
0.0306849297
0.00002683
0.0075596499
0.00106039
0.0070514099
0.00281041
0.0135743499
0.00031858
0.1096024289
0.0052848399
0.0067615599
0.00011906
0.1319628487
0.00385136
0.002423
0.00282398
0.00102915
0.0051505199
0.00009788
0.0055125399
0.00476466
0.00030862
0.0170788698
0.00073245
0.0104690799
0.00348635
0.00225919
0.00170558
0.00055654
0.0116963799
0.00001224
0.00088264
0.00022263
0.0124450299
0.0414257796
0.0007595
0.00033438
0.0079210099
0.0089140499
0.00287364
0.00472434
0.0143603599
0.0679600093
0.0156931398
0.00284529
0.0076966599
0.00069495
0.0514004095
0.0046485
0.00174955
0.0121413799
0.0236628898
0.00277535
0.0204131698
0.00120543
0.0640676794
0.0093765199
0.095175029
0.0216801998
0.0067504199
0.00094277
0.0145504399
0.00342746
0.00105311
0.0145532199
0.0185348298
Sheet1
Emerging diseases know no frontiers. Time is a critical factor
Avian influenza:
human casualties
Early detection
Epidemiological watch
EGEE-II INFSO-RI-031688
WISDOM focuses on drug discovery for neglected and emerging diseases.
Summer 2005: World-wide In Silico Docking On Malaria
46 million ligands docked in 6 weeks
~1 million virtual ligands selected
1TB of data produced
Spring 2006: drug design against H5N1 neuraminidase involved in virus propagation
impact of selected point mutations on the efficiency of existing drugs
identification of new potential drugs acting on mutated N1
N1
H5
EGEE-II INFSO-RI-031688
300,000 Chemical compounds:
Data challenge on EGEE,
In vitro
EGEE-II INFSO-RI-031688
Example: Pharmacokinetis
A lesion is detected in an MRI study of a patient
– start with virtual biopsy
The process requires obtaining
breath-holds.
Before analyzing the variation of each voxel, images must be co-registered to minimize deformation due to different breath holds.
The total computational cost of a clinical trial of 20 patients is around 100 CPU days.
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Example: Determining
earthquake mechanisms
magnitude, mechanism
10 times faster on the Grid than on local computers
Results
Different location (different part of fault line further south)
Different mechanism
Peru, June 23, 2001
EGEE-II INFSO-RI-031688
Permanent or
periodically updated
EGEE-II INFSO-RI-031688
ITU-BR developed a system for RRC 2006
Run compatibility and
Provide more CPU power
Gain experience on how to access
large and reliable computing resources
‘on demand’
EGEE used a subset of its Grid for RRC 2006
Over 400 PCs
Compatibility analysis < 1h
EGEE-II INFSO-RI-031688
Dissemination and outreach
Training and education
Increasing the number of applications by improving application support and middleware functionality
Improved usability through high level grid middleware extensions
Increasing the grid infrastructure
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
summer schools across many countries
>3000 people trained
Material archive online with ~250 presentations
Public and technical websites
4 conferences organized (~ 460 @ Pisa)
Next conference: September 2006 in Geneva ~600 participants
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Links related industry projects (NESSI, BEinGRID, …)
Works with EGEE’s Technical Coordination Group
Collaboration with CERN openlab project
IT industry partnerships for hardware and software
development
Industry Forum
Organises industry events and disseminates grid information
e.g. this Wednesday here at the school
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Dissemination and outreach
Training and education
Increasing the number of applications by improving application support and middleware functionality
Improved usability through high level grid middleware extensions
Increasing the grid infrastructure
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Platform
Infrastructure
Unix
Windows
JVM
TCP/IP
MPI
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
Example: Biomedicine
Parallel simulation
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
Scientific Visualization
Sony PSP – PlayStation Portable
EGEE-II INFSO-RI-031688
Not only portals
Portals are a good way to bring computing power to end-users
In most cases domain specific
Application programmers (and portal programmers) need more powerful interfaces
Workflow engines
Programming environments (gEclipse)
EGEE-II INFSO-RI-031688
Dissemination and outreach
Training and education
Increasing the number of applications by improving application support and middleware functionality
Improved usability through high level grid middleware extensions
Increasing the grid infrastructure
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
Related Infrastructures
EGEE-II INFSO-RI-031688
Dissemination and outreach
Training and education
Increasing the number of applications by improving application support and middleware functionality
Improved usability through high level grid middleware extensions
Increasing the grid infrastructure
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Maintain Europe’s leading position in global science Grids
Ensure a reliable and adaptive support for all sciences
Independent of project funding cycles
Modelled on success of GÉANT
Infrastructure managed centrally in collaboration
with national bodies (in EGEE-II: JRUs)
EGEE - A Large-scale Production Grid Infrastructure
Expand the idea and problems of the JRU
EGEE-II INFSO-RI-031688
Sample of National Grid projects:
Austrian Grid Initiative
OMII; GridPP
Average of 180 M€ per year since 2002 (national + EC)
EGEE-II INFSO-RI-031688
EGEE-II INFSO-RI-031688
Grids represent a powerful new tool for science
Today we have a window of opportunity to move grids from research prototypes to permanent production systems (as networks did a few years ago)
EGEE offers …
… a mechanism for linking together people, resources and data of many scientific community
… a basic set of middleware for gridfying applications with documentation, training and support
… regular forums for linking with grid experts, other communities and industry
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Summary
Success will lead to the adoption of grids as the main computing infrastructure for science
If we succeed then the potential return to international scientific communities will be enormous and open the path for commercial and industrial applications
EGEE - A Large-scale Production Grid Infrastructure
EGEE-II INFSO-RI-031688
Demos
25-29 September 2006
Data sources
Hydrological simulation
Hydraulic simulation