the osg open facility: an on-ramp for opportunistic ... · the osg open facility: an on-ramp for...

14
The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab) Rob Gardner (Chicago) Mats Rynge (ISI/USC) Frank Wuerthwein (UCSD) CHEP 2016 San Francisco October 10, 2016

Upload: others

Post on 29-May-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

The OSG Open Facility: An on-ramp for opportunistic scientific computing

Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)Rob Gardner (Chicago)Mats Rynge (ISI/USC)

Frank Wuerthwein (UCSD)

CHEP 2016San Francisco

October 10, 2016

Page 2: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

The Open Science Grid

2

• Goal: Advance science through open distributed high throughput computing (DHTC) in the US−Operates as a partnership between resource providers (sites) and stakeholders (scientific

communities of users)−OSG project provides middleware, services, and support

Advancing the state of the art in High Throughput Computing−Over 120 sites (computing and storage elements)▪Mix of university and national labs; LHC experiment computing sites and campus clusters

− Uses the Virtual Organization (VO) model ▪Most VOs correspond to large HEP experiments and university communities

• Funded by the US Dept. of Energy and National Science Foundation− Second 5 year period of support

Page 3: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Over 1 billion served

3

Over the last 12 months 150 Million jobs consumed 1.2 Billion hours of computing involving 1.9 Billion data transfers to move 218 Petabytes

This aggregate was accomplished by

federating 129 clusters that contributed 1h to 100M hours each

In the past 30 days 113 Million Core hours

http://display.grid.iu.edu

1-Oct-2016

Page 4: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Growth of the OSG

4

Past 12 months: Averaging 100M hours/month

Page 5: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Reaching beyond HEP

5

ATLAS

CMS

other,physics

life,sciences

other,sciences 2015

Science other than the LHC makes up ~34% of OSG hours Science other than physics makes up ~20% of OSG hours

Page 6: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Opportunistic computing on the OSG

• Considerable effort to extract opportunistic resources across OSG− Nearly 200M hours in 2015• Most are available through the OSG VO− Allows any researcher from US institutions− Accessible through multiple mechanisms (including XSEDE

allocation)−Most OSG sites support the OSG VO• Despite LHC Run 2 demand, opportunistic OSG resources

continue to grow− In part due to university clusters not affiliated with HEP

experiments offering opportunistic resources

6

OSG VO Growth 2011-2015

Wal

l Hou

rs

0E+00

4E+07

7E+07

1E+08

1E+08

Year2011 2012 2013 2014 2015

400,568 3,168,025

47,931,106

97,009,409

125,607,605

ATLAS CMS Others Since start of LHC Run2

Page 7: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Access to the Open Facility

7

OSG-XD login node [Indiana]

OSG Connect login node [Chicago] OSG VO

Flocking (frontend)

OSG DHTC Fabric: >100 sites

Campus OSGConnect clients (e.g. Duke)

Campus login/submit hosts (e.g. MIT)

GlideinWMS factory [UCSD]

XSEDE allocations on OSG

Legacy OSG direct

OSG Connect users

Researcher

OSG operated

Campus/lab operated

Stash (data)

CVMFS (software)

Page 8: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

From big science…

8

20M hours since 10/2015

16M hours since 3/2016

35M hours in 2015

8M hours in 2015

Page 9: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

… to individual researchers

9

Optical data communication and compression David Mitchell (NM State) • Important for digital space/

satellite communication • Simulate whole system

(transmitter, decoder, receiver)

Randomness in evolution Joshua Plotkin (Penn State) • Understand evolution at

molecular scale in DNA • Uses a combination of

mathematical modeling and simulation

Gene interaction graph construction Alex Feltus (Clemson) • Discovery of gene sets

underlying complex traits in organisms

• Applied to agricultural genomics to find traits that may accelerate crop growth

Study of Z2 gauge theory coupled to fermionic fields Snir Gazit (UC Berkeley) • Quantum Monte Carlo to

study confinement transitions

Just a few of the researchers who use 1M+ hours annually on the OSG!

Rice

Maize

COMPLEX GENETIC SYSTEMS WOVEN INTO GRAPHS

TECHNOLOGY: GPU Alignment Algorithm Optimization Seconds to align with GPUs (not 1-3 days); (Feltus/M. Smith/K. Sapra)

BIOLOGY: Conserved gene interaction topology.

Page 10: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

… to individual researchers

10March 15th, 2016

Serving individuals & small groups �

8

AE8�bVMaRQ���?�4BG�V^dab�������

96 projects across 81 institutions Most prolific individual consumed

~34M hours in 2015

Page 11: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

HPC resources on the OSG

• OSG jobs have run on two NSF HPC facilities

• LIGO utilized an allocation at TACC Stampede by OSG job submission− ~4M hours utilized in 2015− Stampede OSG “site” in testing for allocations for CMS and MINOS

experiments

• SDSC Comet access via its Virtual Cluster (VC) interface − Helped commission the VC interface− Delivered 3.5M hours in 2016− From users’ perspective “just another site” on the OSG

11

Page 12: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

New technologies and growth

• Solving the data movement conundrum: Stashcache− XRootD-based caching system for O(1TB) data movement − See B. Bockelman (#501) on Thursday• Easing the integration of campus clusters: HTCondorCE-Bosco− Simplified computing element layer (which can be administered/hosted offsite) needing only SSH

access to cluster− See D. Weitzel (#503) on Wednesday• Virtual computing centers using OSG technology− Single access point accessing campus clusters and OSG open facility − See C. Paus (#276) on Thursday• Site-in-a-box using OSG technology− Initial deployment at Univ. of California campuses for LHC T3s− See J. Dost (#269) on Thursday

12

Page 13: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Conclusions

• Opportunistic resources and their consumption continue to grow on the OSG− On track to delivering over 200M hours opportunistically in 2016

• A wider range of users, including large collaborations, use these resources

• A key to this growth is the expansion of available resources into university HPC clusters and large allocation-based HPC resources

• The OSG is committed to remaining one of the largest open gateways for scientific computing in the US− This is from O(1k) collaborations down to individual researchers− OSG resource providers remain committed to sharing

13

Page 14: The OSG Open Facility: An on-ramp for opportunistic ... · The OSG Open Facility: An on-ramp for opportunistic scientific computing Bo Jayatilaka, Tanya Levshina, Chander Sehgal (Fermilab)

Bo Jayatilaka October 10, 2016

Backup: Universities and national labs

14March 15th, 2016

Universities and National Labs �

9

DOE Labs: BNL, FNAL, SLAC, NERSC, LLNL

Universities with an LHC T2

Other

This includes large Campus clusters independent

of the LHC.

This includes Universities that have no LHC involvement