big data from the lhc commissioning: practical lessons from big science - simon metson (cloudant)

69
Big Data from the LHC Commissioning Practical Lessons from Big Science Simon/@drsm79

Upload: jaxlondonconference

Post on 15-Jul-2015

534 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Big Data from the LHC Commissioning

!

Practical Lessons from Big Science

Simon/@drsm79

Page 2: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Hello!

Page 3: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)
Page 4: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)
Page 5: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)
Page 6: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Bristol University Cloudant

Time at places I’ve worked

Page 7: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

0

25

50

75

100

2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013

Python Perl Bash C++ Java Javascript Fortran

Page 8: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The formula

G * E

Page 9: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The formulaFixed

Fixed Usually fixed

G* E

Page 10: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The formula

Grant * Effectiveness

Page 11: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 12: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 13: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

D i g b i g t u n n e l s

Page 14: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

C h a i n u p s e r i e s o f “ a t o m s m a s h e r s ”

Page 15: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

P u t s e n s i t i v e c a m e r a s i n a w k w a r d p l a c e s

Page 16: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

R e c o r d e v e n t s

Page 17: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Process data on high end machines

http://www.chilton-computing.org.uk

Page 18: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 19: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

CMS online data flow

We have a big digital camera

Page 20: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

It takes photos of this

courtesy of James Jackson

Page 21: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

which come out like this

courtesy of James Jackson

Page 22: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

CMS online data flow

We have a big digital camera

Which goes into lots of computers (the HLT)

Page 23: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

CMS online data flow

We have a big digital camera

Which goes into lots of computers (the HLT)

Which goes into lots of disk (the Storage Manager)

Page 24: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

CMS data flow

We have a big digital camera

Which goes into lots of computers (the HLT)

Which goes into lots of disk (the Storage Manager)

Write to HLT at ~200GB/s

Write to Storage Manager at ~2GB/s

Write to T0 at ~2GB/s

Page 25: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 26: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

1 0 P B o f d a t a / y e a r

Page 27: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 28: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

1PB/week

Page 29: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Why transfer so much data?

Page 30: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

To process all the data taken in one year on one computer would take ~64,000 years

Page 31: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)
Page 32: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC data1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Page 33: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Analysis

• Each analysis is ~unique

• Query language is C++

• Runs on distributed system and local resources

• Series of “cut” selections to identify interesting events

• Data in the final plot may be substantially reduced from the original dataset

Page 34: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Workflow ladderLarge datasets (>100 TB) Complex computation

Private datasets (0.1-10 GB) Simple computation

Work on laptop/desktop machine, store resulting datasets to Grid storage

Use Grid compute and storage exclusively

Shared datasets (0.1-10 GB) Simple computation

Large datasets (>100 TB) Simple computation

Shared datasets (10-100 GB) Simple computation

Work on departmental resources, store resulting datasets to Grid storage

Shared datasets (10-500 GB) Complex computation

Shared datasets (>500 GB) Complex computation

}}}

Number of users

Page 35: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The life of LHC simulated data

1. Simulated by experimentalists at T0/T1/T2 sites

2. Transferred to T1 sites, archived possibly reconstructed & skimmed

3. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

4. Written into locally analysable files, put on laptops

5. Turned into a plot in a paper

Page 36: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Most events get cut

Page 37: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

!“We are going to die, and that makes us the lucky ones. Most people are never going to die because they are never going to be born.”

!- Richard Dawkins

Page 38: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)
Page 39: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Adoption & Use

Page 40: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Setup

• Maybe a bit different to other people

• Many sites (>100) with >100’s TB storage, 10000’s worker nodes

• Global system

• Why not at one site?

• politics, power budget, cost

Page 41: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The grid

Page 42: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

We Have a “Big Data” Problem

Page 43: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

We Have a Big “Data Problem”

Page 44: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Do what you do best, out source the rest

Page 45: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

What's interesting is that big data isn't

interesting any more

Page 46: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

NIH

Page 47: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Define and refine workflows

Page 48: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Our situation

• Expert users, who are not interested in infrastructure

• Will work around things they perceive as unnecessary limitations

Page 49: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Disruptive users

Page 50: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

How to engage disruptive users?

Page 51: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Open access

Page 52: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

1PB/week

Page 53: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Open access

Page 54: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Our situation

• Limited resources for integration/testbed style activities

• Strange organisation

Page 55: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Data temperature

Page 56: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

There is no such thing as now

Page 57: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Keep things as local as possible

Page 58: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Defining monitoring is difficult

Page 59: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Small files are bad, m'kay

Page 60: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Compartmentalise metadata

Page 61: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Recognise, embrace and communicate failures

Page 62: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

People are harder than computers

Page 63: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

People are important

Page 64: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

The formula

���64

Page 65: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Consequences

• Automate all the things

• Learn to love a configuration management system

• Make sure everyone in the team knows how to interact with it

• Simple human solutions go a long way

Page 66: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Build good abstractions

Page 67: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Encourage collaboration

Page 68: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Workflow ladderLarge datasets (>100 TB) Complex computation

Private datasets (0.1-10 GB) Simple computation

Work on laptop/desktop machine, store resulting datasets to Grid storage

Use Grid compute and storage exclusively

Shared datasets (0.1-10 GB) Simple computation

Large datasets (>100 TB) Simple computation

Shared datasets (10-100 GB) Simple computation

Work on departmental resources, store resulting datasets to Grid storage

Shared datasets (10-500 GB) Complex computation

Shared datasets (>500 GB) Complex computation

}}}

Number of users

Page 69: Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Summary