sebasen goasguen –sebgoa@clemson - fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf ·...

59
August 20 th , Fermi Lab 1 Sebas&en Goasguen –[email protected] School of Compu,ng Clemson University, Clemson, SC Scien,fic Associate at CERN Summer 2009 and Summer 2010

Upload: duongthien

Post on 28-Apr-2018

220 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 1

Sebas&enGoasguen–[email protected]

SchoolofCompu,ng

ClemsonUniversity,Clemson,SC

Scien,ficAssociateatCERN

Summer2009andSummer2010

Page 2: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 2

Outline

•  CloudBasics•  BuildingaCloudProvider

– Lxcloud@CERN

•  VOCsandClouds– ResearchdoneatClemson

Page 3: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 3

WhatisCloudCompu&ng?

Page 4: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 4

Afewreferences

"Above the clouds: A Berkeley view of cloud computing"http://berkeleyclouds.blogspot.com/

"A break in the clouds: towards a cloud definition"L.M Vaquero et al. SIGCOMM computer communication review, 2008. http://portal.acm.org/citation.cfm?id=1496100

"An EGEE Comparative Study - Grid cloud comparative study"M-Elian Begin, 2009

Page 5: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 5

OntheHypecurve

• NowprobablyatthetopoftheHype–Oct09

Page 6: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 6

Trendy…

• Source:hOp://www.google.com/trends

Page 7: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 7

Cloudforma&on

• SlideadaptedfromRichWolski,UCSB

Page 8: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 8

WhiteHouseisgoingtotheCloud• Reducecosts…SeeApps.gov

Page 9: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 9

DOEandNASAtoo(Checknovacc.org)

Page 10: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 10

EverythingismovingtotheCloud…StayonEarththough!

• hOp://contactdubai.com/tag/saas‐soTware‐or‐storage‐as‐a‐service

Page 11: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 11

An“Old”idea:OSI/AnatomyoftheGrid/Windowsarchitectures…

Page 12: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 12

WhatistheCloud?The*aaS

•  SaaS–SoVwareasaService‐•  PaaS–PlaYormasaService‐

•  Iaas–InfrastructureasaService–•  Servicecomposi,onatalllayersofdistributedsystem.Buildsasystemofsystem

•  SoTwareandhardwarereuse

•  Tendencyforthe*aaS‐itusbutthesethreearethemainones

Page 13: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 13

SoVwareasaService

Page 14: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 14

Skyisthelimit…

•  Phoneapps…FermiVoice?

Page 15: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 15

PlaYormasaService

Page 16: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 16

InfrastructureasaService/Comingofageofvirtualiza&on

Page 17: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 17

WhatistheCloud?The*aaS•  SaaS–SoVwareasaService–

– EasyAccesstohostedapplica,onsoverthenetwork.MostlikelyusingyourBrowser

– APItotheseapplica,ons•  PaaS–PlaYormasaService–

– Environmenttodeploynewapplica,ons– Restrictedcapabili,esoffered– APItothispla]ormandaccesstoSaaSAPI

•  Iaas–InfrastructureasaService–– AccesstoHardwareresources– APItomakeresourcealloca,onrequests

Page 18: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 18

KeyFeatures•  Youdon’tknowwhat’sbehindbutitworks

– Transparency•  YouPaywhatyouuse

– U,litypricing•  Yougetwhatyouaskfor(On‐demand)

– ReadthefineprintsoftheSLAs…•  Itscalesifyouneedmore

– Howfardoesitscale?– Doesn’tthismeantheunderlyingresourcesareunderu,lized?

Page 19: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 19

Whynow?Evolu&onoftheMashupRevolu&onthanks

toanAPI“explosion”

Page 20: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 20

Whynow?•  BigInternetcompaniesfacedalotofdatatoanalyze:weblogs…

•  Developedinhouse:Newfilesystem(Hadoop),newanalysisframework(Map‐Reduce)

•  Massiveamountofresourcesallacrosstheplanet:>500,000coresforGoogle?

•  Higherneedstoconsolidate:virtualiza,on,energycosts.

•  Newdevices:iPhone/G1•  Atrulyinter‐connectedplanet

Page 21: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 21

Afewinteres&ngthings…tos&rthepot•  Industryisleading.Isacademiabehind?

•  Whocaresaboutstandards?(>20bodiesworkingoncloudstandards…)

•  Weshouldswitchparadigmandrewriteapplica,onsoncetheyare6monthsold.

Page 22: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 22

Outline

•  CloudBasics•  BuildingaCloudProvider

– Lxcloud@CERN(Incollabora&onwithUlrichSchwickerath,EwanRoche,BelmiroMoreiraandRomainWartel)

•  VOCsandClouds– ResearchdoneatClemson

Page 23: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 23

IaaSlevel•  Forconsolida,ngservices

– UsedinITforawhilenow– FermiGridservicesrunninginXenVMs

•  Forofferingon‐demandservices– E.gVOBoxes,replacehardwarerequest

•  Forvirtualizinglargescaleservices– Clusterson‐demand

•  Virtualiza,onisakeyenablerforIaaS

Page 24: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 24

BatchVirtualiza,on

  RunbatchjobswithinVirtualMachines   BeOerapplica,onenvironment

  Custommadebyuser   Increasedsecurity   BeOercontrolonresourcesharing

  Mul,‐coreapps   Increasedflexibilityontheadminside

  CanrunapreferredOSonthemetal

Whyvirtualizing“Batch”?

Page 25: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 25

BatchVirtualiza,on

Type1:Runmyjobs(inyourVM)

Type2:RunmyjobsinmyVM

Howtovirtualize“Batch”…smoothly?

Page 26: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 26

Type3:Givememyinfrastructurei.eaVMorabatchofVMs

Movingtothecloud:

Page 27: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 27 3/23

Deployment Models Innovation in Cloud Computing Architectures

Page 28: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 28

Maincomponents/characteris&csSetofHypervisors•  Physicalmachineswithavirtualmachinemonitor•  XenorKVM...orHyper‐V...orVMwareESx...VMprovisioningsystem•  OpenNebula•  Nimbus•  Eucalyptus•  Pla]ormISF•  oreventradi,onalschedulerslikePBS/Maui.Imagedistribu&onmechanism•  Sharedfilesystem(e.gNFS,AFS,PVFS,Lustre...)•  Copyimages(e.gscp,wget,BiOorent)Networking•  Private/Publicbridged•  NAT

Page 29: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 29

ThoughtsforOSG…tos&rthepotagain•  Sitesneedtohavehypervisors,that’sastar,ngpoint.Withoutit/themtherewon’tbeOSGclouds.

•  WhatVMM/HypervisortheyusedoesnotmaOer…butmyguessisthat80%willuseKVM

•  WhatprovisioningsystemtheyuseisamaOeroflocaltechnicalsetup,tasteandrela,onships

•  Sitescandothisnow•  Thehardproblemisintheimagetransferandtrust…SeeHEPiXvirtualiza,onworkinggroup

Page 30: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 30

CERN'sLXCLOUDarchitecture

•  ImagerepositorywithGoldennodes.

•  VMinstancesnotquaOormanagedhavefinitelife,me

•  SpecificIP/MACsarepinnedtohypervisors

•  Currentlytes,ngtwoprovisioningsystem:OpennebulaandPla]ormISF.

Page 31: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 31

Page 32: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 32

ProvisioningsystemOpenNebulaandPla]ormISFarecurrentlybeingevaluated.ResultsshowninthistalkwereobtainedwithOpenNebula.

OpenNebulaoutoftheUniversityCompultenseofMadrid•  C/C++corewithRubydriversandcommandlineinterface•  MysqlandSqlitebackends•  Usesshascommunica,onbetweenfrontendandhosts•  XML‐RPCAPI•  •  SupportforLVMcontributedbyCERN•  EnablesHybridclouds(i.einstan,a,ononremotecloudproviders)

•  ImplementssubsetofEC2interfaceaswellasupcomingOCCIinterfaceforPubliccloudinterface.

Page 33: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 33

ComparisonwithSimilarTechnologiesOpenNebula - Architecture, Current Status & Roadmap

Platform ISF VMware Vsphere Eucalyptus Nimbus OpenNebula

Virtualization Management VMware, Xen VMware Xen, KVM Xen Xen, KVM,

VMware

Virtual Network Management Yes Yes No Yes Yes

Image Management Yes Yes Yes Yes Yes

Service Contextualization No No No Yes Yes

Scheduling Yes Yes No No Yes

Administration Interface Yes Yes No No Yes

Hybrid Cloud Computing No No No No Yes

Cloud Interfaces No vCloud EC2 WSRF, EC2 EC2 QueryOGF OCCI

vCloud

Flexibility and Extensibility Yes No Yes Yes Yes

Open Source No No GPL Apache Apache

Page 34: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 34

CERN'sLXCLOUDdetails

•  AlogscpandbiOorrentimagedistribu,onhasbeenimplemented

•  Hypervisorsrunu,li,estodetectwhatVMtheyareallowedtorunandwhichimagestheyneedtodownload

•  OpenNebulatriggersinstan,a,onviassh

•  InstancesbasedonLVMsnapshots

Page 35: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 35

Page 36: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 36

ImageDistribu&on

Push:•  Sequen,alSCP•  logarithmicSCP(scp‐wave)•  hOp://code.google.com/p/scp‐wave/

Pull:•  wgetviaanhOpbasedrepository(locally)•  BiOorrent(RomainWartel,BelmiroMoreira@CERN)

SharedFS•  NFS•  PVFS,Lustre...

Page 37: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 37

Imagedistribu&onresults(thxtoBelmiro)

Page 38: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 38

Guidingtheprovisioning

•  Definepoliciestocomposethebatchfarm•  Automatetheprovisioningofthevirtualmachinessuchthatthepoliciesareenforced.

•  e.gInspectthejobqueueanddeducethebestcomposi,onofthebatchfarm.IntermsofSMPVMs,OS...

•  AsizerisusedtomonitorthepoolofVMinstancesandevaluatethepolicies.

•  Currentlyonlyonepolicy:"KeepthepoolfullwiththepropersharesofVMtypes"

•  SeeICAC2010andCCGRID2009papers

Page 39: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 39

AutonomicProvisioningResults

Page 40: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 40

EarlyResultsofsizer

Page 41: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 41

JoiningtheBatchsystem...Acontextualiza=onproblem...

CONTEXT = [ vmid = "$VMID", TTL = "3", AFS = "off", files = "/opt/vmimage/init.sh /opt/vmimage/etchosts /opt/vmimage/etcsysconfigifcfg /opt/vmimage/id_rsa.pub /o pt/vmimage/lsfcontext.conf /opt/vmimage/etcsysconfignetwork", target = "xvdb" ]

• FilesandvariablesarestoredinaISOcreatedonthefly.

• StartupscriptmountsthisISOandrunscontextualiza,onscript.

• VMsaresetupasdynamichostsintheLSFpool.

Page 42: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 42

ScalabilityTests...7,500slotsinLSFviaOpennebula

Page 43: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 43

Tes&ngLSFscalability

Page 44: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 44

IaaSatClemson•  Thereallyeasyway:

– KVMonaregularHPCcluster

– NATnetworking(everyVMgetsitsownNAT)

– BaseimageonNFSserver– KVMsnapshotmodecreatestemporarydiskinscratch,diskdiscardedonceinstanceisshutdown

– SubmitVMsasPBSjobs

IMAGE=/home/sebgoa/kvm/star5.img

export TMPDIR=/local_scratch

kvm -hda $IMAGE -net nic,model=e1000 -net user -m 1280 -snapshot -nographic;

Page 45: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 45

IaaSatClemson

•  But…– NosharedFSbetweenVMs

– LookslikeeachVMhasthesameIP

– Can’tuseregularjobmanagementsystemstorunjobsinthoseVMs(needglidein/proxylikesolu,on)

•  ThissetuphasbeenoneofthekeydriversforourdevelopmentofKestrel:AnXMPPbasedjobmanagementsystem

Page 46: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 46

Kestrel

•  AjobmanagementframeworkusingtheXMPPprotocol

•  Startedasastudentproject

•  UsesInstantMessagingconceptsofno,fica,ons

•  Prac,calinadversenetworkcondi,ons

hop://wiki.github.com/legastero/Kestrel/hops://twiki.grid.iu.edu/bin/view/CampusGrids/InstallingKestrel

Page 47: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 47

Boo&ngVMsisextremelyfast(20VMs/sec)

Page 48: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 48

STARSuccesswithClemsonIaaSandKestrel

•  “Buttosimulatetheequivalentsampleof12.2BillionMonte‐Carloeventswith~10MillionacceptedbyeventtriggeringaTerfulleventreconstruc,on,wewouldhavetaken3yearsatBNLon50machinesThisMonte‐Carloeventgenera,onwouldessen,allynothavebeendone.Withtheresourcesfromcloud,wetook3‐4weeks.”–JeromeLauretBNL.

Page 49: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 49

Conclusions•  TheCloudisherelet’shopeitgetssunny•  APIexplosionopensuppossibili,es•  FocusingonIaaSlayers,LXCLOUDandClemson’sclustershavebeendeveloped/enhancedtoprovisionVMs.

•  GreatscalabilitywithOpenNebula•  KVMshowsgreatpromiseespeciallywiththesnapshotmode

•  PerformancewillgetevenbeOer• MayneedspecializedjobmanagementsystemstomakeuseofCloudsacrossmul,‐site

Page 50: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 50

ThankstoNSF,DOEandOSGThankstoLanceStout,MikeMurphy,

MichaelFenn,LintonAbrahamandalltheotherstudents…

ThankstoCERNandtheIT/PES‐PSgroupThankstoJeromeLauret,MaohewWalker

Ques&ons?:[email protected]://cirg.cs.clemson.edu

Page 51: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 51

Outline

•  CloudBasics•  BuildingaCloudProvider

– Lxcloud@CERN(Incollabora,onwithUlrichSchwickerath,EwanRoche,BelmiroMoreiraandRomainWartel)

•  VOCsandClouds– ResearchdoneatClemson

Page 52: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 52

VOC:VirtualOrganiza&onCluster(JGC+FGCSpapers)

Page 53: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 53

WhyVOCsakaClouds?•  Observa,onthatwhatpeoplewantisresourceswiththeirownOS/Appsandcentralscheduling:Pilotjobframeworks.

•  AcloudisaclusteroverWAN

•  Thereforethereisaneedfor– Awaytorequest/startthenodes– Awaytocreateavirtualnetwork– Awaytorunjobsinthem

•  VerysimilartoglideinWMSbutthepilotsasktostartVMs

Page 54: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 54

Mul&‐SiteOverlay(ICAC2010)

Page 55: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 55

VOCImplementa&on

•  Mul,pleconfigura,ons:– Type1:SharedheadnodeonPhysicalcluster,VOisunawareofVOC(e.gLXCLOUD)

– Type2:VOprovidesvirtualheadnodesonmul,plegridsites.

– Type3:VOusesanoverlaynetworkwithasingleheadnode(e.gSTAR).

Page 56: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 56

Type1:Implementa&on

•  KVMvs.Xenforeaseofuse

•  NormalClusteru,li,es/techniques

•  NFSshare•  AndPVFSsetup

•  KVMoffersasnapshotmodethatgivesusabilitytouseasingleimagefile.Writesaretemporary

Page 57: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 57

Load‐DrivenProvisioning(CCGRID09)•  DynamicProvisioningisdoneviatheuseofaWatchdogontheVOCheadnode

•  WatchdogmonitorsincomingjobsontheOSGgatekeeper(Condorjobmanagerisused)

•  Whenjobsareinthelocalschedulerqueue,thewatchdogstartsaVMonaphysicalhost(sta,cmappingbetweenhostandguestcurrently).XML‐RPCsystem

•  WhenVMstarts,CondorinsidetheVMstartsandadver,zesitspresencetothecentralmanager‐>Jobsrun.

Page 58: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 58

ExperimentalResults•  EngageVOonOSG•  SiteClemson‐BirdnestonOSGProduc,on

•  Clustersizerespondstoload,Simula,onResultsconfirm(PendingIPDPSpaper,simulator:simVOCavailableathOp://cirg.cs.clemson.edu/soTware/simvoc)

Page 59: Sebasen Goasguen –sebgoa@clemson - Fermilabcd-docdb.fnal.gov/0040/004050/001/cloud-fermi.pdf · Sebasen Goasguen ... (Hadoop), new analysis framework (Map ... Thanks to CERN and

August 20th, Fermi Lab 59

From: ACAT 2010, February 22-27th Jaipur/India

Engage VO on OSG