Transcript
Page 1: Cloud Computing and Challenges

StratusLab is co-funded by theEuropean Community’s Seventh

Framework Programme (Capacities)Grant Agreement INFSO-RI-261552

Cloud Computing and Challenges

C. Loomis (CNRS/LAL)Seminar (RAL)

13 October 2010

Page 2: Cloud Computing and Challenges

2

Biography

High-Energy Physicist (<2001) E735 (Duke), Fermilab Tevatron CDF (Rutgers), Fermilab Tevatron ALEPH (CERN), LEP ATLAS (UCSC), LHC

Grid & Cloud Computing (2001—) European DataGrid: Integration and deployment EGEE, EGEE-II, EGEE-III: Coordinator for user-facing activities Quattor: Principal developer for pan language compiler StratusLab: Project Coordinator

SixSq Sàrl (2007—) Founded in 2007 by 3 partners, based in Geneva Provides consulting and tools for automated systems testing

Page 3: Cloud Computing and Challenges

3

Contents

Cloud Technology

StratusLab Project Vision and Benefits Early Results

Challenges Meeting Expectations Sharing Trust

Conclusions

Page 4: Cloud Computing and Challenges

4

“The” Cloud

Cloud is the convergence of several ideas: Maturity of virtualization technologies Appearance of simplified APIs (REST, XMLRPC, …) Excess of commercial computing resources

Cloud marketing hype Many different, often

incompatible definitions Used to sell existing

software/services Even so, interesting,

useful ideas at the core

Page 5: Cloud Computing and Challenges

5

Cloud Taxonomy

Infrastructure as a Service (IaaS)

Platform as a Service (PaaS)

Software as a Service (SaaS)

Vend

or L

ock-

In

Flex

ibili

ty

Page 6: Cloud Computing and Challenges

6

Architecture Remote access to

virtual “hardware”

Advantages Customized execution

environment Instantly accessible via a simple

API Complete control,

pay-as-you-go

Disadvantages Non-standard interfaces, vendor

lock-in Creation of virtual machines is

difficult

Infrastructure as a Service (IaaS)

Infrastructure as a Service (IaaS)

Platform as a Service (PaaS)

Software as a Service (SaaS)

Page 7: Cloud Computing and Challenges

7

ArchitectureFramework and infra. for

web-based applications

AdvantagesLoad balancing,

redundant services, etc. Programmers free from

low-level “plumbing”

DisadvantagesRequires specific language

or development frameworkNon-portable applications

Platform as a Service (PaaS)

Infrastructure as a Service (IaaS)

Platform as a Service (PaaS)

Software as a Service (SaaS)

Page 8: Cloud Computing and Challenges

8

ArchitectureEssentially web “hosting”

AdvantagesVery simple use: no

software installation, interface web

Very accessible: laptops, smart phones, …

DisadvantagesQuestions about data

access, ownership, sustainability, etc.

Integration often difficult

Software as a Service (SaaS)

Infrastructure as a Service (IaaS)

Platform as a Service (PaaS)

Software as a Service (SaaS)

Page 9: Cloud Computing and Challenges

9

Grid Infrastructures

Architecture CPU: conceived as a large, distributed batch system Data: core services deal with files Network: essentially no real management of network resources

Advantages Uniform security model Sharing of resources, algorithms, and expertise through VOs

Disadvantages Tendency toward complexity (APIs, services, etc.) Porting of applications that are not batch-oriented is difficult Inhomogeneous environments increases job failure rates Impossible to deploy VO and user-level services

Page 10: Cloud Computing and Challenges

10

Complementary Technologies

Grid: Federate distributed resources through uniform interfaces

Uniform security model Sharing of resources, algorithms, expertise through VOs

Cloud: Instantaneous deployment of customized resources

Environment is dynamic, elastic, and customized Several levels of abstractions (IaaS, PaaS, et SaaS) Based strongly on virtualization technologies

Page 11: Cloud Computing and Challenges

11

StratusLab Project

Facts 1 June 2010—31 May 2012 (2 yrs.) Partners: 6 from 5 countries Budget: 3.3 M€ (2.3 M€ EC)

Goal Provide coherent, open-source

private cloud distribution

Contacts Website: http://stratuslab.eu/ Twitter: @StratusLab

CNRS (FR) UCM (ES)

GRNET (GR) SIXSQ (CH)

TID (ES) TCD (IE)

Page 12: Cloud Computing and Challenges

12

Grid Services Over Cloud Resources

Grid Resource Center

StratusLabDistribution

PrivateCloud

Cloud API

Grid Services

PublicClouds

users

Page 13: Cloud Computing and Challenges

13

Vital IaaS Features

CPU Virtual machines (appliances) created by users Repository of virtual machines to share software, expertise, …

Data Management The cloud must have mechanisms for efficient data management Minimum: file management, disk management

Network Dynamic control of inbound and outbound ports/connectivity Ability to have a public IP address associated with a machine

Existing virtual machine management tools: Nimbus, Eucalyptus, OpenNebula

Page 14: Cloud Computing and Challenges

14

Benefits

Ease of Deployment Prepared virtual machine images for grid services Decouple OS on physical systems from grid requirements

Increased Reliability/Robustness Migration for load balancing or for avoid hardware failures Better isolate user jobs/machines from others

Customized Environments Responsibility of environments rests with VOs Flexible environment appeals to more diverse scientific community

Page 15: Cloud Computing and Challenges

15

Integration Strategy

Agile Software Processes Short project, need to evolve quickly Focus on user and administrator requirements Planning meeting, 3 week sprint, Demo meeting, Mgt. meeting

Integration Over Development Small project with limited effort (15—20 people) Many existing services that can be integrated Only develop to fill gaps or to expand functionality

Open to Multiple Implementations Identify clean interfaces between components Standardize those interfaces where possible

Page 16: Cloud Computing and Challenges

16

Early Results

Appliance Repository at TCD Based on standard Apache web server Contains stock images for supported operating systems Will contain grid service images (“appliances”)

Complete Grid Sites at GRNET Two StratusLab clouds deployed: Ubuntu 10.04 and CentOS 5.5 Functional (pre-production) grid site on each cloud

Worker Node Tests at LAL Worker nodes deployed via Quattor in StratusLab cloud Run in production for >1 month without problems

Page 17: Cloud Computing and Challenges

17

User and Administrator Surveys

Surveys for Users and Administrators Collect use cases and requirements from target communities 248 and 608 unique views, 22 and 56 completed Analysis: D2.1 (http://stratuslab.eu/doku.php?id=deliverables)

User Feedback 33% of users are regular cloud users Most respondents from CS/eng., bioinformatics, and physics Most planned for “real” use within 2 years

Administrator Feedback 68% of administrators will deploy virtualization/cloud technologies Half have already deployed; remainder within 1 year Reluctance for administrators to trust user-generated VMs

Page 18: Cloud Computing and Challenges

18

Availability2-3 November 2010

ScopeBasic cloud distributionGenerate feedback from

users and administrators

WarningAll APIs, commands, etc.

are subject to change!

Documentation User tutorial Description of reference deployment Installation guide

Services Appliance repository User and administrator support

Software Repackaged/patched OpenNebula Command line tools (user and

administrator) Standard images

(ttylinux 9.4, Ubuntu 10.04, CentOS 5.5)

Web monitor Manual and Quattor-based

installation

Initial Public Release

Page 19: Cloud Computing and Challenges

19

Appliance Repository

Appliance Virtual machine created to provide a single service or a few tightly

coupled services Appliances can be used to exchange software and knowledge

Appliance repository Repository allows virtual machines to be found and shared Current implementation just a simple web server Contains base images, will have grid images as well

Future directions Worthwhile to use metadata schema like Dublin Core? Publish metadata via RDF for others to use? Federation of separate appliance repositories?

Page 20: Cloud Computing and Challenges

20

Site Management Tools

Quattor Use to demonstrate integration with site management tools Chosen because of partner experience and use within EGI Nearly complete automated installation with Quattor available

Virtual machine metadata Information about virtual machine contents critical (esp. when shared) Tools like Quattor can be used to document those virtual machines Can be used as mechanism for generating images as well

Page 21: Cloud Computing and Challenges

21

Meeting Expectations

Bare Metal Performance Virtualization tax near zero from technological improvements IO still issue, but also on physical resources as well!

Instantaneous (~10s) Availability of Machines Fast copies, caching of machine images Client site preparation of contextualization image, … Optimization of OS boot sequence

Fully Elastic Clouds Users expect infinite resources, machine room reality is different! How to prevent users from seeing a queue? Allow resource searching/matchmaking like the grid? Site-level, automated bartering through hybrid clouds?

Page 22: Cloud Computing and Challenges

22

Sharing Resources

Possible to bring grid’s collaborative spirit into the fundamentally “selfish” cloud paradigm?

Authentication/Authorization Mechanisms Reuse worldwide trust network and its technologies

Machine Images and Appliances Great way to share algorithms, software, packaged data, … Issues of transport, caching, and access control

Data Management Like grid, issues of access, persistency, versioning, … Additional issue of service state, logging, accounting, …

Page 23: Cloud Computing and Challenges

23

Building (More) Trust

Reluctance to Run User-Defined Images Realizing full potential of cloud infrastructures will require building

more trust between users, VOs, and system administrators Trust will vary with actors involved sliding scale that balances level of

trust with allowed capabilities

Requirements Define security requirements for images (HEPiX) Matchmaking: appliance metadata vs. site requirements Enhanced monitoring (e.g. ports and connections) Additional trust between firewall “controllers” and site administrators

Page 24: Cloud Computing and Challenges

24

Impact in Scientific Communities

No shoehorn needed! No need to port application to specific framework or to create adaptors for

different environments Sharing without requiring specific OSes, software, versions, …

Specialized platforms AND efficient sharing Platforms exist now with static deployments and allocations Reconfigurations impede sharing of underlying resources Cloud makes process more agile and dynamic

Greater accessibility Smaller groups can “outsource” IT deployment/maintenance Modest calculations done quickly: no procurement, installation

Greater responsibility for IT engineering & development

Page 25: Cloud Computing and Challenges

25

Conclusions

Grid and cloud technologies are complementary Easier deployment, maintenance of sites for administrators More flexible, capable, and dynamic infrastructure for users

StratusLab Early results: surveys, functioning grid services over cloud, appliance

repository Release: expected in a couple of weeks, accessible cloud

infrastructures for tests

Challenges for cloud community Meeting expectations of users and administrators Sharing easily the diversity of resources in communities Building more trust between actors

Page 26: Cloud Computing and Challenges

Copyright © 2010, Members of the StratusLab collaboration: Centre National de la Recherche Scientifique, Universidad Complutense de Madrid, Greek Research and Technology Network S.A., SixSq Sàrl, Telefónica Investigación y Desarrollo SA, and The Provost Fellows and Scholars of the College of the Holy and Undivided Trinity of Queen Elizabeth Near Dublin.

This work is licensed under the Creative CommonsAttribution 3.0 Unported Licensehttp://creativecommons.org/licenses/by/3.0/


Top Related