efficient resource management for cloud computing environment
TRANSCRIPT
Survey resource management for
Cloud computing environment (*)
Nguyễn Quang Hùng
Email: [email protected]
1Nguyễn Quang Hùng
Outline• What is Cloud computing (CC)?
• Cloud computing and Grid computing (GC) on 3600 comparision
• State-of-the-art
• Challenges on efficient resource management
• Future works
• Conclusions
• References
2Nguyễn Quang Hùng
What is Cloud computing?• Definitions for the Cloud computing:
o …a “cloud” refers to an “Infrastructure-as-a-
Service” (IaaS) cloud, such as Amazon EC2,
where IT infrastructure is deployed in a cloud
provider’s datacenter in the form of virtual
machines, Ian Foster, et.al. (2010)
4Nguyễn Quang Hùng
What is Cloud computing?
• Definitions for the Cloud computing:o …a “cloud” refers to an “Infrastructure-as-a-Service” (IaaS) cloud, such as Amazon EC2, where IT
infrastructure is deployed in a cloud provider’s datacenter in the form of virtual machines, Ian
Foster, et.al. (2010) [26]
o “…a Cloud is a type of parallel and distributed
system consisting of a collection of
interconnected and virtualised computers that
are dynamically provisioned and presented as
one or more unified computing resources based
on service-level agreements established through
negotiation between the service provider and
consumers.”, Rajkumar Buyya, et.al. (2009) [2].
5Nguyễn Quang Hùng
What is Cloud computing?• Definitions for the Cloud computing [26]:
o …a “cloud” refers to an “Infrastructure-as-a-Service” (IaaS) cloud, such as Amazon EC2, where IT
infrastructure is deployed in a cloud provider’s datacenter in the form of virtual machines, Ian
Foster, et.al. (2010) [26]
o “…a Cloud is a type of parallel and distributed system consisting of a collection of interconnected
and virtualised computers that are dynamically provisioned and presented as one or more unified
computing resources based on service-level agreements established through negotiation
between the service provider and consumers.”, Rajkumar Buyya, et.al. (2009) [27][2].
o “A large-scale distributed computing paradigm
that is driven by economies of scale, in which a
pool of abstracted, virtualized, dynamically-
scalable, managed computing power, storage,
platforms, and services are delivered on demand
to external customers over the Internet….” [1]
6Nguyễn Quang Hùng
Cloud computing has …(con’d)
• Taxonomies of Cloud Computing [12, 20]:o Infrastructure-as-a-Service (IaaS)
• Amazon EC2 [6], …
o Platform-as-a-Service (PaaS)
• Google AppEngine [17], …
o Software-as-a-Service (SaaS)
• Gmail, SalesForce,…
o Data-as-a-Service (DaaS)
• Strikeiron.com, Kognitio.com
7Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree comparedGrid computing/Grid systems Cloud computing/Cloud systems
- aims to “enable resource sharing andcoordinated problem solving in dynamic, multi-institutionalvirtual organizations” *1+
- Economy driven.- Compute & storage is packed as metered services can be charge pay-as-you-go as electricity, telephone network.
- Lesser scale than Supercomputer and Cloud
- Large scale and Web 2.0 based wholely.
- Distributed paradigm or infrastructure spans across multiple virtual orgranizations.
- Single organization (e.g. Amazon S3 & Amazon EC2)
8Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree compared
9Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree comparedCriterias Grids Clouds
Business model - Project-oriented, limited CPU hours.
- Unlimited use, pay-as-you-go as electricity, gas, telephone
Architecture -1990s- integration of network commodity resources -Five-layer Grid Architecture: Fabric layer, Connectivity Layer, Resource Layer, Collective Layer, Application Layer
-2000s-Developed to address Internet-scale computing problem. i.e. a large pool of compute and storage resources can be access by common protocols (e.g. web services)
10Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree compared
Grid Protocol Architecture Cloud Architecture
11Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree compared (cont’d)Criterias Grids Clouds
Resource Management:
- Compute Model Batch-job management-Portable Batch System (PBS), Condor, SGE, LSF,…- Dedicated to queuing system
Shared resources by all users at the same time
- Data Model -Data Grid -Data security & privacy, big data problem (TB),…
- Data Locality -Even harder than Cloud- Shared data stores on NFS/GPFS/PVFS/Luster - Need scheduler to be data-aware
-Hardly-Google’s MapReduce on Google File System-Need scheduler to be data-aware
12Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree compared (cont’d)Criterias Grids Clouds
Resource Management:
- Combining compute and data management
In progress of works on schedulter data-aware
Not yet
- Virtualization -Not need as much as Cloud on Virtualization
-Need Virtualization
- Monitoring - Mostly physicalresource
-Hard to fully monitor resources and services on cloud systems
- Provenance -Built-in workflowssystems: Chimera, Swift, Keepler, Tavena,…
-More difficult than in Grids
13Nguyễn Quang Hùng
Cloud computing and Grid computing
360-degree compared (cont’d)Criterias Grids Clouds
Resource Management:
- Programming Models -Similar to parallel & distributed computing: MPI, MPICH-G2, GridRPC, Pop-C++,…
-MapReduce-Mash-up and scripting
- Security Models -Across many VOs
-Single Sign-On
-Clouds mostly is dedicated data centers belong to one orgranization.- SSL based
14Nguyễn Quang Hùng
Challenges on resource
management in Clouds
• On Cloud provider’s view:o Provision resources on both HPC batch-job requests w/wo
deadlines and advanced reservation requests on the same
system.
o Energy efficient resource management in Data Centers.
• On Cloud service provider’s view:o Renting cheapest resources on performance-constraints
o QoS to their cloud users
• On Cloud user’s view:o Renting cheapest resources on performance-constraints
o Cloud provider guaratees Service Level Agreement
15Nguyễn Quang Hùng
Problems
• Problem 1:o Provision resources on both HPC batch-job requests w/wo
deadlines and advanced reservation requests on the same
system.
• Problem 2:o Energy efficient resource management in Data Centers.
Nguyễn Quang Hùng 16
The problem of provisioning
computational resources• B. Sotomayor, “Provisioning Computational Resources Using Virtual
Machines and Leases,” PhD Thesis submited to The University of
Chicago, 2010.
• B. Sotomayor, K. Keahey, and I. Foster, “Combining batch execution
and leasing using virtual machines,” Proceedings of the 17th
international symposium on High performance distributed computing
- HPDC ’08, New York, New York, USA: ACM Press, 2008, p. 87.
• B. Sotomayor, K. Keahey, I. Foster, and T. Freeman, “Enabling Cost-
Effective Resource Leases with Virtual Machines,” Hot Topics session
in HPDC 2007, Monterey Bay, CA (USA): 2007, pp. 16-18.
• B. Sotomayor, S. Montero, and I. Mart, “Capacity Leasing in Cloud
Systems using the OpenNebula Engine,” Workshop on Cloud
Computing and its Applications 2008 (CCA08), Chicago, Illinois, USA:
Chicago, Illinois, USA, 2008.
Nguyễn Quang Hùng 17
The problem of provisioning
computational resources
• Problem:o Short time need for computational resources
o Conflicts on optimized functions: Batch-jobs and
Advanced Reservation (AR)
Nguyễn Quang Hùng 18
Some cases on need for
computational resources
• The need for computational resources
many is in short time [3], for examples:o “A scientist requires a large number of computers to run a
simulation in hours”
o “A college instructor needs a cluster to teach MPI labs”
o “A company to host a website which can scale up/down
on traffics”
o Etc.
Nguyễn Quang Hùng 19
Challenges on provisioning
computational resources
• How to provision shared
computational resources efficiently?
• How is resource provisioning model
and architecture for multiple scenarios
efficiently and simultaneously?
Nguyễn Quang Hùng 20
State-of-the-art on provisioning
computational resources
• VM-based approaches:o Virtual Clusters (Nishimura et al, Yamasaki et al, Nimbus
toolkit,…)
• Batch-jobs approacheso PBS, SGE, Condor,…
• Lease based approaches [3-6]o B. Sotomayzor (2010)
• Energy-efficient resource
management [7-8]o Green Cloud
Nguyễn Quang Hùng 21
B. Sotomayor (2010)
• A lease-based model: [3-6][9]o Lease as a fundamental abstraction
o VM as a implementation vehicle
o Concern on overhead in using VM
o Single domain administration
• High performance computing driven
• Batch-job and AR simulationeouslyo Preemptive jobs with/without deadlines
o Non preemptive AR requests
Nguyễn Quang Hùng 22
B. Sotomayor (2010)
Resource and Leasing model
• Assumptions:o Computational resources in a site
o P = {Set of leasable computer, nodes, in a site}
o Leasable resources within a node can be allocated to one
or more leases, up to maximum capacity, and may include
processors, memory, disk space, network bandwidth, etc.
o R = {memory, disk,…} : set of the types of leasable resources in a site
o r : quantity of a resource
o A site define as: R={proc, mem, disk, net-in, net-out}
Nguyễn Quang Hùng 23
Lease state machine
Nguyễn Quang Hùng 24
Scheduling algorithms
Nguyễn Quang Hùng 25
• B. Sotomayor, K. Keahey, and I. Foster, “Combining batch execution and leasing using virtual machines,” Proceedings of the 17th international symposium on High performance distributed computing - HPDC ’08, New York, New York, USA: ACM Press, 2008, p. 87.
• B. Sotomayor, K. Keahey, I. Foster, and T. Freeman, “Enabling Cost-Effective Resource Leases with Virtual Machines,” Hot Topics session in HPDC 2007, Monterey Bay, CA (USA): 2007, pp. 16-18.
• B. Sotomayor, M. Ignacio, and I. Foster, “Resource Leasing and the Art of Suspending Virtual Machines,” The 11th IEEE International Conference on High Performance Computing and Communications (HPCC-09), Seoul, Korea: 2009.
Problem 2:
Nguyễn Quang Hùng 26
Energy-efficent resource
management for Data Center
• Challenges:
• Data center owner:o Minimize power/energy consumption with in QoS (e.g.
performance) constraints.
27Nguyễn Quang Hùng
State-of-the-art on Enegy-Efficient
resource management• A.J. Younge, G.V. Laszewski, L. Wang, S. Lopez-alarcon, and W. Carithers,
“Efficient Resource Management for Cloud Computing Environments,” To
appear in the Work in Progress in Green Computing with the IEEE International
Green Computing Conference (IGCC), I.A. Behrooz Shirazi, ed., Chicago, IL
USA: IEEE, 2010.
• A. Beloglazov and R. Buyya, “Energy efficient allocation of virtual machines in
cloud data centers,” 2010 10th IEEE/ACM International Conference on Cluster,
Cloud and Grid Computing, IEEE, 2010, p. 577–578.
28Nguyễn Quang Hùng
Virtualization
• One of the most important underlying
technology in cloud is the use of
virtualization.
29Nguyễn Quang Hùng
Green computing
• Dynamic voltage and frequency
scaling (DVFS)
J. Younge, G.V. Laszewski, L. Wang, S. Lopez-alarcon, and W. Carithers, “Efficient Resource Management for Cloud Computing Environments,” To appear in the Work in Progress in Green Computing with the IEEE International Green Computing Conference (IGCC), I.A. Behrooz Shirazi, ed., Chicago, IL USA: IEEE, 2010.
30Nguyễn Quang Hùng
Green Cloud framework
31Nguyễn Quang Hùng
Power-aware scheduling
• Minimize the server’s total power.
32Nguyễn Quang Hùng
Power-aware scheduling
• Change in power consumption
decreases.
• Greedy-based algorithm
33Nguyễn Quang Hùng
Greed-based algorithm
34Nguyễn Quang Hùng
VM image management
• Idle physical machines can be
dynamically shutdown and restarted.
• Live migration
• Wake on LAN
35Nguyễn Quang Hùng
VM image design
• VM size
• Boot time: light weight VM
36Nguyễn Quang Hùng
The result
37Nguyễn Quang Hùng
Conclusions
• This slides have presented the state-of-
the-art on resource management on
cloud computing. Focus on:o B. Sotomayor’s work on provision computational resources
using Virtual Machines and Leases
o Energy-effient resource management for cloud computing environment
• Overview on Cloud computing and
Cloud & Grid 360-degree comparision.
Nguyễn Quang Hùng 38
References[1] I. Foster, Y. Zhao, I. Raicu, and S. Lu, “Cloud Computing and Grid Computing 360-Degree
Compared,” 2008 Grid Computing Environments Workshop, Nov. 2008, pp. 1-10.[2] R. Buyya, C.S. Yeo, S. Venugopal, J. Broberg, and I. Brandic, "Cloud Computing and
Emerging IT Platforms : Vision , Hype , and Reality for Delivering Computing as the 5th Utility.“
[3] B. Sotomayor, “Provisioning Computational Resources Using Virtual Machines and Leases,” PhD Thesis submited to The University of Chicago, 2010.
[4] B. Sotomayor, K. Keahey, and I. Foster, “Combining batch execution and leasing using virtual machines,” Proceedings of the 17th international symposium on High performance distributed computing - HPDC ’08, New York, New York, USA: ACM Press, 2008, p. 87.
[5] B. Sotomayor, K. Keahey, I. Foster, and T. Freeman, “Enabling Cost-Effective Resource Leases with Virtual Machines,” Hot Topics session in HPDC 2007, Monterey Bay, CA (USA): 2007, pp. 16-18.
[6] B. Sotomayor, S. Montero, and I. Mart, “Capacity Leasing in Cloud Systems using the OpenNebula Engine,” Workshop on Cloud Computing and its Applications 2008 (CCA08), Chicago, Illinois, USA: Chicago, Illinois, USA, 2008.
[7] A.J. Younge, G.V. Laszewski, L. Wang, S. Lopez-alarcon, and W. Carithers, “Efficient Resource Management for Cloud Computing Environments,” To appear in the Work in Progress in Green Computing with the IEEE International Green Computing Conference (IGCC), I.A. Behrooz Shirazi, ed., Chicago, IL USA: IEEE, 2010.
[8] A. Beloglazov and R. Buyya, “Energy efficient allocation of virtual machines in cloud data centers,” 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, IEEE, 2010, p. 577–578.
[9] B. Sotomayor, R.S. Montero, I.M. Llorente, and I. Foster, “Virtual Infrastructure Management in Private and Hybrid Clouds,” IEEE Internet Computing, vol. 13, Sep. 2009, pp. 14-22.
39Nguyễn Quang Hùng
Thank you!
• Questions?
40Nguyễn Quang Hùng