intelligent datacenter placement

22
Execution Environments for Distributed Computing Intelligent placement of datacenters for Internet Services EEDC 3 4 3 3 0 Master in Computer Architecture, Networks and Systems - CANS Homework number: 6 Group number: EEDC-32 Francesc Lordan [email protected]

Upload: francesc-lordan-gomis

Post on 20-Jun-2015

232 views

Category:

Technology


2 download

DESCRIPTION

Intelligent Placement of datacenters for internet Services

TRANSCRIPT

Page 1: Intelligent Datacenter placement

Execution Environments for Distributed Computing

Intelligent placement of datacenters for Internet

Services

EEDC

343

30

Master in Computer Architecture, Networks and Systems - CANS

Homework number: 6

Group number: EEDC-32Francesc Lordan [email protected]

Page 2: Intelligent Datacenter placement

2

Popular Internet companies offer services to millions of users everyday.

These services are hosted in geographically distributed datacenters.

No public information about how they select the locations

Introduction

Page 3: Intelligent Datacenter placement

3

Introduction

AustinPUE: 1.39Land: 0.394 $/SFEnergy: 0.066 kWhWater: 0.40 cents/galCO2: 569 g/kWh

Page 4: Intelligent Datacenter placement

4

Introduction

BismarkPUE: 1.20Land: 0.434 $/SFEnergy: 0.062 kWhWater: 0.32 cents/galCO2: 869 g/kWh

Page 5: Intelligent Datacenter placement

5

Introduction

Los AngelesPUE: 1.41Land: 0.638 $/SFEnergy: 0.099 kWhWater: 0.33 cents/galCO2: 286 g/kWh

Page 6: Intelligent Datacenter placement

6

Introduction

New YorkPUE: 1.29Land: 3.460 $/SFEnergy: 0.096 kWhWater: 0.35 cents/galCO2: 960 g/kWh

Page 7: Intelligent Datacenter placement

7

Introduction

OrlandoPUE: 1.42Land: 0.272 $/SFEnergy: 0.081 kWhWater: 0.23 cents/galCO2: 541 g/kWh

Page 8: Intelligent Datacenter placement

8

Introduction

SeattlePUE: 1.19Land: 0.987 $/SFEnergy: 0.041 kWhWater: 0.65 cents/galCO2: 120 g/kWh

Page 9: Intelligent Datacenter placement

9

Introduction

St. LouisPUE: 1.32Land: 0.264 $/SFEnergy: 0.047 kWhWater: 0.21 cents/galCO2: 806 g/kWh

Page 10: Intelligent Datacenter placement

10

Framework for placement - Parameters

Cost Capital Expenses (CAPEX): investments made upfront and depreciated over the lifetime of the datacenter

– CAP_ind: independent of the number of servers. • Bringing the electricity and external networking.

– CAP_max: maximum number of servers that can be hosted• Land adquisition

• Datacenter construction

• Purchasing and installing power delivery infrastructure

• Cooling infrastructure

• Backup infrastructure

– CAP_act: purchasing the servers and internal networking gear

Page 11: Intelligent Datacenter placement

11

Framework for placement - Parameters

Cost Operational Expenses (OPEX): costs incurred during the operation of the datacenters

– OP_act: maintenance and administration of the equipment and external networking bandwith.• Domined by the staff compensation.

– OP_utl: electricity and water costs involved in running the servers

Lower taxes and incentives

Page 12: Intelligent Datacenter placement

12

Framework for placement - Parameters

Response Time: Latency between a population center and a location.

– Latency(c, d): latency between a location d and a center c.

– Pcd: Number of servers at a location d that serve request from c

– Servers(c): Number of servers required by the center c

Consistency Delay: time required for state changes to reach all mirrors

– Latency (d1, d2): one-way latency between the locations d1 and d2.

Availability: depends on the network avalability of all the datacenters

CO2 emissions: determined by the type of electricity consumed

– Emissions(d): carbon emissions (g/Kwh) at location d.

Page 13: Intelligent Datacenter placement

13

Framework for placement – Formulation

Inputs:– Maximum number of servers – Expected average utilization for the servers– Number of user that each server can accomodate– Amount of redundancy– Latencies and availability constraints– CAPEX and OPEX for each location– Latencies between any population center and each location– Latencies between any two locations

Page 14: Intelligent Datacenter placement

14

Framework for placement – Formulation

Outputs:– Optimal cost

– Maximum number of servers at each location– Number of servers that service a population center at a location

Page 15: Intelligent Datacenter placement

15

Framework for placement – Solutions

Simple linear programming (LP0)– Simplifies the equation to check if a datacenter must be placed at a

location and which centers it provides. Proportionally assigns the max number of servers and computes the network costs with the original one

Pre-set linear programming (LP1)– Presets if a location contains a datacenter and its size and removes

the centers which are provided variable.

Bruteforce (Brute)– Generates all the possibilities and tests them using the LP1

approach

Page 16: Intelligent Datacenter placement

16

Framework for placement – Solutions

Heuristic Based on LP (Heuristics)– Generates 10 possible datacenter networks for each number of

datacenters using LP0 applies the LP1 algorithm and sorts the results in increasing order of cost and finally runs the bruteforce method on a small set of solutions to obtain the most efficient.

Simualted Annealing plus LP1(SA+LP1)– SA starts with a configuration that fulfills the constraints and

evaluates the neighbors obtained using LP1. The solution is selected when there is no cost improvement within an iteration interval.

Optimized SA+LP1(OSA+LP1)– Adjusts the results of the LP1: when no servers are assigned to a

datacenter, it is removed.

Page 17: Intelligent Datacenter placement

17

Placement tool

User only specifies:– Area of interest– Granularity of the potentials datacenters– Location of existing datacenters– Max number of Servers– Ratio of user per server– Max latency between – Max delay– Min availability

The toolkit obtains the missing data to compute the best datacenter network in order to fulfill the user constraints.

Page 18: Intelligent Datacenter placement

18

Placement tool

Page 19: Intelligent Datacenter placement

19

Placement tool60k serversLatency <60msDelay <=85 msAvailability >= 0.99999

31789

22712

5501

Page 20: Intelligent Datacenter placement

20

Exploring datacenter placement tradeoffs

Latency– Latencies > 70 ms have the same cost– Latency = 50 ms is the best tradeoff between latency and cost– Latencies < 35 doubles the cost of 50 ms

Availability– Less level Tier datacenters more datacenters– It’s cheaper to achive an avaiability level with more low-level Tier

datacenters than with less high-level datacenters.– TierII datacenters are the best option

Page 21: Intelligent Datacenter placement

21

Exploring datacenter placement tradeoffs

Consistency delay– Low consistency delays and low latency are conflicting goals– Low consistency delays implies less datacenters and lower costs

Green Datacenters– When latencies can be relatively high, a green datacenter is less

expensive than $100K a month.

Chiller-less datacenters– Water chillers increases energy consumption by 20% and

building costs by 30%. Necessary for locations with an average temperature over 20ºC.

– Avoiding chillers is feasable when latencies are over 70 ms. It reduces costs by an 8%.

Page 22: Intelligent Datacenter placement

22

Questions