kaleidoscope: cloud micro-elasticity via vm state...

Kaleidoscope: Cloud Micro-Elasticity via VM State ColoringRoy Bryant*, Alexey Tumanov†, Olga, Irzak*, Adin Scannell*, Kaustubh Joshi§, Matti Hiltunen§, H. Andrés Lagar-Cavilla§, Eyal de Lara**University of Toronto, †Carnegie Mellon University, §AT&T Labs Research

• Long instantiation times• Coarse granularity of footprint• Network bandwidth & latency overhead

• Common practice: dedicated migration bandwidth!• Wide-range variability in instantiation times

• Fresh workers → cold state• Number of servers with runtime > month – doubled

• Strong need for elasticity• 15.3% of peak capacity –

average usage• Wide-range short-term

variability in workload• Elastic workers – short lived

• 85% of workers – live < 1hr• 10 min → mean worker lifetime

• Allocate resources proportionally to the working set• Stateful replication of VMs (inherit expensive warm state)• Color-guided state propagation

• proactive prefetch• reactive (on-demand)

• Continuum between full copy and minimal/on demand copy

1. VM memory state coloring• Semantically-aware state propagation• Architectural & introspective runtime memory state

information2. Time & space efficient implementation of coloring3. Micro-elastic cloud server

• Color-aware replication & sharing• Near-ideal post-clone QoS• Resource consumption proportional to demand

4. Real world datacenter savings• Fundamental tradeoff:

• Instantiation time vs. warmup period• Continuum between lazy and

eager replication• Continuum between minimal

and maximal footprint• Architecture-based coloring

• Performed on translated page table pages undergoing mandatory canonicalization at clone time

• Extracting the NX bit (executable)• Identifying the kernel/user-space split

• Introspective coloring• Performed on a "frozen" memory image of the master• File cache radix trees• Frame page structure: used/unused physical pages

• State prefetch of consecutive pages in color space• Significant reduction in bandwidth consumption

• Per-color differentiation in the width of the prefetch window• Insight: reduced window size for executable state• User/kernel data windows : 3-4 times wider than executable

• Color-directed memory deduplication• Per-color probabilities of inter-VM page similarities• One-time, color guided calculation of hashes for the master

memory state • Definition: ability to consume physical resources at sub-VM granularity, tracking effective resource demand• Physical host memory• Network bandwidth

• Description: Kaleidoscope workers allocate memory as necessary, an advantage when spikes are short lived.

• Blocking time reduction• Elastic footprint / resource preservation• Scalability• Runtime• QoS

200018001600140012001000800600400200

0 0 10 20 30 40 50 60

100.00%

80.00%

60.00%

40.00%

20.00%

Clone Lifetime (min)

FrequencyCumulative %

footprintmax

SnowFlock

livemigration

replication

eagerKaleidoscope

0 10 20 30 40 50 60 70 80Frac

Time (seconds)

SupportBanking

EcommerceHttperfOLAP

0 20 40 60 80 100 120

Time (seconds)

0 1 2 3 4 5 6 7 8

Simultaneous New Workers

KaleidoscopeCold Standby

Current Elasticity Models: Issues

10 15 20 25 30 35 40 45

Support Ecommerce Banking

Cold StandbyMinimal Clone

Conservative CloneAggressive Clone

KaleidoscopeWarm Static

Kaleidoscope Design Space

Uses of Color

Evaluation Categories

Motivation

Kaleidoscope Contributions

VM Memory State Coloring

Fractional Footprint

Evaluation Results

Kaleidoscope Vision: Cloud u-elasticity

OLAP Httperf

Cold StandbyMinimal Clone

Conservative CloneAggressive Clone

KaleidoscopeBlocked

Warm Static

Httperf Support Ecommerce Banking OLAP

FreeFile System Cache

User DataKernel DataUser Code

Kernel Code

0 5 10 15 20 25 30 35 40

Time (seconds)

Kaleidoscope Worker Blocked for File System CacheUser Data

Kernel DataUser Code

Kernel CodeFree

Reduction of page faults and blocking Near-optimal QoSPrefetching reduces the length of time that prototypes fail to meet SPECweb's minimum acceptable QoS.

Resource preservationKaleidoscope workers avoid unnecessary pages. Unfetched pages save precious bandwidth and are mostly file system cache and a free list.

RuntimeRuntime approaches near-optimal (warm static) as VCPU blocking and page faults are anticipated by colordirected prefetch and eliminated from the critical path.

ScalabilityThe database's OLAP throughput scales well with the number of simultaneous clones

Before

Traces in this figure include transferred state plus new allocations.

kaleidoscope: cloud micro-elasticity via vm state...

Documents

chapter 3: elasticity price elasticity –demand –supply...

ukrainian kaleidoscope

kaleidoscope 1989

kaleidoscope 2015

the kaleidoscope

urban kaleidoscope

katalyst kaleidoscope

summary of year-end kaleidoscope play & learn participant...

elasticity price elasticity demand supply cross...

kendall kaleidoscope

hhs kaleidoscope 2009

april kaleidoscope

architektura+systemu+ opencontrail+ - semihalf ·...

bengkel kaleidoscope

kaleidoscope ii

a kaleidoscope

cicada - usenix · 1 vm 2 vm 3 vm 4 vm 5vm 6 vm 7 vm 8 vm 9...

kaleidoscope: cloud micro-elasticity via vm state...

kdl kaleidoscope

kaleidoscope 2012