trace-driven analysis of power proportionality in storage systems

Trace-Driven Analysis of Power Proportionality in Storage SystemsSara Alspaugh and Arka Bhattacharya

Why trace-driven analysis

• Lots of published proposals

• Giant design space

Some related workScheme Block

Device / RAID Level

File System Level

Fixed Thresh-hold

Pred-ictive

Erasure Codes (RAID5)

Mirror-ing (RAID1)

Write Logging

Access Freq.-Based Layout

Solid State Devices

Multi-speed Disks

Hybrid / Tiered

DIV-ACC X X X X

EERAID1 X X X X

EERAID5 X X X X X

RIMAC X X X X

PARAID X X X X

PDC X X X

PA-LRU X X X

PB-LRU X X X X

HIBERN X X X X X X

DPRM X X X X

WOL X X X X X X

MAID X X X X

SSD-RAID X X X X X X

EED X X X X X X

SIERRA X X X X

RABBIT X X X X X

Method

EvaluationLaboratory Production Implementation is

infeasible when considering many system types.

AnalysisComponents

Traces

Algorithms

Trace Type Citation

Wikipedia HTTP SOCC ‘10NetApp, Harvard NFS USENIX ‘08, LISA ‘03MSR Cambridge Block Device FAST ‘08Facebook Analytics Hadoop MapReduce EuroSys ‘11Google Web Search ISCA ‘11

AnalysisComponents

Traces

Algorithms

CharacteristicsRequest RateInterarrival TimesRead-Write Mix...

Quantifying Inherent Opportunity• gain =

diff(peak x length, sum(bandwidth)) /peak x length

• waste factor = peak x length / sum(bandwidth)

• waste factor = peak:avg

data set size (B)

data set size (B)ba

data set size (B)

bw_app >> bw_componentcap_app < cap_component

bw_app <= bw_{components}cap_app >> cap_component

unit = disks

Capacity (bytes)

partition

replicate

~ 500 GB

~ 50 MB/s

laptop NFS filer

DB server

unit = servers

partition

replicate

~ 12 TB (disk)

memory cache

~ 32 GB (RAM)

DB server

~ 200 MB/s

~ 1 GB/s

data set size (B)ba

data set size (B)

NAS / NFS (NetApp), disk arrays

web farms (Wikipedia)

data analytics, DFS (Hadoop)

data set size (B)

data set size (B)ba

data set size (B)

bw_app >> bw_componentcap_app < cap_component

bw_app <= bw_{components}cap_app >> cap_component

Challenges• Case 1: writes• Case 2: latency to inactive

components• Case 3: both of the above, set cover

problem

write through: to all components (even if requires waking some)

write offloading: to active components only (propagate on wake)

write log: propagate when ~full reaper: to all components but only wake when queue is full

requests

active units write-offloading

active units write-through

Next steps• data not pictured

here– latencies– ramp times– unit sizes– etc.

• ways to slice it• how to visualize it

• more workloads• go back to related

work to compare• case 3– object popularity

QUESTIONS?The End.

trace-driven analysis of power proportionality in storage systems

power proportional torquea

powerproportional web

system idle power management

data storage

system types

system workload

compute nodes

data location

Documents

using machine learning in trace-driven energy-aware ... ·...

driven by logistics. - rubetrans · driven by logistics....

protest, pandemic and proportionality

abstract proportionality review project

a model-driven approach to trace checking of pattern-based...

trace-driven optimization of networks-on-chip configurations

thor, trace-based hardware-driven layer-oriented natural

unit 8 - key ideas (proportionality)

the accuracy of trace-driven simulations of...

proportionality and financial inclusion

proportionality in e-discovery: emerging...

proportionality of crime to punishment

duress, demandiddng heroism, and proportionality

aristotle dynamics and proportionality

stealthm : system-level protection against cache-based...

simplescalar hacker’s guidesimplescalar hacker’s guide...

htc-sim: a trace-driven simulation framework for energy

engineers ireland seminar 4th november 2015 · computed...

opening the doré to proportionality: discretionary ... ·...

proportionality in bank regulation