embedded system lab. jung youngjin janus: optimal flash provisioning for cloud storage workloads c....

17
Embedded System Lab. Jung YoungJin [email protected] Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely, M. Waliji, F. Labelle, N. Coehlo, X. Shi, and C. E. Schrock. In Proceedings of the annual conference on USENIX Annual Technical Conference, ATC ’13, Berkeley, CA, USA, 2013. USENIX Association.

Upload: rosa-preston

Post on 19-Jan-2018

220 views

Category:

Documents


0 download

DESCRIPTION

정 영 진 Embedded System Lab. Introduction System description Workload characterization Economics and provisioning Optimizing the flash allocation for workloads Optimization with bounded write rates Evaluation Conclusion Contents

TRANSCRIPT

Page 1: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

Embedded System Lab.

Jung [email protected]

Janus: Optimal Flash Provisioning for Cloud Storage Workloads

C. Albrecht, A. Merchant, M. Stokely, M. Waliji, F. Labelle, N. Coehlo, X. Shi, and C. E. Schrock.

In Proceedings of the annual conference on USENIX Annual Technical Conference, ATC ’13, Berkeley, CA, USA, 2013. USENIX Association.

Page 2: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Page 3: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Introduction System description Workload characterization Economics and provisioning Optimizing the flash allocation for

workloads Optimization with bounded write rates Evaluation Conclusion

Contents

Page 4: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

HDD & SSD Disks are slow, even as their capacities grow We can compensate for this by adding flash storage

Large cloud environment Many user Many workload Distributing the available flash capacity

uniformly between the workloads is not ideal

Introduction

Page 5: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Janus? Provides flash storage allocation recommendations for

workloads in a distributed file system Used in distributed file system, GFS, Colosus …

Google workload Analyzed workload characterizations

Most I/O accesses -> recently created files 28% of read operations -> 1% data

Files are placed in the flash upon creation

Introduction

Page 6: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Recommendation Runs periodically to adjust Many read operation -> flash storage Key input

Age of data Read rate of the data by age

Janus work step Collect age of data and characterization of how cacheable

each workload Allocate flash amongst the workloads Coordination with the distributed file system

System description

HybridStorage

Colosus orGFS

Page 7: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Workload A large application have many job

Need to define a metric that lets us compare how many read operations would be served

Workload characterization

Page 8: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Cacheability functions FIFO eviction instance

How much data there is of a given age How many reads there are to files of a given age

LRU eviction instance Amount of data with a given temporal locality Rate of reads to files with that temporal locality(time gap)

Workload characterization

SIGELMAN, B. H., ET AL. Dapper, a large-scale distributed systems tracing infrastructure. rep., Google, Inc., 2010.

Page 9: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Obtaining instance From file system metadata From trace sample

Function input/output Input : size of data Output : the number of read operations

Workload characterization

Page 10: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Peak IOPS and capacity requirementsEconomics and provisioning

Cost effective to put workloads

entirely in flashCost effective to hot portions ofthe data on flash

Page 11: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Determine the best flash allocation for each workload

Primary goal Find maximize the aggregate rate of read operations

Instance Workloads with cacheability function Total flash capacity

Task Allocate flash to workloads to maximize the weighted flash

read rate

Optimizing the flash allocation for workloads

Page 12: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Secondary goal Bound the flash write rate to reduce flash wear

Instance Workloads with cacheability function and write rate Bound on the flash write rate Total flash capacity

Task Allocate flash to workloads and determine write probability

for each workload to maximize the flash read rate

Optimization with bounded write rates

Page 13: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Flash hit rate during trainingEvaluation

Page 14: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Flash usage and flash read rate for one workload over time

Evaluation

Page 15: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Comparison of flash hit rates for alternative allocation methods

Evaluation

Allocation MethodCell A

(low workload variance)

Cell B(high workload

variance)Optimized 28% 74%

Proportional to read rate 26% 64%

Single FIFO 19% 42%Proportional to data size 14% 15%

Page 16: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

2013 USENIX Annual Technical Conference, Presentation video

Reference

Page 17: Embedded System Lab. Jung YoungJin Janus: Optimal Flash Provisioning for Cloud Storage Workloads C. Albrecht, A. Merchant, M. Stokely,

정 영 진 Embedded System Lab.

Q & A