reliability analysis for an energy-aware raid system
DESCRIPTION
Reliability Analysis for an Energy-Aware RAID System. S. Yin, M. I. Alghamdi, X.-J. Ruan, Y. Tian, J. Xie, X. Qin, and M. Qiu, Proc. the 30th IEEE International Performance Computing and Communications Conference (IPCCC), Nov. 2011.TRANSCRIPT
![Page 1: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/1.jpg)
Reliability Analysis of An Energy-Aware RAID System
Shu YinXiao Qin
Auburn University
![Page 2: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/2.jpg)
Presentation Outline
• Motivation;• Related Work;• MREED Model;• Experimental Result;• Conclusion/Future Work.
2
![Page 3: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/3.jpg)
Mobile Multimedia
Data-Intensive Applications
3
Motivation
Bio- Informatics
3D Graphic Weather Forecast
![Page 4: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/4.jpg)
Cluster System
4
Cluster in Data Center
![Page 5: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/5.jpg)
Problem: Energy Dissipation
EPA Report to Congress on Server and Data Center Energy Efficiency, 2007 5
![Page 6: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/6.jpg)
Problem: Energy Dissipation (cont.)
• Using 2010 Historical Trends Scenario– Server and Data Centers Consume 120 Billion
kWh per year;– Assume average commercial end user is
charged 9.46 kWh;– Disk systems can account for 27% of the
computing energy cost of data centers. Server and data centers may
have an electrical cost of
10.4 Billion Dollars!!!
6
![Page 7: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/7.jpg)
• Software- directed Power Management• Dynamic Power Management• Redundancy Technique• Multi- speed Setting
Existing Energy Conservation Techniques
How Reliable Are They ?
7
![Page 8: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/8.jpg)
Contradictory of Energy Efficiency and Reliability
8
Energy Efficiency
Example: Disk spin up and down
Reliability
![Page 9: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/9.jpg)
MREED Model
9
R= RBaseValue[1]*τ+α*R(f)[2]
[1] E. Pinheiro, W.-D. Weber, and L.A. Barroso. Failure trends in a large disk drive population. Proc. USENIX Conf. File and Storage Tech., February2007.
[2] IDEMA Standards. Specification of hard disk drive reliability.
R(f)=1.51e-6f2 – 1.09e-5f + 1.39e-2
Baseline Failure Rate Derived from Disk Utilization
Temperature Factor
Coefficient to RBaseValue, α=1 in our research
![Page 10: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/10.jpg)
MREED Model(Temperature Factor τ[3])
10
Temperature(˚C)
Acceleration Factor
De-rating Factor
Adjusted MTBF
25 1.000 1.00 232.140
26 1.0507 0.95 220.553
30 1.2763 0.78 181.069
34 1.5425 0.65 150.891
38 1.8552 0.54 125.356
42 2.2208 0.45 104.463
46 2.6465 0.38 8.123
[3] G. Cole, “Estimating Drive Reliability in Desktop Computers and Consumer Electronics Systems” Seagate Personal Storage Group, 2000
![Page 11: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/11.jpg)
MREED Model(MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT RAID SYSTEMS)
11
![Page 12: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/12.jpg)
MREED Model(MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT RAID SYSTEMS)
12
Energy-Conservation RAID TechniqueEnergy-Conservation RAID Technique
Weibull DistributionAnalysis
Weibull DistributionAnalysis
Access PatternAccess Pattern
FrequencyFrequency
TemperatureTemperature
Annual Failure Rate
Annual Failure Rate
System ReliabilitySystem Reliability
System Level ReliabilitySystem Level Reliability
![Page 13: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/13.jpg)
Weibull Analysis
13
• A Leading Method for Fitting Life Date
• Advantages:• Accurate• Small Samples• Widely Used
![Page 14: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/14.jpg)
MREED Model(Energy Conservation Techniques- PARAID)
Power-Aware RAID (PARAID)[4] System Structure
[4] Charles Weddle, Mathew Oldhan, Jin Qian, An-I Andy Wang. PARAID- A Gear-Shifting Power-Aware RAID. USENIX FAST 2007.
14
Softstate
RAID
Gears
![Page 15: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/15.jpg)
Model Validation
15
• Techniques• Run the Systems for A Couple of Decades
• The Event Validity Validation Techniques[5]
[5] R.G. Sargent, “Verification and Validation of Simulation Models”, in Proceedings of the 37 th conference on Winter Simulation, ser. WSC’05 Winter Simulation Conference, 2005.
![Page 16: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/16.jpg)
Model Validation
16
• Challenges• Unable to Monitor PARAID Running for Years
• Sample Size is Small from A Validation Perspective (e.g. 100 Disks for Five Years)
![Page 17: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/17.jpg)
Model Validation(DiskSim[6] Simulation)
17
[6] S.W.S John, S. Bucy, Jiri Schindler and G.R. Ganger, “The DiskSim Simulation Environment Version 4.0 Reference Manual”, 2008
Input Trace(File Level)
File to Block Mapper
Simulate File(Block Access)
DiskSim(Block Level)
File to Block Level Converter Outline
![Page 18: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/18.jpg)
Model Validation(DiskSim Simulation)
18
Diagram of the Storage System Corresponding to the DiskSim RAID-0
Driver 0
Bus 0
CTLR 2
BUS 2
Driver 2
CTLR 3
BUS 3
Driver 3
CTLR 4
BUS 4
Driver 4
CTLR 1
BUS 1
Driver 1
CTLR 0
BUS 0
Driver 0
![Page 19: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/19.jpg)
Model Validation(Result)
19
Utilization Comparison Between MREED and DiskSim Simulator
![Page 20: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/20.jpg)
Model Validation(Result)
20
Gear Shifting Comparison Between MREED and DiskSim Simulator
![Page 21: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/21.jpg)
Reliability Evaluation(Experimental Setup)
21
Disk Type Seagate ST3146855FC
Capacity 146 GB
Cache Size Sata 16MB
Buffer to Host Transfer Rate
4Gb/s (Max)
Total Number of Disks 5
File Size 100 MB
Number of Files 1000
Synthetic Trace Poisson Distribution
Time Period 24 Hours
Interval Time (Time Phase)
1 Hour
Power On Hour Per Year 8760 Hours
![Page 22: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/22.jpg)
Reliability Evaluation(Disk Utilization Comparison)
22
Disks Utilization Comparison Between PARAID-0 and RAID-0 at A Low Access Rate (20 Times Per Hour)
![Page 23: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/23.jpg)
23
Disks Utilization Comparison Between PARAID-0 and RAID-0 at A High Access Rate (80 Times Per Hour)
Reliability Evaluation(Disk Utilization Comparison)
![Page 24: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/24.jpg)
24
AFR Comparison Between PARAID-0 and RAID-0 at A Low Access Rate (20 Times Per Hour)
Reliability Evaluation(AFR Comparison)
![Page 25: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/25.jpg)
25
AFR Comparison Between PARAID-0 and RAID-0 at A High Access Rate (80 Per Hour)
Reliability Evaluation(AFR Comparison)
AF
R
![Page 26: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/26.jpg)
Future Work
• Extend the MREED Model Power-Aware RAID-5;– Data Stripping
• Investigate Trade-off Between Reliability & Energy-Efficiency ;
• Evaluate and Compare an array of energy-saving techniques with respect to specific application domains;
26
![Page 27: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/27.jpg)
Conclusion
• A Reliability Model (MREED) for Power-Ware RAID;
• Weibull Distribution Analysis to MREED;
• Validation of MREED;
• Impacts of the Gear-shifting on Reliability of PARAID.
27
![Page 28: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/28.jpg)
Thanks
![Page 29: Reliability Analysis for an Energy-Aware RAID System](https://reader033.vdocuments.net/reader033/viewer/2022061218/54b7b5fe4a7959bf688b46fa/html5/thumbnails/29.jpg)
Questions
?