os-aware tuning improving instruction cache energy efficiency on system workloads

30
OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International Date of Conference : 10-12 April 2006 第 31 第 9962111 第第第 , 9962306 第第第 , 9962145 第第第

Upload: hana

Post on 24-Feb-2016

45 views

Category:

Documents


0 download

DESCRIPTION

OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads. Authors : Tao Li, John, L.K. Published in : Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International Date of Conference :  10-12 April 2006. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

OS-aware TuningImproving Instruction Cache

Energy Efficiency on System WorkloadsAuthors : Tao Li, John, L.K.

Published in : Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International

Date of Conference :  10-12 April 2006

第 31 組 9962111 邱鎮浩 , 9962306 蘇育毅 , 9962145 吳修億

Page 2: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

1.Introduction 2.Experimental Methodology 3.User/OS I-Cache Accesses Characterization 4.OS-aware I-Cache Tuning 5.Power and Performance Evaluation 6.Conclusions

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 302

Outline

Page 3: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Dynamic power dissipation Due to charging and discharging highly

capacitive bit lines and sense amps Static power dissipation

電晶體越做越小 -> 自行導通

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 303

1.Introduction

Page 4: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 304

半導體規格影響耗能

Page 5: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 305

15 個 app 耗能表

Page 6: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Dynamic 省電 : OS-aware cache way lookup

Static 省電 : OS-aware cache set drowsy mode

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 306

Page 7: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Modified vision of SoftWatt SimOS OS: SGI-IRI 5.3 商業版 硬體 : 8-issue superscalar processor

split L1 Instruct. ,data cache unified L2 cache memory

實測 15 個 application9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 307

2.Experimental Methodology

Page 8: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Vortex : database manipulation code Gcc : compiler code Sendmail : SMTP, 1KB~1.5MB Fileman : cp, rm, chmod, tar –cvf… …… OS activity ranges from 6% in compress,92%

in fileman

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 308

15 個 application

Page 9: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 309

3.User/OS I-Cache Accesses Characterization

Dual mode

OS instruc.User instruc.

Page 10: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3010

Page 11: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

A conventional 4-way set associative cache requires four tag comparisons and four data array read-outs for a cache access.

Nevertheless, during user execution, performing tag comparisons and data array read-outs for OS cache lines are unnecessary and waste extra dynamic power.

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 11 / 30

OS-aware Cache Way Lookup

Page 12: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Cache way mode bit

Processor status register(PSR)

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 12 / 30

OS-aware Cache Way Lookup

Page 13: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 13 / 30

OS-aware Cache Way Lookup

Page 14: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Once generated, they remain unchanged unless a cache line replacement.

The processor switches mode.

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 14 / 30

OS-aware Cache Way Lookup

Page 15: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 15 / 30

OS-aware Cache Way Lookup

Page 16: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Due to CMOS technology scaling, static power due to leakage current is gaining in importance in I-cache power dissipation.

These make efforts at leakage control essential to maintain control of I-cache power on current and next generations of processors.

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 16 / 30

OS-aware Cache Set Drowsy Mode

Page 17: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

State-preserving drowsy cache techniques.(L1)

Gated-Vdd technique.(L2)

Cache way mode bit 9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 17 / 30

OS-aware Cache Set Drowsy Mode

Page 18: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 18 / 30

OS-aware Cache Set Drowsy Mode

Page 19: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 19 / 30

OS-aware Cache Set Drowsy Mode

Page 20: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 20 / 30

OS-aware Cache Set Drowsy Mode

Page 21: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Access-biased classification.

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 21 / 30

OS-aware Cache Set Drowsy Mode

Page 22: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 22 / 30

OS-aware Cache Set Drowsy Mode

Page 23: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

使用 OS-aware cache way lookup 之後, dynamic power 省電的比例 user execution 能省下 29% OS execution 能省下 30%

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3023

5.Power and Performance Evaluation

合計共能省下 30% 的 dynamic power

Page 24: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

dynamic power 的省電與否與 cache way access 的減少有關聯

OS-aware cache way lookup 能省下約 30% 的dynamic power 而不影響其效能 power 而不影響其效能

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3024

5.Power and Performance Evaluation

Page 25: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

經測試之後 Residency-based drowsy mode scheme 能省下約 5% - 50% 的 leakage power

Access-based drowsy mode scheme 藉由將部分的 cache 設成 drowsy state 以減少更多的 leakage power ,平均約能省下 37% 的 leakage power

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3025

5.Power and Performance Evaluation

Page 26: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

OS-aware cache set drowsy mode method 執行時間會略高於正常值( <1% )因: 誤將要用到的 cache set 設成 drowsy mode 的花費相對較低 套用 cache set drowsy policies 能讓存取 drowsy

cache 的機率大幅降低

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3026

5.Power and Performance Evaluation

Page 27: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3027

Page 28: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

為了提升 I-cache 的用電效率,我們將從 user/OS I-cache 在系統執行時的存取方式著手,在 dual-mode operation 中觀察其省電能力

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3028

6.Conclusions

Page 29: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

OS-aware cache way lookup 藉由減少 parallel tag comparison 以及 data array readout 來降低 dynamic power 消耗

OS-aware tuning 能有效地減少因 CMOS 製程技術進步產生的額外 static power 消耗 OS-aware tuning 能降低 dynamic power 和

static power 的消耗的目的,且只需做少許的硬體元件更動就能達成。9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3029

6.Conclusions

Page 30: OS-aware Tuning Improving Instruction Cache   Energy Efficiency on System Workloads

Thanks for your listening

9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3030