Download presentation
Presentation is loading. Please wait.
Published byElisa Rackley Modified over 10 years ago
1
OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International Date of Conference : 10-12 April 2006 31 9962111, 9962306, 9962145
2
1.Introduction 2.Experimental Methodology 3.User/OS I-Cache Accesses Characterization 4.OS-aware I-Cache Tuning 5.Power and Performance Evaluation 6.Conclusions 9962111 9962306 9962145 / 30 2 Outline
3
Dynamic power dissipation Due to charging and discharging highly capacitive bit lines and sense amps Static power dissipation -> 9962111 9962306 9962145 / 30 3 1.Introduction
4
9962111 9962306 9962145 / 30 4
5
9962111 9962306 9962145 / 30 5 15 app
6
Dynamic : OS-aware cache way lookup Static : OS-aware cache set drowsy mode 9962111 9962306 9962145 / 30 6
7
Modified vision of SoftWatt SimOS OS: SGI-IRI 5.3 : 8-issue superscalar processor split L1 Instruct.,data cache unified L2 cache memory 15 application 9962111 9962306 9962145 / 30 7 2.Experimental Methodology
8
Vortex : database manipulation code Gcc : compiler code Sendmail : SMTP, 1KB~1.5MB Fileman : cp, rm, chmod, tar –cvf… …… OS activity ranges from 6% in compress,92% in fileman 9962111 9962306 9962145 / 30 8 15 application
9
9962111 9962306 9962145 / 30 9 3.User/OS I-Cache Accesses Characterization OS instruc. User instruc.
10
9962111 9962306 9962145 / 30 10
11
A conventional 4-way set associative cache requires four tag comparisons and four data array read-outs for a cache access. Nevertheless, during user execution, performing tag comparisons and data array read-outs for OS cache lines are unnecessary and waste extra dynamic power. 9962111 9962306 9962145 11 / 30 OS-aware Cache Way Lookup
12
Cache way mode bit Processor status register(PSR) 9962111 9962306 9962145 12 / 30 OS-aware Cache Way Lookup
13
9962111 9962306 9962145 13 / 30 OS-aware Cache Way Lookup
14
Once generated, they remain unchanged unless a cache line replacement. The processor switches mode. 9962111 9962306 9962145 14 / 30 OS-aware Cache Way Lookup
15
9962111 9962306 9962145 15 / 30 OS-aware Cache Way Lookup
16
Due to CMOS technology scaling, static power due to leakage current is gaining in importance in I-cache power dissipation. These make efforts at leakage control essential to maintain control of I-cache power on current and next generations of processors. 9962111 9962306 9962145 16 / 30 OS-aware Cache Set Drowsy Mode
17
State-preserving drowsy cache techniques.(L1) Gated-V dd technique.(L2) Cache way mode bit 9962111 9962306 9962145 17 / 30 OS-aware Cache Set Drowsy Mode
18
9962111 9962306 9962145 18 / 30 OS-aware Cache Set Drowsy Mode
19
9962111 9962306 9962145 19 / 30 OS-aware Cache Set Drowsy Mode
20
9962111 9962306 9962145 20 / 30 OS-aware Cache Set Drowsy Mode
21
Access-biased classification. 9962111 9962306 9962145 21 / 30 OS-aware Cache Set Drowsy Mode
22
9962111 9962306 9962145 22 / 30 OS-aware Cache Set Drowsy Mode
23
OS-aware cache way lookup dynamic power user execution 29% OS execution 30% 9962111 9962306 9962145 / 30 23 5.Power and Performance Evaluation 30% dynamic power
24
dynamic power cache way access OS-aware cache way lookup 30% dynamic power power 9962111 9962306 9962145 / 30 24 5.Power and Performance Evaluation
25
Residency-based drowsy mode scheme 5% - 50% leakage power Access-based drowsy mode scheme cache drowsy state leakage power 37% leakage power 9962111 9962306 9962145 / 30 25 5.Power and Performance Evaluation
26
OS-aware cache set drowsy mode method <1% cache set drowsy mode cache set drowsy policies drowsy cache 9962111 9962306 9962145 / 30 26 5.Power and Performance Evaluation
27
9962111 9962306 9962145 / 30 27
28
I-cache user/OS I-cache dual-mode operation 9962111 9962306 9962145 / 30 28 6.Conclusions
29
OS-aware cache way lookup parallel tag comparison data array readout dynamic power OS-aware tuning CMOS static power OS-aware tuning dynamic power static power 9962111 9962306 9962145 / 30 29 6.Conclusions
30
Thanks for your listening 9962111 9962306 9962145 / 30 30
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.