CS 7810 Lecture 12 Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors D. Brooks et al. IEEE Micro, Nov/Dec 2000
Power/Energy Basics Energy = Power x time Dynamic Power = C V 2 f switching activity factor C capacitances being charged V voltage swing f processor frequency Current trends: f and C are rising, V is dropping, overall dynamic power is increasing Leakage energy is also increasing
Processor Breakdowns Alpha Caches 16% O-o-o Issue Logic 19% Mem management unit 9% FP unit 11% Integer unit 11% Clock power 34% Pentium Pro
Metrics Performance f 1/D (D is delay or execution time) Delay of a circuit 1/(V – Vt) ; lower frequency tolerates longer delays, hence, can reduce voltage Power = a C V 2 f ; since f is roughly proportional to voltage, P V 3 f 3 Since V and f are variable, remove it from the expression: PD 3 = constant (regardless of V and f) This is the best metric to compare processors; any other metric (say, perf/watt) can be “fudged” by changing voltage or frequency
Metric Example Proc-A Proc-B V = 1.25; f = 1GHz Perf 1000 MIPS 800 MIPS Power 100W 80W V = 1.0; f = 0.8GHz Perf 800 MIPS 640 MIPS Power 51.2W 41W V= 1.5; f = 1.2GHz Perf 1200 MIPS 960 MIPS Power 172.8W 138.2W MIPS/W = 10 MIPS/W = 15.6 MIPS/W = 6.9 Power/f 3 =
Metrics PD 3 gives ratio of power if two processors were tuned* to yield the same performance (PD 3 ) 1/3 gives ratio of performance if two processors were tuned* to yield the same power *Tuning is done through voltage and frequency scaling and it is assumed that a linear relationship exists between V and f – note that in modern processors, this is not true and PD x is the right metric, where x > 3 (x can be 1 or 2 in markets where performance is not very critical)
Commercial Examples
Global Power Saving Strategies Dynamic frequency scaling – trivially reduces power, worsens performance, no effect on energy If off-chip components (memory) dominate, there will be an energy reduction with DFS Leakage power is unaffected by DFS, so if leakage dominates, overall energy increases Montecito: 20MHz changes in frequency can happen in a single cycle
Global Power Saving Strategies Dynamic voltage scaling – since we are changing frequency, can also combine it with voltage scaling as each circuit has longer slack – has a more than quadratic effect on dynamic power, a linear effect on leakage power, and a more than linear effect on energy Intel Xscale: roughly 50 s to scale from V DVS opportunities are reducing: lower voltage margins, error rates may increase
Localized Power Saving Strategies When a processor structure is not used in a cycle, gate off its clock for that cycle – gating can happen in a single cycle; increase in complexity Leakage energy can be reduced by gating off supply voltage V during periods of inactivity – takes more time to effect Body biasing can also reduce leakage power
Localized Power Saving Strategies Dynamically adjust frequency/voltage and size for each domain, based on thruput rates
Leakage Power Leakage is a linear function of supply voltage, a linear function of the number of transistors, and an exponential function of threshold voltage From Butts and Sohi, MICRO’00
Power-Performance Trade-Offs
Caches, bpreds are doubled at each point below, while the x-axis represents the sizes of issue queues, registers, ROB, etc. Argues against going to wider/larger superscalars
Other Observations Clustered architectures have better power scalability (since the complexity of each cluster remains unchanged) CMP and SMT can employ complexity-effective designs – power consumption is low (little wasted work) and multi-threaded performance continues to be high From ISPASS’06
Title Bullet