Lev Finkelstein ISCA/Thermal Workshop 6/ Overview 1.Motivation (Kevin) 2.Thermal issues (Kevin) 3.Power modeling (David) 4.Thermal management (David) 5.Optimal DTM (Lev) 6.Clustering (Antonio) 7.Power distribution (David) 8.What current chips do (Lev) 9.HotSpot (Kevin)
Lev Finkelstein ISCA/Thermal Workshop 6/ What current chips do Power and thermal management
Lev Finkelstein ISCA/Thermal Workshop 6/ Controllers Inputs (power, temperature, etc.) Response time Tuning Simplicity of implementation Performance, reliability Power management / thermal management
Lev Finkelstein ISCA/Thermal Workshop 6/ Controllers (cont’d) A non-trivial tradeoff PerformanceReliability Cost
Lev Finkelstein ISCA/Thermal Workshop 6/ Real processors: IBM* PowerPC* G3/G4 Cache throttling AMD* PowerNow!* Technology Transmeta* Longrun* technology Intel® SpeedStep® technology Enhanced Intel® SpeedStep technology * Other names and brands may be claimed as the property of others
Lev Finkelstein ISCA/Thermal Workshop 6/ PowerPC G3 Microprocessor On-chip temperature sensor (junction temperature) –Based on differential voltage change across 2 diodes of different sizes –Implemented in PowerPC G3/G4 processors OS required for control Instruction Cache Throttling used to dynamically lower junction temperature From Micro-35 tutorial
Lev Finkelstein ISCA/Thermal Workshop 6/ Transmeta LongRun ** LongRun power management –Code Morphing* software (processor- internal) –Performance demands are determined by sampling the idle time Crusoe * processor *** –Voltage changes in steps of 25 mV –Frequency changes in steps of 33 MHz *Other names and brands may be claimed as the property of others ** Source: *** Data dated 2001
Lev Finkelstein ISCA/Thermal Workshop 6/ Transmeta LongRun (cont’d) Idle time decrement V&f Activity increment V&f (if possible) Performance mode V&f adjustment Source:
Lev Finkelstein ISCA/Thermal Workshop 6/ Previous Intel microprocessors 1 Thermal monitor mechanism A two-point mechanism using voltage scaling (for battery life) 1 Information on Intel microprocessors is based on Efraim Rotem’s presentation in the TACS workshop 06/2004
Lev Finkelstein ISCA/Thermal Workshop 6/ Thermal monitor Based on clock throttling Full operational mode: maximal frequency Minimal operation mode: clocks are stalled for a part of the duty cycle Activation options: –By OS (e.g., ACPI) –By a special hardware
Lev Finkelstein ISCA/Thermal Workshop 6/ Static voltage scaling (for battery life) Performance mode –Maximal frequency & Vcc –AC outlet or set by user Power saving mode –Low frequency & Vcc –Upon request or while the user changed the usage mode
Lev Finkelstein ISCA/Thermal Workshop 6/ The Intel Pentium® M Processor Targets the mobile market Improved power efficiency Advanced ACPI interface Enhanced SpeedStep architecture
Lev Finkelstein ISCA/Thermal Workshop 6/ DVS in the Pentium M Processor Changes both voltage and frequency at the runtime Efficiently switches between different DVS control points
Lev Finkelstein ISCA/Thermal Workshop 6/ Thermal sensors Two thermal sensors Maximal temperature reached throttling Critical shutdown point reached shutdown
Lev Finkelstein ISCA/Thermal Workshop 6/ Operation modes Software control mechanism (e.g., ACPI) –Track the junction temperature –Initiate the appropriate policy Self throttle –Digital temperature detector initiates one of the power control cycles –Used as a fail-safe mechanism since it is much faster than the software
Lev Finkelstein ISCA/Thermal Workshop 6/ Enhanced Intel SpeedStep technology Implements DVS Upon a thermal trigger or SW request, CPU halts execution and locks PLL at a new frequency (a few sec) Once finished, the Vcc starts changing to the new value (order of 1mV/ sec) Transition up is done in the reverse order
Lev Finkelstein ISCA/Thermal Workshop 6/ DVS cycle
Lev Finkelstein ISCA/Thermal Workshop 6/ DVS transitions Frequency transition is fast enough to allow non-interrupted application execution DVS transitions can be utilized for energy and thermal control during the normal operation flow The target frequency and voltage are programmable by BIOS or OS Support for multiple voltage/ frequency points
Lev Finkelstein ISCA/Thermal Workshop 6/ Adaptive policy (for battery life) Uninterrupted power state transition User selectable policy Increases frequency on demand, and decreases power and frequency while idle for a long time
Lev Finkelstein ISCA/Thermal Workshop 6/ Info More specific information on Pentium M will be available at Efraim Rotem’s presentation in the TACS workshop 06/2004
Lev Finkelstein ISCA/Thermal Workshop 6/ ACPI and OSPM 1 ACPI = Advanced Configuration and Power Interface (an open industry specification) OSPM = Operating System-directed configuration and Power Management Cooling decisions are based on the application load and the thermal heuristics of the system 1 Source: The ACPI specification 2.0, see
Lev Finkelstein ISCA/Thermal Workshop 6/ Cooling policies Active cooling – a direct action by OSPM (e.g., turning on a fan) Passive cooling – reducing the power consumption (e.g., throttling) Critical trip points – shutdown
Lev Finkelstein ISCA/Thermal Workshop 6/ Example of SW-based clock throttling P[%] = _TC1 * (T n – T n-1 ) + _TC2 * (T n –T t ) T n – current temperature T t – target temperature P n = P n-1 + HW[- P] Pn is in % The coefficients are set by the OEM