A Framework for Dynamic Energy Efficiency and Temperature Management (DEETM) Michael Huang, Jose Renau, Seung-Moon Yoo, Josep Torrellas University of Illinois.

Slides:



Advertisements
Similar presentations
Virtual Hierarchies to Support Server Consolidation Michael Marty and Mark Hill University of Wisconsin - Madison.
Advertisements

Energy-Efficient Congestion Control Opportunistically reduce link capacity to save energy Lingwen Gan 1, Anwar Walid 2, Steven Low 1 1 Caltech, 2 Bell.
Eliminating Squashes Through Learning Cross-Thread Violations in Speculative Parallelization for Multiprocessors Marcelo Cintra and Josep Torrellas University.
IMPACT Second Generation EPIC Architecture Wen-mei Hwu IMPACT Second Generation EPIC Architecture Wen-mei Hwu Department of Electrical and Computer Engineering.
LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian.
L1 Data Cache Decomposition for Energy Efficiency Michael Huang, Joe Renau, Seung-Moon Yoo, Josep Torrellas University of Illinois at Urbana-Champaign.
Leakage Energy Management in Cache Hierarchies L. Li, I. Kadayif, Y-F. Tsai, N. Vijaykrishnan, M. Kandemir, M. J. Irwin, and A. Sivasubramaniam Penn State.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
University of Michigan Electrical Engineering and Computer Science 1 A Distributed Control Path Architecture for VLIW Processors Hongtao Zhong, Kevin Fan,
Power Reduction Techniques For Microprocessor Systems
Evaluating an Adaptive Framework For Energy Management in Processor- In-Memory Chips Michael Huang, Jose Renau, Seung-Moon Yoo, Josep Torrellas.
1 MemScale: Active Low-Power Modes for Main Memory Qingyuan Deng, David Meisner*, Luiz Ramos, Thomas F. Wenisch*, and Ricardo Bianchini Rutgers University.
Techniques for Multicore Thermal Management Field Cady, Bin Fu and Kai Ren.
Low-Power and Temperature-Aware Compilation for Embedded Processors José L. Ayala Politecnica University of Madrid
1 Soft Timers: Efficient Microsecond Software Timer Support For Network Processing Mohit Aron and Peter Druschel Rice University Presented By Jonathan.
Hot-and-Cold: Using Criticality in the Design of Energy-Efficient Caches Rajeev Balasubramonian, University of Utah Viji Srinivasan, IBM T.J. Watson Sandhya.
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
ECE 510 Brendan Crowley Paper Review October 31, 2006.
Chapter 18 Multicore Computers
Secure Embedded Processing through Hardware-assisted Run-time Monitoring Zubin Kumar.
Low Power Techniques in Processor Design
Low-Power Wireless Sensor Networks
1 Overview 1.Motivation (Kevin) 1.5 hrs 2.Thermal issues (Kevin) 3.Power modeling (David) Thermal management (David) hrs 5.Optimal DTM (Lev).5 hrs.
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
1 Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy.
Ramazan Bitirgen, Engin Ipek and Jose F.Martinez MICRO’08 Presented by PAK,EUNJI Coordinated Management of Multiple Interacting Resources in Chip Multiprocessors.
Towards reducing total energy consumption while constraining core temperatures Osman Sarood and Laxmikant Kale Parallel Programming Lab (PPL) University.
Temperature Aware Load Balancing For Parallel Applications Osman Sarood Parallel Programming Lab (PPL) University of Illinois Urbana Champaign.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Energy-Effective Issue Logic Hasan Hüseyin Yılmaz.
Energy Management in Virtualized Environments Gaurav Dhiman, Giacomo Marchetti, Raid Ayoub, Tajana Simunic Rosing (CSE-UCSD) Inside Xen Hypervisor Online.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Lev Finkelstein ISCA/Thermal Workshop 6/ Overview 1.Motivation (Kevin) 2.Thermal issues (Kevin) 3.Power modeling (David) 4.Thermal management (David)
Improving Energy Efficiency of Configurable Caches via Temperature-Aware Configuration Selection Hamid Noori †, Maziar Goudarzi ‡, Koji Inoue ‡, and Kazuaki.
Hardware Architectures for Power and Energy Adaptation Phillip Stanley-Marbell.
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.
1 Lecture 2: Memory Energy Topics: energy breakdowns, handling overfetch, LPDRAM, row buffer management, channel energy, refresh energy.
1 CMP-MSI.07 CARES/SNU A Reusability-Aware Cache Memory Sharing Technique for High Performance CMPs with Private Caches Sungjune Youn, Hyunhee Kim and.
Workload Clustering for Increasing Energy Savings on Embedded MPSoCs S. H. K. Narayanan, O. Ozturk, M. Kandemir, M. Karakoy.
Lecture 27 Multiprocessor Scheduling. Last lecture: VMM Two old problems: CPU virtualization and memory virtualization I/O virtualization Today Issues.
CML Path Selection based Branching for CGRAs ShriHari RajendranRadhika Thesis Committee : Prof. Aviral Shrivastava (Chair) Prof. Jennifer Blain Christen.
Learning A Better Compiler Predicting Unroll Factors using Supervised Classification And Integrating CPU and L2 Cache Voltage Scaling using Machine Learning.
Soft Timers : Efficient Microsecond Software Timer Support for Network Processing - Mohit Aron & Peter Druschel CS533 Winter 2007.
Embedded Software Design Week III Processor Basics Raspberry Pi -> Blinking LEDs & pushing buttons.
Dynamic and On-Line Design Space Exploration for Reconfigurable Architecture Fakhreddine Ghaffari, Michael Auguin, Mohamed Abid Nice Sophia Antipolis University.
Overview Motivation (Kevin) Thermal issues (Kevin)
PPEP: online Performance, power, and energy prediction framework
Memory Segmentation to Exploit Sleep Mode Operation
Gift Nyikayaramba 30 September 2014
Ultra-Low-Power Sensor Nodes Featuring a Virtual Runtime Environment
Decoupled Access-Execute Pioneering Compilation for Energy Efficiency
Green Software Engineering Prof
Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed
A Novel Framework for Software Defined Wireless Body Area Network
Fine-Grain CAM-Tag Cache Resizing Using Miss Tags
Computer Architecture Lecture 4 17th May, 2006
Characteristics of Reconfigurable Hardware
Tapestry: Reducing Interference on Manycore Processors for IaaS Clouds
A High Performance SoC: PkunityTM
Mengjia Yan† , Jiho Choi† , Dimitrios Skarlatos,
Fine-grained vs Coarse-grained multithreading
The University of Adelaide, School of Computer Science
Aravindh Anantaraman*, Kiran Seth†, Eric Rotenberg*, Frank Mueller‡
Chih-Hsun Chou Daniel Wong Laxmi N. Bhuyan
8 – Simultaneous Multithreading
The University of Adelaide, School of Computer Science
ITAP: Idle-Time-Aware Power Management for GPU Execution Units
Phase based adaptive Branch predictor: Seeing the forest for the trees
Presentation transcript:

A Framework for Dynamic Energy Efficiency and Temperature Management (DEETM) Michael Huang, Jose Renau, Seung-Moon Yoo, Josep Torrellas University of Illinois at Urbana-Champaign

Micro’33, Dec Motivation  Increasingly high power consumption –High temperature –Inefficient use of energy  Limitations of existing approaches –Static optimizations –Coarse grain dynamic management (DPM) –Inefficient temperature control (sleep) –Independent targets: temp, energy efficiency

Micro’33, Dec Goal  Temperature: enforce limit while minimizing slowdown  Energy efficiency: maximize energy saving while exploiting performance slack Unified framework for Dynamic Energy Efficiency and Temperature Management (DEETM)

Micro’33, Dec Contribution: DTEEM Framework  Existing limitations: –static application –coarse grain dynamic –inefficient techniques –independent targets:  temperature control  energy efficiency  DEETM approach: –multiple techniques –dynamic –fine grain –order techniques for maximum efficiency –unified target

Micro’33, Dec DEETM Framework  Monitors temperature & execution slack  Runs decision algorithm periodically: {Thermal, Slack} components  Activates low-power techniques –dynamically –incrementally –in prioritized order

Micro’33, Dec Techniques  Instruction filter cache  Data cache subbanking  Voltage scaling  Voltage scaling: DRAM only  Light sleep

Micro’33, Dec Instruction Filter Cache Filter Cache Instruction Memory Processor Filter Cache L1 Cache Processor High power mode: Time: 1 Energy: E Filter Cache Instruction Memory Processor High power mode: Time: 1 Energy: E Filter Cache Instruction Memory Processor Low power mode: Time: 1 + mr*1 Energy: e + mr*E

Micro’33, Dec Techniques  Instruction filter cache  Data cache subbanking  Voltage scaling  Voltage scaling: DRAM only  Light sleep

Micro’33, Dec Chip Environment  Processor-in-memory  64 lean processors –2-issue static  Optimized memory hierarchy  Integrated thermal sensors and instruction counter Processor D-Cache I-Cache DRAM Bank Instruction Memory Processor DRAM Bank Interconnect

Micro’33, Dec Individual Techniques-E*D IFilter SubBank VoltFreq 0.7 Normalized Energy-Delay Product Normalized Average Power MemVolt Sleep

Micro’33, Dec Combinations - E*D Normalized Average Power Normalized Energy-Delay Product VoltFreq IFilter Gradual

Micro’33, Dec Summary  Techniques ordered by efficiency (same order apply to both targets)  System applies techniques in order, dynamically and incrementally

Micro’33, Dec Thermal Algorithm Macrocycle

Micro’33, Dec Temperature Control - Limit

Micro’33, Dec Temperature Control - Slowdown

Micro’33, Dec Slack Algorithm Macrocycle Effective IPC*: frequency-adjusted Microcycle Every Macrocycle (HW/OS) The First Few Microcycles (HW)

Micro’33, Dec Slack - Energy Consumption

Micro’33, Dec Slack Misprediction

Micro’33, Dec Related Issues  Algorithm interaction  Selecting macrocycle  Handling Thermal Crisis situation  Reducing technique activation overhead  Flexible technique ordering  Hardware vs. software implementation

Micro’33, Dec Conclusions  Effective & efficient temperature control –very few macrocycles still over limit –27% longer execution vs. 98% (by sleeping)  Efficient & accurate fine-grain exploitation of execution slack –5% slack  27% energy saved –small slack misprediction