Slide 1 U.Va. Department of Computer Science LAVA Architecture-Level Power Modeling N. Kim, T. Austin, T. Mudge, and D. Grunwald. “Challenges for Architectural.

Slides:



Advertisements
Similar presentations
Performance, Energy and Thermal Considerations of SMT and CMP architectures Yingmin Li, David Brooks, Zhigang Hu, Kevin Skadron Dept. of Computer Science,
Advertisements

Combining Statistical and Symbolic Simulation Mark Oskin Fred Chong and Matthew Farrens Dept. of Computer Science University of California at Davis.
Power Reduction Techniques For Microprocessor Systems
Thermal-Scheduling For Ultra Low Power Mobile Microprocessor May, Thermal-Scheduling For Ultra Low Power Mobile Microprocessor George Cai 1 Chee.
Sim-alpha: A Validated, Execution-Driven Alpha Simulator Rajagopalan Desikan, Doug Burger, Stephen Keckler, Todd Austin.
Adaptive Techniques for Leakage Power Management in L2 Cache Peripheral Circuits Houman Homayoun Alex Veidenbaum and Jean-Luc Gaudiot Dept. of Computer.
Low power Design Strategies Daniele Folegnani. Talk outline Why Low Power is Important Power Consumption in CMOS Circuits New Trends for Future Microprocessors.
June 20 th 2004University of Utah1 Microarchitectural Techniques to Reduce Interconnect Power in Clustered Processors Karthik Ramani Naveen Muralimanohar.
Yefu Wang and Kai Ma. Project Goals and Assumptions Control power consumption of multi-core CPU by CPU frequency scaling Assumptions: Each core can be.
Register Packing Exploiting Narrow-Width Operands for Reducing Register File Pressure Oguz Ergin*, Deniz Balkan, Kanad Ghose, Dmitry Ponomarev Department.
Mehdi Amirijoo1 Power estimation n General power dissipation in CMOS n High-level power estimation metrics n Power estimation of the HW part.
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 5 – Power Prof. Luke Theogarajan
CS 7810 Lecture 13 Pipeline Gating: Speculation Control For Energy Reduction S. Manne, A. Klauser, D. Grunwald Proceedings of ISCA-25 June 1998.
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
Lecture 7: Power.
Power-Aware Computing 101 CS 771 – Optimizing Compilers Fall 2005 – Lecture 22.
Author: D. Brooks, V.Tiwari and M. Martonosi Reviewer: Junxia Ma
Computation Energy Randy Huang Sep 29, Outline n Why do we care about energy/power n Components of power consumption n Measurements of power consumption.
Feb 14 th 2005University of Utah1 Microarchitectural Wire Management for Performance and Power in Partitioned Architectures Rajeev Balasubramonian Naveen.
Low power architecture and HDL coding practices for on-board hardware applications Kaushal D. Buch ASIC Engineer, eInfochips Ltd., Ahmedabad, India
Requirements: General, simple, and fast, and must model heating at the granularity of architectural objects  Must be able to dynamically calculate temperatures.
CS 423 – Operating Systems Design Lecture 22 – Power Management Klara Nahrstedt and Raoul Rivas Spring 2013 CS Spring 2013.
6.893: Advanced VLSI Computer Architecture, September 28, 2000, Lecture 4, Slide 1. © Krste Asanovic Krste Asanovic
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
Erkan Çetiner. Outline Introduction Related Works Modeling Methodology Baseline Results DTM Techniques Conclusions.
17 Sep 2002Embedded Seminar2 Outline The Big Picture Who’s got the Power? What’s in the bag of tricks?
Low Power Techniques in Processor Design
Chalmers University of Technology FlexSoC Seminar Series – Page 1 Power Estimation FlexSoc Seminar Series – Daniel Eckerbert
1 System-level Power Estimation and Optimization Chong-Min Kyung KAIST.
1 VLSI Design SMD154 LOW-POWER DESIGN Magnus Eriksson & Simon Olsson.
Low-Power Wireless Sensor Networks
1 Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy.
Drowsy Caches: Simple Techniques for Reducing Leakage Power Authors: ARM Ltd Krisztián Flautner, Advanced Computer Architecture Lab, The University of.
Speculative Software Management of Datapath-width for Energy Optimization G. Pokam, O. Rochecouste, A. Seznec, and F. Bodin IRISA, Campus de Beaulieu
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Exploiting Program Hotspots and Code Sequentiality for Instruction Cache Leakage Management J. S. Hu, A. Nadgir, N. Vijaykrishnan, M. J. Irwin, M. Kandemir.
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
MS108 Computer System I Lecture 2 Metrics Prof. Xiaoyao Liang 2014/2/28 1.
University of Michigan Electrical Engineering and Computer Science Composite Cores: Pushing Heterogeneity into a Core Andrew Lukefahr, Shruti Padmanabha,
1 Some Limits of Power Delivery in the Multicore Era Runjie Zhang, Brett H. Meyer, Wei Huang, Kevin Skadron and Mircea R. Stan University of Virginia,
3 Computing System Fundamentals
Predictive Design Space Exploration Using Genetically Programmed Response Surfaces Henry Cook Department of Electrical Engineering and Computer Science.
EEE2243 Digital System Design Chapter 7: Advanced Design Considerations by Muhazam Mustapha, extracted from Intel Training Slides, April 2012.
3 rd Nov CSV881: Low Power Design1 Power Estimation and Modeling M. Balakrishnan.
1 Estimating the Worst-Case Energy Consumption of Embedded Software Ramkumar Jayaseelan Tulika Mitra Xianfeng Li School of Computing National University.
1 An Improved Block-Based Thermal Model in HotSpot 4.0 with Granularity Considerations Wei Huang 1, Karthik Sankaranarayanan 1, Robert Ribando 3, Mircea.
Houman Homayoun, Sudeep Pasricha, Mohammad Makhzan, Alex Veidenbaum Center for Embedded Computer Systems, University of California, Irvine,
Power Estimation and Optimization for SoC Design
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
Computer Structure & Architecture 7b - CPU & Buses.
Feb 14 th 2005University of Utah1 Microarchitectural Wire Management for Performance and Power in Partitioned Architectures Rajeev Balasubramonian Naveen.
1 Power estimation in the algorithmic and register-transfer level September 25, 2006 Chong-Min Kyung.
Basics of Energy & Power Dissipation
© Digital Integrated Circuits 2nd Inverter Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
Addressing Instruction Fetch Bottlenecks by Using an Instruction Register File Stephen Hines, Gary Tyson, and David Whalley Computer Science Dept. Florida.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Sunpyo Hong, Hyesoon Kim
Application Domains for Fixed-Length Block Structured Architectures ACSAC-2001 Gold Coast, January 30, 2001 ACSAC-2001 Gold Coast, January 30, 2001.
CS203 – Advanced Computer Architecture
PipeliningPipelining Computer Architecture (Fall 2006)
CS203 – Advanced Computer Architecture
Hot Chips, Slow Wires, Leaky Transistors
Figure 8.1 Architecture of a Simple Computer System.
A High Performance SoC: PkunityTM
Chapter 1 Introduction.
The Processor Lecture 3.1: Introduction & Logic Design Conventions
Presentation transcript:

Slide 1 U.Va. Department of Computer Science LAVA Architecture-Level Power Modeling N. Kim, T. Austin, T. Mudge, and D. Grunwald. “Challenges for Architectural Level Power Modeling.” In Power Aware Computing, (R. Melhem and R. Graybill eds.), Kluwer, Kevin Skadron Mircea Stan

Slide 2 U.Va. Department of Computer Science LAVA Who Cares?  Power is now a first-level design constraint for both embedded/mobile and high-performance/general-purpose processors  Battery life (eg, laptops)  Heat removal and package cost  Degradation and lifetime Architects don’t have good tools to model this

Slide 3 U.Va. Department of Computer Science LAVA Modeling  What architects normally do: model behavior/performance at the cycle level (eg, SimpleScalar)  Many abstractions and simplifications –Examples: I-cache, memory buses  Faster than a more detailed model; still good enough  Power and heat, however, require more implementation detail  Current power-performance simulators try to omit the extra detail by using abstractions or analytic models

Slide 4 U.Va. Department of Computer Science LAVA Current Arch.-Level Power Simulators  Wattch (Brooks et al.)  Doesn’t model anything outside the core (eg, external bus)  CACTI-based models for large structures (cache, branch predictor, register file, instruction window, etc.) –Tuned using Intel data  SimplePower (Vijaykrishnan et al.)  Adds bus and memory modeling, also I/O pads  Look-up-tables (LUTs)  Tempest (Cai & Lim)  Power density  Chip-level thermal modeling  No one: data sensitivity, clock tree, global interconnect

Slide 5 U.Va. Department of Computer Science LAVA Typical Power-Performance Modeling Technology parameters Micro-arch. config Static power model Cycle- accurate performance model Dynamic power estimation activity factors cycle-by-cycle statistics

Slide 6 U.Va. Department of Computer Science LAVA Power Basics P = ½ACV 2 f +  AVI short + VI leak  A = activity factor  C = capacitance  V = dynamic voltage  f = frequency  I short = short-circuit current during switching  I leak = leakage current

Slide 7 U.Va. Department of Computer Science LAVA Power Basics P = ½ACV 2 f +  AVI short + VI leak P = ACV DD V swing f +  AVI short + VI leak  A = activity factor  C = capacitance  V = dynamic voltage  f = frequency  I short = short-circuit current during switching   = duration of short-circuit current  I leak = leakage current  Why averages don’t work: Bursty behavior, dI/dt, peak current, temperature

Slide 8 U.Va. Department of Computer Science LAVA Better Simulation Method P = ACV DD V swing f +  AVI short + Vi leak sum over all blocks

Slide 9 U.Va. Department of Computer Science LAVA Typical Power-Performance Modeling Technology parameters Micro-arch. config Static power model Cycle- accurate performance model Dynamic power estimation activity factors cycle-by-cycle statistics

Slide 10 U.Va. Department of Computer Science LAVA Metrics  What metrics do we care about?  Execution time [sec or IPC]  Energy (battery life)[W]  Energy-delay product[Ws]  Energy-delay 2 product?[Ws 2 ]  Power density (temperature)[W / mm 2 ]  Temperature[K or °C]  DI/dt  Peak power dissipation  Might also care about these at finer granularities: micro-arch. blocks, decoders, circuits, etc.

Slide 11 U.Va. Department of Computer Science LAVA Basic Techniques for Power Efficiency  Leakage: turn things off (but you lose the data)  Resize structures  Turn off idle structures  Turn off entries, eg cache decay  4T RAM cells?  Dynamic: reduce activity factors  Clock gating  Resize structures  “Utility” predictor  Throttle processor width (eg, fetch width)  Filter caches  Datapath resizing  etc. P = ½ACV 2 f +  AVI short + VI leak

Slide 12 U.Va. Department of Computer Science LAVA Simulating Power for Greater Accuracy P = ½ACV 2 f +  AVI short + VI leak In all these cases, we want to find relevant power-related parameters (esp. effective switching capacitance C) -- performance model provides A  More detailed block-specific power information (circuit design style, etc.)  Switching activity (Hamming distance)  Interconnect (floorplanning, approx. area)  Clock tree (H-tree vs. balanced H-tree)  Random logic (empirical models)  Busses, transactions, durations (pull-up/pull-down/hi-Z, read vs. write, etc.)

Slide 13 U.Va. Department of Computer Science LAVA Simulation Challenges  Need leakage models - f(T)  Need temperature models  Eventually want to integrate all these into a fast simulator  This is a research challenge in its own right