EECS Department, Northwestern University, Evanston Thermal-Induced Leakage Power Optimization by Redundant Resource Allocation Min Ni and Seda Ogrenci.

Slides:



Advertisements
Similar presentations
A R EVISIT TO THE P RIMAL -D UAL B ASED C LOCK S KEW S CHEDULING A LGORITHM Min Ni and Seda Ogrenci Memik EECS Department, Northwestern University.
Advertisements

Fast Algorithms For Hierarchical Range Histogram Constructions
Support vector machine
1 Regression Models & Loss Reserve Variability Prakash Narayan Ph.D., ACAS 2001 Casualty Loss Reserve Seminar.
3D-STAF: Scalable Temperature and Leakage Aware Floorplanning for Three-Dimensional Integrated Circuits Pingqiang Zhou, Yuchun Ma, Zhouyuan Li, Robert.
The structure and evolution of stars
Venkataramanan Balakrishnan Purdue University Applications of Convex Optimization in Systems and Control.
Optimization of thermal processes2007/2008 Optimization of thermal processes Maciej Marek Czestochowa University of Technology Institute of Thermal Machinery.
DETERMINATION OF UNSTEADY CONTAINER TEMPERATURES DURING FREEZING OF THREE-DIMENSIONAL ORGANS WITH CONSTRAINED THERMAL STRESSES Brian Dennis Aerospace Engineering.
1 Chapter 4 Interpolation and Approximation Lagrange Interpolation The basic interpolation problem can be posed in one of two ways: The basic interpolation.
Analysis of Simple Cases in Heat Transfer P M V Subbarao Professor Mechanical Engineering Department I I T Delhi Gaining Experience !!!
1 Closed-Loop Modeling of Power and Temperature Profiles of FPGAs Kanupriya Gulati Sunil P. Khatri Peng Li Department of ECE, Texas A&M University, College.
1cs542g-term Notes  Added required reading to web (numerical disasters)
Adaptive Data Collection Strategies for Lifetime-Constrained Wireless Sensor Networks Xueyan Tang Jianliang Xu Sch. of Comput. Eng., Nanyang Technol. Univ.,
GG 313 Geological Data Analysis # 18 On Kilo Moana at sea October 25, 2005 Orthogonal Regression: Major axis and RMA Regression.
Power-Aware Placement
 Based on the resource constraints a lower bound on the iteration interval is estimated  Synthesis targeting reconfigurable logic (e.g. FPGA) faces the.
Curve-Fitting Polynomial Interpolation
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 Temperature-Aware Resource Allocation and Binding in High Level Synthesis Authors: Rajarshi Mukherjee, Seda Ogrenci Memik, and Gokhan Memik Presented.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
D Nagesh Kumar, IIScOptimization Methods: M1L4 1 Introduction and Basic Concepts Classical and Advanced Techniques for Optimization.
July 3, Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.
Integration of Equations
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
A Methodology for Interconnect Dimension Determination By: Jeff Cobb Rajesh Garg Sunil P Khatri Department of Electrical and Computer Engineering, Texas.
CpE- 310B Engineering Computation and Simulation Dr. Manal Al-Bzoor
Frame by Frame Bit Allocation for Motion-Compensated Video Michael Ringenburg May 9, 2003.
Ashley Brinker Karen Joseph Mehdi Kabir ECE 6332 – VLSI Fall 2010.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 10: February 18, 2015 Architecture Synthesis (Provisioning, Allocation)
Chin-Yu Huang Department of Computer Science National Tsing Hua University Hsinchu, Taiwan Optimal Allocation of Testing-Resource Considering Cost, Reliability,
Approximate Dynamic Programming Methods for Resource Constrained Sensor Management John W. Fisher III, Jason L. Williams and Alan S. Willsky MIT CSAIL.
Silesian University of Technology in Gliwice Inverse approach for identification of the shrinkage gap thermal resistance in continuous casting of metals.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
Thermal-aware Phase-based Tuning of Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing This work was supported.
Yanjmaa Jutmaan  Department of Applied mathematics Some mathematical models related to physics and biology.
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
INTRO TO OPTIMIZATION MATH-415 Numerical Analysis 1.
Sunpyo Hong, Hyesoon Kim
A Two-Phase Linear programming Approach for Redundancy Problems by Yi-Chih HSIEH Department of Industrial Management National Huwei Institute of Technology.
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
1 Chapter 4 Interpolation and Approximation Lagrange Interpolation The basic interpolation problem can be posed in one of two ways: The basic interpolation.
Searching a Linear Subspace Lecture VI. Deriving Subspaces There are several ways to derive the nullspace matrix (or kernel matrix). ◦ The methodology.
1 Chapter 2 Program Performance. 2 Concepts Memory and time complexity of a program Measuring the time complexity using the operation count and step count.
Numerical Analysis – Data Fitting Hanyang University Jong-Il Park.
Characterizing Processors for Energy and Performance Management Harshit Goyal and Vishwani D. Agrawal Department of Electrical and Computer Engineering,
1 Hardware Reliability Margining for the Dark Silicon Era Liangzhen Lai and Puneet Gupta Department of Electrical Engineering University of California,
Smruti R. Sarangi IIT Delhi
Memory Segmentation to Exploit Sleep Mode Operation
Evaluating Register File Size
On-Chip Power Network Optimization with Decoupling Capacitors and Controlled-ESRs Wanping Zhang1,2, Ling Zhang2, Amirali Shayan2, Wenjian Yu3, Xiang Hu2,
A paper on Join Synopses for Approximate Query Answering
On calibration of micro-crack model of thermally induced cracks through inverse analysis Dr Vladimir Buljak University of Belgrade, Faculty of Mechanical.
Inference for Geostatistical Data: Kriging for Spatial Interpolation
LECTURE 05: THRESHOLD DECODING
ELEC 6970: Low Power Design Class Project By: Sachin Dhingra
LECTURE 05: THRESHOLD DECODING
Circuit Design Techniques for Low Power DSPs
Step change in the boundary condition of conduction problems
LECTURE 07: BAYESIAN ESTIMATION
LECTURE 05: THRESHOLD DECODING
Achieving Design Closure Through Delay Relaxation Parameter
Communication Driven Remapping of Processing Element (PE) in Fault-tolerant NoC-based MPSoCs Chia-Ling Chen, Yen-Hao Chen and TingTing Hwang Department.
The structure and evolution of stars
Presentation transcript:

EECS Department, Northwestern University, Evanston Thermal-Induced Leakage Power Optimization by Redundant Resource Allocation Min Ni and Seda Ogrenci Memik November 6, 2006

Thermal Leakage Coupling Four main sources of leakage current –Reverse-biased junction leakage current (IREV) –Gate induced drain leakage (IGIDL) –Gate direct tunneling leakage (IG) –Subthreshold (weak inversion) leakage (Isub)

Thermal Leakage Coupling Power consumption as a function of temperature [Pedram06]:

Previous Work Low Power Resource Binding  [Chang, DAC95], [Chang, DAC96] Temperature-aware Resource Binding [Mukherjee, DAC05]  Given Resource Constraint  Given Peak Temperature Constraint

Motivation Question: how to decide the peak temperature constraint in high-level synthesis?  One possible metric is minimizing the total leakage power Concept: two-state low power design  Phase one: low leakage power resource allocation  Phase two: low dynamic power resource binding Modeling: relation between number of resources n, temperature T and total leakage power P leakage  Solution: find the number of resources, hence, temperature that minimizes the total leakage power

Outline Leakage estimation model  Curve fitting  Heat transfer for leakage estimation Redundant resource allocation  Resource dynamic power  Estimating the package properties  Steady state temperature Experimental results

Leakage Modeling Analytic Model Curve fitting  Exact Lagrange’s interpolation:

Leakage Modeling Benefit vs. Analytic:  Polynomial is better for analytical and numerical computation  Let HSpice take care of the physical details Benefit vs. non-exact fitting, e.g. least-square  Exact fitting over the range of interest

Heat Transfer Modeling The basic relation between power density, heat transfer coefficient and temperature [Im, IEDM00]  Temperature evaluation based on dynamic power, which assumes to be a constant value

Heat Transfer Modeling Actual power The situation becomes more complicated after adding the leakage power Leakage power scaling based on the area of resource  F = 250 for 16-bit multiplier with area =  F = 80 for 32-bit adder with area =

Optimal Resource Number The relation between the number of resources and total leakage power  If we set, we have, 

What’s Next Given the number of resources n, the subproblem becomes solving the following equation Here, we still have two unknown values  Dynamic power P d  Heat transfer coefficient h Our goal is to decide n, which minimizes the following  P leakage = n*L p (T x )

Resource Dynamic Power Estimation Assumptions and simplifications  Each resource consumes a typical average dynamic power for executing one operation  Ignore the dynamic power of extra dynamic power of MUX when sharing resource Dynamic power of one operation is Bench#stepsP d per AddP d per Multi Arf Ewf Fdct Fft

Estimating the Package Properties Tradeoff between heat transfer coefficient and cost  Thermal runaway  Maximum h (minimum cost) package Find the maximum h  Binary search  Two Initial points

Steady State Temperature Solve the following equation by secant method Secant method, no explicit derivative is needed Initial point

Complete flow of algorithm Incremental search  The solution space is small Near-optimal solution  The leakage benefit becomes small Optimize when more than one resource type is in the DFG  First add redundancy for the module with highest power density  The operations are assumed to be distributed evenly among all available resources

Experiment results Resources used in the experiments Scaling from 180nm down to 70nm by full-scale methodology Benchmarks are popular DSP and multimedia kernels [Mangione-Smith, Micro97], example “arf”, AreaAverage P d Delay Adder Multiplier

Experiment Results Leakage power vs. min-resource allocation(53.8% improvement) and temperature-aware allocation(35.7% improvement) [Mukherjee, DAC05]

Experiment Results Resource temperature of different allocation strategies  Adder and multiplier temperature

Conclusions The contribution of this paper includes:  A paradigm for two-stage low power resource allocation and binding methodology  A simple leakage estimation model in high-level synthesis design phase  A leakage optimizing algorithm trading off resource area with total leakage power Thank you