Memory Segmentation to Exploit Sleep Mode Operation

Slides:



Advertisements
Similar presentations
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Advertisements

Coverage by Directional Sensors Jing Ai and Alhussein A. Abouzeid Dept. of Electrical, Computer and Systems Engineering Rensselaer Polytechnic Institute.
4/22/ Clock Network Synthesis Prof. Shiyan Hu Office: EREC 731.
NTHU-CS VLSI/CAD LAB TH EDA De-Shiuan Chiou Da-Cheng Juan Yu-Ting Chen Shih-Chieh Chang Department of CS, National Tsing Hua University, Taiwan Fine-Grained.
Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.
Power Reduction Techniques For Microprocessor Systems
Leakage and Dynamic Glitch Power Minimization Using MIP for V th Assignment and Path Balancing Yuanlin Lu and Vishwani D. Agrawal Auburn University ECE.
Introduction to CMOS VLSI Design Lecture 18: Design for Low Power David Harris Harvey Mudd College Spring 2004.
Multiobjective VLSI Cell Placement Using Distributed Simulated Evolution Algorithm Sadiq M. Sait, Mustafa I. Ali, Ali Zaidi.
On the Limits of Leakage Power Reduction in Caches Yan Meng, Tim Sherwood and Ryan Kastner UC, Santa Barbara HPCA-2005.
System-Wide Energy Minimization for Real-Time Tasks: Lower Bound and Approximation Xiliang Zhong and Cheng-Zhong Xu Dept. of Electrical & Computer Engg.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 5 – Power Prof. Luke Theogarajan
Changbo Long ECE Department, UW-Madison Lei He EDA Research Group EE Department, UCLA Distributed Sleep Transistor Network.
Lecture 7: Power.
Memory Access Scheduling and Binding Considering Energy Minimization in Multi- Bank Memory Systems Chun-Gi Lyuh, Taewhan Kim DAC 2004, June 7-11, 2004.
Architectural and Compiler Techniques for Energy Reduction in High-Performance Microprocessors Nikolaos Bellas, Ibrahim N. Hajj, Fellow, IEEE, Constantine.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
Low Power Techniques in Processor Design
Power Reduction for FPGA using Multiple Vdd/Vth
Low-Power Wireless Sensor Networks
CAD for Physical Design of VLSI Circuits
Washington State University
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
Power Management of Flash Memory for Portable Devices ELG 4135, Fall 2006 Faculty of Engineering, University of Ottawa November 1, 2006 Thayalan Selvam.
A Routing Approach to Reduce Glitches in Low Power FPGAs Quang Dinh, Deming Chen, Martin D. F. Wong Department of Electrical and Computer Engineering University.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
3 rd Nov CSV881: Low Power Design1 Power Estimation and Modeling M. Balakrishnan.
CUHK Learning-Based Power Management for Multi-Core Processors YE Rong Nov 15, 2011.
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
Workload Clustering for Increasing Energy Savings on Embedded MPSoCs S. H. K. Narayanan, O. Ozturk, M. Kandemir, M. Karakoy.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
CS203 – Advanced Computer Architecture
LOW POWER DESIGN METHODS
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
Power-Optimal Pipelining in Deep Submicron Technology
Advanced Algorithms Analysis and Design
COMP541 Memories II: DRAMs
CS203 – Advanced Computer Architecture
Power Management in Embedded Systems
Combinational Logic Design&Analysis.
Green cloud computing 2 Cs 595 Lecture 15.
Evaluating Register File Size
The Analysis of Cyclic Circuits with Boolean Satisfiability
Warped Gates: Gating Aware Scheduling and Power Gating for GPGPUs
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
SECTIONS 1-7 By Astha Chawla
Department of Electrical & Computer Engineering
Microarchitectural Techniques for Power Gating of Execution Units
Chapter 4: Using NP-Completeness to Analyze Subproblems
Fine-Grain CAM-Tag Cache Resizing Using Miss Tags
Challenges in Nanoelectronics: Process Variability
Digital Processing Platform
Objective of This Course
Qingbo Zhu, Asim Shankar and Yuanyuan Zhou
A High Performance SoC: PkunityTM
Sungho Kang Yonsei University
Research Status of Equivalence Checking at Zhejiang University
Post-Silicon Calibration for Large-Volume Products
Algorithms for Budget-Constrained Survivable Topology Design
Lecture 7: Power.
Department of Electrical Engineering Joint work with Jiong Luo
VLSI CAD Flow: Logic Synthesis, Placement and Routing Lecture 5
NP-Completeness Reference: Computers and Intractability: A Guide to the Theory of NP-Completeness by Garey and Johnson, W.H. Freeman and Company, 1979.
Introduction to Algorithms: Dynamic Programming
Lecture 7: Power.
The University of Adelaide, School of Computer Science
Utsunomiya University
Presentation transcript:

Memory Segmentation to Exploit Sleep Mode Operation 32nd ACM/IEEE DAC, 1997 Amir H. Farrahi, et.al. CS554 (2006 Fall) Ju-Young Jung

Introduction Background Paper overview Methodologies to obtain the idle set Experimental results Recent researches for memory power saving Leakage power Considerations Conclusion

ABC over Memory Partitioning (MP) The principle of memory partitioning Sub-divide the address space into several smaller blocks, which consumes less energy Map such blocks to different physical memory banks Selectively activate or disable each bank What is the gains from MP? Performance-oriented solution Low-energy

ABC over MP (Cont’d) Pitfalls “Let’s consider only Power” Contemporary embedded system increasingly requires more computation power, and thus low power technique must have little or no effect on performance. “Finer partitioning, better performance” Arbitrary partitioning causes an excessively large # of small banks, and thus area inefficiency. Severe wiring overhead ( wire delay > gate delay) Offset the benefit from MP, and even degraded

Example of Memory Partitioning

Skimming Over This Paper Questions? How should we partition memory to exploit sleep mode? (define this problem and suggest algorithm) How to obtain the information on idle sets? Board-level memory optimization Target for large off-chip DRAMs What is idea? Do NOT refresh DRAM blocks without live data Save refresh energy

Circuit Partitioning for sleep mode

Methodologies to Obtain the Idle Set Idle Sets for Memory Element (ME) M = {m1, m2,…, mr}; the set of dynamic MEs Access pattern for each ME expressed by (ti, Ai) ,where ti : access time / Ai : access type (R or W) No-Refresh Rules 1st access to mi is W @ t  prior to t The last access to mi  thereafter access to mi @ t followed by W access @ t’  during the interval (t, t’)

Obtaining Idle Sets for ME

Methodologies to Obtain Idle Sets Idle Sets for Clocked Element Assumption : “FUs have registers at input” If FU not used, gate the clock signal to the register No dynamic power consumption Active only during FU assigned to a control step c Otherwise it is idle for this control step

Formulation ME m is idle Interval I1 and I2 are non-overlapping Idle set Nm of m I1 covers I2 Length L(I) of an interval I = (l, r) Intersection of I1 and I2 (I1∧I2) Intersection of N1 and N2 (N1∧N2)

Problem P1 Instance : Ordered quadruple (a,b,c,S) Objective : Determine whether there exists a b-balanced bi-partitioning (S1,S2) such that : G(S1,S2) >= c Theorem : P1 is NP-complete

General Strategies for NP-Complete Rule out the possibility of existence of a polynomial time algorithm for P1 Toward theoretical end Study the complexity of special sub-class of the general problem that potentially solvable in polynomial time Toward practical end Heuristic approaches developed to solve the problem sub-optimally but in polynomial time

Exact Algorithm

Polynomial-Time Sub-Classes Problem P2 Instance : same as P1, with each NIS containing a single interval. Objective : same as P1 Theorem : P2 solvable in polynomial time

Bounded Number of Switching Input parameter d Allowable # of switching to sleep mode Problem P3 Instance : Same as P1, plus integer d Objective : Same as P1 with sw1+sw2 ≤ d Theorem : P3 solvable in (pseudo) polynomial time

Experimental Results

Target Power to Save In the past, switching power was dominant source of power consumption, and thus major concerns are laid on here Under 65nm feature size, leakage current becomes major contributor to power consumption We need to consider both dynamic and static power consumption to further save it

Leakage Power New low-power design challenge Cause is technology scaling Device speed Chip density Two principal static power components Sub-threshold leakage : weak inversion current across device Gate leakage : tunneling current through insulator

Leakage Power Operating frequency and voltage f ∝ (V − Vth)α / V Overall power consumption P = ACV 2f + VIleak Ileak = Isub + Iox

Power Breakdown Trends (ITRS 2002)

Leakage Power Sources

Considerations Obtaining to memory access pattern Dynamic access profile: statically running target application on a given microprocessor Random weighted vector generation Optimal partition based on this information As long as the profiled information sustain, good! What if feedback control system? Task periods variable Memory access pattern might be changed

Recent researches Partitioning itself Not much, and mainly focus on cache memory Segmentation Sub-banking Physical as well as logical partitioning Another report Minor contribution compared to this paper Geometric heuristic algorithm integrated with GA (genetic algorithm)

Conclusion Power consumption become one of the first-class design constraints in embedded system Dynamic power saving technique Partitioning problem to elongate sleep mode Power consumption contributor change Feedback control system’s problem

Thank You !