VADA Lab.SungKyunKwan Univ. 1 Lower Power Voltage Scaling 1999. 8. 성균관대학교 조 준 동

Slides:



Advertisements
Similar presentations
Zhou Peng, Zuo Decheng, Zhou Haiying Harbin Institute of Technology 1.
Advertisements

VADA Lab.SungKyunKwan Univ. 1 L3: Lower Power Design Overview (2) 성균관대학교 조 준 동 교수
Power Reduction Techniques For Microprocessor Systems
Minimum Energy CMOS Design with Dual Subthrehold Supply and Multiple Logic-Level Gates Kyungseok Kim and Vishwani D. Agrawal ECE Dept. Auburn University.
Fall 06, Sep 19, 21 ELEC / Lecture 6 1 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic.
L27:Lower Power Algorithm for Multimedia Systems 성균관대학교 조 준 동
Polynomial-Time Algorithms for Designing Dual-Voltage Energy Efficient Circuits Master’s Thesis Defense Mridula Allani Advisor : Dr. Vishwani D. Agrawal.
Energy Source Lifetime Optimization for a Digital System through Power Management Department of Electrical and Computer Engineering Auburn University,
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
VADA Lab.SungKyunKwan Univ. 1 Dynamic Voltage Scaling.
Aleksandra Tešanović Low Power/Energy Scheduling for Real-Time Systems Aleksandra Tešanović Real-Time Systems Laboratory Department of Computer and Information.
Chia-Yen Hsieh Laboratory for Reliable Computing Microarchitecture-Level Power Management Iyer, A. Marculescu, D., Member, IEEE IEEE Transaction on VLSI.
Spring 07, Feb 20 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Reducing Power through Multicore Parallelism Vishwani.
Hardware/Software Integration in Portable Systems Trevor Pering University of California Berkeley.
Instruction Set Architecture (ISA) for Low Power Hillary Grimes III Department of Electrical and Computer Engineering Auburn University.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Spring 07, Feb 22 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Power Aware Microprocessors Vishwani D. Agrawal.
Processor Frequency Setting for Energy Minimization of Streaming Multimedia Application by A. Acquaviva, L. Benini, and B. Riccò, in Proc. 9th Internation.
Author: D. Brooks, V.Tiwari and M. Martonosi Reviewer: Junxia Ma
Low Power Design of Integrated Systems Assoc. Prof. Dimitrios Soudris
Low power architecture and HDL coding practices for on-board hardware applications Kaushal D. Buch ASIC Engineer, eInfochips Ltd., Ahmedabad, India
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
CS 423 – Operating Systems Design Lecture 22 – Power Management Klara Nahrstedt and Raoul Rivas Spring 2013 CS Spring 2013.
Computer performance.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Power-Aware SoC Test Optimization through Dynamic Voltage and Frequency Scaling Vijay Sheshadri, Vishwani D. Agrawal, Prathima Agrawal Dept. of Electrical.
Minimizing Response Time Implication in DVS Scheduling for Low Power Embedded Systems Sharvari Joshi Veronica Eyo.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
Power Saving at Architectural Level Xiao Xing March 7, 2005.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Determining the Optimal Process Technology for Performance- Constrained Circuits Michael Boyer & Sudeep Ghosh ECE 563: Introduction to VLSI December 5.
Computer Performance Computer Engineering Department.
2011/08/09 Sunwook Bae. Contents Paper Info Introduction Overall Architecture Resource Management Evaluation Conclusion References.
Low-Power Wireless Sensor Networks
Computer Science Department University of Pittsburgh 1 Evaluating a DVS Scheme for Real-Time Embedded Systems Ruibin Xu, Daniel Mossé and Rami Melhem.
Low Power Design for Real-Time Systems Low power (energy) consumption is a key design for embedded systems Battery’s life during operation Reliability.
An Efficient Algorithm for Dual-Voltage Design Without Need for Level-Conversion SSST 2012 Mridula Allani Intel Corporation, Austin, TX (Formerly.
1 EE5900 Advanced Embedded System For Smart Infrastructure Energy Efficient Scheduling.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Using Cycle Efficiency as a System Designer Metric to Characterize an Embedded DSP and Compare Hard Core vs. Soft Core Advisor Dr. Vishwani D. Agrawal.
3 rd Nov CSV881: Low Power Design1 Power Estimation and Modeling M. Balakrishnan.
Power Estimation and Optimization for SoC Design
Power and Control in Networked Sensors E. Jason Riedy and Robert Szewczyk Presenter: Fayun Luo.
Morgan Kaufmann Publishers
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
Preface.
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
Class Report 林常仁 Low Power Design: System and Algorithm Levels.
Class Report 何昭毅 : Voltage Scaling. Source of CMOS Power Consumption  Dynamic power consumption  Short circuit power consumption  Leakage power consumption.
Tae- Hyoung Kim, Hanyong Eom, John Keane Presented by Mandeep Singh
JouleTrack - A Web Based Tool for Software Energy Profiling Amit Sinha and Anantha Chandrakasan Massachusetts Institute of Technology June 19, 2001.
11/15/05ELEC / Lecture 191 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
LOW POWER DESIGN METHODS
M V Ganeswara Rao Associate Professor Dept. of ECE Shri Vishnu Engineering College for Women Bhimavaram Hardware Architecture of Low-Power ALU using Clock.
Adiabatic Technique for Energy Efficient Logic Circuits Design
Lecture 2: Performance Evaluation
CS161 – Design and Architecture of Computer Systems
Jacob R. Lorch Microsoft Research
Andrea Acquaviva, Luca Benini, Bruno Riccò
Green cloud computing 2 Cs 595 Lecture 15.
Evaluating Register File Size
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
SECTIONS 1-7 By Astha Chawla
Morgan Kaufmann Publishers
Dynamic Voltage Scaling
Circuit Design Techniques for Low Power DSPs
A High Performance SoC: PkunityTM
Nov. 12, 1997 Bob Brodersen ( CS 152 Computer Architecture and Engineering Introduction to Architectures for Digital.
Presentation transcript:

VADA Lab.SungKyunKwan Univ. 1 Lower Power Voltage Scaling 성균관대학교 조 준 동

VADA Lab.SungKyunKwan Univ. 2 Voltage Scaling Merely changing a processor clock frequency is not an effective technique for reducing energy consumption. Reducing the clock frequency will reduce the power consumed by a processor, however, it does not reduce the energy required to perform a given task. Lowering the voltage along with the clock actually alters the energy-per-operation of the microprocessor, reducing the energy required to perform a fixed amount of work.

VADA Lab.SungKyunKwan Univ. 3 Dynamic Voltage Scaling(DVS)

VADA Lab.SungKyunKwan Univ. 4 Processor Usage Model

VADA Lab.SungKyunKwan Univ. 5 OS: Voltage Scaling

VADA Lab.SungKyunKwan Univ. 6 Scale Supply Voltage with f CLK

VADA Lab.SungKyunKwan Univ. 7 Adaptive Power Supply Voltages

VADA Lab.SungKyunKwan Univ. 8 Variable Supply Voltage Block Diagram

VADA Lab.SungKyunKwan Univ. 9 Typical MPEG IDCT Histogram

VADA Lab.SungKyunKwan Univ. 10 Voltage scheduling under timing constraints –Energy consumption of a processor: 10nJ/cycle at 2.5V 25nJ/cycle at 4 V 40nJ/cycle at 5V –maximum clock frequencies: 50MHz at 5V, 40MHz at 4V, 25MHz at 2.5V –Given that an application needs 1000M cycles to finish and the timing constaint is 25sec.

VADA Lab.SungKyunKwan Univ. 11 Different Voltage Schedules Time(sec) Mcycles 50MHz 40J (A) Time(sec) Mcycles 50MHz 32.5J (B) Time(sec) Mcycles 40MHz 25J (C) Timing constraint Mcycles 25MHz Energy consumption (  V dd 2 )

VADA Lab.SungKyunKwan Univ. 12 Example of Variable Supply

VADA Lab.SungKyunKwan Univ. 13 DVS Implementation

VADA Lab.SungKyunKwan Univ. 14 Variable Supply Voltage Block Diagram Computational work varies with time. An approach to reduce the energy consumption of such systems beyond shut down involves the dynamic adjustment of supply voltage based on computational workload. The basic idea is to lower power supply when the a fixed supply for some fraction of time. The supply voltage and clock rate are increased during high workload period.

VADA Lab.SungKyunKwan Univ. 15 Data Driven Signal Processing The basic idea of averaging two samples are buffered and their work loads are averaged. The averaged workload is then used as the effective workload to drive the power supply. Using a pingpong buffering scheme, data samples I n +2, I n +3 are being buffered while I n, I n +1 are being processed.

VADA Lab.SungKyunKwan Univ. 16 Example of Buffering

VADA Lab.SungKyunKwan Univ. 17 Graphical Interpretation

VADA Lab.SungKyunKwan Univ. 18 Buffering Example: MPEG Decoder

VADA Lab.SungKyunKwan Univ. 19 DVS

VADA Lab.SungKyunKwan Univ. 20 DVS Scheduling Framework µProc. Speed Time StartDeadlineStartDeadline Idle time represents wasted energy Lower speed, Lower voltage, Lower energy Energy ~ Work Speed Work Use real-time framework to constrain task voltage scheduling

VADA Lab.SungKyunKwan Univ. 21 DVS Simulation Speed Time S1S1 S2S2 S3S3 D1D1 D3D3 D2D2 Task Variance Weather Interrupts User Input Cache Behavior Scheduling Overhead Intercom RealityTheory Implementation Simulate run-time scheduler to fully understand voltage-scaling behavior

VADA Lab.SungKyunKwan Univ. 22 Simulation Infrastructure GUI Run-time Scheduler Voltage Scheduler Application support libraries MPEG  Priority 80 GUI  Priority 23 MPEG  Priority 80 GUI  Priority 23 Speed  Priority { Frame_Start(deadline); Decode_MPEG_Frame(); Frame_Finish(); } { Frame_Start(deadline); Decode_MPEG_Frame(); Frame_Finish(); } Windowing Cryptography I/O Support lpARM MPEG Develop support environment to model complete software system

VADA Lab.SungKyunKwan Univ. 23 Run-Time Voltage Scaling Normalized to 3.3V fixed-voltage processor Combination of independent benchmarks Dynamic Voltage Scaling significantly reduces energy dissipation!

VADA Lab.SungKyunKwan Univ. 24 Run-Time Performance Analysis AudioMPEGGUI Software can automatically recognize and adjust for bi-modal GUI distribution 0 2x deadline Normalized to deadline at max processor speed Application characteristics strongly affect voltage scaling performance

VADA Lab.SungKyunKwan Univ. 25 Compute ASAP+ System Shutdown

VADA Lab.SungKyunKwan Univ. 26 Another Approach: Reduce Clock Frequency

VADA Lab.SungKyunKwan Univ. 27 Voltage Scheduling II

VADA Lab.SungKyunKwan Univ. 28 Evaluation: Algorithms

VADA Lab.SungKyunKwan Univ. 29 AVG Computes an exponentially moving average of the previous intervals. At each interval the run-percent from the previous interval is combined with the previous running average, forming a long-term prediction of system behavior. is the relative weighting of past intervals relative of the current interval (larger value means a great weight on the past) using the equation (weight X old + new)/(weight+1). 3 can be used.

VADA Lab.SungKyunKwan Univ. 30 OS: Voltage Scheduling

VADA Lab.SungKyunKwan Univ. 31 Run-Time Scheduling Dynamics µProc. Speed Time Thread accomplishing more than expected, reduce speed Deadline exceeded, increase speed Higher-priority task Run faster to make up lost time Initial speed estimate Optimal schedule E(work) Workload calculated to be average of previous frames Periodically re-evaluate schedule to adjust for unforeseen events

VADA Lab.SungKyunKwan Univ. 32 Vertical Layering

VADA Lab.SungKyunKwan Univ. 33 Optimal Scheduling For a region spanned by a given task specification, each point in time will either be scheduled at the minimum speed spanned by that task or else the task will not be scheduled to run at that point. Algorithm n tasks to schedule O(n) speed settings to consider for each task O(n) linked tasks requiring adjustment for each setting: Total complexity: O(n 3 ) time.

VADA Lab.SungKyunKwan Univ. 34 Scheduling step0

VADA Lab.SungKyunKwan Univ. 35 Scheduling step1

VADA Lab.SungKyunKwan Univ. 36 Scheduling step2

VADA Lab.SungKyunKwan Univ. 37 Scheduling step3

VADA Lab.SungKyunKwan Univ. 38 Scheduling step4

VADA Lab.SungKyunKwan Univ. 39 Scheduling step5

VADA Lab.SungKyunKwan Univ. 40 References [Lin97] Lin et al., "Scheduling Techniques for Variable Voltage Low Power Designs," ACM Transactions on Design Automation of Electronic Systems, vol. 2, no. 2, pp , [Govil95] - Extended simulation with practical algorithms on traces of UNIX workstations [Kuroda98] - Implementation of DVS processor to mitigate effects of process variation [Ishihara98] - Dynamic voltage scaling with non- constant capacitances S. Gary, et. al., "The PowerPC 603 Microprocessor: A Low-Power Design for Portable Applications," Proceedings of the Thirty-Ninth IEEE Computer Society International Conference, Mar. 1994, pp A. Chandrakasan, Low Power Digital CMOS Design, Boston: Kluwer Academic Publishers, C. Nagendra, et.al., "A Comparison of the Power-Delay Characteristics of CMOS Adders,” Proceedings of the International Workshop on Low Power Design, Apr. 1994, pp T. Callaway and E. Swartzlander, "Optimizing Arithmetic Elements for Signal Processing," VLSI Signal Processing, Vol. 5, New York: IEEE Special Publications, 1992, pp T. Biggs, et. al., "A 1 Watt Compatible Microprocessor," Proceedings of the IEEE Symposium on Low Power Electronics, Oct. 1994, pp J. Lorch, A Complete Picture of the Energy Consumption of a Portable Computer, M.S. Thesis, University of California, Berkeley, 1995

VADA Lab.SungKyunKwan Univ. 41 References S. Kunii, "Means of Realizing Long Battery Life in Portable PCs," Proceedings of the IEEE Symposium on Low Power Electronics, Oct. 1995, pp M. Culbert, "Low Power Hardware for a High Performance PDA," Proceedings of the Thirty-Ninth IEEE Computer Society International Conference, Mar. 1994, pp T. Ikeda, "ThinkPad Low-Power Evolution," Proceedings of the IEEE Symposium on Low Power Electronics, Oct. 1995, pp A. Chandrakasan, A. Burstein, and R.W. Brodersen, "A Low Power Chipset for Portable Multimedia Applications," IEEE Journal of Solid State Circuits, Vol. 29, Dec. 1994, pp M. Horowitz, T. Indermaur, and R. Gonzalez, "Low-Power Digital Design," Proceedings of the IEEE Symposium on Low Power Electronics, Oct. 1994, pp D. Lidsky and J. Rabaey, "Early Power Exploration - A World Wide Web Application," Proceedings of the Thirty-Third Design Automation Conference, June T. Burd, Low-Power CMOS Cell Library Design Methodology, M.S. Thesis, University of California, Berkeley, UCB/ERL M94/89, 1994.

VADA Lab.SungKyunKwan Univ. 42 A. Chandrakasan, S. Sheng, and R.W. Brodersen, "Low-Power CMOS Digital Design," IEEE Journal of Solid State Circuits, Apr. 1992, pp Advanced RISC Machines, Ltd., ARM710 Data Sheet, Technical Document, Dec Integrated Device Technology, Inc., Enhanced Orion 64-Bit RISC Microprocessor, Data Sheet, Sep Intel Corp., Embedded Ultra-Low Power Intel486TM GX Processor, SmartDieTM Product Specification, Dec A. Stratakos, S. Sanders, and R.W. Brodersen, "A Low-voltage CMOS DC-DC Converter for Portable Battery-operated Systems," Proceedings of the Twenty-Fifth IEEE Power Electronics Specialist Conference, June 1994, pp J. Bunda, et. al., "16-Bit vs. 32-Bit Instructions for Pipelined Architectures," Proceedings of the 20th International Symposium on Computer Architecture, May 1993, pp Advanced RISC Machines, Ltd., Introduction to Thumb, Developer Technical Document, Mar

VADA Lab.SungKyunKwan Univ. 43 J. Bunda, W.C. Athas, and D. Fussell, "Evaluating Power Implications of CMOS Microprocessor Design Decisions," Proceedings of the International Workshop on Low Power Design, Apr. 1994, pp P. Freet, "The SH Microprocessor: 16-Bit Fixed Length Instruction Set Provides Better Power and Die Size," Proceedings of the Thirty-Ninth IEEE Computer Society International Conference, Mar. 1994, pp T. Burd, B. Peters, A Power Analysis of a Microprocessor: A Study of an Implementation of the MIPS R3000 Architecture, ERL Technical Report, University of California, Berkeley, J. Montanaro, et. al., "A 160MHz 32b 0.5W CMOS RISC Microprocessor," Proceedings of the Thirty-Ninth IEEE International Solid-State Circuits Conference - Slide Supplement, Feb. 1996, pp J. Bunda, Instruction-Processing Optimization Techniques for VLSI Microprocessors, Ph.D. Thesis, The University of Texas at Austin, R. Gonzalez and M. Horowitz, "Energy Dissipation in General Purpose Processors," Proceedings of the IEEE Symposium on Low Power Electronics, Oct. 1995, pp