Critical Power Slope: Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi †,Charles Lefurgy ‡, Eric Van Hensbergen ‡, Ram Rajamony ‡,

Slides:



Advertisements
Similar presentations
Power Reduction Techniques For Microprocessor Systems
Advertisements

1 MemScale: Active Low-Power Modes for Main Memory Qingyuan Deng, David Meisner*, Luiz Ramos, Thomas F. Wenisch*, and Ricardo Bianchini Rutgers University.
st International Conference on Parallel Processing (ICPP)
Energy in sensor nets. Where does the power go Components: –Battery -> DC-DC converter –CPU + Memory + Flash –Sensors + ADC –DAC + Audio speakers –Display.
Shimin Chen Big Data Reading Group Presented and modified by Randall Parabicoli.
CS 7810 Lecture 12 Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors D. Brooks et al. IEEE Micro, Nov/Dec.
Investigating the Effect of Voltage- Switching on Low-Energy Task Scheduling in Hard Real-Time Systems Paper review Presented by Chung-Fu Kao.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
Improving Proxy Cache Performance: Analysis of Three Replacement Policies John Dilley and Martin Arlitt IEEE internet computing volume3 Nov-Dec 1999 Chun-Fu.
Energy in sensor nets. Where does the power go Components: –Battery -> DC-DC converter –Sensors->ADC->MCU+Memory  Radio.
Chapter 4 Assessing and Understanding Performance
Optimization Of Power Consumption For An ARM7- BASED Multimedia Handheld Device Hoseok Chang; Wonchul Lee; Wonyong Sung Circuits and Systems, ISCAS.
Power-aware Computing n Dramatic increases in computer power consumption: » Some processors now draw more than 100 watts » Memory power consumption is.
Architectural and Compiler Techniques for Energy Reduction in High-Performance Microprocessors Nikolaos Bellas, Ibrahim N. Hajj, Fellow, IEEE, Constantine.
1 The Problem of Power Consumption in Servers L. Minas and B. Ellison Intel-Lab In Dr. Dobb’s Journal, May 2009 Prepared and presented by Yan Cai Fall.
Energy, Energy, Energy  Worldwide efforts to reduce energy consumption  People can conserve. Large percentage savings possible, but each individual has.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group.
CS 423 – Operating Systems Design Lecture 22 – Power Management Klara Nahrstedt and Raoul Rivas Spring 2013 CS Spring 2013.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Software-Hardware Cooperative Power Management Technique for Main Memory So, today I’m going to be talking about a software-hardware cooperative power.
Authors: Tong Li, Dan Baumberger, David A. Koufaty, and Scott Hahn [Systems Technology Lab, Intel Corporation] Source: 2007 ACM/IEEE conference on Supercomputing.
Power-Aware SoC Test Optimization through Dynamic Voltage and Frequency Scaling Vijay Sheshadri, Vishwani D. Agrawal, Prathima Agrawal Dept. of Electrical.
Scaling and Packing on a Chip Multiprocessor Vincent W. Freeh Tyler K. Bletsch Freeman L. Rawson, III Austin Research Laboratory.
Energy Management Uichin Lee KAIST KSE. Mobile Processing Power – Changing the Mobile Device From
Folklore Confirmed: Compiling for Speed = Compiling for Energy Tomofumi Yuki INRIA, Rennes Sanjay Rajopadhye Colorado State University 1.
Minimizing Response Time Implication in DVS Scheduling for Low Power Embedded Systems Sharvari Joshi Veronica Eyo.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
Abhilash Thekkilakattil, Radu Dobrin, Sasikumar Punnekkat Mälardalen Real-time Research Center, Mälardalen University Västerås, Sweden Preemption Control.
Low Power Techniques in Processor Design
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Multi Core Processor Submitted by: Lizolen Pradhan
Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,
Cloud Computing Energy efficient cloud computing Keke Chen.
1 Overview 1.Motivation (Kevin) 1.5 hrs 2.Thermal issues (Kevin) 3.Power modeling (David) Thermal management (David) hrs 5.Optimal DTM (Lev).5 hrs.
1 Using Multiple Energy Gears in MPI Programs on a Power- Scalable Cluster Vincent W. Freeh, David K. Lowenthal, Feng Pan, and Nandani Kappiah Presented.
AUTHORS: STIJN POLFLIET ET. AL. BY: ALI NIKRAVESH Studying Hardware and Software Trade-Offs for a Real-Life Web 2.0 Workload.
Drowsy Caches: Simple Techniques for Reducing Leakage Power Authors: ARM Ltd Krisztián Flautner, Advanced Computer Architecture Lab, The University of.
1 EE5900 Advanced Embedded System For Smart Infrastructure Energy Efficient Scheduling.
Critical Power Slope Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi, Charles Lefurgy, Eric Van Hensbergen Ram Rajamony Raj Rajkumar.
Energy Savings with DVFS Reduction in CPU power Extra system power.
Using Cycle Efficiency as a System Designer Metric to Characterize an Embedded DSP and Compare Hard Core vs. Soft Core Advisor Dr. Vishwani D. Agrawal.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Abhilash Thekkilakattil, Radu Dobrin, Sasikumar Punnekkat Mälardalen Real-time Research Center, Mälardalen University Västerås, Sweden Towards Preemption.
1 Tuning Garbage Collection in an Embedded Java Environment G. Chen, R. Shetty, M. Kandemir, N. Vijaykrishnan, M. J. Irwin Microsystems Design Lab The.
Srihari Makineni & Ravi Iyer Communications Technology Lab
1 Process Scheduling in Multiprocessor and Multithreaded Systems Matt Davis CS5354/7/2003.
Power Containers: An OS Facility for Fine-Grained Power and Energy Management on Multicore Servers Kai Shen, Arrvindh Shriraman, Sandhya Dwarkadas, Xiao.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Dynamic Voltage Frequency Scaling for Multi-tasking Systems Using Online Learning Gaurav DhimanTajana Simunic Rosing Department of Computer Science and.
Power and Control in Networked Sensors E. Jason Riedy and Robert Szewczyk Presenter: Fayun Luo.
NUS.SOC.CS5248 A Time Series-based Approach for Power Management in Mobile Processors and Disks X. Liu, P. Shenoy and W. Gong Presented by Dai Lu.
Improving Disk Throughput in Data-Intensive Servers Enrique V. Carrera and Ricardo Bianchini Department of Computer Science Rutgers University.
Lev Finkelstein ISCA/Thermal Workshop 6/ Overview 1.Motivation (Kevin) 2.Thermal issues (Kevin) 3.Power modeling (David) 4.Thermal management (David)
Multimedia Computing and Networking Jan Reduced Energy Decoding of MPEG Streams Malena Mesarina, HP Labs/UCLA CS Dept Yoshio Turner, HP Labs.
CprE 458/558: Real-Time Systems (G. Manimaran)1 Energy Aware Real Time Systems - Scheduling algorithms Acknowledgement: G. Sudha Anil Kumar Real Time Computing.
E-MOS: Efficient Energy Management Policies in Operating Systems
CSE 591: Energy-Efficient Computing Lecture 3 SPEED: processor Anshul Gandhi 347, CS building
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
Power Capping Via Forced Idleness ANSHUL GANDHI Carnegie Mellon Univ. 1.
Distributed Process Scheduling- Real Time Scheduling Csc8320(Fall 2013)
Overview Motivation (Kevin) Thermal issues (Kevin)
Crusoe Processor Seminar Guide: By: - Prof. H. S. Kulkarni Ashish.
Green cloud computing 2 Cs 595 Lecture 15.
Juan Rubio, Lizy K. John Charles Lefurgy
FAST: Frequency-Aware Static Timing Analysis
Presentation transcript:

Critical Power Slope: Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi †,Charles Lefurgy ‡, Eric Van Hensbergen ‡, Ram Rajamony ‡, Raj Rajkumar † † Real-Time and Multimedia Systems Lab Dept. of Electrical and Computer Engineering Carnegie Mellon University ‡ Austin Research Laboratory IBM

The Question Operating Points –[600MHz,6V], [525MHz,4.2V],[450MHz,2.8V],[375MHz,2V],[300MHz, 1.7V], [225MHz,1.5V],[150MHz,1.45V] Where should I operate (for energy efficiency)? –Dynamic Voltage Scaling (DVS) algorithms –Lowest performance without sacrificing user/application requirement Why lowest performance is not always the best –Even for voltage scaling systems

Energy Efficiency... power time Watts Low frequencyHigh frequency

Majority of OS policies assume Not always the case! –When it is not the case? –How do we determine this? Assumption < Watts

Motivation – < : not always true –How do we choose which operating points to use? Measurement results Analytical model: Critical Power Slope Analysis on voltage scaling systems Conclusion Outline

Power Management Techniques Provides multiple operating points –[600MHz,6V],[450MHz,2.8V],[300MHz, 1.7V]…etc Three empirical data points –Frequency Scaling PowerPC 405GP –Clock Throttling Pentium with ACPI –Voltage Scaling Strong ARM SA-1100 Note: We are not making any statement on the benefits of these techniques! –These are merely samples which real systems use to manage power.

Basic Results Runtime and frequency –CPU intensive workload: inverse relationship Power and frequency –Frequency scaling, clock throttling processors CPU active: linear relationship CPU idle: constant m: slope CPU active CPU idle Power Frequency

Energy Consumption Compare energy consumption at different operating points –Same workload W –Same amount of time t power time

Energy consumption (Pentium L1 cache read hit) 2490J 2591J 174.3sec

Energy consumption (PPC L1 cache read hit) 136J 66.4sec 162J

Measurement Results Results consistent with different workloads –Register, L1 cache, memory, disk accesses –Web server (Pentium) Pentium –Highest frequency always energy efficient PowerPC –Lowest frequency always energy efficient Why? –What happens on voltage scaling systems?

Motivation –Which operating points should we consider? Measurement results –Pentium: highest performance better –PowerPC: lowest performance better Analytical model: Critical Power Slope Analysis on voltage scaling systems Conclusion Outline

CPU intensive workload W Frequency –Assume utilization of system = 1 – units of time to complete W –Energy consumed At frequency –Time to compute W: –Remaining extra idle time: Characterization

–Power increases linearly with frequency –m: slope Is energy efficient?? –True if –Depends on m Critical Power Slope

Use slope m to characterize system –Find hypothetical m for and call it Critical Power Slope (CPS) Critical Power Slope cont’d

What does it mean? Freq Power

If –Energy efficient to run at higher freq. –Pentium If –Energy efficient to run at lower freq. –PowerPC Implications of CPS <

J.Pouwelse, K.Langendoen, and H. Sips, “Dynamic Voltage Scaling on a Low-Power Microprocessor”, MOBICOM2001 Voltage Scaling Processors (Strong Arm SA-1100)

Look at every operating point at frequency If –Energy efficient at higher frequency than If –Energy efficient at lower frequency than CPS for voltage scaling system

Analysis on SA-1100 Above 74MHz At 74MHz Below 74MHz Energy Inefficient below 74MHz!

Summary Power Frequency Power Frequency Power Frequency PentiumPowerPC SA-1100 CPS: Characterizes the runtime trade-off of power management techniques

Conclusion Which operating points should we consider? –Traditional DVS algorithms attempt to go to lowest frequency –Not always the best choice Critical Power Slope Identifies energy inefficient operating points Can be used to inform OS (DVS algorithms) of operating points it should not consider