KnightShift: Scaling the Energy Proportionality Wall Through Server-Level Heterogeneity Daniel WongMurali Annavaram University of Southern California MICRO-2012.

Slides:



Advertisements
Similar presentations
Sabyasachi Ghosh Mark Redekopp Murali Annavaram Ming-Hsieh Department of EE USC KnightShift: Enhancing Energy Efficiency by.
Advertisements

Energy in Cloud Computing and Renewable Energy
Technology Drivers Traditional HPC application drivers – OS noise, resource monitoring and management, memory footprint – Complexity of resources to be.
Reducing Network Energy Consumption via Sleeping and Rate- Adaption Sergiu Nedevschi, Lucian Popa, Gianluca Iannaccone, Sylvia Ratnasamy, David Wetherall.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
Computer Abstractions and Technology
Self-Adaptive, Energy-Conserving variant of Hadoop Distributed File System Kumar Sharshembiev.
1 MemScale: Active Low-Power Modes for Main Memory Qingyuan Deng, David Meisner*, Luiz Ramos, Thomas F. Wenisch*, and Ricardo Bianchini Rutgers University.
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
Thin Servers with Smart Pipes: Designing SoC Accelerators for Memcached Bohua Kou Jing gao.
Watchdog Confident Event Detection in Heterogeneous Sensor Networks Matthew Keally 1, Gang Zhou 1, Guoliang Xing 2 1 College of William and Mary, 2 Michigan.
Datacenter Power State-of-the-Art Randy H. Katz University of California, Berkeley LoCal 0 th Retreat “Energy permits things to exist; information, to.
Energy-Efficient Design Some design issues in each protocol layer Design options for each layer in the protocol stack.
Energy Efficient Web Server Cluster Andrew Krioukov, Sara Alspaugh, Laura Keys, David Culler, Randy Katz.
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
On the Energy Efficient Design of Wireless Sensor Networks Tariq M. Jadoon, PhD Department of Computer Science Lahore University of Management Sciences.
CS : Creating the Grid OS—A Computer Science Approach to Energy Problems David E. Culler, Randy H. Katz University of California, Berkeley August.
ECE 510 Brendan Crowley Paper Review October 31, 2006.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
Cutting the Electric Bill for Internet-Scale Systems Andreas Andreou Cambridge University, R02
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
SensEye: A Multi-Tier Camera Sensor Network by Purushottam Kulkarni, Deepak Ganesan, Prashant Shenoy, and Qifeng Lu Presenters: Yen-Chia Chen and Ivan.
Trust Management in Mobile Ad Hoc Networks Using a Scalable Maturity-Based Model Authors: Pedro B. Velloso, Rafael P. Laufer, Daniel de O. Cunha, Otto.
Cloud Data Center/Storage Power Efficiency Solutions Junyao Zhang 1.
Identifying and Using Energy Critical Paths Nedeljko Vasić with Dejan Novaković, Satyam Shekhar, Prateek Bhurat, Marco Canini, and Dejan Kostić EPFL, Switzerland.
Naixue GSU Slide 1 ICVCI’09 Oct. 22, 2009 A Multi-Cloud Computing Scheme for Sharing Computing Resources to Satisfy Local Cloud User Requirements.
Folklore Confirmed: Compiling for Speed = Compiling for Energy Tomofumi Yuki INRIA, Rennes Sanjay Rajopadhye Colorado State University 1.
PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, An-I Andy Wang – Florida State University Peter Reiher – University of California,
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Power Save Mechanisms for Multi-Hop Wireless Networks Matthew J. Miller and Nitin H. Vaidya University of Illinois at Urbana-Champaign BROADNETS October.
Improving Network I/O Virtualization for Cloud Computing.
Last Time Performance Analysis It’s all relative
1 Using Multiple Energy Gears in MPI Programs on a Power- Scalable Cluster Vincent W. Freeh, David K. Lowenthal, Feng Pan, and Nandani Kappiah Presented.
Architectural Support for Fine-Grained Parallelism on Multi-core Architectures Sanjeev Kumar, Corporate Technology Group, Intel Corporation Christopher.
Optimal Client-Server Assignment for Internet Distributed Systems.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Critical Power Slope Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi, Charles Lefurgy, Eric Van Hensbergen Ram Rajamony Raj Rajkumar.
Low-Latency Datacenters John Ousterhout Platform Lab Retreat May 29, 2015.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Dana Butnariu Princeton University EDGE Lab June – September 2011 OPTIMAL SLEEPING IN DATACENTERS Joint work with Professor Mung Chiang, Ioannis Kamitsos,
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Efficient Energy Management Protocol for Target Tracking Sensor Networks X. Du, F. Lin Department of Computer Science North Dakota State University Fargo,
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
An Energy Efficient MAC Protocol for Wireless LANs, E.-S. Jung and N.H. Vaidya, INFOCOM 2002, June 2002 吳豐州.
Accounting for Load Variation in Energy-Efficient Data Centers
Link Layer Support for Unified Radio Power Management in Wireless Sensor Networks IPSN 2007 Kevin Klues, Guoliang Xing and Chenyang Lu Database Lab.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
An Integrated GPU Power and Performance Model (ISCA’10, June 19–23, 2010, Saint-Malo, France. International Symposium on Computer Architecture)
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
E-MOS: Efficient Energy Management Policies in Operating Systems
1 Chapter Overview Monitoring Access to Shared Folders Creating and Sharing Local and Remote Folders Monitoring Network Users Using Offline Folders and.
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Energy.
CS203 – Advanced Computer Architecture Warehouse Scale Computing.
Power-Optimal Pipelining in Deep Submicron Technology
Power Management in Embedded Systems
Seth Pugsley, Jeffrey Jestes,
Green cloud computing 2 Cs 595 Lecture 15.
Warped Gates: Gating Aware Scheduling and Power Gating for GPGPUs
System Control based Renewable Energy Resources in Smart Grid Consumer
Green Software Engineering Prof
Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism Topic 13 Using Energy Efficiently Inside the Server Prof. Zhang.
Digital Processing Platform
Design Unit 26 Design a small or home office network
TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for Online Search Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, and T. N. Vijaykumar.
Cloud Computing Architecture
Chih-Hsun Chou Daniel Wong Laxmi N. Bhuyan
Presentation transcript:

KnightShift: Scaling the Energy Proportionality Wall Through Server-Level Heterogeneity Daniel WongMurali Annavaram University of Southern California MICRO-2012 Supported by NSF and DARPA

Overview Overview | 2  1. Measuring EP|2. EP Trends|3. KnightShift |4. Effect on EP|5. Evaluation

Measuring Energy Proportionality Measuring EP | 3  Energy Proportionality Curve  Actual – empirically measured power usage  Linear – extrapolated from peak to idle power usage  Ideal – utilization and power are perfectly proportional Server BServer A

 DR is a course first-order approximation of EP ❖ …but it is not accurate – only measures two extremes ❖ Ignores power consumption at intermediate utilizations  Assuming 100W peak and Google datacenter utilization [1] ❖ Server A = 68.6W, Server B = 64.6W Dynamic Range (DR) Measuring EP | 4 DR=60%DR=50% [1] L. Barroso and U. Holzle,“The Case For Energy-proportional Computing,” Computer, Dec How can we accurately quantify EP?

 EP is a better indicator of energy usage than DR  Why is DR not enough? ❖ EP = DR + how linear the energy proportionality curve Energy Proportionality (EP) [2] Measuring EP | 5 [2] F. Ryckbosch, S. Polfliet, and L. Eeckhout, “Trends in Server Energy Proportionality,” Computer,2011. EP=53%EP=57% ???

 Linearly Energy Proportional (LD=0)EP=DR  Superlinearly Energy Proportional (+LD)EP<DR  Sublinearly Energy Proportional (-LD) EP>DR  LD shows how far off the actual EP curve is from the linear EP curve Linear Deviation (LD) Measuring EP | 6 SuperlinearSublinear

 Proportionality Gap utilization x% Proportionality Gap (PG) Measuring EP | 7

 SPECpower_ssj2008 ❖ Measures performance and power at 10% utilization intervals  291 servers  November 2007 – December 2011 Energy Proportionality Trends Trends | 8

 ❖ DR improves from 50% to 80%  Since 2009 ❖ DR stalled at 80%  100% DR very difficult ❖ Power supplies, voltage converters, fans, chipsets, network, etc. Dynamic Range Trends Trends | 9

 EP also stalled around 80% ❖ Caused by DR  High EP servers are -LD Energy Proportionality Trends Trends | 10 Since DR growth stalled, the only way to improve EP is through lowering LD

 Large PG at low utilization regardless of EP  As EP improves, PG at high utilization near 0 Proportionality Gap Trends Trends | 11 Energy disproportionality at low utilization will be the main obstacle to achieving perfectly ideal EP

 Energy efficiency is defined as ssj_ops/watt  Energy efficiency at high load has grown dramatically  Energy efficiency at low load has grown slowly  Most datacenter workloads spent majority of time at low load Energy Efficiency Trends Trends | 12 Low utilization energy efficiency growth must be addressed to improve overall server energy efficiency

 EP stall primarily caused by stall in DR ❖ Main focus has been improving peak and idle power consumption  To improve EP in the future: ❖ Improve LD ❖ Target large proportionality gap at low utilizations  Previous server-level low power modes are inactive ❖ Exploits idle periods  DR improvements  There is now a need for server-level active low power modes ❖ Exploits low utilization periods  LD/PG improvements Overcoming the EP Wall Trends | 13

 Server-level active low power mode solution to exploit low utilization periods  Basic Idea -- fronts a high-power primary server with a low-power compute node, called the Knight  Knight capability = fraction of throughput compared to primary server  KnightShift consists of 3 components: ❖ KnightShift hardware ❖ System software ✒ Supports certain functionality (data sharing, networking, etc) ❖ KnightShift runtime ✒ Supports KnightShift functionality KnightShift Server Architecture KnightShift | 14

 Primary Server and Knight contains independent CPU/Memory/Chipset  Independent power domains ❖ Remote wakeup through wake-on-lan  Shared Disk (NFS)  Networking through simple router ❖ Communicate b/t both nodes ❖ Expose only Knight’s IP ❖ Requires Knight to stay on  Implementation Options: ❖ Ensemble-level (Commodity parts) ❖ Board-level (Motherboard Intg.) ❖ Server-level (Add-on board) Ensemble-level KnightShift KnightShift | 15

 Example KnightShift operation KnightShift Runtime KnightShift | 16 Sleep Wakeup awake sync Low High  Power Consumption  Primary: Flush memory state Primary: Send sleep message and enter low power state Knight: Begin processing request Knight: Sends wakeup message Primary: Wakes up and sends awake message Knight: Flush memory state. Sends sync message. Primary: Begin processing requests Primary Server Knight

 Monitors server utilization  Mode switching policy ❖ Aggressively switch into the Knight ❖ Conservatively switch out off the Knight ❖ More optimized policy will improve response time at cost of energy  Redirect requests (Using scheduler/web balancer) ❖ Forward incoming requests to active node  Coordinating mode switching ❖ Ensure data consistency KnightShift Runtime KnightShift | 17

 KnightShift-enhanced 291 SPECpower servers  Theoretically scale power of Knight ❖ Power Knight = C 1.7 × Power Primary, with Knight capability C Effect of KnightShift on EP KnightShift EP | 18

20% Knight 50% Knight Effect of KnightShift on PG KnightShift EP | 19 KnightShift effectively close the proportionality gap at low utilization

 KnightShift essentially shifted all servers to –LD  All servers now have EP>60% (from 20%)  Some servers with EP=1 ❖ KnightShift can achieve ideal EP! Effect of KnightShift on EP and LD KnightShift EP | 20

 Primary Server ❖ Dual 4-core Intel Xeon L5630 ❖ 500GB HD, 36GB DRAM ❖ 156W-205W ❖ Sleep/Wakeup time 5/20s  Knight ❖ Intel Atom D525 (15% capable) ❖ 500GB HD, 1GB DRAM ❖ 15W-16.7W  EP improved from 24% to 48% Prototype Evaluation Evaluation | 21

 Wikipedia-based benchmark (WikiBench) [3] ❖ Cloned Wikipedia database dump ❖ Request trace from actual Wikipedia traffic Prototype Evaluation Evaluation | 22 [3]Wikibench –

Prototype Results Evaluation | 23 High power usage during high utilization Knight saves significant power during low utilization  Queuing model simulation  Sensitivity Analysis ❖ Utilization patterns ❖ Knight capability ❖ Transition time

 EP growth stalled by DR  Large disproportionality at low utilization  Key to improving EP ❖ Improve LD ❖ Target low utilization proportionality gap ❖ Need for server-level active low power mode  KnightShift exploits low utilization periods using a Knight ❖ Enables high efficiency at low utilization ❖ Effectively improves DR, LD and closes PG gap at low util. ❖ In some cases, achieves ideal EP Conclusion Conclusion | 24

Thank you! Questions? Conclusion | 25