Multi-mode Energy Management for Multi-tier Server Clusters Tibor Horvath Kevin Skadron University of Virginia PACT 2008.

Slides:



Advertisements
Similar presentations
Ramya (UCSB), Parthasarathy et al (HP Labs). Overview Power delivery, consumption and cooling problems in a data center are being tackled currently by.
Advertisements

Reducing Network Energy Consumption via Sleeping and Rate- Adaption Sergiu Nedevschi, Lucian Popa, Gianluca Iannaccone, Sylvia Ratnasamy, David Wetherall.
Hadi Goudarzi and Massoud Pedram
A 2 -MAC: An Adaptive, Anycast MAC Protocol for Wireless Sensor Networks Hwee-Xian TAN and Mun Choon CHAN Department of Computer Science, School of Computing.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Power Aware Virtual Machine Placement Yefu Wang. 2 ECE Introduction Data centers are underutilized – Prepared for extreme workloads – Commonly.
Power Management in Cloud Computing using Green Algorithm -Kushal Mehta COP 6087 University of Central Florida.
ElasticTree: Saving Energy in Data Center Networks Brandon Heller, Srini Seetharaman, Priya Mahadevan, Yiannis Yiakoumis, Puneed Sharma, Sujata Banerjee,
CS 795 – Spring  “Software Systems are increasingly Situated in dynamic, mission critical settings ◦ Operational profile is dynamic, and depends.
Energy Conservation in Datacenters through Cluster Memory Management and Barely-Alive Memory Servers Vlasia Anagnostopoulou Susmit.
Power-aware Resource Allocation for Cpu- and Memory Intense Internet Services Vlasia Anagnostopoulou Susmit Biswas, Heba Saadeldeen,
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Mehdi Amirijoo1 Dynamic power management n Introduction n Implementation, levels of operation n Modeling n Power and performance issues regarding.
1 Prediction-based Strategies for Energy Saving in Object Tracking Sensor Networks Yingqi Xu, Wang-Chien Lee Proceedings of the 2004 IEEE International.
“ElasticTree: Saving energy in data center networks“ by Brandon Heller, Seetharaman, Mahadevan, Yiakoumis, Sharma, Banerjee, McKeown presented by Nicoara.
Energy Management and Adaptive Behavior Tarek Abdelzaher.
Energy Efficient Web Server Cluster Andrew Krioukov, Sara Alspaugh, Laura Keys, David Culler, Randy Katz.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
System-Wide Energy Minimization for Real-Time Tasks: Lower Bound and Approximation Xiliang Zhong and Cheng-Zhong Xu Dept. of Electrical & Computer Engg.
Techniques for Efficient Processing in Runahead Execution Engines Onur Mutlu Hyesoon Kim Yale N. Patt.
ECE 510 Brendan Crowley Paper Review October 31, 2006.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
Robert Deaver Sept. 24 th 2009 Energy Management for Servers and Clusters.
1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,
CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 1.
Adaptive Server Farms for the Data Center Contact: Ron Sheen Fujitsu Siemens Computers, Inc Sever Blade Summit, Getting the.
Towards Autonomic Hosting of Multi-tier Internet Services Swaminathan Sivasubramanian, Guillaume Pierre and Maarten van Steen Vrije Universiteit, Amsterdam,
AUTONOMOUS RESOURCE PROVISIONING FOR MULTI-SERVICE WEB APPLICATIONS Jiang Dejun,Guillaume Pierre,Chi-Hung Chi WWW '10 Proceedings of the 19th international.
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Low-Power Wireless Sensor Networks
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Energy Usage in Cloud Part2 Salih Safa BACANLI. Cooling Virtualization Energy Proportional System Conclusion.
Profile Driven Component Placement for Cluster-based Online Services Christopher Stewart (University of Rochester) Kai Shen (University of Rochester) Sandhya.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Computational Sprinting on a Real System: Preliminary Results Arun Raghavan *, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K.
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Towards Dynamic Green-Sizing for Database Servers Mustafa Korkmaz, Alexey Karyakin, Martin Karsten, Kenneth Salem University of Waterloo.
Managing Server Energy and Operational Costs Chen, Das, Qin, Sivasubramaniam, Wang, Gautam (Penn State) Sigmetrics 2005.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
CUHK Learning-Based Power Management for Multi-Core Processors YE Rong Nov 15, 2011.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Adaptive Sleep Scheduling for Energy-efficient Movement-predicted Wireless Communication David K. Y. Yau Purdue University Department of Computer Science.
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
Static Process Scheduling
Accounting for Load Variation in Energy-Efficient Data Centers
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
Dynamic Power Management Using Online Learning Gaurav Dhiman, Tajana Simunic Rosing (CSE-UCSD) Existing DPM policies do not adapt optimally with changing.
SizeCap: Efficiently Handling Power Surges for Fuel Cell Powered Data Centers Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Overview Motivation (Kevin) Thermal issues (Kevin)
Jacob R. Lorch Microsoft Research
Tao Zhu1,2, Chengchun Shu1, Haiyan Yu1
Green cloud computing 2 Cs 595 Lecture 15.
SECTIONS 1-7 By Astha Chawla
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
ElasticTree Michael Fruchtman.
PA an Coordinated Memory Caching for Parallel Jobs
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Comparison of the Three CPU Schedulers in Xen
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
ElasticTree: Saving Energy in Data Center Networks
TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for Online Search Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, and T. N. Vijaykumar.
Chih-Hsun Chou Daniel Wong Laxmi N. Bhuyan
Presentation transcript:

Multi-mode Energy Management for Multi-tier Server Clusters Tibor Horvath Kevin Skadron University of Virginia PACT 2008

2 Introduction ● Well-known problem: data center energy use ● Energy is one of the largest expenses for site operators ● 1.2% of U.S. energy use attributed to servers (estimate for 2005)

3 Opportunities ● Power usage effectiveness (PUE) ● Overhead power for non-IT equipment ● EPA est. average 1.96; Google average 1.21 ● Energy proportionality of hardware ● Conversion losses, idle power, leakage ● Ideally, power should be proportional to load ● Compute capacity overprovisioning ● Static provisioning for peak expected load ● Ideally, capacity should track actual load

4 Opportunities ● Power usage effectiveness (PUE) ● Overhead power for non-IT equipment ● EPA est. average 1.96; Google average 1.21 ● Energy proportionality of hardware ● Conversion losses, idle power, leakage ● Ideally, power should be proportional to load ● Compute capacity overprovisioning ←Our focus ● Static provisioning for peak expected load ● Ideally, capacity should track actual load

5 Cluster Energy Management ● A run-time solution that dynamically: 1.estimates demand 2.reconfigures cluster to adjust capacity – activates sufficient number of nodes – may use dynamic voltage scaling (DVS) on activated nodes 3.shuts down inactive (idle) nodes – may leave some spare nodes turned on to absorb load bursts ● Reduces waste from overprovisioning

6 Multi-mode Energy Management ● Manages energy use of both the active and the inactive portion of the cluster ● Exploits multiple dynamic power management (DPM) techniques: ● DVS for the active nodes ● System sleep states for the inactive nodes – Various power levels and wakeup latencies ● Significant additional energy reduction possible ● Need to optimize capacity in each sleep state to maintain service responsiveness

7 System Model ● Cluster running a multi-tier service ● Each tier a different application ● Service level agreement: ● Average end-to-end latency ● Assumptions: ● Homogeneous tiers – Uniform CPU frequencies ● Perfect load balancing within tiers – Uniform utilization Tier 1 Tier 2 Tier s request response latency

8 Motivating Example latency target exceeded Spares ● An SLA violation is detected

9 Motivating Example ● An SLA violation is detected ● Optimal reaction? ● Activate spares? ● Reallocate among tiers? ● Increase CPU frequencies? latency target exceeded Spares ? ? ? ?

10 Motivating Example ● An SLA violation is detected ● Optimal reaction? ● Activate spares? ● Reallocate among tiers? ● Increase CPU frequencies? ● Predict system behavior for different feasible cluster configurations ● Machine power model ● Tier latency model latency target exceeded Spares ? ? ? ?

11 Power Model ● Mainly determined by CPU frequency ( f ) and utilization ( U ) ● Equation obtained from linear approximation: P(f,U) = a 3 f U + a 2 f + a 1 U + a 0 ● Parameters a i calculated using curve fitting ● Average estimation error: 1% (R 2 =0.99)

12 Multi-mode Energy Components ● Total cluster energy use: E total = E active + E sleep + E transition ● Active portion is dictated by load ● Sleeping portion can be managed separately ● Transition energy is insignificant for typical Web workloads ● Major fluctuations are long-term (daily, weekly, etc.) ● Short bursts can be filtered out

13 Active Energy Optimization ● Constrained optimization problem ● Minimize total machine power by setting – Tier sizes ( m i ) – Tier CPU frequencies ( f i ) ● Subject to – Sum of tier latencies ≤ Target latency from SLA – Sum of tier sizes ≤ Cluster size ● Solution: necessary condition For all i,j : G(m i, f i ) = G(m j, f j ) ● Can be used to limit search space

14 Sleep Energy Optimization ● Sleep states range from On to Soft-Off (0..n) ● Power draw: p i ≤ p i-1 ● Wakeup latency: ω i ≥ ω i-1 ● Control tradeoff between responsiveness and energy ● Maximum accommodated load increase rate (MALIR, σ ) ● Another constrained optimization problem ● Minimize spare machine power by selecting sleep states ● Subject to MALIR constraint – A feasible wakeup schedule must always exist to meet load increase

15 Sleep Energy Optimization ● Minimum feasible power occurs when feasible wakeup schedule is tight ● All sleeping nodes have to be woken up at t 0 ● Optimal number of spare servers for sleep state i : S i = σ(ω i+1 – ω i ) ∕ f max d(t) = d(t 0 ) + σ(t−t 0 ) demand t0t0 t0+ω1t0+ω1 t0+ω2t0+ω2 t0+ω3t0+ω3 t c(t 0 +ω i ) = c(t 0 +ω i-1 ) + S i f max capacity S 0 f max S 1 f max S 2 f max

16 Policy Design ● Active capacity policy ● Assumptions – Static machine power components dominate ● Heuristic search (each period): – Start with minimal tier sizes – Round 1: While predicted latency is too high, add one machine (at full frequency) to a selected tier – Round 2: While predicted latency is too low, lower the frequency of a selected tier – In both rounds, tier selection is guided by greedy minimization of StdDev G i

17 Policy Design ? ? G 1 =0.4 7 G 2 =0.2 3 G 3 =0.8 2

18 Policy Design G 1 =0.47 → 0.53 (StdDev=0.2409) G 2 =0.2 3 G 3 =0.8 2

19 Policy Design G 1 =0.4 7 G 2 =0.23 → 0.61 (StdDev=0.1438) G 3 =0.8 2

20 Policy Design G 1 =0.4 7 G 2 =0.2 3 G 3 =0.82 → 0.83 (StdDev=0.2466)

21 Policy Design G 1 =0.47 → 0.53 (StdDev=0.2409) G 2 =0.23 → 0.61 (StdDev=0.1438) G 3 =0.82 → 0.83 (StdDev=0.2466)

22 Policy Design ● Spare server policies ● “Optimal” Policy – Pre-computes S i and assigns spare machines to each state up to this limit, working from shallow to deep states. – This way, it attempts to ensure machines can be woken up by the time they are needed. ● Demotion Policy – Machines are put into progressively deeper states, as guided by carefully calculated fixed timeouts. – No attempt to ensure it is possible to wake up in time to meet demand. – The most widely used predictive policy

23 Feedback Control Design ● Load estimation is critical ● Utilization is not sufficient ● Adjust estimates based on performance monitoring

24 Experimental Setup ● Testbed HW: 12-node cluster ● 3 sleep states (ACPI S3, S4, S5) ● 4 CPU frequencies (1.0, 1.8, 2.0, 2.2 GHz) ● Total cluster AC power measurements ● SW: standard Web server infrastructure ● LVS (load balancer) + Apache + JBoss + MySQL ● Our system implemented in middleware ● Workload: trace-driven TPC-W ● Highly dynamic workload that emulates client behavior ● Load fluctuations simulated from real-life Web traces

25 Policies Evaluated ● Baseline: statically provisioned for peak load (not overprovisioned) ● DVS: uses our active capacity algorithm but without reconfiguration, allowing only DVS ● Optimal( states ): uses the “Optimal” spare server policy for reconfiguration ● Demotion( states ): uses the Demotion spare server policy for reconfiguration ● where states is: – Off: allow the Soft-Off (S5) state only (and DVS) – Multiple: allow all 3 sleep states (and DVS)

26 Performance and Energy Efficiency ● Average latency ● Baseline: 139 ms ● Target: 250 ms ● Active policy: 227 ms ● Overall energy (cost) savings ● DVS: 11% ● Optimal(Off): 34% ● Energy efficiency (EDP) ● Baseline: Js ● DVS: Js ● Optimal(Off): Js

27 Shape of Load Fluctuations ● Trace acceleration factor: 20x ● Exploiting multiple states yields 6–14% extra energy savings ● Optimal is less sensitive to trace differences ● Optimal outperforms Demotion for the majority of traces by up to 7% in energy savings

28 Peak Load Intensity ● Acceleration: 60x ● Multiple states yield additional energy savings of 7–23% ● Demotion(Off) is highly inefficient ● Worse energy than Optimal(Off) by up to 9% ● Fails to meet the target latency – Cause: does not leave spare capacity to absorb load while machines are being waken up

29 Time Scale of Fluctuations ● Acceleration factors: 5–60x ● System invariants cannot be accelerated – wakeup latencies, service startup times ● Major influence on energy savings and efficiency ● More dynamic fluctuations → more spare capacity is necessary → more important the policy ● High fluctuation rates + Demotion-like Off-only policy → performance degradation

30 Conclusions ● Separate concerns: active and spare capacity ● Minimal sleep energy subject to responsiveness constraints – A combination of spare capacities in multiple sleep states ● Exploiting multiple sleep states ● Extra energy savings up to 23% over Off-only version ● Negligible performance impact ● Benefits of the optimization-based spare policy ● Outperforms Demotion for the majority of traces ● Guarantees specified level of responsiveness ● No reliance on workload prediction