SizeCap: Efficiently Handling Power Surges for Fuel Cell Powered Data Centers Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric.

Slides:



Advertisements
Similar presentations
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Advertisements

Introduction Build and impact metric data provided by the SGIG recipients convey the type and extent of technology deployment, as well as its effect on.
Application-Aware Memory Channel Partitioning † Sai Prashanth Muralidhara § Lavanya Subramanian † † Onur Mutlu † Mahmut Kandemir § ‡ Thomas Moscibroda.
Building Cloud-ready Video Transcoding System for Content Delivery Networks(CDNs) Zhenyun Zhuang and Chun Guo Speaker: 饒展榕.
1 Storage-Aware Caching: Revisiting Caching for Heterogeneous Systems Brian Forney Andrea Arpaci-Dusseau Remzi Arpaci-Dusseau Wisconsin Network Disks University.
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
ElasticTree: Saving Energy in Data Center Networks Brandon Heller, Srini Seetharaman, Priya Mahadevan, Yiannis Yiakoumis, Puneed Sharma, Sujata Banerjee,
Peter X. Gao, Andrew R. Curtis, Bernard Wong, S. Keshav Cheriton School of Computer Science University of Waterloo August 15,
Xavier León PhD defense
GREEN CLOUD By Sphoorthy. LOGO WHAT IS CLOUD COMPUTING? Cloud computing is a model for enabling convenient, on- demand network access to a shared pool.
Chuanjun Zhang, UC Riverside 1 Low Static-Power Frequent-Value Data Caches Chuanjun Zhang*, Jun Yang, and Frank Vahid** *Dept. of Electrical Engineering.
Proteus: Power Proportional Memory Cache Cluster in Data Centers Shen Li, Shiguang Wang, Fan Yang, Shaohan Hu, Fatemeh Saremi, Tarek Abdelzaher.
Intelligent Placement of Datacenters for Internet Services Íñigo Goiri, Kien Le, Jordi Guitart, Jordi Torres, and Ricardo Bianchini 1.
Commonwealth of Massachusetts Statewide Strategic IT Consolidation (ITC) Initiative ITD Virtualization and Shared Services Executive Briefing Presentation.
Exploiting Virtualization for Delivering Cloud based IPTV Services Speaker : 吳靖緯 MA0G IEEE Conference on Computer Communications Workshops.
Buffer Management for Shared- Memory ATM Switches Written By: Mutlu Apraci John A.Copelan Georgia Institute of Technology Presented By: Yan Huang.
Discrete Event Simulation in Automotive Final Process System Vishvas Patel John Ma Throughput Analysis & Simulations General Motors 1999 Centerpoint Parkway.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Liam Newcombe BCS Data Centre Specialist Group Secretary Modelling Data Centre Energy Efficiency and Cost.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
Design and Evaluation of Hierarchical Rings with Deflection Routing Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, ​ Kevin Chang, Greg Nazario, Reetuparna.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
An Energy-Efficient Hypervisor Scheduler for Asymmetric Multi- core 1 Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer.
CPU Scheduling CSCI 444/544 Operating Systems Fall 2008.
Neil Sanderson 24 October, Early days for virtualisation Virtualization Adoption x86 servers used for virtualization Virtualization adoption.
Row Buffer Locality Aware Caching Policies for Hybrid Memories HanBin Yoon Justin Meza Rachata Ausavarungnirun Rachael Harding Onur Mutlu.
A Row Buffer Locality-Aware Caching Policy for Hybrid Memories HanBin Yoon Justin Meza Rachata Ausavarungnirun Rachael Harding Onur Mutlu.
Software Architecture for Dynamic Thermal Management in Datacenters Tridib Mukherjee Graduate Research Assistant IMPACT Lab ( Department.
Thermal Aware Data Management in Cloud based Data Centers Ling Liu College of Computing Georgia Institute of Technology NSF SEEDM workshop, May 2-3, 2011.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Zeta: Scheduling Interactive Services with Partial Execution Yuxiong He, Sameh Elnikety, James Larus, Chenyu Yan Microsoft Research and Microsoft Bing.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
HAT: Heterogeneous Adaptive Throttling for On-Chip Networks Kevin Kai-Wei Chang Rachata Ausavarungnirun Chris Fallin Onur Mutlu.
Carnegie Mellon University, *Seagate Technology
1 Hardware Reliability Margining for the Dark Silicon Era Liangzhen Lai and Puneet Gupta Department of Electrical Engineering University of California,
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
World Energy and Environmental Outlook to 2030
GreenCloud: A Packet-level Simulator of Energy-aware Cloud Computing Data Centers Dzmitry Kliazovich, Pascal Bouvry, Yury Audzevich, and Samee Ullah Khan.
OPERATING SYSTEMS CS 3502 Fall 2017
UH-MEM: Utility-Based Hybrid Memory Management
Seth Pugsley, Jeffrey Jestes,
Threads vs. Events SEDA – An Event Model 5204 – Operating Systems.
Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance
Green cloud computing 2 Cs 595 Lecture 15.
Energy Sources and Sustainability
Maieutic Parataxis.
Dynamo: Amazon’s Highly Available Key-value Store
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
Ching-Chi Lin Institute of Information Science, Academia Sinica
Comparison of the Three CPU Schedulers in Xen
Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller
Row Buffer Locality Aware Caching Policies for Hybrid Memories
Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance
Achieving High Performance and Fairness at Low Cost
Towards Green Aware Computing at Indiana University
ElasticTree: Saving Energy in Data Center Networks
TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for Online Search Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, and T. N. Vijaykumar.
Qingbo Zhu, Asim Shankar and Yuanyuan Zhou
Buffer Management for Shared-Memory ATM Switches
Patrick Akl and Andreas Moshovos AENAO Research Group
Modular Edge-connected data centers
Modular Edge-connected data centers
ITAP: Idle-Time-Aware Power Management for GPU Execution Units
Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S
Presentation transcript:

SizeCap: Efficiently Handling Power Surges for Fuel Cell Powered Data Centers Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu

Executive Summary Fuel cells: efficient power source for data centers Problem: limited load following capability  Fuel cells only gradually increase output power when load increases  Power surges may lead to a power shortfall  server shutdown or damage Existing Approaches  Power capping: hurts performance  Energy storage device (ESD): increases cost Our Approach: SizeCap  Our goal: low cost, still guarantee workload performance  Key Idea 1: Size the ESD to cover only typical-case power surges  Key Idea 2: Use smart power capping, which is aware of fuel cell and workload behavior, to handle remaining power surges SizeCap safely reduces ESD size by 50 – 85% 2

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 3

Fuel Cell Powered Data Centers Data center power consumption continues to grow  In USA alone:  We need more energy-efficient power sources Fuel cells  Convert fuel (e.g., hydrogen, natural gas) into electricity  Advantages: high energy efficiency, low CO 2 emission, highly reliable delivery infrastructure 4 91 billion  140 billion Server 1 Fuel Cell System Rack Server N

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 5

Problem: Limited Load Following Capability Fuel cell power output only gradually increases when power demand increases Can lead to server damage or shut down 6 Power Shortfall How can we efficiently handle power shortfalls? Power Shortfall

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 7

Existing Approaches to Handling Power Shortfalls Power capping  Cuts down the power demand  Performs DVFS or shuts down nodes  Low cost  Hurts performance Energy storage device (ESD)  Buffers energy  Supplies extra energy when needed  High performance  High cost: ESD is sized to handle worst-case power surges, even though they rarely occur Our goal: high performance, low cost 8 Cost Performance ESD Power Capping Our Goal

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 9

SizeCap: Key Ideas Key Idea 1: Size ESD based on typical-case power surges, not worst-case surges Key Idea 2: Use smart power capping to handle remaining power surges 10

Key Idea 1: Size ESD Based on Typical Case We study production data center traces from Microsoft Unavailable period: time that underprovisioned ESD cannot handle power surges Trace 1: reduce ESD size by 85%  only 0.4% unavailable period 11

Key Idea 1: Size ESD Based on Typical Case We study production data center traces from Microsoft Trace 2: reduce ESD size by 50%  6.2% unavailable period 12 Sizing ESD based on typical-case power surges does not hurt performance significantly Sizing ESD based on typical-case power surges does not hurt performance significantly

SizeCap: Key Ideas Key Idea 1: Size ESD based on typical-case power surges, not worst-case surges Key Idea 2: Use smart power capping to handle remaining power surges 13

Key Idea 2: Smart Power Capping Make power capping aware of fuel cell load following behavior  Fuel cells respond differently to different power surges  With fuel cell load following model, we can know how fuel cell power responds to rack power demand  Control the rack power such that it never exceeds sum of fuel cell power and ESD output Make power capping aware of workload behavior  Workload performance is dependent on how power is allocated over time  Allocate power over time to maximize workload performance 14 Smart power capping uses fuel cell, workload behavior to deliver higher benefits Smart power capping uses fuel cell, workload behavior to deliver higher benefits

SizeCap A framework to reduce ESD capacity by employing smart power capping policies At design time  Select best power capping policy implementable in system  Find minimum ESD size that still meets service level agreement (SLA) under the selected policy At runtime  Period-based power control  Every period: use power capping policy to determine power used by each server in next period 15

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 16

Design Time: Policy Selection & ESD Sizing 17 ESD Capacity + Power Capping Policy ESD Sizing Engine Power Capping Policy Pool Representative Workload/Trace Service Level Agreement (SLA) Power Capping Constraints SizeCap Best Capping Policy

Runtime: Execute Power Capping Policy Power Budget Planner: Plan total rack power budget for next period Power Budget Assigner: Distribute rack power among the servers for next period Controller can be centralized or decentralized 18 Fuel Cell System Info Power Budget Assigner ESD Fuel Cell System Power Budget Planner Server Power Budget(s) Server Info(s) Server(s) Rack Power Budget Power Capping Controller ESD Energy

Power Capping Policy Taxonomy 19 Power Capping Policy C FCA WA C FCA WA D FC A WA D FC A WA C FCA WU C FCA WU D FCA WU D FCA WU C FCU WA C FCU WA D FCU WA D FCU WA C FCU WU C FCU WU D FC U WU D FC U WU FCA WA FCA WA FCA WU FCA WU FCU WA FCU WA FCU WU FCU WU FCA FCU Fuel Cell Model Aware vs. Unaware Workload Aware vs. Unaware Centralized vs. Decentralized

Power Capping Policy Taxonomy 20 Power Capping Policy C FCA WA C FCA WA C FCA WU C FCA WU D FCA WU D FCA WU C FCU WU C FCU WU D FC U WU D FC U WU FCA WA FCA WA FCA WU FCA WU FCU WU FCU WU FCA FCU Fuel Cell Model Aware vs. Unaware Workload Aware vs. Unaware Centralized vs. Decentralized Fuel Cell Unaware Workload Unaware

Fuel Cell and Workload Unaware Policies Goal: No power shortfalls, optimize performance in next period Power Budget Planner Use ESD first When ESD is used up  Ramp up rack power with conservative but safe rate  Static rate that guarantees no shortfalls in entire fuel cell operating range Power Budget Assigner Assign power to each server proportional to each server’s workload intensity or current power consumption 21

Power Capping Policy Taxonomy 22 Power Capping Policy C FCA WA C FCA WA C FCA WU C FCA WU D FCA WU D FCA WU C FCU WU C FCU WU D FC U WU D FC U WU FCA WA FCA WA FCA WU FCA WU FCU WU FCU WU FCA FCU Fuel Cell Model Aware vs. Unaware Workload Aware vs. Unaware Centralized vs. Decentralized Fuel Cell Aware Workload Unaware

Fuel Cell Aware, Workload Unaware Policies Goal: No power shortfalls, optimize performance in next period Power Budget Planner Use ESD first When ESD is used up  Ramp up rack power with maximum safe ramp rate  Dynamically adapted rate to guarantee no shortfalls only under current conditions, derived from fuel cell model Power Budget Assigner Same as fuel cell and workload unaware policies 23

Power Capping Policy Taxonomy 24 Power Capping Policy C FCA WA C FCA WA C FCA WU C FCA WU D FCA WU D FCA WU C FCU WU C FCU WU D FC U WU D FC U WU FCA WA FCA WA FCA WU FCA WU FCU WU FCU WU FCA FCU Fuel Cell Model Aware vs. Unaware Workload Aware vs. Unaware Centralized vs. Decentralized Fuel Cell Aware Workload Aware

Fuel Cell and Workload Aware Policy Goal: No power shortfalls, optimize performance over multiple periods Spend max ESD power now, cap aggressively in later periods Save some power now and cap more, use power in later periods Power Budget Planner Use fuel cell model to find all safe power capping settings Use workload behavior to assign power  Look at how workload performs over next several periods under different power allocations  Pick power capping setting that maximizes long-term performance Power Budget Assigner Similar to previous policies 25

Outline Background Problem Existing Approaches Key Ideas Detailed Design Evaluation Conclusion 26

Evaluation Methodology Simulation Configuration  Rack with 45 production servers  Each server runs power capping driver developed in-house Traces  Production traces collected from Microsoft data centers  WebSearch workload Metrics  Success rate: Percentage of requests completed within the maximum allowable service time  Average latency: Average service latency of all requests  P95 latency: 95 th percentile (tail) latency 27

Key Evaluation Results SLA: Assume margins of 0.1% success rate, 3% average latency, and 10% P95 latency under fully-provisioned ESD D-FCA-WU: Safely reduces ESD size by 85% for Trace 1, by 50% for Trace 2, and meets SLA Policies with awareness of fuel cell and/or workload behavior reduce ESD size 10–20% more than unaware policies 28

Conclusion Fuel cells: efficient power source for data centers Problem: limited load following capability  Fuel cells only gradually increase output power when load increases  Power surges may lead to a power shortfall  server shutdown or damage Existing Approaches  Power capping: hurts performance  Energy storage device (ESD): increases cost Our Approach: SizeCap  Our goal: low cost, still guarantee workload performance  Key Idea 1: Size the ESD to cover only typical-case power surges  Key Idea 2: Use smart power capping, which is aware of fuel cell and workload behavior, to handle remaining power surges SizeCap safely reduces ESD size by 50 – 85% 29

SizeCap: Efficiently Handling Power Surges for Fuel Cell Powered Data Centers Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu

Impact of ESD Cost on TCO ESD cost  Supercapacitor: $5.6 per kJ [McCawley, Fung Institute 2014] ESD sizes for our traces  Trace 1 Fully-provisioned: kJ per rack  $ After SizeCap: 16.9 kJ per rack  $94.50  Trace 2 Fully-provisioned: 68.0 kJ per rack  $ After SizeCap: 34.0 kJ per rack  $