Automated Cost-Aware Data Center Management (part 4)

Slides:



Advertisements
Similar presentations
Ramya (UCSB), Parthasarathy et al (HP Labs). Overview Power delivery, consumption and cooling problems in a data center are being tackled currently by.
Advertisements

Matt Warner Future Facilities Proactive Airflow Management in Data Centre Operation - using CFD simulation to improve resilience, energy efficiency and.
Seyedehmehrnaz Mireslami, Mohammad Moshirpour, Behrouz H. Far Department of Electrical and Computer Engineering University of Calgary, Canada {smiresla,
Orchestrated Scheduling and Prefetching for GPGPUs Adwait Jog, Onur Kayiran, Asit Mishra, Mahmut Kandemir, Onur Mutlu, Ravi Iyer, Chita Das.
Computer Room Provision in Atlas and R89 Graham Robinson.
A Cyber-Physical Systems Approach to Energy Management in Data Centers Presented by Chen He Adopted form the paper authors.
© Future Facilities Ltd By Stuart Crump The Value of Blanking Plates Server equipment will suck the air supply it requires from wherever it can within.
ASPLOS, October 2002 © 2002, Carla Ellis stems & Architecture ECOSystem: Managing Energy as a First Class Operating System Resource Heng Zeng, Carla Ellis,
Utility-Function-Driven Energy- Efficient Cooling in Data Centers Authors: Rajarshi Das, Jeffrey Kephart, Jonathan Lenchner, Hendrik Hamamn IBM Thomas.
Layer Assignment Algorithm for RLC Crosstalk Minimization Bin Liu, Yici Cai, Qiang Zhou, Xianlong Hong Tsinghua University.
YZ X Rack Hot air Cold air Rack nod e. PCR Node i.Temp(t) Temp(x,y,z,t)
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
Effect of Rack Server Population on Temperatures in Data Centers CEETHERM Data Center Laboratory G.W. Woodruff School of Mechanical Engineering Georgia.
Dynamic Reduced-order Model for the Air Temperature Field Inside a Data Center G.W. Woodruff School of Mechanical Engineering Georgia Institute of Technology.
CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 1.
University of Karlsruhe, System Architecture Group Balancing Power Consumption in Multiprocessor Systems Andreas Merkel Frank Bellosa System Architecture.
Green IT and Data Centers Darshan R. Kapadia Gregor von Laszewski 1.
RESEARCH A systematic quest for undiscovered truth A way of thinking
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
Google File System Simulator Pratima Kolan Vinod Ramachandran.
Thermal Aware Server Provisioning (TASP) and Workload Distribution (TAWD) for Internet Data Centers (IDCs) Zahra Abbasi, Georgios Varsamopoulos and Sandeep.
Sensor-Based Fast Thermal Evaluation Model For Energy Efficient High-Performance Datacenters Q. Tang, T. Mukherjee, Sandeep K. S. Gupta Department of Computer.
Cloud Resource Scheduling for Online and Batch Applications Kick-off meeting.
Summer Report Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY
ConSil Jeff Chase Duke University. Collaborators Justin Moore –received PhD in April, en route to Google. Did this research. Wrote this paper. Named the.
Thermal-aware Issues in Computers IMPACT Lab. Part A Overview of Thermal-related Technologies.
The Major Steps of a Public Health Evaluation 1. Engage Stakeholders 2. Describe the program 3. Focus on the evaluation design 4. Gather credible evidence.
Software Architecture for Dynamic Thermal Management in Datacenters Tridib Mukherjee Graduate Research Assistant IMPACT Lab ( Department.
Thermal Aware Data Management in Cloud based Data Centers Ling Liu College of Computing Georgia Institute of Technology NSF SEEDM workshop, May 2-3, 2011.
Green Computing Metrics: Power, Temperature, CO2, … Computing system: Many-cores, Clusters, Grids and Clouds Algorithm and model: task scheduling, CFD.
1 Thermal Management of Datacenter Qinghui Tang. 2 Preliminaries What is data center What is thermal management Why does Intel Care Why Computer Science.
Data Center Energy-Efficient Network-Aware Scheduling
Wajid Minhass, Paul Pop, Jan Madsen Technical University of Denmark
Increasing DC Efficiency by 4x Berkeley RAD Lab
Reducing Data Center Energy Consumption via Coordinated Cooling and Load Management By: Luca Parolini, Bruno Sinopoli, Bruce H. Krogh from CMU Presentation:
Thermal Management in Datacenters Ayan Banerjee. Thermal Management using task placement Tasks: Requires a certain number of servers (cores) for a specified.
-1- UC San Diego / VLSI CAD Laboratory Optimal Reliability-Constrained Overdrive Frequency Selection in Multicore Systems Andrew B. Kahng and Siddhartha.
Interviews, Questionnaires, and control flowcharts Chapter 19.
Adaptable Approach to Estimating Thermal Effects in a Data Center Environment Corby Ziesman IMPACT Lab Arizona State University.
Date of download: 5/27/2016 Copyright © ASME. All rights reserved. From: Experimentally Validated Computational Fluid Dynamics Model for a Data Center.
1 ITM 1.2 How IT Decisions Impact Data Center Facilities: The Importance of Collaboration Lars Strong P.E. Upsite Technologies, Inc.
1 PCE 2.1: The Co-Relationship of Containment and CFDs Gordon Johnson Senior CFD Manager at Subzero Engineering CDCDP (Certified Data Center Design Professional)
Automated Cost-Aware Data Center Management (part 1) Justin Moore Advisor: Jeff Chase Committee Parthasarathy Ranganathan, Carla Ellis, Alvin Lebeck, Jun.
Ruihong Lin 1, Yuhui Deng 1,2, Liyao Yang 1 1 Department of Computer Science, Jinan University, Guangzhou, , China 2 State Key Laboratory of Computer.
Containment Solutions
Energy Aware Network Operations
Air Distribution Control
Thermal-aware Task Placement in Data Centers (part 4)
Air Cooling Design for Machine Components
Green cloud computing 2 Cs 595 Lecture 15.
Operational Viability of a Renewable-Nuclear Mix:
Date of download: 11/2/2017 Copyright © ASME. All rights reserved.
Cooling-Aware and Thermal-Aware Workload Placement for Green HPC Data Centers Sandeep K. S. Gupta (co-authors: Ayan Banerjee, Tridib Mukherjee, George.
ElasticTree Michael Fruchtman.
User Interference Effect on Routing of Cognitive Radio Ad-Hoc Networks
Bank-aware Dynamic Cache Partitioning for Multicore Architectures
Composing Time- and Event-driven Distributed Real-time Systems
Maestro: Orchestrating Lifetime Reliability in Chip Multiprocessors
Nlyte for Colocation Providers
ElasticTree: Saving Energy in Data Center Networks
Multi-hop Coflow Routing and Scheduling in Data Centers
Where Does the Power go in DCs & How to get it Back
UAV Electronics Cooling System
Adaptive Single-Chip Multiprocessing
Measurement and Prediction of Dynamic Density
Enabling Prediction of Performance
Gustavo Rau de Almeida Callou
Narander Kumar and Shalini Agarwal
Using State and Local Data to Improve Results
Anand Bhat*, Soheil Samii†, Raj Rajkumar* *Carnegie Mellon University
Presentation transcript:

Automated Cost-Aware Data Center Management (part 4) Justin Moore Advisor: Jeff Chase Committee Parthasarathy Ranganathan, Carla Ellis, Alvin Lebeck, Jun Yang

Heat Recirculation Rebalance power, reduce cooling costs δQ CRAC Q Rebalance power, reduce cooling costs Hot exhaust air mixes with cold incoming air Cause: inefficient data center layout, air flow [Sharma2003]

TARP: Results Less heat recirculation  lower cooling power cost “Hot spots” can be OK and beneficial, provided heat is exiting Workload inversely proportional to recirculation Avoid until the last minute servers whose exhaust recirculates Moore05Usenix

Cost-Aware Management Identify primary sources of costs Fixed: Acquisition Dynamic: Power and Cooling Cumulative: “Wear and tear” Combine thermal costs with workload costs Utility functions, SLAs, penalties, etc

- Reliability, utility, profitability Ongoing Work: Models Candidate Settings Policy Workload Weatherman Cost Model - Reliability, utility, profitability

Ongoing Work: Costs Per Day

Ongoing Work: Costs Per Day Max inlet of 25C: even = 12.25C ($3259.00 / day) ; wman = 15.75C ($3140.15 / day) Diff = $118.85 / day ; $43,380.25 / year 0 wear-and-tear: even = 7.25C ($3541.91 / day) ; wman = 10.75C ($3260.28 / day) Diff = $281.63 / day ; $102,794.95 / year

Conclusions Need for comprehensive management Address new challenges Power and thermal considerations are significant Unified cost-aware management architecture Extend MAPE architecture Construct accurate models for instrumentation Model and predict data-center-wide conditions Formulate cost-aware management policies

Planning: Management Policies Ad-hoc Independent and limited in scope Priority-based scheduling, thermal kill switches, … Heuristic Qualitative decisions based on “proxy” variables i.e., Reduce power to reduce cooling costs Informed Model and predict system behavior Prevent, Detect, Recover

TARP: Evaluation Methodology 7 racks/row 40 servers/rack 150W idle 285W at 100% Redline @ 25C 4 CRAC units 90 KW/CRAC Flovent – CFD Simulator Validated in [Sharma03] Gives Tin and Tout for each object Tsup = 15 + (Tred – max(Tin))

Minimize Heat Recirculation 2-phase calibration creates power budgets Baseline workload measures Qb and δQb I.e., all servers on but idle Bin servers into non-overlapping pods Utilize each pod 100%; re-measure Q and δQ Heat Recirculation Factor (HRF) HRFj = (Qj – Qb) / (δQj – δQb)