CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Data Centers.

Slides:



Advertisements
Similar presentations
Fred Chong 290N Green Computing
Advertisements

Energy in Cloud Computing and Renewable Energy
©2009 HP Confidential template rev Ed Turkel Manager, WorldWide HPC Marketing 4/7/2011 BUILDING THE GREENEST PRODUCTION SUPERCOMPUTER IN THE.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Servers.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Data Centers.
CIT  Describe servers that offer substantial savings in electrical usage over traditional servers  Explain methods for greening the workplace.
CIT 470: Advanced Network and System Administration
Cooling Product Positioning
Cloud Computing Data Centers Dr. Sanjay P. Ahuja, Ph.D FIS Distinguished Professor of Computer Science School of Computing, UNF.
Where Does the Power go in DCs & How to get it Back Foo Camp James Hamilton web: blog:
Datacenter Power State-of-the-Art Randy H. Katz University of California, Berkeley LoCal 0 th Retreat “Energy permits things to exist; information, to.
A Computer Scientist Looks at the Energy Problem Randy H. Katz University of California, Berkeley EECS BEARS Symposium February 12, 2009 “Energy permits.
CS : Creating the Grid OS—A Computer Science Approach to Energy Problems David E. Culler, Randy H. Katz University of California, Berkeley August.
Thermal Management Solutions from APW President Systems
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Data Centers.
 Site  Requirements  Local Resources  Initial layout ideas  Brief material selection  Supply options.
T HE M ETHOD AND T OOL OF C OST A NALYSIS FOR C LOUD C OMPUTING X INHUI L I Y ING L I T IANCHENG L IU J IE Q IU F ENGCHUN W ANG Presented By Sneha Arvind.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 1.
September 18, 2009 Critical Facilities Round Table 1 Introducing the Heat Wheel to the Data Center Robert (Dr. Bob) Sullivan, Ph.D. Data Center Infrastructure.
Cutting the Green IT Hype: Fact vs. Fiction Kenneth G. Brill, Executive Director Uptime Institute Inc
CHAPTER OVERVIEW SECTION 5.1 – MIS INFRASTRUCTURE
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Data Center Consolidation & Energy Efficiency in Federal Facilities
Green IT and Data Centers Darshan R. Kapadia Gregor von Laszewski 1.
1 Copyright © 2011, Elsevier Inc. All rights Reserved. Chapter 6 Authors: John Hennessy & David Patterson.
Information Technology Division Executive Office for Administration and Finance Springfield Data Center Program Energy Strategy Meeting 9/29/2009.
CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES
Optimising Data Centre Power Planning and Managing Change in Data Centres - 28th November Cirencester.
Thermodynamic Feasibility 1 Anna Haywood, Jon Sherbeck, Patrick Phelan, Georgios Varsamopoulos, Sandeep K. S. Gupta.
Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April
1 Lecture 20: WSC, Datacenters Topics: warehouse-scale computing and datacenters (Sections ) – the basics followed by a look at the future.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Virtualization is only half the battle for efficiency in Data Centers Eduard Bodor System Engineer Romania & Bulgaria.
Energy Usage in Cloud Part2 Salih Safa BACANLI. Cooling Virtualization Energy Proportional System Conclusion.
Smart Grid: Opportunities in the Data Center January 22, 2010 Charles O’Donnell, Vice President, Engineering Liebert AC Power Emerson Network Power.
Strata IT Training Chapter 16 Green Networking. Green Servers Power Management –Power off monitor settings –Automatic server shutdown policy Rack mounted.
1 Drivers for Building Life Cycle Integrations Jim Sinopoli.
Thermal-aware Issues in Computers IMPACT Lab. Part A Overview of Thermal-related Technologies.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Data Centers.
50th HPC User Forum Emerging Trends in HPC September 9-11, 2013
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
Virtualization Supplemental Material beyond the textbook.
Authors: William Tschudi, Lawrence Berkeley National Lab Stephen Fok, Pacific Gas and Electric Company Stephen Fok, Pacific Gas and Electric Company Presented.
#watitis2015 TOWARD A GREENER HORIZON: PROPOSED ENERGY SAVING CHANGES TO MFCF DATA CENTERS Naji Alamrony
Optimizing Power and Data Center Resources Jim Sweeney Enterprise Solutions Consultant, GTSI.
CERN - IT Department CH-1211 Genève 23 Switzerland t Power and Cooling Challenges at CERN IHEPCCC Meeting April 24 th 2007 Tony Cass.
All content in this presentation is protected – © 2008 American Power Conversion Corporation Row Cooling.
PIC port d’informació científica First operational experience from a compact, highly energy efficient data center module V. Acín, R. Cruz, M. Delfino,
© 2010 VMware Inc. All rights reserved Datacenter 101 Ravi Soundararajan, VMware, Inc.
Insights into Google's PUE Results Spring Published Google PUE Results.
1 ITM 1.2 How IT Decisions Impact Data Center Facilities: The Importance of Collaboration Lars Strong P.E. Upsite Technologies, Inc.
By Harshal Ghule Guided by Mrs. Anita Mahajan G.H.Raisoni Institute Of Engineering And Technology.
CS203 – Advanced Computer Architecture Warehouse Scale Computing.
The Genome Sequencing Center's Data Center Plans Gary Stiehr
Dell EMC Modular Data Centers
Extreme Scale Infrastructure
CIT 668: System Architecture
CANOVATE MOBILE (CONTAINER) DATA CENTER SOLUTIONS
Chapter 1: Introduction to the Personal Computer
Unit 2: Chapter 2 Cooling.
The Data Center Challenge
Lecture 20: WSC, Datacenters
Green cloud computing 2 Cs 595 Lecture 15.
Instructor Materials Chapter 1: Introduction to the Personal Computer
CIT 668: System Architecture
Cloud Computing Data Centers
Where Does the Power go in DCs & How to get it Back
Cloud Computing Data Centers
The Greening of IT November 1, 2007.
Presentation transcript:

CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Data Centers

Topics Data Center: A facility for housing a large amount of computer or communications equipment. 1.Racks 2.Power 3.PUE 4.Cooling 5.Containers 6.Economics

Google DC in The Dalles Located near 3.1GW hydroelectric power station on Columbia River

Google DC in The Dalles

Inside a Data Center

Inside a Container Data Center

Data Center is composed of: A physically safe and secure space Racks that hold computer, network, and storage devices Electric power sufficient to operate the installed devices Cooling to keep the devices within their operating temperature ranges Network connectivity throughout the data center and to places beyond

Data Center Components

Data Center Tiers See for more details about tiers.

Racks: The Skeleton of the DC 19” rack standard –EIA-310D –Other standard numbers. NEBS 21” racks –Telecom equipment. 2-post or 4-post Air circulation (fans) Cable management Doors or open

Rack Units

Rack Sizes

Rack Purposes Organize equipment –Increase density with vertical stacking. Cooling –Internal airflow in rack cools servers. –Data center airflow determined by arrangement of racks. Wiring Organization –Cable guides keep cables within racks.

Rack Power Infrastructure Different power sockets can be on different circuits. Individual outlet control (power cycle.) Current monitoring and alarms. Network managed (web or SNMP.)

Rack-Mount Servers 4U 1U

Blade Servers

Buying a Rack Buy the right size –Space for servers. –power, patch panels, etc. Be sure it fits your servers. –Appropriate mounting rails. –Shelves for non-rack servers. Environment options –Locking front and back doors –Sufficient power and cooling. –Power/environment monitors. –Console if needed.

Space Aisles Wide enough to move equipment. Separate hot and cold aisles. Hot spots Result from poor air flow. Servers can overheat when average room temperature is too low. Work space A place for SAs to work on servers. Desk space, tools, etc. Capacity Room to grow.

Data Center Power Distribution power.htm

UPS (Uninterruptible Power Supply) Provides emergency power when utility fails –Most use batteries to store power Conditions power, removing voltage spikes

Standby UPS Power will be briefly interrupted during switch Computers may lockup/reboot during interruption No power conditioning Short battery life Very inexpensive

Online UPS AC -> DC -> AC conversion design True uninterrupted power without switching Extremely good power conditioning Longer battery life Higher price

Power Distribution Unit (PDU) Takes high voltage feed and divides into many 110/120 V circuits that feed servers. –Similar to breaker panel in a house.

Estimating Per-Rack Power

The Power Problem 4-year power cost = server purchase price. Upgrades may have to wait for electricity. Power is a major data center cost –$5.8 billion for server power in –$3.5 billion for server cooling in –$20.5 billion for purchasing hardware in 2005.

Measuring Power Efficiency PUE is ratio of total building power to IT power; efficiency of datacenter building infrastructure SPUE is ratio of total server input to its useful power, where useful power is power consumed by CPU, DRAM, disk, motherboard, etc. Excludes losses due to power supplies, fans, etc. Computation efficiency depends on software and workload and measures useful work done per watt

Power Usage Effectiveness (PUE) PUE = Data center power / Computer power –PUE=2 indicates that for each watt of power used to power IT equipment, one watt used for HVAC, power distribution, etc. –Decreases towards 1 as DC is more efficient. PUE variation –Industry average > 2 –Microsoft = 1.22 –Google = 1.19

Data Center Energy Usage

Sources of Efficiency Losses UPS –88-94% efficiency –Less if lightly loaded PDU voltage transformation –.5% or less Cables from PDU to racks –1-3% depending on distance and cable type Computer Room Air Conditioning (CRAC) –Delivery of cool air over long distances uses fan power and increases air temperature

Cooling a Data Center Keep temperatures within ◦ C Cooling equipment rated in BTUs –1 Watt = 3412 BTUH –BTUH = British Thermal Unit / Hour Keep humidity between 30-55% – High = condensation – Low = static shock Avoid hot/cold spots – Can produce condensation

Computer Room Air Conditioning Large scale, highly reliable air conditioning units from companies like Liebert. Cooling capacity measured in tons.

Waterworks for Data Center

Estimating Heat Load

Hot-Cold Aisle Architecture Server air intake from cold aisles Server air exhaust into hot aisles Improve efficiency by reducing mixture of hot/cold

Free Cooling Cooling towers dissipate heat by evaporating water, reducing or eliminating need to run chillers Google Belgium DC uses 100% free cooling

Improving Cooling Efficiency Air flow handling: Hot air exhausted by servers does not mix with cold air, and path to cooling coil is very short so little energy spent moving Elevated cold aisle temperatures: Cold aisle of containers kept at 27 ◦ C rather than ◦ C. Use of free cooling: In moderate climates, cooling towers can eliminate majority of chiller runtime.

Server PUE (SPUE) Primary sources of inefficiency –Power Supply Unit (PSU) (70-75% efficiency) –Voltage Regulator Modules (VRMs) Can lose more than 30% power in conversion losses –Cooling fans Software can reduce fan RPM when not needed SPUE ratios of are common today

Power Supply Unit Efficiency 80 PLUS initiative to promote PSU efficiency –80+% efficiency at 20%, 50%, 100% of rated load –Can be less than 80% efficient at idle power load First 80 PLUS PSU shipped in 2005

Server Useful Power Consumption DevicePower Usage Intel Xeon W GHz Quad Core130 W Intel Xeon E GHz Quad Core80W Intel Xeon E GHz Dual Core80W 7200RPM Hard Drive7W 10,000RPM Hard Drive14W 15,000RPM Hard Drive20W DDR2 DIMM1.65W Video Card20-120W The best method to determine power usage is to measure it

Server Utilization ~10-50% Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers are rarely completely idle and seldom operate near their maximum utilization, instead operating most of the time at between 10 and 50 percent of their maximum It is surprisingly hard to achieve high levels of utilization of typical servers (and your home PC is even worse) “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer, December 2007

Server Power Usage Range: % Figure 2. Server power usage and energy efficiency at varying utilization levels, from idle to peak performance. Even an energy-efficient server still consumes about half its full power when doing virtually no work. Energy efficiency = Utilization/Power “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer, December 2007

Server Utilization vs. Latency Utilization 100% Latency

Improving Power Efficiency

Application consolidation –Reduce the number of applications by eliminating old applications in favor of new ones that can server the purpose of multiple old ones. –Allows elimination of old app servers. Server consolidation –Use single DB for multiple applications. –Move light services like NTP onto shared boxes. Use SAN storage –Local disks typically highly underused –Use SAN so servers share single storage pool

Improving Power Efficiency Virtualization –Host services on VMs instead of on physical servers –Host multiple virtual servers on single physical svr Only-as-needed Servers –Power down servers when not in use –Works best with cloud computing Granular capacity planning –Measure computing needs carefully –Buy minimal CPU, RAM, disk configuration based on your capacity measurements and forecasts

Containers Data Center in a shipping container. –4-10X normal data center density. –1000s of servers. –100s of kW of power. Advantages –Efficient cooling –High server density –Rapid deployment –Scalability Vendor offerings:

Microsoft Chicago Data Center

Google Container Patents Containers docked at central power spline Container air flow diagram, with a center cold aisle and hot air return behind servers Vertical stack of containers

Data Center Failure Events

Hardware Isn’t Reliable Enough If servers are 99% reliable, then a system with 10 servers is ≈ 90% reliable a system with 100 servers is ≈ 37% reliable

Fault-Tolerant Architecture Must use fault-tolerant software architecture –Hardware must detect faults –Hardware must notify software in timely fashion Fault-tolerant architecture reduces costs –Choose hardware reliability level that maximizes cost efficiency, not just reliability Fault-tolerant architecture can improve perf –Spreading processing and storage across many servers improves bandwidth and CPU capacity

Causes of Service Disruptions

Total Cost of Ownership (TCO) TCO = Data Center Depreciation + Data Center Operating Expenses (Opex) + Server Depreciation + Server Operating Expenses (Opex) Depreciation is the process of allocating cost of assets across period during which assets are used. Example: server cost = $10,000, $0 residual value annual depreciation over 4 years = $2500

Cost to Build Data Center Primary components (power, cooling, space) scale roughly linearly with space. 80% of total construction cost goes to power + cooling Typical depreciation periods of years

Operational Costs Operational costs include –Electricity –Salaries for personnel –Server maintenance contracts –Software licenses Larger data centers are cheaper –Smaller number of sysadmins per server –Fixed number of security guards For multi-MW data center, $0.02-$0.08/month

Case Study Tier 3 multi-MW data center –Dell 2950 III EnergySmart servers (300W, $6000) –Cost of electricity is 6.2/kW –Servers financed with 3-year 12% –Cost of DC construction is $15/W, 12-yr lifetime –DC opex is 4/month –PUE = 2.0 –Server lifetime is 3 years –Server maintenance is 5% of capex –Server avg power = 75% peak

Key Points Data center components –Physically secure space –Racks, the DC skeleton –Power, including UPS and PDU –Cooling –Networking Power efficiency (server cost = 4 years power on avg) –PUE = Data center power / IT equipment power –Most power in traditional DC goes to cooling, UPS –SPUE = Server PUE; inefficiencies from PSU, VRM, fans Cooling –Heat load estimation –Air flow control (hot/cold aisle architecture or containers) –Higher cold air temperatures (27C vs. 20C) –Free cooling (cooling towers) TCO = DC depr + DC opex + Svr depr + Svr opex

References 1.Luiz Andre Barroso and Urs Holzle, The Case for Energy- Proportional Computing, IEEE Computer, Vol 40, Issue 12, December Luiz Andre Barroso and Urs Holzle, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, 1st edition, Morgan and Claypool Publishers 3.Xiaobo Fan, Wolf-Dietrich Weber, Luiz Andre Barroso, Power provisioning for a warehouse-sized computer, ISCA '07: Proceedings of the 34th annual international symposium on Computer architecture 4.Thomas A. Limoncelli, Christina J. Hogan, and Strata R. Chalup, The Practice of System and Network Administration, Second Edition, Addison-Wesley Professional, Evi Nemeth, Garth Snyder, Trent R. Hein, Ben Whaley, UNIX and Linux System Administration Handbook, 4 th edition, Prentice Hall, 2010.