Data Center Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101

Slides:



Advertisements
Similar presentations
Programmable Measurement Architecture for Data Centers Minlan Yu University of Southern California 1.
Advertisements

Mobility Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
Profiling Network Performance in Multi-tier Datacenter Applications Minlan Yu Princeton University 1 Joint work with Albert Greenberg,
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
Identifying Performance Bottlenecks in CDNs through TCP-Level Monitoring Peng Sun Minlan Yu, Michael J. Freedman, Jennifer Rexford Princeton University.
Profiling Network Performance in Multi-tier Datacenter Applications
NoHype: Virtualized Cloud Infrastructure without the Virtualization Eric Keller, Jakub Szefer, Jennifer Rexford, Ruby Lee ISCA 2010 Princeton University.
1 In VINI Veritas: Realistic and Controlled Network Experimentation Jennifer Rexford with Andy Bavier, Nick Feamster, Mark Huang, and Larry Peterson
Profiling Network Performance in Multi-tier Datacenter Applications Jori Hardman Carly Ho Paper by Minlan Yu, Albert Greenberg, Dave Maltz, Jennifer Rexford,
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
Computer Networks Transport Layer. Topics F Introduction  F Connection Issues F TCP.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
COS 461: Computer Networks
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Data.
Datacenter Networks Mike Freedman COS 461: Computer Networks
Jennifer Rexford Princeton University MW 11:00am-12:20pm SDN Software Stack COS 597E: Software Defined Networking.
Scalable Management of Enterprise and Data Center Networks Minlan Yu Princeton University 1.
Microsoft Virtual Academy Module 4 Creating and Configuring Virtual Machine Networks.
Performance Diagnosis and Improvement in Data Center Networks
CLOUD COMPUTING & COST MANAGEMENT S. Gurubalasubramaniyan, MSc IT, MTech Presented by.
Enabling Innovation Inside the Network Jennifer Rexford Princeton University
Characterizing the Existing Internetwork PART 1
Hosting Virtual Networks on Commodity Hardware VINI Summer Camp.
Software-Defined Networks Jennifer Rexford Princeton University.
Networking the Cloud Presenter: b 電機三 姜慧如.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Lecture 8: Design of Parallel Programs Part III Lecturer: Simon Winberg.
Introduction to Cloud Computing
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Seamless Access to Services for Mobile Users Jennifer Rexford Princeton University Joint work with Matvey Ayre, Mike.
MDC417 Follow me on Working as Practice Manager for Insight, he is a subject matter expert in cloud, virtualization and management.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
LAN Switching and Wireless – Chapter 1
Cloud Scale Performance & Diagnosability Comprehensive SDN Core Infrastructure Enhancements vRSS Remote Live Monitoring NIC Teaming Hyper-V Network.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Introducing Network Design Concepts Designing and Supporting Computer Networks.
Vytautas Valancius, Nick Feamster, Akihiro Nakao, and Jennifer Rexford.
Networking Fundamentals. Basics Network – collection of nodes and links that cooperate for communication Nodes – computer systems –Internal (routers,
Visual Studio Windows Azure Portal Rest APIs / PS Cmdlets US-North Central Region FC TOR PDU Servers TOR PDU Servers TOR PDU Servers TOR PDU.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco PublicITE I Chapter 6 1 Introducing Network Design Concepts Designing and Supporting Computer Networks.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
VMware vSphere Configuration and Management v6
Advanced Computer Networks Lecturer: E EE Eng. Ahmed Hemaid Office: I 114.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Web Technologies Lecture 13 Introduction to cloud computing.
| Basel Fabric Management with Virtual Machine Manager Philipp Witschi – Cloud Architect & Microsoft vTSP Thomas Maurer – Cloud Architect & Microsoft MVP.
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Energy.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
SEMINAR ON.  OVERVIEW -  What is Cloud Computing???  Amazon Elastic Cloud Computing (Amazon EC2)  Amazon EC2 Core Concept  How to use Amazon EC2.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Chapter 6: Securing the Cloud
CIS 700-5: The Design and Implementation of Cloud Networks
Lecture 2: Cloud Computing
TCP Performance Monitoring
What is Cloud Computing - How cloud computing help your Business?
TCP Congestion Control at the Network Edge
NTHU CS5421 Cloud Computing
TCP Congestion Control at the Network Edge
Internet and Web Simple client-server model
Specialized Cloud Architectures
Cloud-Enabling Technology
Lecture 8, Computer Networks (198:552)
Chapter 8 – Data switching and routing
Data Center Traffic Engineering
Presentation transcript:

Data Center Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101

Networking Case Studies 2 Data Center Backbone Enterprise Cellular Wireless

Cloud Computing 3

Elastic resources – Expand and contract resources – Pay-per-use – Infrastructure on demand Multi-tenancy – Multiple independent users – Security and resource isolation – Amortize the cost of the (shared) infrastructure Flexible service management 4

Cloud Service Models Software as a Service – Provider licenses applications to users as a service – E.g., customer relationship management, ,.. – Avoid costs of installation, maintenance, patches, … Platform as a Service – Provider offers platform for building applications – E.g., Google’s App-Engine – Avoid worrying about scalability of platform 5

Cloud Service Models Infrastructure as a Service – Provider offers raw computing, storage, and network – E.g., Amazon’s Elastic Computing Cloud (EC2) – Avoid buying servers and estimating resource needs 6

Enabling Technology: Virtualization Multiple virtual machines on one physical machine Applications run unmodified as on real machine VM can migrate from one computer to another 7

Multi-Tier Applications Applications consist of tasks – Many separate components – Running on different machines Commodity computers – Many general-purpose computers – Not one big mainframe – Easier scaling

Multi-Tier Applications 9 Front end Server Aggregator … Aggregator Worker … …

Data Center Network 10

Virtual Switch in Server 11

Top-of-Rack Architecture Rack of servers – Commodity servers – And top-of-rack switch Modular design – Preconfigured racks – Power, network, and storage cabling 12

Aggregate to the Next Level 13

Modularity, Modularity, Modularity Containers Many containers 14

Data Center Network Topology 15 CR AR... S S S S Internet S S S S A A A A A A … S S S S A A A A A A …... Key CR = Core Router AR = Access Router S = Ethernet Switch A = Rack of app. servers ~ 1,000 servers/pod

Capacity Mismatch 16 CR AR S S S S S S S S A A A A A A … S S S S A A A A A A …... S S S S S S S S A A A A A A … S S S S A A A A A A … ~ 5:1 ~ 40:1 ~ 200:1

Data-Center Routing 17 CR AR... S S S S DC-Layer 3 Internet S S S S A A A A A A … S S S S A A A A A A …... DC-Layer 2 Key CR = Core Router (L3) AR = Access Router (L3) S = Ethernet Switch (L2) A = Rack of app. servers ~ 1,000 servers/pod == IP subnet SSSS SS

Reminder: Layer 2 vs. Layer 3 Ethernet switching (layer 2) – Cheaper switch equipment – Fixed addresses and auto-configuration – Seamless mobility, migration, and failover IP routing (layer 3) – Scalability through hierarchical addressing – Efficiency through shortest-path routing – Multipath routing through equal-cost multipath So, like in enterprises… – Connect layer-2 islands by IP routers 18

Case Study: Performance Diagnosis in Data Centers anyu/writeup/nsdi11.pdf 19

Applications Inside Data Centers 20 Front end Server Aggregator Workers ….

Challenges of Datacenter Diagnosis Multi-tier applications – Hundreds of application components – Tens of thousands of servers Evolving applications – Add new features, fix bugs – Change components while app is still in operation Human factors – Developers may not understand network well – Nagle’s algorithm, delayed ACK, etc. 21

Diagnosing in Today’s Data Center 22 Host App OS Packet sniffer App logs: #Reqs/sec Response time 1% req. >200ms delay Switch logs: #bytes/pkts per minute Packet trace: Filter out trace for long delay req. SNAP: Diagnose net-app interactions SNAP: Diagnose net-app interactions

Problems of Different Logs 23 Host App OS Packet sniffer App logs: Application-specific Switch logs: Too coarse-grained Packet trace: Too expensive SNAP: Generic, fine-grained, and lightweight SNAP: Generic, fine-grained, and lightweight Runs everywhere, all the time

TCP Statistics Instantaneous snapshots – #Bytes in the send buffer – Congestion window size, receiver window size – Snapshots based on random sampling Cumulative counters – #FastRetrans, #Timeout – RTT estimation: #SampleRTT, #SumRTT – RwinLimitTime – Calculate difference between two polls 24

Identifying Performance Problems 25 – Not any other problems – Send buffer is almost full – #Fast retransmission – #Timeout – RwinLimitTime – Delayed ACK diff(SumRTT)/diff(SampleRTT) > MaxDelay Sender App Send Buffer Receiver Network Direct measure Sampling Inference

SNAP Architecture 26 At each host for every connection Collect data Direct access to OS -Polling per-connection statistics: Snapshots (#bytes in send buffer) Cumulative counters (#FastRestrans) -Adaptive tuning of polling rate

SNAP Architecture 27 At each host for every connection Collect data Performance Classifier Classifying based on the life of data transfer -Algorithms for detecting performance problems -Based on direct measurement in the OS

SNAP Architecture 28 At each host for every connection Collect data Performance Classifier Cross- connection correlation Direct access to data center configurations -Input Topology, routing information Mapping from connections to processes/apps -Correlate problems across connections Sharing the same switch/link, app code

SNAP Deployment Production data center – 8K machines, 700 applications – Ran SNAP for a week, collected petabytes of data Identified 15 major performance problems – Operators: Characterize key problems in data center – Developers: Quickly pinpoint problems in app software, network stack, and their interactions 29

Characterizing Perf. Limitations – Bottlenecked by CPU, disk, etc. – Slow due to app design (small writes) 30 Sender App Send Buffer Receiver Networ k #Apps that are limited for > 50% of the time 551 Apps 1 App 6 Apps 8 Apps 144 Apps – Send buffer not large enough – Fast retransmission – Timeout – Not reading fast enough (CPU, disk, etc.) – Not ACKing fast enough (Delayed ACK)

Delayed ACK Delayed ACK caused significant problems – Delayed ACK was used to reduce bandwidth usage and server interruption 31 Data Data+ACK Data A B ACK B has data to send B doesn’t have data to send 200 ms …. Delayed ACK should be disabled in data centers

Diagnosing Delayed ACK with SNAP Monitor at the right place – Scalable, low overhead data collection at all hosts Algorithms to identify performance problems – Identify delayed ACK with OS information Correlate problems across connections – Identify the apps with significant delayed ACK issues Fix the problem with operators and developers – Disable delayed ACK in data centers 32

Conclusion Cloud computing – Major trend in IT industry – Today’s equivalent of factories Data center networking – Regular topologies interconnecting VMs – Mix of Ethernet and IP networking Modular, multi-tier applications – New ways of building applications – New performance challenges 33

Load Balancing 34

Load Balancers Spread load over server replicas – Present a single public address (VIP) for a service – Direct each request to a server replica 35 Virtual IP (VIP)

Wide-Area Network 36 Router DNS Server DNS-based site selection Servers Internet Clients Data Centers

Wide-Area Network: Ingress Proxies 37 Router Data Centers Servers Clients Proxy