Datacenter Networks Mike Freedman COS 461: Computer Networks

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Distributed Data Processing
Mobility Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
Take your CMS to the cloud to lighten the load Brett Pollak Campus Web Office UC San Diego.
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
Module CSY3021 Network Planning and Programming RD-CSY /09 1.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
Virtual Layer 2: A Scalable and Flexible Data-Center Network Work with Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Parantap Lahiri,
Portland: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Offense Kai Chen Shih-Chi Chen.
Datacenters CS 168, Fall 2014 Sylvia Ratnasamy
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
ProActive Routing In Scalable Data Centers with PARIS Joint work with Dushyant Arora + and Jennifer Rexford* + Arista Networks *Princeton University Theophilus.
COS 461: Computer Networks
Cloud Computing (101).
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Data.
A Scalable, Commodity Data Center Network Architecture.
Spanning Tree and Datacenters EE 122, Fall 2013 Sylvia Ratnasamy Material thanks to Mike Freedman, Scott Shenker,
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
Data Center Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
Plan Introduction What is Cloud Computing?
Performance Diagnosis and Improvement in Data Center Networks
Active Network Applications Tom Anderson University of Washington.
Enabling Innovation Inside the Network Jennifer Rexford Princeton University
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Serval: Software Defined Service-Centric Networking Jen Rexford Erik Nordstrom, David Shue, Prem Gopalan, Rob Kiefer, Mat Arye, Steven Ko, Mike Freedman.
Software-Defined Networks Jennifer Rexford Princeton University.
Networking the Cloud Presenter: b 電機三 姜慧如.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
VL2 – A Scalable & Flexible Data Center Network Authors: Greenberg et al Presenter: Syed M Irteza – LUMS CS678: 2 April 2013.
Software Architecture
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Presented By: Vasantha Lakshmi Gutha Graduate student (CS) Course: CENG 5931 University of Houston-Clear Lake Spring 2011.
Seamless Access to Services for Mobile Users Jennifer Rexford Princeton University Joint work with Matvey Ayre, Mike.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
LAN Switching and Wireless – Chapter 1
Cloud Scale Performance & Diagnosability Comprehensive SDN Core Infrastructure Enhancements vRSS Remote Live Monitoring NIC Teaming Hyper-V Network.
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
Review: –Ethernet What is the MAC protocol in Ethernet? –CSMA/CD –Binary exponential backoff Is there any relationship between the minimum frame size and.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Introducing Network Design Concepts Designing and Supporting Computer Networks.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Visual Studio Windows Azure Portal Rest APIs / PS Cmdlets US-North Central Region FC TOR PDU Servers TOR PDU Servers TOR PDU Servers TOR PDU.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco PublicITE I Chapter 6 1 Introducing Network Design Concepts Designing and Supporting Computer Networks.
Advanced Computer Networks Lecturer: E EE Eng. Ahmed Hemaid Office: I 114.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Chapter 8 – Cloud Computing
Lent Term M/W/F 11-midday
Web Technologies Lecture 13 Introduction to cloud computing.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Data Center Architectures
Chapter 6: Securing the Cloud
CIS 700-5: The Design and Implementation of Cloud Networks
Lecture 2: Cloud Computing
Improving Datacenter Performance and Robustness with Multipath TCP
Chapter 4 Data Link Layer Switching
Improving Datacenter Performance and Robustness with Multipath TCP
GGF15 – Grids and Network Virtualization
IS3120 Network Communications Infrastructure
NTHU CS5421 Cloud Computing
VL2: A Scalable and Flexible Data Center Network
Internet and Web Simple client-server model
Lecture 8, Computer Networks (198:552)
Data Center Traffic Engineering
Presentation transcript:

Datacenter Networks Mike Freedman COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101 http://www.cs.princeton.edu/courses/archive/spr13/cos461/

Networking Case Studies Datacenter Backbone Enterprise Cellular Wireless

Cloud Computing 3

Cloud Computing Elastic resources Multi-tenancy Expand and contract resources Pay-per-use Infrastructure on demand Multi-tenancy Multiple independent users Security and resource isolation Amortize the cost of the (shared) infrastructure Flexible service management 4

Cloud Service Models Software as a Service Platform as a Service Provider licenses applications to users as a service E.g., customer relationship management, e-mail, .. Avoid costs of installation, maintenance, patches, … Platform as a Service Provider offers platform for building applications E.g., Google’s App-Engine, Amazon S3 storage Avoid worrying about scalability of platform 5

Cloud Service Models Infrastructure as a Service Provider offers raw computing, storage, and network E.g., Amazon’s Elastic Computing Cloud (EC2) Avoid buying servers and estimating resource needs 6

Enabling Technology: Virtualization Multiple virtual machines on one physical machine Applications run unmodified as on real machine VM can migrate from one computer to another 7

Multi-Tier Applications Applications consist of tasks Many separate components Running on different machines Commodity computers Many general-purpose computers Not one big mainframe Easier scaling

Componentization leads to different types of network traffic “North-South traffic” Traffic to/from external clients (outside of datacenter) Handled by front-end (web) servers, mid-tier application servers, and back-end databases Traffic patterns fairly stable, though diurnal variations “East-West traffic” Traffic within data-parallel computations within datacenter (e.g. “Partition/Aggregate” programs like Map Reduce) Data in distributed storage, partitions transferred to compute nodes, results joined at aggregation points, stored back into FS Traffic may shift on small timescales (e.g., minutes)

North-South Traffic Router Front-End Proxy Web Server Web Server Data Cache Database

East-West Traffic Distributed Storage Map Tasks Reduce Tasks

Datacenter Network

Virtual Switch in Server 13

Top-of-Rack Architecture Rack of servers Commodity servers And top-of-rack switch Modular design Preconfigured racks Power, network, and storage cabling 14

Aggregate to the Next Level 15

Modularity, Modularity, Modularity Containers Many containers 16

Datacenter Network Topology Internet CR CR . . . AR AR AR AR S S . . . S S S S Key CR = Core Router AR = Access Router S = Ethernet Switch A = Rack of app. servers A A … A A A … A ~ 1,000 servers/pod 17

Capacity Mismatch? 1 2 3 . . . “Oversubscription”: Demand/Supply AR AR AR AR 2 S S S S 3 . . . S S S S S S S S A A … A A A … A A A … A A A … A “Oversubscription”: Demand/Supply 1 > 2 > 3 1 < 2 < 3 1 = 2 = 3 18

Particularly bad for east-west traffic Capacity Mismatch! CR CR ~ 200:1 AR AR AR AR S S S S ~ 40:1 . . . S S S S S S S S ~ 5:1 A A … A A A … A A A … A A A … A Particularly bad for east-west traffic 19

Layer 2 vs. Layer 3? Ethernet switching (layer 2) IP routing (layer 3) Cheaper switch equipment Fixed addresses and auto-configuration Seamless mobility, migration, and failover IP routing (layer 3) Scalability through hierarchical addressing Efficiency through shortest-path routing Multipath routing through equal-cost multipath 20

~ 1,000 servers/pod == IP subnet Datacenter Routing Internet CR CR DC-Layer 3 . . . AR AR AR AR DC-Layer 2 S S S S S S S S . . . S S S S Key CR = Core Router (L3) AR = Access Router (L3) S = Ethernet Switch (L2) A = Rack of app. servers A A … A A A … A ~ 1,000 servers/pod == IP subnet 21

Outstanding datacenter networking problems remains…

Network Incast Incast arises from synchronized parallel requests Web Server Data Cache Incast arises from synchronized parallel requests Web server sends out parallel request (“which friends of Johnny are online?” Nodes reply at same time, cause traffic burst Replies potential exceed switch’s buffer, causing drops

Network Incast Solutions mitigating network incast Web Server Data Cache Solutions mitigating network incast Reduce TCP’s min RTO (often use 200ms >> DC RTT) Increase buffer size Add small randomized delay at node before reply Use ECN with instantaneous queue size All of above

Full Bisection Bandwidth Eliminate oversubscription? Enter FatTrees Provide static capacity But link capacity doesn’t “scale-up”. Scale out? Build multi-stage FatTree out of k–port switches k/2 ports up, k/2 down Supports k3/4 hosts: 48 ports, 27,648 hosts

Full Bisection Bandwidth Not Sufficient Must choose good paths for full bisectional throughput Load-agnostic routing Use ECMP across multiple potential paths Can collide, but ephemeral? Not if long-lived, large elephants Load-aware routing Centralized flow scheduling, end-host congestion feedback, switch local algorithms

Conclusion Cloud computing Datacenter networking Major trend in IT industry Today’s equivalent of factories Datacenter networking Regular topologies interconnecting VMs Mix of Ethernet and IP networking Modular, multi-tier applications New ways of building applications New performance challenges

Load Balancing

Load Balancers Spread load over server replicas Present a single public address (VIP) for a service Direct each request to a server replica 10.10.10.1 Virtual IP (VIP) 192.121.10.1 10.10.10.2 10.10.10.3

Wide-Area Network Internet Datacenters Servers DNS Server Clients Router DNS Server DNS-based site selection Servers Internet Clients Datacenters

Wide-Area Network: Ingress Proxies Router Datacenters Servers Clients Proxy