6.888 Lecture 6: Network Performance Isolation Mohammad Alizadeh Spring 2016 1.

Slides:



Advertisements
Similar presentations
Elastic Provisioning In Virtual Private Clouds
Advertisements

Virtual Switching Without a Hypervisor for a More Secure Cloud Xin Jin Princeton University Joint work with Eric Keller(UPenn) and Jennifer Rexford(Princeton)
Towards Predictable Datacenter Networks
EyeQ: (An engineer’s approach to) Taming network performance unpredictability in the Cloud Vimal Mohammad Alizadeh Balaji Prabhakar David Mazières Changhoon.
Sharing Cloud Networks Lucian Popa, Gautam Kumar, Mosharaf Chowdhury Arvind Krishnamurthy, Sylvia Ratnasamy, Ion Stoica UC Berkeley.
Deconstructing Datacenter Packet Transport Mohammad Alizadeh, Shuang Yang, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker Stanford University.
System Center 2012 R2 Overview
The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
Stratos: A Network-Aware Orchestration Layer for Middleboxes in the Cloud Aditya Akella, Aaron Gember, Anand Krishnamurthy, Saul St. John University of.
Virtualization and Cloud Computing Virtualization David Bednárek, Jakub Yaghob, Filip Zavoral.
FairCloud: Sharing the Network in Cloud Computing Computer Communication Review(2012) Arthur : Lucian Popa Arvind Krishnamurthy Sylvia Ratnasamy Ion Stoica.
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds School of Computer Engineering Nanyang Technological University,
My Point of View about Bandwidth Sharing Bin Wang.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
Course Name- CSc 8320 Advanced Operating Systems Instructor- Dr. Yanqing Zhang Presented By- Sunny Shakya Latest AOS techniques, applications and future.
Software Defined Networking COMS , Fall 2014 Instructor: Li Erran Li SDNFall2014/
Virtualization Abdullah Aldhamin June 3, Outline  Virtualization: what and why?  Server Virtualization Offerings  Shortcoming and Challenges.
Alan Shieh Cornell University Srikanth Kandula Albert Greenberg Changhoon Kim Microsoft Research Seawall: Performance Isolation for Cloud Datacenter Networks.
NoHype: Virtualized Cloud Infrastructure without the Virtualization Eric Keller, Jakub Szefer, Jennifer Rexford, Ruby Lee ISCA 2010 Princeton University.
Automatic Run-time Adaptation in Virtual Execution Environments Ananth I. Sundararaj Advisor: Peter A. Dinda Prescience Lab Department of Computer Science.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Jerry Chou and Bill Lin University of California, San Diego
© 2008 AT&T Intellectual Property. All rights reserved. CloudNet: Where VPNs Meet Cloud Computing Flexibly and Dynamically Timothy Wood Kobus van der Merwe,
An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.
Service Management API Management Portal Service Provider Foundation SPF Web Server Stamp2 SPF DB VMM Server 2 Stamp1 VMM Server 1 Stamp scale.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
Transport SDN: Key Drivers & Elements
Microsoft Virtual Academy Module 4 Creating and Configuring Virtual Machine Networks.
Data Center Network Redesign using SDN
Network Sharing Issues Lecture 15 Aditya Akella. Is this the biggest problem in cloud resource allocation? Why? Why not? How does the problem differ wrt.
Sharing the Data Center Network Alan Shieh, Srikanth Kandula, Albert Greenberg, Changhoon Kim, Bikas Saha Microsoft Research, Cornell University, Windows.
Introduction To Windows Azure Cloud
Department of Computer Science Engineering SRM University
Algorithms for Provisioning Virtual Private Networks in the Hose Model Source: Sigcomm 2001, to appear in IEEE/ACM Transactions on Networking Author: Amit.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
End-to-end resource management in DiffServ Networks –DiffServ focuses on singal domain –Users want end-to-end services –No consensus at this time –Two.
Virtualization Infrastructure Administration Network Jakub Yaghob.
Adaptive software in cloud computing Marin Litoiu York University Canada.
Towards Wide-Area Network Virtualization Panagiotis Papadimitriou Leibniz Universität Hannover With acknowledgments to Zdravko Bozakov, David Dietrich,
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
Microsoft Virtual Academy.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.
Resource Allocation in Network Virtualization Jie Wu Computer and Information Sciences Temple University.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Towards Predictable Data Centers Why Johnny can’t use the cloud and what we can do about it? Hitesh Ballani, Paolo Costa, Thomas Karagiannis, Greg O’Shea.
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
Shuihai Hu, Wei Bai, Kai Chen, Chen Tian (NJU), Ying Zhang (HP Labs), Haitao Wu (Microsoft) Sing Hong Kong University of Science and Technology.
R2C2: A Network Stack for Rack-scale Computers Paolo Costa, Hitesh Ballani, Kaveh Razavi, Ian Kash Microsoft Research Cambridge EECS 582 – W161.
Software Defined Datacenter – from Vision to Solution
Chen Qian, Xin Li University of Kentucky
Yiting Xia, T. S. Eugene Ng Rice University
CIS 700-5: The Design and Implementation of Cloud Networks
Data Center Network Architectures
Data Centers: Network Architecture
Improving Datacenter Performance and Robustness with Multipath TCP
SQL DB Elastic Pools for SaaS
Network Layer Goals: Overview:
Elastic Provisioning In Virtual Private Clouds
CloudMirror: Application-Driven Bandwidth Guarantees in Datacenters
VL2: A Scalable and Flexible Data Center Network
Centralized Arbitration for Data Centers
Lecture 17, Computer Networks (198:552)
Loom: Flexible and Efficient NIC Packet Scheduling
An Optimization Problem in Adaptive Virtual Environments
Towards Predictable Datacenter Networks
Data Center Traffic Engineering
Presentation transcript:

6.888 Lecture 6: Network Performance Isolation Mohammad Alizadeh Spring

Multi-tenant Cloud Data Centers Shared infrastructure between multiple tenants/apps

Lack of Performance Predictability Unpredictable performance, esp. at the tail GAE memcache read 100 values 3

Congestion Kills Predictability 4 Apr 20134NSDI 2013

5 ?

Sharing the Network 6 Alice’s Switch VM1 VM2 VMn VM3 … Bob’s Switch VM1 VM2 VMi VM3 … Customer specifies capacity of the virtual NIC. No traffic matrix. … … Hose Model (Duffield et al., SIGCOMM’99) 2Ghz VCPU 15GB memory 1Gb/s network

Sharing the Network Tenant selects bandwidth guarantees. Models: Hose, VOC, TAG Place VMs, ensuring all guarantees can be met Enforce bandwidth guarantees & Provide work-conservation VM setup Runtime Oktopus [SIGCOMM’10] Hadrian [NSDI’13] CloudMirror [SIGCOMM’14] Seawall [NSDI’10] FairCloud [SIGCOMM’12] EyeQ [NSDI’13] ElasticSwitch [SIGCOMM’13] ….  Adapted from slide by Lucian Popa

Example Runtime System: EyeQ (NSDI’13) 8

Shim Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s Shim 10Gb/s pipe (min) Rate Guarantees EyeQ Shim Layer In the trusted Domain (Hypervisor/NIC)

Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 5Gb/s 10Gb/s pipe (min) Rate Guarantees

RX Module Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 5Gb/s

Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 1Gb/s 8Gb/s

VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 1Gb/s 8Gb/s 5Gb/s Distributed Rate Allocation

RX Module Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 1Gb/s 5Gb/s Spare capacity

Distributed Rate Allocation VM 2Gb/s 8Gb/s 2Gb/s 8Gb/s 2.5Gb/s 5Gb/s

Transmit/Receive Modules VM 2Gb/s 8Gb/s 1Gb/s Congestion detectors Rate limit. RCP: Rate feedback (R) every 10kB (no per-source state needed) Per-destination rate limiters: only if dest. is congested… bypass otherwise Feedback pkt Rate: 1Gb/s 2Gb/s 8Gb/s

Sharing the Network Tenant selects bandwidth guarantees. Models: Hose, VOC, TAG Place VMs, ensuring all guarantees can be met Enforce bandwidth guarantees & Provide work-conservation VM setup Runtime  Adapted from slide by Lucian Popa Cloud Mirror Uses ElasticSwitch [SIGCOMM’13]

Cloud Mirror 18  Slides based on presentation by JK Lee (HP)

Motivation Cloud applications are diverse & complex Bandwidth models like pipe and hose not a good fit 19 [Bing.com traffic pattern, Sigcomm’12] web DB cache web logic

Hose model is unfit Hose aggregates BW towards different components – Too coarse-grained – Prevents accurate and efficient guarantees on infrastructure intra-component (self-edge) inter-component

Hose is too coarse-grained web logicDB Web … … Logic DB TCP-like fair allocation would yield 300:200 3-tier web example Hose model congestion

ww 2B web (N) B … LL … DD … logic (N) DB (N) 2B Hose over-provisions physical link bandwidth Hose model reservation at L 2 : 2B · N N: # VMs in each tier B: per-VM per-edge bandwidth Physical deployment example 2X overprovision by Hose Model 2 B N logic - DB demand = B · N web (N) logic (N) DB (N) BB B web + logic DB L1L1 L2L2 ww … LL … DD …

Contributions 1.Tenant Application Graph (TAG) - Accurate for complex apps - Flexible to elastic scaling - Intuitive 2.VM Placement Algorithm - Guarantee bandwidth and high availability - Efficient for network and compute resources 23

Tenant Application Graph (TAG) 1. Aggregate pipes (like Hose) - Model simplicity - Multiplexing gain 2. Preserve inter-component structure (like Pipe) - Accurately capture application demands - Efficiently utilize network resources DB mem web logic DB mem web logic DB mem web logic Component-level graph

Tenant Application Graph (TAG) web (N w ) DB (N D ) B snd B rcv B in web DB B snd B rcv TAG model B snd = per-VM sending bandwidth (VM-to-component aggregation) B rcv = per-VM receiving bandwidth (component-to-VM aggregation) What do self-edges mean?

Abstract models in TAG Self-edge  Hose Directional edge  directional Hose, Virtual Trunk Total guarantee of virtual trunk = min(B snd ·N w, B rcv ·N D ) B rcv web(N w ) B sn d … … DB(N D ) B in Virtual Switch Virtual Trunk web (N w ) DB (N D ) B snd B rcv B in TAG model

Questions How are TAGs constructed? How to predict bandwidth demands? What is missing for the TAG model?

CloudMirror operation VM placement BW reservation Admission control VM placement BW reservation Admission control TAG spec Network topology & BW reservation state Available VM slots host1 10 host2 50 host3 25 We b (N) DB (N) B

Discussion 29

Next Time: Centralized Arbitration 30

31