A Network-State Management Service

Slides:



Advertisements
Similar presentations
Jennifer Rexford Princeton University MW 11:00am-12:20pm Logically-Centralized Control COS 597E: Software Defined Networking.
Advertisements

Windows® Deployment Services
On the Management of Virtual Networks Group 3 Yang Wenzhi Wang Qian Issam Alkindi Zhu Guangjin Zhang Haifeng Sanjeev.
SDN Controller Challenges
Traffic Engineering with Forward Fault Correction (FFC)
Software-defined networking: Change is hard Ratul Mahajan with Chi-Yao Hong, Rohan Gandhi, Xin Jin, Harry Liu, Vijay Gill, Srikanth Kandula, Mohan Nanduri,
Dynamic Scheduling of Network Updates Xin Jin Hongqiang Harry Liu, Rohan Gandhi, Srikanth Kandula, Ratul Mahajan, Ming Zhang, Jennifer Rexford, Roger Wattenhofer.
Dynamic Scheduling of Network Updates Based on the slides by Xin Jin Hongqiang Harry Liu, Rohan Gandhi, Srikanth Kandula, Ratul Mahajan, Ming Zhang, Jennifer.
A Network-State Management Service Peng Sun Ratul Mahajan, Jennifer Rexford, Lihua Yuan, Ming Zhang, Ahsan Arefin Princeton & Microsoft.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
Inside Windows Azure Virtual Machines Vijay Rajagopalan Microsoft Corporation.
Deep Dive into Windows Azure Virtual Machines – From Cloud Vendor and Enterprise Perspective Vijay Rajagopalan Principal Lead Program Manager Microsoft.
Build a SharePoint App with Microsoft Access. About me.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Virtualized FPGA accelerators in Cloud Computing Systems
Windows ® Deployment Services Infrastructure Planning and Design Published: February 2008 Updated: January 2012.
Copyright © 2011 EMC Corporation. All Rights Reserved. MODULE – 6 VIRTUALIZED DATA CENTER – DESKTOP AND APPLICATION 1.
Basic Concepts Of CITRIX XENAPP.
NetPilot: Automating Datacenter Network Failure Mitigation Xin Wu, Daniel Turner, Chao-Chih Chen, David A. Maltz, Xiaowei Yang, Lihua Yuan, Ming Zhang.
OS Services And Networking Support Juan Wang Qi Pan Department of Computer Science Southeastern University August 1999.
Visual Studio Windows Azure Portal Rest APIs / PS Cmdlets US-North Central Region FC TOR PDU Servers TOR PDU Servers TOR PDU Servers TOR PDU.
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
SDN Management Layer DESIGN REQUIREMENTS AND FUTURE DIRECTION NO OF SLIDES : 26 1.
Operating Systems Security
CellSDN: Software-Defined Cellular Core networks Xin Jin Princeton University Joint work with Li Erran Li, Laurent Vanbever, and Jennifer Rexford.
REST By: Vishwanath Vineet.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
CSci8211: SDN Controller Design 1 Overview of SDN Controller Design  SDN Re-cap  SDN Controller Design: Case Studies  NOX Next Week:  ONIX  ONOS 
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Windows Azure Overview for IT Pros Anton Boyko. Intro to Cloud Computing Intro to Windows Azure Cloud Services Web Sites Virtual Machines Workload Options.
Maslow’s hierarchy of network programming and the unmet needs Ratul Mahajan Microsoft Research.
SDN controllers App Network elements has two components: OpenFlow client, forwarding hardware with flow tables. The SDN controller must implement the network.
Chen Qian, Xin Li University of Kentucky
Building ARM IaaS Application Environment
CIS 700-5: The Design and Implementation of Cloud Networks
Lecture 2: Cloud Computing
RHEV Platform at LHCb Red Hat at CERN 17-18/1/17
Operating Systems & System Software
TrueSight Operations Management 11.0 Architecture
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
Heitor Moraes, Marcos Vieira, Italo Cunha, Dorgival Guedes
Network Operating System Lab
Direct Attached Storage and Introduction to SCSI
MCTS Guide to Microsoft Windows 7
Operating Systems (CS 340 D)
NOX: Towards an Operating System for Networks
The Client/Server Database Environment
Overview of SDN Controller Design
LQCD Computing Operations
Oracle Solaris Zones Study Purpose Only
Dynamic Scheduling of Network Updates
HP Operations Orchestration
Migration Strategies – Business Desktop Deployment (BDD) Overview
Parallel and Multiprocessor Architectures – Shared Memory
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Virtualization Techniques
Edge computing (1) Content Distribution Networks
Database System Architecture
Cloud computing mechanisms
Language Processors Application Domain – ideas concerning the behavior of a software. Execution Domain – Ideas implemented in Computer System. Semantic.
Internet and Web Simple client-server model
Chapter 4 Network Layer Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April Network Layer.
Cloud-Enabling Technology
IP Control Gateway (IPCG)
Myrinet 2Gbps Networks (
Azure Container Service
Features Overview.
In-network computation
STATEL an easy way to transfer data
Presentation transcript:

A Network-State Management Service Paper 1 A Network-State Management Service Peng Sun(Princeton), Ratul Mahajan, Jennifer Rexford, (Princeton), Lihua Yuan, Ming Zhang, Ahsan Arefin( Microsoft) ( in the Proc. of ACM SIGCOMM 2014) Presented by: Suman Maroju EECS,Northwestern University

Paper Overview A network-state management service called Statesman for DCNs is presented. It allows multiple management applications to operate independently while ensuring performance and network- wide invariants. View based architecture. Tested on Microsoft Azure data center for 7 months. 3 applications tested.

DCN A modern data center is home to tens of thousands of hosts, each consisting of one or more processors, memory, network interface, and local high-speed I/O (disk or flash). Compute resources are packaged into racks and allocated as clusters consisting of thousands of hosts that are tightly connected with a high-bandwidth network.

Problem 1-Application conflict Which ever happens first takes the control

Problem 2-Safety violation Joint actions disconnects the ToR (top-of-rack)

(Corybantic)

Three views of the network state: Statement uses three views of the proposed state: Observed Proposed Target Design inspired by version control system git Each application corresponds to different git user. Observed-pull;Proposed-pushed;Target-merged.

Dependency Model of State Variable Prior work considered independent variable-value pairs. Does not contain enough semantic knowledge about how various state variables are related. Dependency model can capture the domain-specific cross variable dependencies among the state variable.

Detailed Architecture:

Input and Output in Statesman

Network State Variables & Controllability Firmware-upgrade DeviceFirmwareVersion DeviceFirmwareVersionls- Controllable Switch configuration DeviceConfiglsControllable LinkAdminPowerlsControllable

Checking Network State Resolving Conflicts 1.TS-OS OpenFlow agent-DeviceAgent-BootStatus=Down. So TS cannot be applied. 2.PS-OS LinkEndAddress=Down PS cannot be applied. 3.PS-TS Upgrading a switch. Read controllability values from OS, set uncontrollability values at PS or TS and use SkipUpdate to resolve TS-OS conflicts or partial rejection. For PS-TS conflicts, last-write wins, priority based locking.

Statesman System Design and Implementation Storage: 50000 lines of C# and C++ code. RESTful web service. Paxos rings (Smaller): Storage instance multiple locations. Smaller rings. Proxy layer for uniform access. Updator: Command Template (OpenFlow,BGP etc) Monitor: (SNMP,OpenFlow)

Read-write APIs of Statesman Implemented as a HTTP web service with RESTful APIs. Freshness parameter included (Staleness). Link failure mitigation DeviceFirmwareVersion

Application experiences: 1.Switch upgrade DeviceFirmwareVersion 2.Failure mitigation Frame-Check-Sequence(FCS) error rates, LinkAdminPower-shutdown and generate repair ticket. 3.Inter-DC TE Bandwidth demands from bandwidth broker Tunnel status and flow matching rules. 99% of the ToR pairs in the DC should have atleast 50% of their baseline capacity

Conflict resolution in Statesman

Handling Operational Failures Switch-upgrade application on 250 switches. A. Straggling switch takes 4 hours to upgrade. Cannot download new firmware image. B. Unstable switches. C. Failure case( human intervention)

System Performance Latency

Checker performance

Read-write performance

Related Work Most of the previous works enable centralized control of traffic flow by directly forwarding states of switches. Similar to Statesman, Onix and Hercules provide a shared network-state platform for all applications but not designed to resolve conflicts. Pyretic, PANE and Maple are recent proposals to deal with multiple applications but focus only on traffic management. Corybantic used explicit resolution by evaluation other applications proposals leading to complexity. Other approaches include partitioning the network into multiple isolated virtual slices.

Thanks! Question?