Jon Crowcroft, Data Center Networks for the Application.

Slides:



Advertisements
Similar presentations
Big Data + SDN SDN Abstractions. The Story Thus Far Different types of traffic in clusters Background Traffic – Bulk transfers – Control messages Active.
Advertisements

Software-defined networking: Change is hard Ratul Mahajan with Chi-Yao Hong, Rohan Gandhi, Xin Jin, Harry Liu, Vijay Gill, Srikanth Kandula, Mohan Nanduri,
Lecture 18: Congestion Control in Data Center Networks 1.
Networking Issues in LAN Telephony Brian Yang
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
16-Jun-15 Recursion. 2 Definitions I A recursive definition is a definition in which the thing being defined occurs as part of its own definition Example:
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz University of Maryland Tuesday, June 29, 2010 This work is licensed.
Introduction to Boosting Aristotelis Tsirigos SCLT seminar - NYU Computer Science.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
28-Jun-15 Recursion. 2 Definitions I A recursive definition is a definition in which the thing being defined occurs as part of its own definition Example:
Information-Agnostic Flow Scheduling for Commodity Data Centers
7/14/2015EECS 584, Fall MapReduce: Simplied Data Processing on Large Clusters Yunxing Dai, Huan Feng.
Recursion. Definitions I A recursive definition is a definition in which the thing being defined occurs as part of its own definition Example: A list.
SM3121 Software Technology Mark Green School of Creative Media.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
FIREWALL TECHNOLOGIES Tahani al jehani. Firewall benefits  A firewall functions as a choke point – all traffic in and out must pass through this single.
Practical TDMA for Datacenter Ethernet
TitleEfficient Timing Channel Protection for On-Chip Networks Yao Wang and G. Edward Suh Cornell University.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Cloud Computing April 19, Agenda Introduction What is cloud computing? What are the benefits? What are the concerns? What’s next?
Assumptions Hypothesis Hopes RAMCloud Mendel Rosenblum Stanford University.
Jon Turner (and a cast of thousands) Washington University Design of a High Performance Active Router Active Nets PI Meeting - 12/01.
1 Performance Evaluation of Computer Networks: Part II Objectives r Simulation Modeling r Classification of Simulation Modeling r Discrete-Event Simulation.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
David G. Andersen CMU Guohui Wang, T. S. Eugene Ng Rice Michael Kaminsky, Dina Papagiannaki, Michael A. Kozuch, Michael Ryan Intel Labs Pittsburgh 1 c-Through:
CS 395 Last Lecture Summary, Anti-summary, and Final Thoughts.
Data Structures & Algorithms and The Internet: A different way of thinking.
CPU S CHEDULING Lecture: Operating System Concepts Lecturer: Pooja Sharma Computer Science Department, Punjabi University, Patiala.
On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1.
Analysis of Algorithms
Probabilistic Consistency and Durability in RAINS: Redundant Array of Independent, Non-Durable Stores Andy Huang and Armando Fox Stanford University.
1 Flow Identification Assume you want to guarantee some type of quality of service (minimum bandwidth, maximum end-to-end delay) to a user Before you do.
MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.
Distributed Galois Andrew Lenharth 2/27/2015. Goals An implementation of the operator formulation for distributed memory – Ideally forward-compatible.
Wei Bai with Li Chen, Kai Chen, Dongsu Han, Chen Tian, Hao Wang SING HKUST Information-Agnostic Flow Scheduling for Commodity Data Centers 1 SJTU,
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
MapReduce: Simplified Data Processing on Large Clusters By Dinesh Dharme.
Presenter : Weerawardhana J.L.M.N. Department of Computer Engineering, University of Peradeniya.
Recursion. Definitions I A recursive definition is a definition in which the thing being defined occurs as part of its own definition Example: A list.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
Distributed databases A brief introduction with emphasis on NoSQL databases Distributed databases1.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Three topics – 1. Data Centers, 2. IoT, 3. Networks for Dev/Disasters Jon Crowcroft,
The Post Windows Operating System
Lab A: Planning an Installation
Data Center Networks for the Application
Cloud Legal Project - Update
Software Architecture in Practice
Srinivas Narayana MIT CSAIL October 7, 2016
Hamed Rezaei, Mojtaba Malekpourshahraki, Balajee Vamanan
Transport Protocols Relates to Lab 5. An overview of the transport protocols of the TCP/IP protocol suite. Also, a short discussion of UDP.
The Stanford Clean Slate Program
湖南大学-信息科学与工程学院-计算机与科学系
James B. Orlin Presented by Tal Kaminker
CPU Scheduling Basic Concepts Scheduling Criteria
Congestion Control in Software Define Data Center Network
Introduction There are many situations in which we might use replicated data Let’s look at another, different one And design a system to work well in that.
Big-O, Ω, ϴ Trevor Brown CSC263 Tutorial 1 Big-O, Ω, ϴ Trevor Brown
Network Processors for a 1 MHz Trigger-DAQ System
Centralized Arbitration for Data Centers
Introduction to MapReduce
CS 6290 Many-core & Interconnect
CIS679: Two Planes and Int-Serv Model
Presentation transcript:

Jon Crowcroft, Data Center Networks for the Application

Data Centers don’t just go fast They need to serve applications 1. Latency, not just throughput 2. Availability & Failure Detectors 3. Application code within network Work based on Matt Grosvenor’s PhD And EPSRC funded NaaS project…

1. Deterministic latency bounding Learned what I was teaching wrong! I used to say: Integrated Service too complex Admission&scheduling hard Priority Queue can’t do it PGPS computation for latency? I present Qjump scheme, which Uses intserv (PGPS) style admission ctl Uses priority queues for service levels

Data Center Latency Problem Tail of the distribution, due to long/bursty flows interfering Need to separate classes of flow Low latency are usually short flows (or RPCs) Bulk transfers aren’t so latency/jitter sensitiv

Data Center Qjump Solution In Data Center, not general Internet! can exploit topology & traffic matrix & source behaviour knowledge Regular, and simpler topology key But also largely “cooperative” world…

5 port virtual output queued

Latency of ping v. iperf

Hadoop perturbs time synch

Hadoop perturbs memcached

Hadoop perturbs Naiad

Qjump – self admit/policer (per hypervisor, at least)

Qjump latency thru 1 switch

Memcached latency redux w/ QJ

QJ naiad barrier synch latency redux

Experimental topology

PTPd/memcached along, with Hadoop, with Hadoop+QJump

2PC throughput with and without QJump

QJump: variance compared with other schemes…

Topology for flow completion experiment

Web search FCT100Kb ave

Web search FCT 99 th centile

Web search FCT 10Mb ave

Data-mining 100k FCT ave

Data-mining 100k FCT 99 th cent

Data-mining 10M FCT ave

Memcached latency

Qjump scale-up emulation

Latency bound validation topo

Big Picture Comparison – Related work…

2 Failure Detectors 2PC & CAP theorem Recall CAP (Brewer’s Hypothesis) Consistency, Availability, Partitions Strong& weak versions! If have net&node deterministic failure detector, isn’t necessarily so! What can we use CAP-able system for?

Consistent, partition tolerant app? Software Defined Net update! Distributed controllers have distributed rules Rules change from time to time Need to update, consistently Need update to work in presence of partitions By definition! So Qjump may let us do this too!

3. Application code -> Network Last piece of data center working for application Switch and Host NICs have a lot of smarts Network processors, like GPUs or (net)FPGAs Can they help applications? In particular, avoid pathological traffic patterns (e.g. TCP incast)

Application code E.g. shuffle phase in map/reduce Does a bunch of aggregation (min, max, ave) on a row of results And is cause of traffic “implosion” So do work in stages in the switches in the net (like merge sort!) Code very simple Cross-compile into switch NIC cpus

Other application examples Are many … Arose in Active Network research Transcoding Encryption Compression Index/Search Etc etc

Need language to express these Finite iteration (not Turing-complete language) So design python– with strong types! Work in progress in NaaS project at Imperial and Cambridge…

Conclusions/Discussion Data Center is a special case! Its important enough to tackle We can hard bound latency easily We can detect failures and therefore solve some nice distributed consensus problems We can optimise applicatiosn pathological traffic patterns Plenty more to do…