2005/02/231/27 Scalability and Accuracy in a Large- Scale Network Emulator Amin Vahdat, Ken Yocum, Kevin Walsh, Priya Mahadevan, Dejan Kostić, Jeff Chase,

Slides:



Advertisements
Similar presentations
11 Modelnet Emulation environment for wide-area systems
Advertisements

Emulatore di Protocolli di Routing per reti Ad-hoc Alessandra Giovanardi DI – Università di Ferrara Pattern Project Area 3: Problematiche di instradamento.
Router Architecture : Building high-performance routers Ian Pratt
1 Modeling and Emulation of Internet Paths Pramod Sanaga, Jonathon Duerig, Robert Ricci, Jay Lepreau University of Utah.
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
OSMOSIS Final Presentation. Introduction Osmosis System Scalable, distributed system. Many-to-many publisher-subscriber real time sensor data streams,
CS335 Networking & Network Administration Tuesday, May 18, 2010.
Scalability and Accuracy in a Large- Scale Network Emulator Amin Vahdat, Ken Yocum, Kevin Walsh, Priya Mahadevan, Dejan Kostic, Jeff Chase, and David Becker.
1 Sonia Fahmy Ness Shroff Students: Roman Chertov Rupak Sanjel Center for Education and Research in Information Assurance and Security (CERIAS) Purdue.
CS335 Networking & Network Administration Tuesday, April 20, 2010.
Tesseract A 4D Network Control Plane
Xen and the Art of Virtualization. Introduction  Challenges to build virtual machines Performance isolation  Scheduling priority  Memory demand  Network.
Switching, routing, and flow control in interconnection networks.
Practical TDMA for Datacenter Ethernet
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
T. S. Eugene Ngeugeneng at cs.rice.edu Rice University1 COMP/ELEC 429 Introduction to Computer Networks Lecture 8: Bridging Slides used with permissions.
Chapter 4: Managing LAN Traffic
Ad Hoc Networking via Named Data Michael Meisel, Vasileios Pappas, and Lixia Zhang UCLA, IBM Research MobiArch’10, September 24, Shinhaeng.
Hosting Virtual Networks on Commodity Hardware VINI Summer Camp.
Wave Relay System and General Project Details. Wave Relay System Provides seamless multi-hop connectivity Operates at layer 2 of networking stack Seamless.
NetworkProtocols. Objectives Identify characteristics of TCP/IP, IPX/SPX, NetBIOS, and AppleTalk Understand position of network protocols in OSI Model.
1 Pertemuan 20 Teknik Routing Matakuliah: H0174/Jaringan Komputer Tahun: 2006 Versi: 1/0.
Eric Keller, Evan Green Princeton University PRESTO /22/08 Virtualizing the Data Plane Through Source Code Merging.
CSE 6590 Fall 2010 Routing Metrics for Wireless Mesh Networks 1 4 October, 2015.
Objectives: Chapter 5: Network/Internet Layer  How Networks are connected Network/Internet Layer Routed Protocols Routing Protocols Autonomous Systems.
CCNA 1 Module 10 Routing Fundamentals and Subnets.
1 Heterogeneity in Multi-Hop Wireless Networks Nitin H. Vaidya University of Illinois at Urbana-Champaign © 2003 Vaidya.
A novel approach of gateway selection and placement in cellular Wi-Fi system Presented By Rajesh Prasad.
Sharing Information across Congestion Windows CSE222A Project Presentation March 15, 2005 Apurva Sharma.
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
CS 4700 / CS 5700 Network Fundamentals Lecture 17: Network Modeling (Not Everyone has a Datacenter)
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
Group 3 Sandeep Chinni Arif Khan Venkat Rajiv. Delay Tolerant Networks Path from source to destination is not present at any single point in time. Combining.
High-speed TCP  FAST TCP: motivation, architecture, algorithms, performance (by Cheng Jin, David X. Wei and Steven H. Low)  Modifying TCP's Congestion.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
CSE 6590 Fall 2009 Routing Metrics for Wireless Mesh Networks 1 12 November, 2015.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Utility Computing Amin Vahdat Internet Systems and Storage Group
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Based upon slides from Jay Lepreau, Utah Emulab Introduction Shiv Kalyanaraman
1 Wide Area Network Emulation on the Millennium Bhaskaran Raman Yan Chen Weidong Cui Randy Katz {bhaskar, yanchen, wdc, Millennium.
Symbiotic Routing in Future Data Centers Hussam Abu-Libdeh Paolo Costa Antony Rowstron Greg O’Shea Austin Donnelly MICROSOFT RESEARCH Presented By Deng.
Teknik Routing Pertemuan 10 Matakuliah: H0524/Jaringan Komputer Tahun: 2009.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Routing and Routing Protocols PJC CCNA Semester 2 Ver. 3.0 by William Kelly.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
1 IEX8175 RF Electronics Avo Ots telekommunikatsiooni õppetool, TTÜ raadio- ja sidetehnika inst.
1 11 Distributed Channel Assignment in Multi-Radio Mesh Networks Bong-Jun Ko, Vishal Misra, Jitendra Padhye and Dan Rubenstein Columbia University.
L Subramanian*, I Stoica*, H Balakrishnan +, R Katz* *UC Berkeley, MIT + USENIX NSDI’04, 2004 Presented by Alok Rakkhit, Ionut Trestian.
Routing Algorithms Lecture Static/ Dynamic, Direct/ Indirect, Shortest Path Routing, Flooding, Distance Vector Routing, Link State Routing, Hierarchical.
1 Switching and Forwarding Sections Connecting More Than Two Hosts Multi-access link: Ethernet, wireless –Single physical link, shared by multiple.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
1 LAN switching and Bridges Relates to Lab Outline Interconnection devices Bridges/LAN switches vs. Routers Bridges Learning Bridges Transparent.
Routing Semester 2, Chapter 11. Routing Routing Basics Distance Vector Routing Link-State Routing Comparisons of Routing Protocols.
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Architecture and Algorithms for an IEEE 802
The Underlying Technologies
Chapter 4: Routing Concepts
CS 4700 / CS 5700 Network Fundamentals
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
File Transfer Issues with TCP Acceleration with FileCatalyst
Development & Evaluation of Network Test-beds
CS 4700 / CS 5700 Network Fundamentals
ModelNet: A Large-Scale Network Emulator for Wireless Networks Priya Mahadevan, Ken Yocum, and Amin Vahdat Duke University, Goal:
Distributed Systems CS
Distributed Systems CS
Presentation transcript:

2005/02/231/27 Scalability and Accuracy in a Large- Scale Network Emulator Amin Vahdat, Ken Yocum, Kevin Walsh, Priya Mahadevan, Dejan Kostić, Jeff Chase, and David Becker Duke University Proceedings of 5th Symposium on Operating Systems Design and Implementation (OSDI 2002)

2005/02/232/27 Introduction ● Evaluate Internet-scale distributed systems ● E.g. peer-to-peer, overlay, wide-area replication ● Realistic scenarios: real world ● Difficult to deploy and administer ● Results not reproducible or not necessarily representative of future behaviour ● Simulations: e.g. NS ● More control ● May miss important system interactions ● Emulation ● Run unmodified code on target platforms ● More control: can subject system traffic to constraints (bandwidth, latency, loss rate, topology,…) ● Thus far limited to small and static systems  ModelNet

2005/02/233/27 Goal of ModelNet ● Environment should support: ● Unmodified applications ● Reproducible results ● Experimentation under broad range of network topologies and dynamically changing network characteristics ● Large-scale experiments with large number of nodes and high traffic

2005/02/234/27 ModelNet Architecture ● Scalable Internet emulation environment ● Based on dummynet, extended to improve accuracy and include multi-hop and multi-core emulation ● Edge nodes running user-specified OS and applications ● Each instance is a virtual edge node (VN) with unique IP in emulated topology ● Route traffic through core routers ● Core nodes emulate behaviour of configured target network ● Captures effects of congestion and cross-traffic ● Uses emulated links or pipes

2005/02/235/27 ModelNet Phases ● CREATE ● Generate network topology  GML graph (*) ● Can use Internet traces, BGP dumps, synthetic topology generators ● User can annotate graph to specify packet loss rates, failure distribution, etc. (*) GML – graph modeling language

2005/02/236/27 ModelNet Phases ● DISTILL ● Transform GML graph to pipe topology to model target network ● Simplify network ● Trade accuracy for reduced emulation cost

2005/02/237/27 ModelNet Phases ● ASSIGN ● Map distilled topology to core nodes, load balancing ● Ideal assignment NP- complete problem ● Mapping pipes to cores depends on routing, link properties and traffic load ● Use simple greedy k-clusters assignment ● Randomly pick one node in the topology for each core node, then cores greedily select from connected nodes in round-robin

2005/02/238/27 ModelNet Phases ● BIND ● Assign VNs to edge nodes ● Can have multiple VNs per physical edge node ● Bind each physical node to a single core ● Install sets of pipes in distilled topology and routing tables with shortest-path between VN pairs ● Configure edge nodes with IP addresses for each VN

2005/02/239/27 ModelNet Phases ● RUN ● Execute target applications on edge nodes

2005/02/2310/27 The Core ● Principal tasks (in steady state) ● Receive packets from network interface ● Move packets ● Pipe to pipe ● Pipe to final destination ● Moving packets is strictly higher priority than receiving packets ● Preferentially emulate packets already in core  core CPU saturation results in dropped packets at physical level rather than emulation

2005/02/2311/27 The Core ● Traffic routing ● Emulate links as pipes ● Pre-computed shortest- path for all VN pairs  requires O(n 2 ) space ● Route is ordered list of pipes ● Move packets through pipes by reference (packet descriptor)

2005/02/2312/27 The Core ● Packet scheduling ● Heap of pipes sorted by earliest deadline (exit time for first packet in queue) ● Scheduler executes once per clock tick (10KHz), runs at kernel’s highest priority ● Finds heaps with deadline later than current time ● Move packets to next destination (tail of next pipe or VN) ● Calculate new deadlines and reinsert pipes into heap

2005/02/2313/27 The Core ● Multi-core configuration ● Next pipe may be on different core node ● Transfer packet descriptor to next node ● Packet contents buffered at entry core node and forwarded to destination upon delivery of packet

2005/02/2314/27 Scalability Issues ● Bandwidth limitation ● Traffic through ModelNet core is limited to cluster’s physical internal bandwidth ● Memory requirement ● ModelNet must buffer up to full bandwidth-delay product of target network ● Routing protocol ● Assumes perfect routing protocol: shortest path between all pairs of host ● Instantaneous discovery of new shortest path upon node or link failure

2005/02/2315/27 Setup for Experiments ● Core routers: ● 1.4 GHz Pentium-IIIs w/ 1 GB memory ● FreeBSD-4.5-STABLE ● Connected via 1GB switch ● Edge nodes: ● 1 GHz Pentium-IIIs w/ 256 MB memory ● Linux ● Connected via 100Mb/s Ethernet

2005/02/2316/27 Baseline Accuracy ● Accurately emulate target packet characteristics on hop-by-hop basis ● Use kernel logging to track performance and accuracy ● Run ModelNet scheduler at highest kernel priority ● Results: ● Each hop accurately emulated to granularity of hardware timer (100μs) ● Maintains accuracy up to 100% CPU utilization ● Future improvement: ● in subsequent hops use packet dept handling to correct for emulation errors

2005/02/2317/27 Capacity ● Quantify as function of load and # of hops ● Single core ● 1 Gb/s link ● 1-5 edge nodes ● Each with up to 24 netperf senders (24 VNs) and 24 receivers ● 1 Gb/s Ethernet connection ● For 1 hop: ● At 120 flows CPU is 50% used ● Network link is bottleneck ● >4 hops ● CPU is bottleneck

2005/02/2318/27 Additional Cores ● Deliver higher throughput ● increasing probability of packet’s path crossing node boundary  cross-core traffic ● Introduces communication overhead ● Ability to scale depends on ● Application communication characteristics ● Partitioning of topology (minimize cross-core traffic)

2005/02/2319/27 VN Multiplexing ● Mapping of VNs to physical edge nodes ● Enables larger-scale emulations ● Affects emulation accuracy and scalability ● Context switch overhead ● Scheduling behaviour ● Resource contention at edge nodes

2005/02/2320/27 Tradeoff: Accuracy vs. Scalability ● Impractical to model every packet and link for large portion of Internet ● Create controlled Internet-like execution context for applications ● Reduce overhead by making approximations that minimally impact application behaviour ● Ideally automate tradeoff to satisfy resource conditions and report degree of inaccuracy to user

2005/02/2321/27 Distillation ● Hop-by-hop emulation ● Distilled topology isomorphic to target network ● Accurate but highest per packet cost ● End-to-end emulation ● Collapse each path to single pipe  full mesh ● Lowest overhead ● Can capture raw network latency, bandwidth and loss rate ● Cannot emulate link contention among competing flows

2005/02/2322/27 Distillation ● Walk-in ● Preserve first walk-in links, replace interior by full mesh ● Breadth-first traversal to find successive frontier sets (first frontier set is set of all VNs) ● Each packet traverses at most (2*walk-in)+1 pipes ● Cannot model contention in interior ● Walk-out ● Model under-provisioned core ● Extend walk-in algorithm to preserve inner core ● Find “topological center” by generating successive frontiers until one of size one or zero is found ● Collapse paths between walk-in and walk-out

2005/02/2323/27 Distillation ● Ring topology ● 20 routers ● Interconnected at 20 Mb/s ● 20 VNs connected to each router by 2 Mb/s links ● VNs partitioned into generator and receiver sets ● Each generator sends to random receiver ● Hop by hop: 419 pipes ● End to end: 79,800 pipes ● Last-mile only: 400 edge links and 190 interior links

2005/02/2324/27 Changing Network Characteristics ● Evaluation of adaptive Internet systems ● User can ● directly incorporate generators for competing traffic ● accurate for emulation of “background” cross traffic ● consumes resources at edge nodes and bandwidth at core ● modify pipe parameters during emulation ● inject cross traffic by dynamically low overhead, scales independently of traffic rate ● does not capture all details of Internet packet dynamics (e.g. slow start, bursty traffic) ● not responsive to congestion  emulation error grows with link utilization level ● Fault injection

2005/02/2325/27 Case Studies ● Network of Gnutella clients ● 10,000 nodes (100 VNs for each of the 100 edge nodes) ● Support for emulation of ad hoc wireless environments ● Implemented but not presented in this paper ● CFS (1) ● Able to reproduce results from CFS implementation running on RON (2) testbed (published by another group) ● Replicated web services ● Replay of trace to IBM’s main website ● Able to show that one additional replica improves latency, third replica only marginally beneficial ● Ability to emulate contention on interior links crucial for obtaining these results ● Adaptive overlays ● ACDC: overlay that adapts to changing network conditions ● Similar experiment results obtained by ModelNet and ns2 (1) CFS - Cooperative File System (2) RON - Resilient Overlay Network (MIT)

2005/02/2326/27 Related Work ● Many other efforts on emulation ● Mostly focus on specific, static and small-scale systems ● Netbed (Emulab) ● Similar to ModelNet, except that ModelNet focuses on scalable emulation of large-scale networks ● Will integrate ModelNet efforts into Netbed ● Competing research by WASP (1) project ● Emulate network characteristics at end host ● Requires emulation software on all edge nodes ● Cannot capture congestion of multiple flows on single pipe (1) WASP – Wide Area Server Performance (J.-Y. Pan, H. Bhanoo, E. Nahum, M. Rosu, C, Faloutsos, and S. Seshan)

2005/02/2327/27 Summary ● ModelNet designed to support ● Unmodified applications ● Reproducible results ● Broad range of network topologies and dynamically changing characteristics ● Large-scale experiments ● Provided means of balancing accuracy and cost ● Presented case studies to show generality of approach