An Architecture for Optimal and Robust Composition of Services across the Wide-Area Internet Bhaskaran Raman Qualifying Examination Proposal Feb 12, 2001.

Slides:

Advertisements

Similar presentations

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.

Advertisements

Service Encapsulation in ICEBERG Bhaskaran Raman ICEBERG, EECS, U.C.Berkeley Presentation at Ericsson, Sweden, June 2001.

Availability and Performance in Wide-Area Service Composition Bhaskaran Raman EECS, U.C.Berkeley July 2002.

Packet Switching COM1337/3501 Textbook: Computer Networks: A Systems Approach, L. Peterson, B. Davie, Morgan Kaufmann Chapter 3.

Lecture 6 Overlay Networks CPE 401/601 Computer Network Systems slides are modified from Jennifer Rexford.

Wide-Area Service Composition: Evaluation of Availability and Scalability Bhaskaran Raman SAHARA, EECS, U.C.Berkeley Provider Q Texttoaudio Provider R.

Configurable restoration in overlay networks Matthew Caesar, Takashi Suzuki.

Efficient Streaming for Delay-tolerant Multimedia Applications Saraswathi Krithivasan Advisor: Prof. Sridhar Iyer.

June 3, A New Multipath Routing Protocol for Ad Hoc Wireless Networks Amit Gupta and Amit Vyas.

1 Switching and Forwarding Bridges and Extended LANs.

Department of Computer Engineering University of California at Santa Cruz Networking Systems (1) Hai Tao.

An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.

Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Informed Detour Selection Helps Reliability Boulat A. Bash.

New Routing Architectures Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.

Application Layer Multicast

Wide-Area Service Composition: Availability, Performance, and Scalability Bhaskaran Raman SAHARA, EECS, U.C.Berkeley SAHARA Retreat, Jan 2002.

Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.

CS 268: Project Suggestions Ion Stoica February 6, 2003.

Configurable restoration in overlay networks Matthew Caesar, Takashi Suzuki.

1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.

1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.

1 Fault-tolerant Paths ISRG Retreat Z. Morley Mao 1/11/2000 Services Paths.

University Of Maryland1 A Study Of Cyclone Technology.

Internet-Scale Research at Universities Panel Session SAHARA Retreat, Jan 2002 Prof. Randy H. Katz, Bhaskaran Raman, Z. Morley Mao, Yan Chen.

Study of Distance Vector Routing Protocols for Mobile Ad Hoc Networks Yi Lu, Weichao Wang, Bharat Bhargava CERIAS and Department of Computer Sciences Purdue.

RRAPID: Real-time Recovery based on Active Probing, Introspection, and Decentralization Takashi Suzuki Matthew Caesar.

Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.

CS :: Fall 2003 Layered Coding and Networking Ketan Mayer-Patel.

Problem Definition Data path –Created by the Automatic Path Creation (APC) component –Service: program with well-defined interface –Operator: stateless.

Internet Indirection Infrastructure (i3) Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh Surana UC Berkeley SIGCOMM 2002.

Availability in Wide-Area Service Composition Bhaskaran Raman and Randy H. Katz SAHARA, EECS, U.C.Berkeley.

Scalable Construction of Resilient Overlays using Topology Information Mukund Seshadri Dr. Randy Katz.

Receiver-driven Layered Multicast Paper by- Steven McCanne, Van Jacobson and Martin Vetterli – ACM SIGCOMM 1996 Presented By – Manoj Sivakumar.

1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.

1 Pertemuan 20 Teknik Routing Matakuliah: H0174/Jaringan Komputer Tahun: 2006 Versi: 1/0.

CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.

“Intra-Network Routing Scheme using Mobile Agents” by Ajay L. Thakur.

A Framework for Highly-Available Cascaded Real-Time Internet Services Bhaskaran Raman Qualifying Examination Proposal Feb 12, 2001 Examination Committee:

IDRM: Inter-Domain Routing Protocol for Mobile Ad Hoc Networks C.-K. Chau, J. Crowcroft, K.-W. Lee, S. H.Y. Wong.

CCNA 1 Module 10 Routing Fundamentals and Subnets.

A Framework for Highly-Available Session-Oriented Internet Services Bhaskaran Raman, Prof. Randy H. Katz {bhaskar, The ICEBERG Project.

A Transport Framework for Distributed Brokering Systems Shrideep Pallickara, Geoffrey Fox, John Yin, Gurhan Gunduz, Hongbin Liu, Ahmet Uyar, Mustafa Varank.

A Framework for Highly-Available Real-Time Internet Services Bhaskaran Raman, EECS, U.C.Berkeley.

A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.

Measurement and Modeling of Packet Loss in the Internet Maya Yajnik.

Performance and Availability in Wide-Area Service Composition Bhaskaran Raman ICEBERG, EECS, U.C.Berkeley Presentation at Siemens, June 2001.

Resilient Overlay Networks By David Andersen, Hari Balakrishnan, Frans Kaashoek, and Robert Morris MIT RON Paper from ACM Oct Advanced Operating.

Web Cache Redirection using a Layer-4 switch: Architecture, issues, tradeoffs, and trends Shirish Sathaye Vice-President of Engineering.

Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.

A comparison of overlay routing and multihoming route control Hayoung OH

Wide-Area Service Composition: Performance, Availability and Scalability Bhaskaran Raman SAHARA, EECS, U.C.Berkeley Presentation at Ericsson, Jan 2002.

Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.

A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside

Improving TCP Performance over Wireless Networks

1 Wide Area Network Emulation on the Millennium Bhaskaran Raman Yan Chen Weidong Cui Randy Katz {bhaskar, yanchen, wdc, Millennium.

COS 420 Day 15. Agenda Finish Individualized Project Presentations on Thrusday Have Grading sheets to me by Friday Group Project Discussion Goals & Timelines.

CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.

Service Composition: Breakout Session Summary Randy Katz David Culler Summary: Bhaskaran Raman.

Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.

A Framework for Composing Services Across Independent Providers in the Wide-Area Internet Bhaskaran Raman Qualifying Examination Proposal Feb 12, 2001.

1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.

Placing Relay Nodes for Intra-Domain Path Diversity Meeyoung Cha Sue Moon Chong-Dae Park Aman Shaikh Proc. of IEEE INFOCOM 2006 Speaker 游鎮鴻.

OverQos: An Overlay based Architecture for Enhancing Internet Qos L Subramanian*, I Stoica*, H Balakrishnan +, R Katz* *UC Berkeley, MIT + USENIX NSDI’04,

1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.

Towards an integrated multimedia service hosting overlay Dongyan Xu Xuxian Jiang Proceedings of the 12th annual ACM international conference on Multimedia.

Fall, 2001CS 6401 Switching and Routing Outline Routing overview Store-and-Forward switches Virtual circuits vs. Datagram switching.

Accelerating Peer-to-Peer Networks for Video Streaming

Research Issues in Middleware (Bhaskar)

Presentation transcript:

An Architecture for Optimal and Robust Composition of Services across the Wide-Area Internet Bhaskaran Raman Qualifying Examination Proposal Feb 12, 2001 Examination Committee: Prof. Anthony D. Joseph (Chair) Prof. Randy H. Katz Prof. Ion Stoica Prof. David Brillinger

Technological Trend "Service and content providers play an increasing role in the value chain. The dominant part of the revenues moves from the network operator to the content provider. It is expected that value-added data services and content provisioning will create the main growth." Subscriber user Service broker Service mgt. Access network operator Core network operator Value added service providers Value added service providers Value added service providers Content providers Content providers Content providers Access Networks Cellular systems Cordless (DECT) Bluetooth DECT data Wireless LAN Wireless local loop SatelliteCableDSL

Service Composition Provider Q Texttospeech Provider R Cellular Phone repository Provider A Video-on-demand server Provider B Thin Client Provider A Provider B Replicated instances Transcoder

Service Composition Operational model: –Service providers deploy different services at various network locations –Next generation portals compose services Quickly enable new functionality on new devices Possibly through SLAs –Code is NOT mobile (mutually untrusting service providers) Composition across –Service providers –Wide-area Notion of service-level path

Problem Statement Optimal service-level path creation –Based on network and service performance Robust service-level paths –Detect and recover from failures

Challenges Optimal service-level path –When there are multiple instances of each intermediate service –How to learn the performance of each “leg” of the service-level path? Robustness –Detect and recover from failures –Possibly across the wide-area Internet –Important for long-lived sessions Several minutes/hours –Quick recovery required for real-time applications –How to provide appropriate backup?

Overall Architecture Composed services Hardware platform Peering relations, Overlay network Service clusters Logical platform Application plane Handling failures Service-level path creation Service location Network performance Detection Recovery

Problem Scope Services have no “hard” state –Sessions can be transferred from one service instance to another –This is assumed while handling failures –Assumption valid for a large set of applications Content streaming Transformation agents Logical operations: personalized redirection

Research Contributions Construction of optimal service-level path –Choice of instances of intermediate services based on network performance High availability for service-level paths –Mechanisms for detecting different kinds of failures –Creation of appropriate backup service-level path to recover from failures

Outline Related work Feasibility of failure detection over the wide-area Design of the framework Evaluation Research methodology and timeline Summary

Related work: Service Composition TACC (A. Fox, Berkeley) –Fault-tolerance within a single service-provider cluster for composed services –Based on cluster-manager/front-end based monitoring Simja (Berkeley), COTS (Stanford), Future Computing Environments (G. Tech) –Semantic issues addressed – which services can be composed –Based on service interface definitions, typing None address wide-area network performance or failure issues for long-lived composed sessions

Related work: Performance and Robustness Cluster-based approaches: TACC, AS1, LARD –AS1 (E. Amir, Berkeley): soft-state model for maintenance of long-lived sessions –LARD (Rice Univ): Web-server load balancing within a cluster –Fault management and load balancing within a cluster –Wide-area performance and failure issues not addressed Wide-area server selection: SPAND (M. Stemm, Berkeley), Harvest (Colorado), Probing mechanisms –Network and/or server performance discovery for selecting optimal replica –For composed services, require multi-leg measurement –For long-lived sessions, need recovery during session Routing around failures: Tapestry/CAN (Berkeley), RON (MIT) –Use redundancy in overlay networks –Recovery of composed service-level paths not addressed

Related work: summary TACC COTS, Future Comp. Env. WA server selection Tapestry, CAN RON Our System Composed Services Yes No Yes WA perf. adaptation No Yes?? Routing around failures No Yes

Outline Related work Feasibility of failure detection over the wide-area Design of the framework Evaluation Research methodology and timeline Summary

Failure detection in the wide-area: Analysis Provider A Video-on-demand server Provider B Thin Client Transcoder Peering relations, Overlay network Handling failures Service-level path creation Service location Network performance Detection Recovery

Failure detection in the wide-area: Analysis What are we doing? –Keeping track of the liveness of the WA Internet path Why is it important? –10% of Internet paths have 95% availability [Labovitz’99] –BGP could take several minutes to converge [Labovitz’00] –These could significantly affect real-time sessions based on service-level paths Why is it challenging? –Is there a notion of “failure”? –Given Internet cross-traffic and congestion? –What if losses could last for any duration with equal probability?

Failure detection: the trade-off Monitoring for liveness of path using keep-alive heartbeat Time Failure: detected by timeout Timeout period Time False-positive: failure detected incorrectly  unnecessary overhead Timeout period There’s a trade-off between time-to-detection and rate of false-positives

UDP-based keep-alive stream Geographically distributed hosts: –Berkeley, Stanford, UIUC, TU-Berlin, UNSW –Some trans-oceanic links, some within the US UDP heart-beat every 300ms between pairs Measure gaps between receipt of successive heart-beats

UDP-based keep-alive stream gaps above 900ms 6/11 False-positive rate: 6/11

UDP Experiments: What do we conclude? Significant number of outages > 30 seconds –Of the order of once a day But, 1.8 second outage  30 second outage with 50% prob. –If we react to 1.8 second outages by transferring a session can have much better availability than what’s possible today Provider A Video-on-demand server Provider B Thin Client Transcoder

UDP Experiments: What do we conclude? 1.8 seconds good enough for non-interactive applications –On-demand video/audio usually have 5-10 second buffers anyway 1.8 seconds not good for interactive/live applications –But definitely better than having the entire session cut- off

UDP Experiments: Validity of conclusions Results similar for other host-pairs: –Berkeley  Stanford, UIUC  Stanford, Berkeley  UNSW, TUBerlin  UNSW Results in parallel with other independent studies: –RTT spikes are isolated; undone in a couple of seconds [Acharya’96] –Correlation of packet losses does not persist beyond 1000ms [Yajnik’99]

Outline Related work Feasibility of failure detection over the wide-area Design of the framework Evaluation Research methodology and timeline Summary

Design of the Framework Question: how do we construct optimal, robust composed services? Need to: –Monitor for liveness –Monitor for performance –Compose services

Design Alternative: Hop-by-hop composition Active monitoring Service 1 Alternate service 1 Service 2 Service 1 Aggregated active monitoring Alternate service 1 Service 2

Design Alternative: Hop-by-hop composition No special support for composition Problems: –Sub-optimal service-level path –Alternative path may not be active –What if both ends are fixed? Service 1 Aggregated active monitoring Alternate service 1 Service 2

Architecture Internet Service cluster: compute cluster capable of running services Peering: monitoring & cascading Destination Source Composed services Hardware platform Peering relations, Overlay network Service clusters Logical platform Application plane

Architecture: Advantages Overlay nodes are clusters –Compute platform for services –Hierarchical monitoring Within cluster – for process/machine failures Across clusters – for network path failures –Aggregated monitoring Amortized overhead

The Overlay Network Peering relations, Overlay network Handling failures Service-level path creation The overlay network provides the context for service-level path creation and failure handling

Service Level Path Creation Peering relations, Overlay network Handling failures Service-level path creation Service location Network performance Detection Recovery Finding entry/exit

Finding entry and exit Independent of other mechanisms We do not address this directly We assume: –Entry or exit point can be rather static By appropriate placement of overlay nodes Pre-configuration possible at end-host –Or, can learn entry or exit point through another level of indirection

Service-Level Path Creation Connection-oriented network –Explicit session setup stage –There’s “switching” state at the intermediate nodes Need a connection-less protocol for connection setup Need to keep track of three things: –Network path liveness –Metric information (latency/bandwidth) for optimality decisions –Where services are located

Service-Level Path Creation Three levels of information exchange –Network path liveness Low overhead, but very frequent –Metric information: latency/bandwidth Higher overhead, not so frequent Bandwidth changes only once in several minutes [Balakrishnan’97] Latency changes appreciably only once in about an hour [Acharya’96] –Information about location of services in clusters Bulky, but does not change very often (once in a few weeks, or months) Could also use independent service location mechanism

Service Level Path Creation Link-state algorithm to exchange information –Lesser overhead of individual measurement  finer time-scale of measurement –Service-level path created at entry node –Link-state because it allows all-pair-shortest-path calculation in the graph

Service Level Path Creation Two ideas: –Path caching Remember what previous clients used Another use of clusters –Dynamic path optimization Since session-transfer is a first-order feature First path created need not be optimal

Session Recovery: Design Tradeoffs End-to-end vs. local-link End-to-end: –Pre-establishment possible –But, failure information has to propagate –And, performance of alternate path could have changed Local-link: –No need for information to propagate –But, additional overhead Overlay n/w Handling failures Service-level path creation Service location Network performance Detection Recovery Finding entry/exit

The Overlay Topology: Design Factors How many nodes? –Large number of nodes  lesser latency overhead –But scaling concerns Where to place nodes? –Close to edges so that hosts have points of entry and exit close to them –Close to backbone to take advantage of good connectivity Who to peer with? –Nature of connectivity –Least sharing of physical links among overlay links

Outline Related work Feasibility of failure detection over the wide-area Design of the framework Evaluation Research methodology and timeline Summary

Evaluation Important concerns: –Overhead of routing over the overlay network –Number of overlay nodes required Network modeling –AS-Jan2000, MBone, TIERS, Transit-Stub –Between nodes –Each node represents an Address Prefix [Jamin’00] –In comparison, the Internet has ~100,000 globally visible APs [Jamin’00]

Evaluation Overlay nodes –50, 100, 200: those with max. degree (backbone placement) –Peering between “nearby” overlay nodes –Physical links are not shared 1000 random pairs of hosts in original network –Overhead of routing over overlay network –No intermediate services used – for isolating the raw latency overhead

Evaluation: Effect of Size of Overlay 2.1% have over 5% overhead 2.2% have over 40% overhead 2.6% have over 60% overhead

Evaluation: Routing Overhead 2.1% of the end-host pairs have over 5% overhead 13.3% have over 50% overhead

Evaluation: What can we conclude? Number of overlay nodes required for low latency quite low: –200 for 5000 nodes –How many for 100,000 nodes? (number of APs on the Internet) –For linear growth, 4000 overlay nodes (in comparison, there are ~10,000 ASs on the Internet) –Note: growth has to be sub-linear Latency overhead of using overlay network quite low –Can get away with < 5% overhead in most nodes in some cases

Outline Related work Feasibility of failure detection over the wide-area Design of the framework Evaluation Research methodology and timeline Summary

Research Methodology Connection-oriented overlay network of clusters Session-transfer on failure Aggregation – amortization of overhead Simulation –Routing overhead –Effect of size of overlay Implementation –MP3 music for GSM cellular-phones –Codec service for IP- telephony Wide-area monitoring tradeoffs –How quickly can failures be detected? –Rate of false- positives Evaluation Analysis Design

Research Methodology: Metrics Feasibility: –Overhead End-to-end latency; bandwidth for information exchange –Scalability To a large number of client sessions –Stability Of optimality and session-transfer decisions Usefulness: –Latency to recovery Measure of effectiveness –Use of composability For building application functionality

Research Methodology: Approach Simulations, Trace-collection, Real implementation Simulation –For initial estimation of overhead Simulation + Traces –Bandwidth usage estimation, Stability study Real implementation –Scalability studies –Real services for use of composability Testbed –Collaborating with UNSW, TUBerlin

Research Plan: Phase I (0-6 months) Detailed analysis of –Latency and bandwidth overhead –Latency to recovery Use traces of latency/bandwidth over wide-area Develop real implementation in parallel –This is already in progress –Will give feedback for the analysis above

Research Plan: Phase II (6-12 months) Use implementation from Phase I –Deploy real services on the wide-area testbed –Analyze end-to-end effects of session-recovery –Examine scalability Use traces from Phase I to analyze stability of optimality decisions –Collect more traces of latency/bandwidth

Research Plan: Phase III (12-18 months) Use feedback from deployment of real services to refine architecture Analyze placement strategies –Use wide-area measurements and traces from phases I and II Write, graduate… Appropriate conferences and workshops: NOSSDAV, ACM Multimedia, SOSP, INFOCOM, SIGMETRICS, SIGCOMM

Summary Logical overlay network of service clusters –Middleware platform for service deployment –Optimal service-level path creation –Failure detection and recovery Failures can be detected in O(1sec) over the wide- area –Useful for many applications Number of overlay nodes required seems reasonable –O(1000s) for minimal latency overhead Several interesting issues to look at: –Overhead, Scalability, Stability

References [Labovitz’99] C. Labovitz, A. Ahuja, and F. Jahanian, “Experimental Study of Internet Stability and Wide-Area Network Failures”, Proc. Of FTCS’99 [Labovitz’00] C. Labovitz, A. Ahuja, A. Bose, and F. Jahanian, “Delayed Internet Routing Convergence”, Proc. SIGCOMM’00 [Acharya’96] A. Acharya and J. Saltz, “A Study of Internet Round- Trip Delay”, Technical Report CS-TR-3736, U. of Maryland [Yajnik’99] M. Yajnik, S. Moon, J. Kurose, and D. Towsley, “Measurement and Modeling of the Temporal Dependence in Packet Loss”, Proc. INFOCOM’99 [Balakrishnan’97] H. Balakrishnan, S. Seshan, M. Stemm, and R. H. Katz, “Analyzing Stability in Wide-Area Network Performance”, Proc. SIGMETRICS’97

Related work: Routing around Failures Tapestry, CAN (Berkeley) –Locate replicated objects in the wide-area using an overlay network –Overlay nodes named such that routing table is small –Redundancy in the overlay network helps in availability in the presence of failures Resilient Overlay Networks (MIT) –Small number (~50) of nodes on the Internet form a redundant overlay network –Application level routing metrics, and quick recovery from failures Recovery of composed service-level paths not addressed

UDP-based keep-alive stream 11, 8 5, 5 6, 3 85 gaps above 900ms 6/113/8 False-positive rate: 6/11, 3/8

Open Issues Using application level information for dynamic path construction –Some transformations may not be good Service interfaces for automated composition and correctness checking Soft-state based management of application level state

UDP-based keep-alive stream HB destinationHB sourceTotal timeNum. False positives Num. Failures BerkeleyUNSW130:48: UNSWBerkeley130:51:4598 BerkeleyTU-Berlin130:49:46278 TU-BerlinBerkeley130:50: TU-BerlinUNSW130:48: UNSWTU-Berlin130:46:38245 BerkeleyStanford124:21: StanfordBerkeley124:21:1926 StanfordUIUC89:53:1741 UIUCStanford76:39:10741 BerkeleyUIUC89:54:1165 UIUCBerkeley76:39:4035

Example services that can be composed Content streaming –Audio/Video Transcoding –Format translation –Text to speech Rate adaptation –Lossy encoding –Reduction of frame size Adding FEC

Example services that can be composed Unicast to multicast and vice-versa Personalized redirection –Between multiple user devices –Service handoff Adding semantic content –E.g., song title, or classification –Ads 

Some more composed services MP3  PCM  GSM MP3  Reduce quality  Add FEC for wireless link Video  Redirector (Handheld/Desktop)

Link State vs. Distance Vector Link State: –Quicker reaction to failures –Failure information need not propagate –Multiple metrics possible (app level) –Important reason: need distances from intermediate nodes for composition Distance Vector requires lesser storage –Not true with path vector Why not Path Vector? –Convergence could take a long time [Labovitz’00]