Peter R Pietzuch and Sumeer Bhola IBM T J Watson Research Center Congestion Control in a Reliable Scalable.

Slides:



Advertisements
Similar presentations
A feedback–based scheme for improving TCP performance in Ad Hoc Wireless Networks Group : Manish Mehta Aditya Barve.
Advertisements

William Stallings Data and Computer Communications 7 th Edition Chapter 13 Congestion in Data Networks.
Congestion Control Reasons: - too many packets in the network and not enough buffer space S = rate at which packets are generated R = rate at which receivers.
Traffic Shaping Why traffic shaping? Isochronous shaping
Receiver-driven Layered Multicast S. McCanne, V. Jacobsen and M. Vetterli University of Calif, Berkeley and Lawrence Berkeley National Laboratory SIGCOMM.
Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,
Congestion Control Created by M Bateman, A Ruddle & C Allison As part of the TCP View project.
TCP Congestion Control Dina Katabi & Sam Madden nms.csail.mit.edu/~dina 6.033, Spring 2014.
CS 408 Computer Networks Congestion Control (from Chapter 05)
L-21 Multicast. L -15; © Srinivasan Seshan, Overview What/Why Multicast IP Multicast Service Basics Multicast Routing Basics DVMRP Overlay.
Congestion Control An Overview -Jyothi Guntaka. Congestion  What is congestion ?  The aggregate demand for network resources exceeds the available capacity.
Receiver-driven Layered Multicast S. McCanne, V. Jacobsen and M. Vetterli SIGCOMM 1996.
1/14 Ad Hoc Networking, Eli M. Gafni and Dimitri P. Bertsekas Distributed Algorithm for Generating Loop-free Routes in Networks With Frequently.
1-1 CMPE 259 Sensor Networks Katia Obraczka Winter 2005 Transport Protocols.
A loss detection Service for Active Reliable Multicast Protocols Moufida MAIMOUR & C. D. PHAM INRIA-RESO RESAM UCB-Lyon – ENS Lyon INC’02, Plymouth Tuesday,
1 Congestion Control. Transport Layer3-2 Principles of Congestion Control Congestion: r informally: “too many sources sending too much data too fast for.
DTNLite: Reliable Data Delivery in Sensornets Rabin Patra and Sergiu Nedevschi UCB Nest Retreat 2004.
Dissemination protocols for large sensor networks Fan Ye, Haiyun Luo, Songwu Lu and Lixia Zhang Department of Computer Science UCLA Chien Kang Wu.
High-performance bulk data transfers with TCP Matei Ripeanu University of Chicago.
1 TCP Transport Control Protocol Reliable In-order delivery Flow control Responds to congestion “Nice” Protocol.
Stability of Hop by Hop Congestion Control Stephan Bohacek Dept. of Mathematics University of Southern California.
1 Emulating AQM from End Hosts Presenters: Syed Zaidi Ivor Rodrigues.
ACN: Congestion Control1 Congestion Control and Resource Allocation.
Hermes: A Distributed Event- Based Middleware Architecture Peter Pietzuch and Jean Bacon 1st DEBS Workshop, Vienna,
Reliable Transport Layers in Wireless Networks Mark Perillo Electrical and Computer Engineering.
L13: Sharing in network systems Dina Katabi Spring Some slides are from lectures by Nick Mckeown, Ion Stoica, Frans.
Distributed Publish/Subscribe Network Presented by: Yu-Ling Chang.
Receiver-driven Layered Multicast Paper by- Steven McCanne, Van Jacobson and Martin Vetterli – ACM SIGCOMM 1996 Presented By – Manoj Sivakumar.
Messaging Technologies Group: Yuzhou Xia Yi Tan Jianxiao Zhai.
Data Communications and Networking
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
1 Robust Transport Protocol for Dynamic High-Speed Networks: enhancing XCP approach Dino M. Lopez Pacheco INRIA RESO/LIP, ENS of Lyon, France Congduc Pham.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Chapter 2 – X.25, Frame Relay & ATM. Switched Network Stations are not connected together necessarily by a single link Stations are typically far apart.
CONGESTION CONTROL and RESOURCE ALLOCATION. Definition Resource Allocation : Process by which network elements try to meet the competing demands that.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
1 Impact of transmission errors on TCP performance (Nitin Vaidya)
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Total Order in Content-based Publish/Subscribe Systems Joint work with: Vinod Muthusamy, Hans-Arno Jacobsen.
CprE 545 project proposal Long.  Introduction  Random linear code  LT-code  Application  Future work.
TCP Trunking: Design, Implementation and Performance H.T. Kung and S. Y. Wang.
1 Congestion Control Computer Networks. 2 Where are we?
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
1 Analysis of a window-based flow control mechanism based on TCP Vegas in heterogeneous network environment Hiroyuki Ohsaki Cybermedia Center, Osaka University,
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
AMQP, Message Broker Babu Ram Dawadi. overview Why MOM architecture? Messaging broker like RabbitMQ in brief RabbitMQ AMQP – What is it ?
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
CS440 Computer Networks 1 Packet Switching Neil Tang 10/6/2008.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Distributed Ranked Data Dissemination in Social Networks Joint work with: Mo Sadoghi Vinod Muthusamy Hans-Arno.
Challenges in the Next Generation Internet Xin Yuan Department of Computer Science Florida State University
By Nitin Bahadur Gokul Nadathur Department of Computer Sciences University of Wisconsin-Madison Spring 2000.
CS640: Introduction to Computer Networks Aditya Akella Lecture 15 TCP Congestion Control.
Directed Diffusion: A Scalable and Robust Communication Paradigm for Sensor Networks Presented by Barath Raghavan.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
CS 268: Lecture 6 Scott Shenker and Ion Stoica
UNIT-V Transport Layer protocols for Ad Hoc Wireless Networks
Congestion Control (from Chapter 05)
Distributed Publish/Subscribe Network
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Congestion Control Reasons:
TCP Congestion Control
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Towards Predictable Datacenter Networks
Impact of transmission errors on TCP performance
Presentation transcript:

Peter R Pietzuch and Sumeer Bhola IBM T J Watson Research Center Congestion Control in a Reliable Scalable Message-Oriented Middleware Middleware’03, Rio de Janeiro, Brazil, June 2003

1 Message-Oriented Middleware Scalability –Asynchronous communication and loose synchronisation –Publish/Subscribe communication with filtering –Overlay network of message brokers Reliability –Guaranteed delivery semantics for messages –Resend messages lost due to failure Congestion –Publication rate may be too high  not enough capacity –Must guarantee stable behaviour of the system –Usually done with over-provisioning of the system B B B B B  Congestion Control for Overlay Networks

2 The Congestion Control Problem Characteristics of a MOM –Large message buffers at brokers –Burstiness due to application-level routing –TCP CC only deals with inter-broker connections B   B  B       Message Brokers App-level Queues Network Causes of Congestion –Under-provisioned system Network bandwidth (congestion at output queues) Broker processing capacity (congestion at input queues) –Additional resource requirement due to recovery

3 Outline Message-Oriented Middleware The Congestion Control Problem Gryphon –Congestion in Gryphon Congestion Control Protocols –Publisher-Driven Congestion Control –Subscriber-Driven Congestion Control Evaluation –Experimental Results Conclusion

4 The Gryphon MOM IBM’s MOM with publish/subscribe –Supports guaranteed in-order, exactly-once delivery IB SHB PHB SHB P P P P P P P S S S S S S S S S S S Brokers can be –Publisher-Hosting (PHB) –Subscriber-Hosting (SHB) –Intermediate (IB) Clients connect to brokers Publishers are aggregated to publishing endpoints (pubends) –Ordered stream of messages; maintained in persistent storage –NACKs for lost messages –IB’s cache stream data and satisfy NACKs

5 Congestion in Gryphon Congestion due to recovery after link failure –System never recovers from unstable state PHBPHB SHB1 SHB2 IB link failure msgs (kb/s) failure Requirements of CC in MOM –Independent from particular MOM implementation –No/little involvement of intermediate brokers –Detect congestion before queue overflow occurs –Ensure that recovering SHBs will eventually catch up

6 Congestion Control Protocols PHB-Driven CC Protocol (PDCC) –Feedback loop between pubends and downstream SHBs to monitor congestion –Limit publication rate of new messages to prevent congestion SHB-Driven CC Protocol (SDCC) –Monitor rate of progress at a recovering SHB –Limit rate of NACKs during recovery PHB SHB 1.Detect congestion in the system –Change in throughput used as a congestion metric –Reduction in throughput  queue build-up 2.Limit message rates to obtain stable behaviour

7 PHB-Driven Congestion Control Downstream Congestion Query Msgs (DCQ) –Trigger the congestion control mechanism –Periodically sent down the dissemination tree by pubend Upstream Congestion Alert Msgs (UCA) –Indicate congestion in the system –SHBs observe their message throughput and respond with a UCA msg when congested –Cause pubend to reduce its publication rate Properties –DCQ/UCA msgs treated as high-priority by brokers –Frequency of DCQ msg controls responsiveness of PDCC –No UCA msgs flow in an uncongested system –Similar to ATM ABR flow control

8 Processing of DCQ/UCA Msgs Publisher-Hosting Brokers –Hybrid additive/multiplicative increase/decrease scheme to change publication rate –Attempt to find optimal operating point Subscriber-Hosting Brokers –Non-recovering brokers should receive msgs at the publication rate –Recovering brokers should receives msgs at a higher rate Intermediate Brokers –Aggregate UCA msgs to prevent feedback explosion Pass up UCA msg from worst-congested SHB –Short-circuit first UCA msg for fast congestion notification PHB SHB IB

9 SHB-Driven Congestion Control Important to restrict NACK rate –Small NACK msg can trigger many large data msgs –Mechanism to control degree of resources spent on resent messages during recovery (recovery time) No support from other brokers necessary SHBs maintain NACK window –Decide which parts of the message stream to NACK –Observe recovery rate –Open/close NACK window additively depending on rate change –Similar to CC in TCP Vegas

10 Implementation in Gryphon Gryphon’s message stream is subdivided into ticks –Discrete time interval that can hold a single message –4 states: –Doubt Horizon: position in stream of first Q tick Rate of progress of the DH as a congestion metric –Independent from filtering and actual publication rate (D)ataMsg published (S)ilenceNo msg published (F)inalTick was garbage collected (Q)uestionUnknown (send NACK) doubt horizon time

11 Experimental Evaluation Network of dedicated broker machines –Simple topology (4 brokers) –Complex topology (9 brokers; asymmetric paths) –Hundreds of publishing and subscribing clients –Large queue sizes to maximize throughput (5-25 Mb) Congestion was created by –restricting bandwidth on inter-broker links –failing inter-broker links PHBPHB SHB1 SHB2 IB

12 Experiments I Congestion due to recovery after link failure –PDCC reduces publication rate –SDCC keeps recovery rate steady msgs (kb/s) link failure PHB SHB1 SHB2 recovery

13 Experiments II Congestion due to dynamic b/w limits of IB-SHB1 link –Publication rate follows link bottleneck –UCA msgs are received at pubend PHB SHB1 SHB low b/w med b/wlow b/w UCA msg msgs (kb/s) throughput ratio

14 Conclusions Reliable, content-based pub/sub needs congestion control –Characteristics different from traditional network cc Publisher-driven and subscriber-driven congestion control –Distinguish between recovering and non-recovering brokers –Hybrid additive and multiplicative adjustment –Normalised rate regardless of publication rate –NACK window for controlled recovery Future work –Fairness between many pubends in the same system –Dynamic adjustment of the DCQ rate

15 Thank you Any Questions?

16 Related Work TCP Congestion Control –Point-to-point congestion control only –Throughput-based congestion metric Reliable Multicast –Scalable feedback processing –Sender-based and receiver-based schemes –Feedback loops Multicast ABR ATM –Forward and Backward Resource Management Cells –BRM cell consolidation at ATM switches Overlay Networks –Little work done so far