Ph.D. thesis defense. August 21, 2009. Ýmir Vigfússon Joint work with: Hussam Abu-Libdeh, Mahesh Balakrishnan, Ken Birman, Gregory Chockler, Qi Huang,

Slides:



Advertisements
Similar presentations
Sliver: A fast distributed slicing algorithm Ymir Vigfusson Cornell University Ymir Vigfusson Cornell University Vincent Gramoli EPFL & UniNE Switzerland.
Advertisements

PLATO: Predictive Latency- Aware Total Ordering Mahesh Balakrishnan Ken Birman Amar Phanishayee.
Dr. Multicast for Data Center Communication Scalability Ymir Vigfusson Hussam Abu-Libdeh Mahesh Balakrishnan Ken Birman Cornell University Yoav Tock IBM.
Internet Indirection Infrastructure (i3 ) Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh Surana UC Berkeley SIGCOMM 2002 Presented by:
P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
JetStream: Achieving Predictable Gossip Dissemination by Leveraging Social Network Principles Jay A. Patel 1, Indranil Gupta 1, and Noshir Contractor 2.
20/10/2006ALPAGE1 Ordered slicing of very large scale overlay networks Mark Jelasity University of Bologna, Italy Anne-Marie Kermarrec INRIA Rennes/IRISA,
Cognitive Publish/Subscribe for Heterogeneous Clouds Šarūnas Girdzijauskas, Swedish Institute of Computer Science (SICS) Joint work with:
VDR: Proactive element Conclusions VDR reaches 3.5% more nodes than VDR-R and 9% more nodes than our modified random walk routing strategy (RWR) VDR shows.
EL9331 Meridian: A Lightweight Network Location Service without Virtual Coordinates Bernard Wong, Aleksandrs Slivkins, Emin Gun Sirer SIGCOMM’05 ( Slides.
Ýmir Vigfússon IBM Research Haifa Labs Ken Birman Cornell University Qi Huang Cornell University Deepak Nataraj Cornell University.
Lecture 6 Overlay Networks CPE 401/601 Computer Network Systems slides are modified from Jennifer Rexford.
Farnoush Banaei-Kashani and Cyrus Shahabi Criticality-based Analysis and Design of Unstructured P2P Networks as “ Complex Systems ” Mohammad Al-Rifai.
Distributed Systems 2006 Group Communication I * *With material adapted from Ken Birman.
An Engineering Approach to Computer Networking
Ken Birman Cornell University. CS5410 Fall
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
ZIGZAG A Peer-to-Peer Architecture for Media Streaming By Duc A. Tran, Kien A. Hua and Tai T. Do Appear on “Journal On Selected Areas in Communications,
10/31/2007cs6221 Internet Indirection Infrastructure ( i3 ) Paper By Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh Sharma Sonesh Sharma.
Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems Antony Rowstron and Peter Druschel Proc. of the 18th IFIP/ACM.
Internet Indirection Infrastructure Ion Stoica UC Berkeley.
CS 268: Project Suggestions Ion Stoica February 6, 2003.
Probabilistic Data Aggregation Ling Huang, Ben Zhao, Anthony Joseph Sahara Retreat January, 2004.
A Real-Time Video Multicast Architecture for Assured Forwarding Services Ashraf Matrawy, Ioannis Lambadaris IEEE TRANSACTIONS ON MULTIMEDIA, AUGUST 2005.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Random Early Detection Gateways for Congestion Avoidance
Indirection Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm Slides.
An Evaluation of Scalable Application-level Multicast Using Peer-to-peer Overlays Miguel Castro, Michael B. Jones, Anne-Marie Kermarrec, Antony Rowstron,
Dr. Multicast for Data Center Communication Scalability Ymir Vigfusson Hussam Abu-Libdeh Mahesh Balakrishnan Ken Birman Cornell University Yoav Tock IBM.
Internet Indirection Infrastructure (i3) Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh Surana UC Berkeley SIGCOMM 2002.
 Structured peer to peer overlay networks are resilient – but not secure.  Even a small fraction of malicious nodes may result in failure of correct.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Receiver-driven Layered Multicast Paper by- Steven McCanne, Van Jacobson and Martin Vetterli – ACM SIGCOMM 1996 Presented By – Manoj Sivakumar.
Qi Huang Ýmir Vigfússon Ken Birman Haoyuan Li Cornell University IBM Research Haifa Labs Cornell University.
Building a massively scalable serverless VPN using Any Source Multicast Athanasios Douitsis Dimitrios Kalogeras National Technical University of Athens.
Study of the Relationship between Peer to Peer Systems and IP Multicasting From IEEE Communication Magazine January 2003 學號 :M 姓名 : 邱 秀 純.
1 Internet Protocol: Forwarding IP Datagrams Chapter 7.
Communication (II) Chapter 4
Lecture 2 TCP/IP Protocol Suite Reference: TCP/IP Protocol Suite, 4 th Edition (chapter 2) 1.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Information-Centric Networks07a-1 Week 7 / Paper 1 Internet Indirection Infrastructure –Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh.
Higashino Lab. Maximizing User Gain in Multi-flow Multicast Streaming on Overlay Networks Y.Nakamura, H.Yamaguchi and T.Higashino Graduate School of Information.
Secure Group Communication: Key Management by Robert Chirwa.
ACN: RED paper1 Random Early Detection Gateways for Congestion Avoidance Sally Floyd and Van Jacobson, IEEE Transactions on Networking, Vol.1, No. 4, (Aug.
Dynamic Source Routing in ad hoc wireless networks Alexander Stojanovic IST Lisabon 1.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Implicit group messaging in peer-to-peer networks Daniel Cutting, 28th April 2006 Advanced Networks Research Group.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
CSC 600 Internetworking with TCP/IP Unit 7: IPv6 (ch. 33) Dr. Cheer-Sun Yang Spring 2001.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
DHT-based unicast for mobile ad hoc networks Thomas Zahn, Jochen Schiller Institute of Computer Science Freie Universitat Berlin 報告 : 羅世豪.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
Cooperative Location- Sensing for Wireless Networks Authors : Haris Fretzagias Maria Papadopouli Presented by cychen IEEE International Conference on Pervasive.
Client Assignment in Content Dissemination Networks for Dynamic Data Shetal Shah Krithi Ramamritham Indian Institute of Technology Bombay Chinya Ravishankar.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Cisco Confidential © 2013 Cisco and/or its affiliates. All rights reserved. 1 Cisco Networking Training (CCENT/CCT/CCNA R&S) Rick Rowe Ron Giannetti.
Multicast Scaling Laws with Hierarchical Cooperation Chenhui Hu, Xinbing Wang, Ding Nie, Jun Zhao Shanghai Jiao Tong University, China.
Optimization Problems in Wireless Coding Networks Alex Sprintson Computer Engineering Group Department of Electrical and Computer Engineering.
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
1 Group Communications: Host Group and IGMP Dr. Rocky K. C. Chang 19 March, 2002.
Internet Indirection Infrastructure (i3)
Multicast Outline Multicast Introduction and Motivation DVRMP.
Network Address Translation
Bridges and Extended LANs
REED : Robust, Efficient Filtering and Event Detection
An Engineering Approach to Computer Networking
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Presentation transcript:

Ph.D. thesis defense. August 21, Ýmir Vigfússon Joint work with: Hussam Abu-Libdeh, Mahesh Balakrishnan, Ken Birman, Gregory Chockler, Qi Huang, Jure Leskovec, Deepak Nataraj and Yoav Tock.

Most network traffic is unicast communication (one-to-one). But a lot of content is identical: – Audio streams, video broadcasts, system updates, etc. To minimize redundancy, would be nice to multicast communication (one-to-many). Group communication

Multicast by Unicast

IP Multicast

Gossip

Group communication MechanismDeliv. Speed Redun dancy Scalable in # users? Scalable in # groups? Point-to-point unicast SlowHighNoYes * IP Multicast (IPMC) FastNoneYesNo GossipSlowLowYesNo

Talk Outline Dr. Multicast (MCMD) – Group scalability in IP Multicast. Gossip Objects (GO) platform – Group scalability in gossip. Affinity – GO+MCMD optimizations based on group overlaps – Explore the properties of overlaps in data sets Conclusion

IP Multicast in Data Centers Smaller scale – well defined hierarchy Single administrative domain Firewalled – can ignore malicious behavior

Useful, but rarely used. Various problems: – Security – Stability – Scalability IP Multicast in Data Centers

Useful, but rarely used. Various problems: – Security – Stability – Scalability Bottom line: Administrators have no control over IPMC. – Thus they choose to disable it. IP Multicast in Data Centers

Policy: Enable control of IPMC. Transparency: Should be backward compatible with hardware and software. Scalability: Needs to scale in number of groups. Robustness: Solution should not bring in new problems. Wishlist

Acceptable Use Policy Assume a higher-level network management tool compiles policy into primitives. Explicitly allow a process (user) to use IPMC groups. allow-join(process ID, logical group ID) allow-send(process ID, logical group ID) Multicast by point-to-point unicast always permitted. Additional restraints. max-groups(process ID, limit) force-unicast(process ID, logical group ID)

Dr. Multicast (MCMD) Translates logical IPMC groups into either physical IPMC groups or multicast by unicast. Optimizes resource use.

Network Overhead Gossip Layer uses constant background bandwidth on average 2.1 kb/s

Application Overhead Insignificant overhead when mapping logical IPMC group to physical IPMC group.

Optimization questions BLACK Multicast Users Groups

Optimization Questions o Assign IPMC and unicast addresses s.t.  Min. receiver filtering  Min. network traffic  Min. # IPMC addresses  … yet have all messages delivered to interested parties

Optimization Questions o Assign IPMC and unicast addresses s.t.  % receiver filtering (hard)  Min. network traffic  # IPMC addresses (hard) Prefers sender load over receiver load. Control knobs part of administrative policy. (1)

MCMD Heuristic Groups in `user- interest’ space G RAD S TUDENTS F REE F OOD (1,1,1,1,1,0,1,0,1,0,1,1)(0,1,1,1,1,1,1,0,0,1,1,1)

MCMD Heuristic Groups in `user- interest’ space

MCMD Heuristic Groups in `user- interest’ space Filtering cost: MAX Sending cost:

MCMD Heuristic Groups in `user- interest’ space Filtering cost: MAX Sending cost: Unicast

MCMD Heuristic Groups in `user- interest’ space Unicast

Policy: Permits data center operators to selectively enable and control IPMC. Transparency: Standard IPMC interface to user, standard IGMP interface to network. Scalability: Uses IPMC when possible, otherwise point-to-point unicast. Robustness: Distributed, fault-tolerant service. Dr. Multicast

Talk Outline Dr. Multicast (MCMD) – Group scalability in IP Multicast. Gossip Objects (GO) platform – Group scalability in gossip. Affinity – GO+MCMD optimizations based on group overlaps – Explore the properties of overlaps in data sets Conclusion

Def: Exchange information with a random node once per round. Has appealing properties: – Bounded network traffic. – Scalable in group size. – Robust against failures. – Simple to code. When # of groups scales up, lose Gossip

GO Platform

Recipient selection: – Pick node d uniformly at random. Content selection: – Pick a rumor r uniformly at random. Random gossip

Gossip rumors usually small: – Incremental updates. – Few bytes hash of actual information. Packet size below MTU irrelevant. – Stack rumors in a packet. – But which ones? Rumors can be delivered indirectly. – Uninterested node might forward to an interested one. Observations

Recipient selection: – Pick node d uniformly at random. Content selection: – Fill packet with rumors picked uniformly at random. Random gossip w. stacking

Recipient selection: – Pick node d biased towards higher group traffic. Content selection: – Compute the utility of including rumor r Probability of r infecting an uninfected host when it reaches the target group. – Pick rumors to fill packet with probability proportional to utility. GO Heuristic

Recipient selection: – Pick node d biased towards higher group traffic. Content selection: – Compute the utility of including rumor r Probability of r infecting an uninfected host when it reaches the target group. – Pick rumors to fill packet with probability proportional to utility. GO Heuristic Include r ? Target group of r

IBM Websphere trace (1364 groups) Evaluation

IBM Websphere trace (1364 groups) Evaluation

IBM Websphere trace (1364 groups) Evaluation

Talk Outline Dr. Multicast (MCMD) – Group scalability in IP Multicast. Gossip Objects (GO) platform – Group scalability in gossip. Affinity – GO+MCMD optimizations based on group overlaps. – Explore the properties of overlaps in data sets. Conclusion

Both MCMD and GO have optimizations that depend on pairwise group overlaps (affinity). What degree of affinity should we expect to arise in the real-world? Affinity

What’s in a ``group’’ ? Social: – Yahoo! Groups – Amazon Recommendations – Wikipedia Edits – LiveJournal Communities – Mutual Interest Model Systems: – IBM Websphere – Hierarchy Model Data sets/models Users Groups

User and group degree distributions appear to follow power-laws. Power-law degree distributions often modeled by preferential attachment. Mutual Interest model: – Preferential attachment for bipartite graphs. Social data sets Groups Users

IBM Websphere has remarkable structure! Typical for real-world systems? – Only one data point. Systems Data Set

Distributed systems tend to be hierarchically structured. Hierarchy model – Motivated by Live Objects. Systems Data Set Thm: Expect a pair of users to overlap in groups.

Social: – Yahoo! Groups – Amazon Recommendations – Wikipedia Edits – LiveJournal Communities – Mutual Interest Model Systems: – IBM Websphere – Hierarchy Model Data sets/models Users Groups

Group similarity Def: Similarity of groups j,j’ is WikipediaLiveJournal

Group similarity Def: Similarity of groups j,j’ is Mutual Interest Model

Group similarity Def: Similarity of groups j,j’ is IBM WebsphereHierarchy model

Is the similarity we see a real effect? Consider a random graph with the same degree distributions as a baseline. Spokes model: Baseline overlap

Plot difference between data and Spokes At most 50 samples per group size pair. Data set/modelAvg. Δ value Wikipedia Amazon Yahoo! Groups Mutual Interest Model IBM Websphere Hierarchy Model Looking pretty random

Group communication important, but group scalability is lacking. Dr. Multicast harnesses IPMC in data centers. – Impact: HotNets paper + NSDI Best Poster award. – Solution being adopted by CISCO and IBM.

GO provides group scalability for gossip. – Impact: LADIS paper + Invited to the P2P Conference. – Platform will run under the Live Objects framework. Characterizing and exploiting group affinity in systems is exciting current and future work.

GO: Platform Support For Gossip Applications. With Ken Birman, Qi Huang, Deepak Nataraj. LADIS ‘09. Invited to P2P '09. Adaptively Parallelizing Distributed Range Queries. With Adam Silberstein, Brian Cooper, Rodrigo Fonseca. VLDB ’09. Slicing Distributed Systems. With Vincent Gramoli, Ken Birman, Anne-Marie Kermarrec, Robbert van Renesse. PODC ’08 (short). In IEEE Transactions on Computers Dr. Multicast: Rx for Data Center Communication Scalability. With Hussam Abu-Libdeh, Mahesh Balakrishnan, Ken Birman, Yoav Tock. Hotnets ‘08. LADIS ‘08. NSDI ‘08 (Best Poster). Hyperspaces for Object Clustering and Approximate Matching in P2P Overlays. With Bernard Wong, Emin Gun Sirer. HotOS ‘07.

Baseline overlap Plot difference between data and Spokes Cell: Avg. Δ over particular group sizes. Wikipedia

Baseline overlap Plot difference between data and Spokes Cell: Avg. Δ over particular group sizes. Websphere

Social affinity pretty random. Websphere has substantial overlaps. MCMD Heuristic does well in all cases: Affinity results

Group communication important, but group scalability is lacking. Dr. Multicast harnesses IPMC in data centers. – Impact: HotNets paper + NSDI Best Poster award. – Solution being adopted by CISCO and IBM. GO provides group scalability for gossip. – Impact: LADIS paper + Invited to the P2P Conference. – Platform will run under the Live Objects framework. Characterizing and exploiting group affinity in systems is exciting current and future work.

GO: Platform Support For Gossip Applications. With Ken Birman, Qi Huang, Deepak Nataraj. LADIS ‘09. Invited to P2P '09. Adaptively Parallelizing Distributed Range Queries. With Adam Silberstein, Brian Cooper, Rodrigo Fonseca. VLDB ‘09. Slicing Distributed Systems. With Vincent Gramoli, Ken Birman, Anne-Marie Kermarrec, Robbert van Renesse. PODC ‘08. In IEEE Transactions on Computers Dr. Multicast: Rx for Datacenter Communication Scalability. With Hussam Abu-Libdeh, Mahesh Balakrishnan, Ken Birman, Yoav Tock. Hotnets ‘08. LADIS ‘08. NSDI ‘08 (Best Poster). Hyperspaces for Object Clustering and Approximate Matching in P2P Overlays. With Bernard Wong, Emin Gun Sirer. HotOS ‘07.