Locality Aware Mechanisms for Large-scale Networks Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz UC Berkeley Future Directions in Distributed Computing.

Slides:



Advertisements
Similar presentations
Brocade: Landmark Routing on Peer to Peer Networks Ben Y. Zhao Yitao Duan, Ling Huang, Anthony Joseph, John Kubiatowicz IPTPS, March 2002.
Advertisements

A Construction of Locality-Aware Overlay Network: mOverlay and Its Performance Found in: IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 22, NO.
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Supporting Rapid Mobility via Locality in an Overlay Network Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz Sahara / OceanStore Joint Session June 10,
Perspective on Overlay Networks Panel: Challenges of Computing on a Massive Scale Ben Y. Zhao FuDiCo 2002.
Tapestry: Scalable and Fault-tolerant Routing and Location Stanford Networking Seminar October 2001 Ben Y. Zhao
Tapestry: Decentralized Routing and Location SPAM Summer 2001 Ben Y. Zhao CS Division, U. C. Berkeley.
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Multi-Layer Switching Layers 1, 2, and 3. Cisco Hierarchical Model Access Layer –Workgroup –Access layer aggregation and L3/L4 services Distribution Layer.
*Towards A Common API for Structured Peer-to-Peer Overlays Frank Dabek, Ben Y. Zhao, Peter Druschel, John Kubiatowicz, Ion Stoica MIT, U. C. Berkeley,
1 Accessing nearby copies of replicated objects Greg Plaxton, Rajmohan Rajaraman, Andrea Richa SPAA 1997.
Rapid Mobility via Type Indirection Ben Y. Zhao, Ling Huang, Anthony D. Joseph, John D. Kubiatowicz Computer Science Division, UC Berkeley IPTPS 2004.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Outline for today Structured overlay as infrastructures Survey of design solutions Analysis of designs.
The Oceanstore Regenerative Wide-area Location Mechanism Ben Zhao John Kubiatowicz Anthony Joseph Endeavor Retreat, June 2000.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
Scribe: A Large-Scale and Decentralized Application-Level Multicast Infrastructure Miguel Castro, Peter Druschel, Anne-Marie Kermarrec, and Antony L. T.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems Antony Rowstron and Peter Druschel Proc. of the 18th IFIP/ACM.
Internet Indirection Infrastructure Ion Stoica UC Berkeley.
Brocade Landmark Routing on Structured P2P Overlays Ben Zhao, Yitao Duan, Ling Huang Anthony Joseph and John Kubiatowicz (IPTPS 2002) Goals Improve routing.
Supporting Rapid Mobility via Locality in an Overlay Network Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz Sahara / OceanStore Joint Session June 10,
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
Each mesh represents a single hop on the route to a given root. Sibling nodes maintain pointers to each other. Each referrer has pointers to the desired.
Tapestry: Wide-area Location and Routing Ben Y. Zhao John Kubiatowicz Anthony D. Joseph U. C. Berkeley.
OSD Metadata Management
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Object Naming & Content based Object Search 2/3/2003.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Weaving a Tapestry Distributed Algorithms for Secure Node Integration, Routing and Fault Handling Ben Y. Zhao (John Kubiatowicz, Anthony Joseph) Fault-tolerant.
Tapestry on PlanetLab Deployment Experiences and Applications Ben Zhao, Ling Huang, Anthony Joseph, John Kubiatowicz.
Decentralized Location Services CS273 Guest Lecture April 24, 2001 Ben Y. Zhao.
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Or, Providing High Availability and Adaptability in a Decentralized System Tapestry: Fault-resilient Wide-area Location and Routing Issues Facing Wide-area.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
Tapestry: A Resilient Global-scale Overlay for Service Deployment Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, and John.
Tapestry: Decentralized Routing and Location System Seminar S ‘01 Ben Y. Zhao CS Division, U. C. Berkeley.
Tapestry GTK Devaroy (07CS1012) Kintali Bala Kishan (07CS1024) G Rahul (07CS3009)
Arnold N. Pears, CoRE Group Uppsala University 3 rd Swedish Networking Workshop Marholmen, September Why Tapestry is not Pastry Presenter.
Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
Distributed Systems Concepts and Design Chapter 10: Peer-to-Peer Systems Bruce Hammer, Steve Wallis, Raymond Ho.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
Brocade Landmark Routing on P2P Networks Gisik Kwon April 9, 2002.
Information-Centric Networks07a-1 Week 7 / Paper 1 Internet Indirection Infrastructure –Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh.
Vincent Matossian September 21st 2001 ECE 579 An Overview of Decentralized Discovery mechanisms.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
1 More on Plaxton routing There are n nodes, and log B n digits in the id, where B = 2 b The neighbor table of each node consists of - primary neighbors.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Peer to Peer Network Design Discovery and Routing algorithms
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Tapestry : An Infrastructure for Fault-tolerant Wide-area Location and Routing Presenter : Lee Youn Do Oct 5, 2005 Ben Y.Zhao, John Kubiatowicz, and Anthony.
Bruce Hammer, Steve Wallis, Raymond Ho
Self-stabilizing energy-efficient multicast for MANETs.
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
1 Plaxton Routing. 2 History Greg Plaxton, Rajmohan Rajaraman, Andrea Richa. Accessing nearby copies of replicated objects, SPAA 1997 Used in several.
Fabián E. Bustamante, Fall 2005 A brief introduction to Pastry Based on: A. Rowstron and P. Druschel, Pastry: Scalable, decentralized object location and.
Accessing nearby copies of replicated objects
John D. Kubiatowicz UC Berkeley
Rapid Mobility via Type Indirection
Dynamic Replica Placement for Scalable Content Delivery
Tapestry: Scalable and Fault-tolerant Routing and Location
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Brocade: Landmark Routing on Peer to Peer Networks
Presentation transcript:

Locality Aware Mechanisms for Large-scale Networks Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz UC Berkeley Future Directions in Distributed Computing June 2002

ZXL FuDiCo, June 2002 Global Scale Applications Clear demand for global scale applications – Exploiting collective resources – File sharing, data dissemination, shared computation Wide-area issues – Scalability, fault-handling, adaptability, manageability Decentralized Object Location and Routing (DOLR) – Provides scalable message routing to object location – Spawns numerous apps: file systems, multicast, web caches, mobility systems Can we make them (& their apps) more scalable?

ZXL FuDiCo, June 2002 Outline Decentralized location and scalability Locality-awareness in design and policy – Proximity metrics – Data replication – Service replication – State maintenance The Gaia network model

ZXL FuDiCo, June 2002 Decentralized Location Review Goal: route message to object given unique OID Key Properties: – Scalable overlay routing mesh (metric: RDP) – (sub-)Logarithmic storage per node – (sub-)Logarithmic overlay hops per route – Scalable algorithmic core One perspective on differentiation – Proximity routing versus random route choice – Where and how object location info is stored

ZXL FuDiCo, June 2002 Wide-area Scalability Large number of 2 nd generation P2P apps – Multicast: Bayeux, CAN multicast, Scribe – FS: OceanStore, PAST, Mnemosyne, CFS – Others, I3, Mobile Tapestry, Squirrel, etc… Apps differ in scalability properties – Question: Are they truly scalable? Or … Why is a wide-area application or networking substrate more scalable than another?

ZXL FuDiCo, June 2002 Locality Awareness Optimizing infrastructure load for scalability “The ability to exploit local resources over remote ones whenever possible” “-Centric” approach – Client-centric, server-centric, data source-centric Intuition: raindrops rippling in a pond – Client requests result in load on overall network – Dampen / confine the impact of network operations: storage / communication costs, faults

ZXL FuDiCo, June 2002 Another Perspective More transparent layering beneficial – DiffServ, resource allocation Overlays / IP  inefficient interaction – Expose IP level network scale – Attempt to correlate interlayer behavior Old concept, new application (WANs)

ZXL FuDiCo, June 2002 Outline Decentralized location and scalability Locality-awareness in design and policy – Proximity metrics – Data replication – Service replication – State maintenance The Gaia network model

ZXL FuDiCo, June 2002 Routing Proximity Building a routing table w/ constraints Proximity neighbor selection (Tapestry / Pastry) – Integrate proximity metrics into routing construction – Distributed algorithms approx. global knowledge – “Locally optimal” routing tables maintained Proximity routing (CAN / Chord) – Local heuristics determine preference among routes – Performance gain for long-running stable networks Locality awareness – Minimize wide-area traffic, bandwidth utilization, congestion, and sensitivity to wide-area faults

ZXL FuDiCo, June 2002 Data Replication Storage and consistency vs. performance Providing local resolution of client requests – Caching: Akamai – Streaming media: multicast – Process and file migration In context of object location systems – # of replicates (metadata or data) – Placement of replicates

ZXL FuDiCo, June 2002 Replication and Placement Largely independent of actual network substrate CAN: – Replicated data propagates back to hotspot source – Long term performance gain for immutable data Pastry / Chord / Kademlia: – Distribute replicates of immutable objects randomly Tapestry / PRR (Plaxton, Rajamaran, Richa) – Replicate Log(N) object pointers – More replicates around location of object Application level: OceanStore, SCAN,...

ZXL FuDiCo, June 2002 Implications Static approaches vs. run-time optimizations – Overlays increasingly dynamic in membership – Run-time schemes too slow to be effective – Approach must consider underlying network scale node-to-node object replicate propagation is high in storage cost and slow to stabilize Locality awareness of replication schemes – Object replication vs. object pointer replication – Indirection allows mutability / flexibility

ZXL FuDiCo, June 2002 Service Replication Distribute server processing close to clients Overlay multicast example: Scribe and Bayeux Scribe: membership handled by any node – New members find nearest node to attach to via Pastry location – Localize membership traffic, reduce server load Bayeux: explicitly replicated root nodes – Members find nearest root, then attach to MC tree – Self-organizing trees Other examples: dynamic transcoding

ZXL FuDiCo, June 2002 Service Replication … Scribe Bayeux C R C R R Join/leave Msgs

ZXL FuDiCo, June 2002 State Maintenance Maintenance of link conditions, routing state Example: soft-state beacons measure link conditions between neighbors (Tapestry, Scribe)  Locality awareness – Naïve: messages sent across overlay hops at same rate regardless of actual network distance – Traffic can scale with length of overlay hop, possibly causing congestion Alternatives: – Scale soft-state frequency w/ length of overlay hop – External fault-detection / measurement platform

ZXL FuDiCo, June 2002 Outline Decentralized location and scalability Locality-awareness in design and policy – Proximity metrics – Data replication – Service replication – State maintenance The Gaia network model

ZXL FuDiCo, June 2002 Scalable Network Core for UbiComp My (infrastructure-based) perspective: – Highly available (scalable) backbone infrastructure – Service large # of heterogeneous clients Solving some problems – Devices decreasing in size – Devices Increasing in power (Moore’s law) – Network connectivity expanding Still need: management (availability/performance) – Shorter MTBF, changing network, security / attacks – Sheer # of devices makes normal solutions infeasible

ZXL FuDiCo, June 2002 Layered Management Proactive redundancy, exploit plentiful resources – Fault-detection not enough Precomputation of redundancy + rapid adaptation E.g. more replication for slow-changing data, active duplication of messages – Continuous probing for alternatives – Minimal degradation of availability during faults Making maintenance scalable – Stacked layers of maintenance domains – Each layer uses DOLRs for self-organization / comm. – Aggregate data at each level (GRID, SDS)

ZXL FuDiCo, June 2002 Layered Management

ZXL FuDiCo, June 2002 Scalable Protection Availability mechanisms need protection – Redundancy generation – Detection and triggering of faults Intelligent exception triggering – Pattern-based heuristics, tracebacks detect attack flows – Group agreement protocols make authoritative decisions – Aggregate warnings propagate up notification hierarchy Active countermeasures – Isolate and remove malicious nodes – Switch to redundant path / plane (GUID aliasing in Tapestry, realities in CAN)

ZXL FuDiCo, June 2002 The Living Network Model Gaia: a living network – James Lovelock [1979] Large scale self-management – Locally constrained interactions  scalability, performance – Layered control structure – Upward propagation of aggregate data Survival via active redundancy & self-repair – Catastrophic failures handled by top level control (human interaction)

ZXL FuDiCo, June 2002 For More Information CAN / CAN multicast – Chord / CFS – Pastry / Past / Scribe / Squirrel – Tapestry / OceanStore / Bayeux / Brocade – – Other relevant material: –