John D. Kubiatowicz UC Berkeley

Slides:



Advertisements
Similar presentations
Brocade: Landmark Routing on Peer to Peer Networks Ben Y. Zhao Yitao Duan, Ling Huang, Anthony Joseph, John Kubiatowicz IPTPS, March 2002.
Advertisements

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Perspective on Overlay Networks Panel: Challenges of Computing on a Massive Scale Ben Y. Zhao FuDiCo 2002.
Tapestry: Decentralized Routing and Location SPAM Summer 2001 Ben Y. Zhao CS Division, U. C. Berkeley.
Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK
Implementation and Deployment of a Large-scale Network Infrastructure Ben Y. Zhao L. Huang, S. Rhea, J. Stribling, A. D. Joseph, J. D. Kubiatowicz EECS,
1 Accessing nearby copies of replicated objects Greg Plaxton, Rajmohan Rajaraman, Andrea Richa SPAA 1997.
Rapid Mobility via Type Indirection Ben Y. Zhao, Ling Huang, Anthony D. Joseph, John D. Kubiatowicz Computer Science Division, UC Berkeley IPTPS 2004.
The Oceanstore Regenerative Wide-area Location Mechanism Ben Zhao John Kubiatowicz Anthony Joseph Endeavor Retreat, June 2000.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
Scribe: A Large-Scale and Decentralized Application-Level Multicast Infrastructure Miguel Castro, Peter Druschel, Anne-Marie Kermarrec, and Antony L. T.
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
Each mesh represents a single hop on the route to a given root. Sibling nodes maintain pointers to each other. Each referrer has pointers to the desired.
Tapestry: Wide-area Location and Routing Ben Y. Zhao John Kubiatowicz Anthony D. Joseph U. C. Berkeley.
Distributed Object Location in a Dynamic Network Kirsten Hildrum, John D. Kubiatowicz, Satish Rao and Ben Y. Zhao.
Implementation of a Tapestry Node: The main components: The core router, utilizes the routing and object reference tables to handle messages, The node.
OceanStore: Data Security in an Insecure world John Kubiatowicz.
OceanStore Theoretical Issues and Open Problems John Kubiatowicz University of California at Berkeley.
Weaving a Tapestry Distributed Algorithms for Secure Node Integration, Routing and Fault Handling Ben Y. Zhao (John Kubiatowicz, Anthony Joseph) Fault-tolerant.
Opportunities for Continuous Tuning in a Global Scale File System John Kubiatowicz University of California at Berkeley.
Tapestry on PlanetLab Deployment Experiences and Applications Ben Zhao, Ling Huang, Anthony Joseph, John Kubiatowicz.
CITRIS Poster Supporting Wide-area Applications Complexities of global deployment  Network unreliability.
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Or, Providing High Availability and Adaptability in a Decentralized System Tapestry: Fault-resilient Wide-area Location and Routing Issues Facing Wide-area.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
OceanStore An Architecture for Global-Scale Persistent Storage Motivation Feature Application Specific Components - Secure Naming - Update - Access Control-
Tapestry: A Resilient Global-scale Overlay for Service Deployment Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, and John.
Long Term Durability with Seagull Hakim Weatherspoon (Joint work with Jeremy Stribling and OceanStore group) University of California, Berkeley ROC/Sahara/OceanStore.
Locality Aware Mechanisms for Large-scale Networks Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz UC Berkeley Future Directions in Distributed Computing.
Tapestry GTK Devaroy (07CS1012) Kintali Bala Kishan (07CS1024) G Rahul (07CS3009)
1 Plaxton Routing. 2 Introduction Plaxton routing is a scalable mechanism for accessing nearby copies of objects. Plaxton mesh is a data structure that.
Arnold N. Pears, CoRE Group Uppsala University 3 rd Swedish Networking Workshop Marholmen, September Why Tapestry is not Pastry Presenter.
Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Application-Layer Multicast -presented by William Wong.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
1 More on Plaxton routing There are n nodes, and log B n digits in the id, where B = 2 b The neighbor table of each node consists of - primary neighbors.
Tapestry: A Resilient Global-scale Overlay for Service Deployment 1 Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, and John.
Peer to Peer Network Design Discovery and Routing algorithms
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Tapestry : An Infrastructure for Fault-tolerant Wide-area Location and Routing Presenter : Lee Youn Do Oct 5, 2005 Ben Y.Zhao, John Kubiatowicz, and Anthony.
Self-stabilizing energy-efficient multicast for MANETs.
1 Plaxton Routing. 2 History Greg Plaxton, Rajmohan Rajaraman, Andrea Richa. Accessing nearby copies of replicated objects, SPAA 1997 Used in several.
CS791Aravind Elango Maintenance-Free Global Data Storage Sean Rhea, Chris Wells, Patrick Eaten, Dennis Geels, Ben Zhao, Hakim Weatherspoon and John Kubiatowicz.
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
Architecture and Algorithms for an IEEE 802
Internet Indirection Infrastructure (i3)
Option 2: The Oceanic Data Utility: Global-Scale Persistent Storage
Advanced Computer Networks
Introduction to Wireless Sensor Networks
Controlling the Cost of Reliability in Peer-to-Peer Overlays
IS3120 Network Communications Infrastructure
Introduction to Internet Routing
Intra-Domain Routing Jacob Strauss September 14, 2006.
Accessing nearby copies of replicated objects
Routing.
Infrastructure-based Resilient Routing
OceanStore: Data Security in an Insecure world
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
Locality Optimizations in Tapestry Sahara/OceanStore Winter Retreat
Rapid Mobility via Type Indirection
Dynamic Replica Placement for Scalable Content Delivery
BGP Interactions Jennifer Rexford
Replica Placement Heuristics of Application-level Multicast
Tapestry: Scalable and Fault-tolerant Routing and Location
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Routing.
Brocade: Landmark Routing on Peer to Peer Networks
Simultaneous Insertions in Tapestry
Presentation transcript:

John D. Kubiatowicz UC Berkeley Locality Aware Mechanisms for Large-scale Networks: The Tapestry Infrastructure John D. Kubiatowicz UC Berkeley Confine load on infrastructure Prospective on differences Heuristics vs. primitives more outline, flow control through talk more details on content, pictures diffusing impact, ripples are load placed on the network keeponfrogin@msn.com

Global Scale Applications Clear demand for global scale applications Exploiting collective resources File sharing, data dissemination, shared computation Wide-area issues Scalability, fault-handling, adaptability, manageability Decentralized Object Location and Routing (DOLR) Provides scalable message routing to objects Tahoe Retreat 06/02 John Kubiatowicz

Utility-based Storage: OceanStore Pac Bell Sprint IBM AT&T Canadian OceanStore Tahoe Retreat 06/02 John Kubiatowicz

Locality, Locality, Locality “The ability to exploit local resources over remote ones whenever possible” “-Centric” approach Client-centric, server-centric, data source-centric Requirements: Find data quickly, wherever it might reside Locate nearby object without global communication Permit rapid object migration Verifiable: can’t be sidetracked Data name cryptographically related to data Infrastructure level How to optimize load on infrastructure Not application specific Tahoe Retreat 06/02 John Kubiatowicz

Basic Tapestry Mesh 4 2 3 1 NodeID 0xEF34 NodeID 0xEF34 0xEF31 0xEFBA 0xFF37 0xE555 0xE530 0xEF44 0x0999 0x099F 0xE399 0xEF40 Tahoe Retreat 06/02 John Kubiatowicz

Randomization and Locality Tahoe Retreat 06/02 John Kubiatowicz

Parallel Insert Algorithms (SPAA ’02) Massive parallel insert is important We now have algorithms that handle “arbitrary simultaneous inserts” Construction of nearest-neighbor mesh links Log2 n message complexityfully operational routing mesh Objects kept available during this process Incremental movement of pointers Interesting Issue: Introduction service How does a new node find a gateway into the Tapestry? Tahoe Retreat 06/02 John Kubiatowicz

Highly Dynamic Systems Instability is the common case….! Small half-life for P2P apps (1 hour????) Congestion, flash crowds, misconfiguration, faults BGP convergence 3-30 mins! Mobile clients in semi-connected mode Must Use Overlay under instability! Must be somehow: Insensitive to faults and denial of service attacks Route around bad servers and ignore bad data Repairable infrastructure Easy to reconstruct routing and location information Without care, worst case: sub-optimal paths, network partitions, broken invariants  loss of availability Anthony D. Joseph: Loss from routing reconfiguration delays of minutes or more: BGP convergence time of 3 to 15 minutes Stress model is now semi-connected versus semi-disconnected (ie, bigger islands of connectivity Tahoe Retreat 06/02 John Kubiatowicz

Continuous Self-Repair All state is Soft State Continuous probing, selection of neighbors Periodic restoration of state Stability through statistics: Redundancy at many levels: Neighbor links, Object Roots, etc. Dynamic Stabilization: Integrate/remove themselves automatically Pointer state routed around faulty node Markov Models: What is a misbehaving router? Communication link? What level of redundancy necessary? Are we under attack? Tahoe Retreat 06/02 John Kubiatowicz

State Maintenance Maintenance of link conditions, routing state Example: soft-state beacons measure link conditions between neighbors (Tapestry, Scribe) Locality awareness Naïve: messages sent across overlay hops at same rate regardless of actual network distance Traffic can scale with length of overlay hop, possibly causing congestion Alternatives: Scale soft-state frequency w/ length of overlay hop External fault-detection / measurement platform Soft-stae is the mechanism, and possibly not a good one Tahoe Retreat 06/02 John Kubiatowicz

Locality-Based Heartbeats Tahoe Retreat 06/02 John Kubiatowicz

Redundant Routing Keep multiple routes for every link Pre-compute alternatives Quick adaptation (using up pre-computing) Destination-rooted spanning tree For each node N, reachable by all other nodes All paths union to form spanning tree rooted at N Convergence # of candidates for the next hop decreases by b every hop, where b = base of Tapestry ID Nodes routing to same ID converge as a function of the network density between them Tahoe Retreat 06/02 John Kubiatowicz

Convergence Tahoe Retreat 06/02 John Kubiatowicz

First Reachable Link Selection (FRLS) Tahoe Retreat 06/02 John Kubiatowicz

FRLS Reachability Tahoe Retreat 06/02 John Kubiatowicz

Proactive Multicast Tahoe Retreat 06/02 John Kubiatowicz

Self-Organizing Soft-State Replication Simple algorithms for placing replicas on nodes in the interior Intuition: locality properties of Tapestry help select positions for replicas Tapestry helps associate parents and children to build multicast tree Preliminary results show that this is effective Tahoe Retreat 06/02 John Kubiatowicz

The Living Network Model Gaia: a living network James Lovelock [1979] Large scale self-management Locally constrained interactions  scalability, performance Layered control structure Upward propagation of aggregate data Survival via active redundancy & self-repair Catastrophic failures handled by top level control (human interaction) Tahoe Retreat 06/02 John Kubiatowicz

Conclusion Decentralized Object Location and Routing (DOLR) Important to be able to route to objects … With Locality Use of local resources whenever possible! Continuous adaptation, repair System never quite fully stable Continuous convergence Keep objects as available as possible Tahoe Retreat 06/02 John Kubiatowicz