Dynamic Replica Placement for Scalable Content Delivery

Slides:



Advertisements
Similar presentations
Brocade: Landmark Routing on Peer to Peer Networks Ben Y. Zhao Yitao Duan, Ling Huang, Anthony Joseph, John Kubiatowicz IPTPS, March 2002.
Advertisements

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Scalable Content-Addressable Network Lintao Liu
1 Efficient and Robust Streaming Provisioning in VPNs Z. Morley Mao David Johnson Oliver Spatscheck Kobus van der Merwe Jia Wang.
1 Accessing nearby copies of replicated objects Greg Plaxton, Rajmohan Rajaraman, Andrea Richa SPAA 1997.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
Scribe: A Large-Scale and Decentralized Application-Level Multicast Infrastructure Miguel Castro, Peter Druschel, Anne-Marie Kermarrec, and Antony L. T.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Prefix Caching assisted Periodic Broadcast for Streaming Popular Videos Yang Guo, Subhabrata Sen, and Don Towsley.
1 Caching/storage problems and solutions in wireless sensor network Bin Tang CSE 658 Seminar on Wireless and Mobile Networking.
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
Application Layer Multicast
Distributed Lookup Systems
Collaborative Web Caching Based on Proxy Affinities Jiong Yang, Wei Wang in T. J.Watson Research Center Richard Muntz in Computer Science Department of.
1 An Overlay Scheme for Streaming Media Distribution Using Minimum Spanning Tree Properties Journal of Internet Technology Volume 5(2004) No.4 Reporter.
Object Naming & Content based Object Search 2/3/2003.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
Dynamic Multicast Tree Construction in OceanStore Puneet Mehra and Satrajit Chatterjee Advanced Topics in Computer Systems Final Project EECS Department,
Tapestry on PlanetLab Deployment Experiences and Applications Ben Zhao, Ling Huang, Anthony Joseph, John Kubiatowicz.
1 Seminar: Information Management in the Web Gnutella, Freenet and more: an overview of file sharing architectures Thomas Zahn.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
ICDE A Peer-to-peer Framework for Caching Range Queries Ozgur D. Sahin Abhishek Gupta Divyakant Agrawal Amr El Abbadi Department of Computer Science.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
Barracuda Load Balancer Server Availability and Scalability.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Network Aware Resource Allocation in Distributed Clouds.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Higashino Lab. Maximizing User Gain in Multi-flow Multicast Streaming on Overlay Networks Y.Nakamura, H.Yamaguchi and T.Higashino Graduate School of Information.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
De-Nian Young Ming-Syan Chen IEEE Transactions on Mobile Computing Slide content thanks in part to Yu-Hsun Chen, University of Taiwan.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
1 An Adaptive File Distribution Algorithm for Wide Area Network Takashi Hoshino, Kenjiro Taura, Takashi Chikayama University of Tokyo.
KAIS T On the problem of placing Mobility Anchor Points in Wireless Mesh Networks Lei Wu & Bjorn Lanfeldt, Wireless Mesh Community Networks Workshop, 2006.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Peer to Peer Network Design Discovery and Routing algorithms
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
An overlay for latency gradated multicasting Anwitaman Datta SCE, NTU Singapore Ion Stoica, Mike Franklin EECS, UC Berkeley
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
1 Plaxton Routing. 2 History Greg Plaxton, Rajmohan Rajaraman, Andrea Richa. Accessing nearby copies of replicated objects, SPAA 1997 Used in several.
Brocade: Landmark Routing on Overlay Networks
Internet Indirection Infrastructure (i3)
A Study of Group-Tree Matching in Large Scale Group Communications
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.
Plethora: Infrastructure and System Design
Accessing nearby copies of replicated objects
Turning Heterogeneity into an Advantage in Overlay Routing
Host Multicast: A Framework for Delivering Multicast to End Users
Early Measurements of a Cluster-based Architecture for P2P Systems
SCOPE: Scalable Consistency in Structured P2P Systems
EE 122: Peer-to-Peer (P2P) Networks
Overlay Networking Overview.
Edge computing (1) Content Distribution Networks
John D. Kubiatowicz UC Berkeley
Locality Optimizations in Tapestry Sahara/OceanStore Winter Retreat
Rapid Mobility via Type Indirection
Replica Placement Heuristics of Application-level Multicast
Deterministic and Semantically Organized Network Topology
EE 122: Lecture 22 (Overlay Networks)
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Brocade: Landmark Routing on Peer to Peer Networks
Presentation transcript:

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, kubitron}@CS.Berkeley.EDU EECS Department UC Berkeley

Motivation Scenario data source replica data plane cache always update adaptive coherence always update replica data plane client CDN server Web content server network plane

Goal and Challenges Dynamic choice of number and location of replicas Provide content distribution to clients with good Quality of Service (QoS) while retaining efficient and balanced resource consumption of the underlying infrastructure Dynamic choice of number and location of replicas Clients’ QoS constraints Servers’ capacity constraints Efficient update dissemination Delay Bandwidth consumption Scalability: millions of objects, clients and servers No global network topology knowledge We are going to compare several approaches, dynamic replica placement… And how they are self-organized into d-tree and how they are maintained through soft-state mgt.

Previous Work (Replica Placement) Focused on static replica placement Clients’ distributions and access patterns known in advance Assume global IP network topology Data Location via DNS-redirection Highly inefficient (this is a “hack”) Centralized CDN name server cannot record replica locations

Previous Work (Info Dissemination) No inter-domain IP multicast Application-level multicast (ALM) unscalable Root maintains states for all children (Narada, Overcast, ALMI, RMX) Root handles all “join” requests (Bayeux) Root split is common solution, but suffers consistency overhead

Solutions for Dissemination Tree Peer-to-Peer Overlay Location Services with Good Scalability & Locality Simultaneous Replica Placement and Tree Construction

Peer-to-peer Routing and Location Services Properties Needed by Tree Building Algorithms Distributed, scalable location with guaranteed success Search with locality P2P Routing and Location Services: Tapestry CAN, Chord, Pastry insufficient locality or flexibility to place objects Http://www.cs.berkeley.edu/~ravenben/tapestry

Simultaneous Replica Placement and Tree Construction Static Replica Placement + IP Multicast Modeled as a global optimization problem Design a greedy algorithm with logN approximation Optimal case for comparison Dynamic Replica Placement + Application-level Multicast Search for qualified local replicas first Place new replicas on Tapestry overlay path Two approaches: naïve and smart Soft-state Tree Maintenance Each node only maintains states for its parent and direct children We are going to compare several approaches, dynamic replica placement… And how they are self-organized into d-tree and how they are maintained through soft-state mgt. Talk about facility location problem: Each CDN server is a facility and each client is a location, to put a replica incurs an uniform cost 1. Each CDN server can cover a certain subset of clients and has a capacity constraint on the # of clients to cover. To minimize the total cost while covering all clients.

Dynamic Replica Placement: naïve data plane parent candidate Tapestry overlay path s proxy c network plane Tapestry mesh

Dynamic Replica Placement: naïve data plane parent candidate s proxy Piggyback the overlay dist and load of the servers on the Tapestry path, thus it will choose the one as close to c as possible c first placement choice network plane Tapestry mesh Tapestry overlay path

Dynamic Replica Placement: smart parent candidates Aggressive search Greedy load distribution data plane client child Tapestry overlay path Again, client c submit request to s through Tapestry S have a larger set, including …, we call it aggressive search… Load-qualified candidates will send its load to c, c will ping them to get the distance and choose the one both satisfy the latency constraint and has the we choose the one with the lightest load as parent, we call it greedy load dist Then it will place a cache on the client and tie it into the d-tree If none of them satisfy, c will send a another lazy placement msg to s through Tapestry, piggyback the load of the servers on the Tapestry path. S will use the overlay distance for estimation of IP distance, and choose the one as far to c as possible s proxy parent sibling c server child network plane

Dynamic Replica Placement: smart Aggressive search Lazy placement Greedy load distribution data plane parent candidates client child Tapestry overlay path s proxy parent first placement choice sibling c server child network plane

Evaluation Methodology Network Topology 5000-node network with GT-ITM transit-stub model 500 d-tree server nodes, 4500 clients join in random order Dissemination Tree Server Deployment Random d-tree Backbone d-tree (choose backbone routers and subnet gateways first) Constraints 50 ms latency bound and 200 clients/server load bound

Four Approaches for Comparison Overlay Dynamic Naïve Placement (dynamic_naïve) Overlay Dynamic Smart Placement (dynamic_smart) Static Placement on Overlay Network (overlay_static) Static Placement on IP Network (IP_static)

Number of Replicas Deployed and Load Distribution Overlay_smart uses much less replicas than overlay_naïve and very close to IP_static Overlay_smart has better load distribution than od_naïve, overlay_static and very close to IP_static

Multicast Performance 85% of overlay_smart Relative Delay Penalty (RDP) less than 4 Bandwidth consumed by overlay_smart is very close to IP_static and much less than overlay_naive With fixed replicas, how good multicast performance d-tree has

Tree Construction Traffic Including “join” requests, “ping” messages, replica placement and parent/child registration Overlay_smart consumes three to four times of traffic than overlay_naïve, and the traffic of overlay_naïve is quite close to IP_static Far less frequent event than update dissemination

Conclusions Peer-to-peer networks can be used to construct CDNs Dissemination Tree: dynamic Content Distribution Network with good QoS, efficiency and load balancing P2P location service to improve scalability and locality Simultaneous dynamic replica placement and tree construction In particular Use Tapestry to contact nearby region of tree to select parent Lazy placement of new replicas on Tapestry overlay path Close to optimal number of replicas, good load distribution, low multicast delay and bandwidth penalty at the price of reasonable construction traffic

Future Work Evaluate with more diverse topologies and real workload Dynamic replica deletion/migration to adapt to the shift of users’ interests Implementation for OceanStore, a global-scale persistent data storage system