Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen Scott Shenker This is a modified version of the original presentation by the authors.
On Large-Scale Peer-to-Peer Streaming Systems with Network Coding Chen Feng, Baochun Li Dept. of Electrical and Computer Engineering University of Toronto.
On the Economics of P2P Systems Speaker Coby Fernandess.
PlanetP: Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities F. M. Cuenca-Acuna, C. Peery, R. P. Martin, and T. D.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
A Dependable Auction System: Architecture and an Implementation Framework
A S URVEY OF TRUST MANAGEMENT AND ITS APPLICATIONS S UPERVISED BY : D R. Y AN W ANG Ravendra Singh Student-id:
The Organic Grid: Self- Organizing Computation on a Peer-to-Peer Network Presented by : Xuan Lin.
Availability in Global Peer-to-Peer Systems Qin (Chris) Xin, Ethan L. Miller Storage Systems Research Center University of California, Santa Cruz Thomas.
Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen Department of.
FRIENDS: File Retrieval In a dEcentralized Network Distribution System Steven Huang, Kevin Li Computer Science and Engineering University of California,
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Cooperative Caching Middleware for Cluster-Based Servers Francisco Matias Cuenca-Acuna Thu D. Nguyen Panic Lab Department of Computer Science Rutgers University.
A probabilistic approach to building large scale federated systems Francisco Matias Cuenca-Acuna
Rutgers PANIC Laboratory The State University of New Jersey Self-Managing Federated Services Francisco Matias Cuenca-Acuna and Thu D. Nguyen Department.
Adaptive Content Management in Structured P2P Communities Jussi Kangasharju Keith W. Ross David A. Turner.
Francisco Matias Cuenca-Acuna Christopher Peery Thu D. Nguyen Usando algoritmos probabilísticos para construir sistemas.
A. Frank 1 Internet Resources Discovery (IRD) Peer-to-Peer (P2P) Technology (1) Thanks to Carmit Valit and Olga Gamayunov.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Erasure Coding vs. Replication: A Quantiative Comparison
Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities Francisco Matias Cuenca-Acuna, Christopher Peery, Richard P.
Comparing Hybrid Peer-to-Peer Systems Beverly Yang and Hector Garcia-Molina Presented by Marco Barreno November 3, 2003 CS 294-4: Peer-to-peer systems.
Navigating and Sharing in a Decentralized World Francisco Matias Cuenca-Acuna
Text-Based Content Search and Retrieval in ad hoc P2P Communities Francisco Matias Cuenca-Acuna Thu D. Nguyen
Searching in Unstructured Networks Joining Theory with P-P2P.
Wide-area cooperative storage with CFS
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
Peer-to-peer archival data trading Brian Cooper and Hector Garcia-Molina Stanford University.
Improving Data Access in P2P Systems Karl Aberer and Magdalena Punceva Swiss Federal Institute of Technology Manfred Hauswirth and Roman Schmidt Technical.
Correctness of Gossip-Based Membership under Message Loss Maxim Gurevich, Idit Keidar Technion.
Internet Quarantine: Requirements for Containing Self-Propagating Code David Moore et. al. University of California, San Diego.
Algorithms for Self-Organization and Adaptive Service Placement in Dynamic Distributed Systems Artur Andrzejak, Sven Graupner,Vadim Kotov, Holger Trinks.
C LIENT /S ERVER AND P EER TO P EER National 4/5 Computing Science.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Network Aware Resource Allocation in Distributed Clouds.
Software Reliability SEG3202 N. El Kadri.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Colin J. MacDougall.  Class of Systems and Applications  “Employ distributed resources to perform a critical function in a decentralized manner”  Distributed.
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
Tsunami: Maintaining High Bandwidth Under Dynamic Network Conditions Dejan Kostić, Ryan Braud, Charles Killian, Eric Vandekieft, James W. Anderson, Alex.
A Dynamic Data Grid Replication Strategy to Minimize the Data Missed Ming Lei, Susan Vrbsky, Xiaoyan Hong University of Alabama.
Autonomous Replication for High Availability in Unstructured P2P Systems (Paper by Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen) Hristo.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
Research Direction Advisor: Frank,Yeong-Sung Lin Presented by Jia-Ling Pan 2010/10/211NTUIM OPLAB.
Minimizing Churn in Distributed Systems P. Brighten Godfrey, Scott Shenker, and Ion Stoica UC Berkeley SIGCOMM’06.
Peer to Peer Network Design Discovery and Routing algorithms
Aug 22, 2002Sigcomm 2002 Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen AT&T Labs-research Scott Shenker ICIR.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
GPFS: A Shared-Disk File System for Large Computing Clusters Frank Schmuck & Roger Haskin IBM Almaden Research Center.
The Biologically Inspired Distributed File System: An Emergent Thinker Instantiation Presented by Dr. Ying Lu.
OceanStore : An Architecture for Global-Scale Persistent Storage Jaewoo Kim, Youngho Yi, Minsik Cho.
Internet Quarantine: Requirements for Containing Self-Propagating Code
A Replica Location Service
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Providing Secure Storage on the Internet
Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad
Distributed computing deals with hardware
Introduction To Distributed Systems
Client/Server and Peer to Peer
Presentation transcript:

Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen

Introduction Increasing connectivity is driving more dynamic and complex information sharing pattern −Web, file-sharing (e.g., Gnutella), collaborative computing (e.g., −Appearing in many other important computing domains −e.g., Grid and Web services Three emerging properties of these systems (P2P) −Large scale −Decentralized control −Computing takes place on a range of devices, not just servers New problems −Successful operation may depend on numerous resources spanning multiple administrative domains −Service and data availability is difficult to achieve

Introduction(contd.) Factors that affect availability −Lack of coordination in resource management −Wide range of peer behaviors −Expensive (impossible?) to create global views GOAL: provide predictable data availability in P2P systems −E.g., for file systems, we want to reason about minimum file availability Approach −Present explicit model of probabilistic availability to users −Allow users to specify the desired availability level to guide replication −Explicitly inform users when cannot achieve desired availability level −Use an autonomous replication algorithm to tolerate decentralized control −Use as little global state as possible Availability: % of time a piece of data is accessible

Pitfalls in P2P environments Wide range of node availability −Node MTTF no longer determined by hardware reliability but by users’ on-line behavior −Fixed number of replicas too wasteful −E.g., small number of replicas on highly available nodes equivalent to many replicas on low available nodes −Gnutella span from 0.1% to 100%, with an average of 24% −Also, we need to recreate replicas as nodes join and leave Long term dynamic membership −In fact, a fixed number doesn’t work at all because availability profile will likely change over time

Our Approach

Our approach   Use replication but −Vary number of replicas based on estimated file availability −Take advantage of nodes going offline as opposed to failing −Constantly monitor availability −Use erasure codes to minimize space requirements and spread file to more nodes

Internet The strategy Node A PlanetP Hoarded Set F1F1 F2F2 FiFi Excess Storage FjFj FkFk MembershipInfo. Gossiping Node B PlanetP Hoarded Set F3F3 F4F4 FxFx Excess Storage FyFy FzFz MembershipInfo. Gossiping PlanetP provides: - A Loosely synchronized community view - Propagates - Node av. Information - File placement as a Bloom Filter

Internet The strategy Node A PlanetP Hoarded Set F1F1 F2F2 FiFi Excess Storage FjFj FkFk MembershipInfo. Gossiping Node B PlanetP Hoarded Set F3F3 F4F4 FxFx Excess Storage FyFy FzFz MembershipInfo. Gossiping Advertise: - Availability 20% - Files F1, F2 - Fragments Fi, Fj, Fk

Internet The strategy Node A PlanetP Hoarded Set F1F1 F2F2 FiFi Excess Storage FjFj FkFk MembershipInfo. Gossiping Node B PlanetP Hoarded Set F3F3 F4F4 FxFx Excess Storage FyFy FzFz MembershipInfo. Gossiping Based on Node’s B view of F3: - Pick a random node - Create a new fragment for F3 - Push it  F3F3

Dealing with Decentralization Nodes replicate and evict autonomously −Reduce the use of global data and lack of central control All decisions are probabilistic −Weighted by availability estimates Target nodes control their own storage space −Protects system against greedy and faulty nodes Erasure codes plus −Use a modified version of Reed Solomon −Provide a large fragment space −Don’t re-create lost fragments −Prevents duplicates due to autonomous and misinformed decisions

Availability-based replacement Estimating file availability −Probability of finding an online copy or being able to reconstruct the file from the erasure coded fragments Evict fragments of files with “too much” availability −Note that “too much” is in comparison only to files in local excess storage (don’t have to know about all files in system) Why does it work? −Randomized placement decisions  local sample of file availabilities reflect global distribution −This approximation drives space allocation and allows files with insufficient availability to gain fragments

Evaluation

Test scenarios Evaluate three significantly different environments The file sharing environment −1000 nodes hosting a total of files −Node availability avg:24%, min:0.1%, 90th perc:75%, max:100% −Target 99.9% availability −10 minute refresh rate Sources −Saroiu et al. (Gnutella, Napster), DirectConnect at Rutgers OMNI −Centralized knowledge with no limitation on replica placement Base −What happens if you do not have availability estimates?

Availability Comparison

Effect of Av. Based Replacement

Conclusion Practical availability levels are achievable in spite of low node availability and decentralized environment −CO: 80% avg. av.  1X −FS: 24% avg. av.  6X −WG: 33% avg. av.  9X Having some global information is critical −But can do quite well with loosely synchronized data Our algorithm is competitive −Matches OMNI on avg. availability −Provides reasonable min av. −Validated under different environments −Resilient to malicious users −Allocates space fairly within limits

PlanetP Questions?