Replica Placement Strategy for Wide-Area Storage Systems Byung-Gon Chun and Hakim Weatherspoon RADS Final Presentation December 9, 2004.

Slides:

Advertisements

Similar presentations

Analysis and Construction of Functional Regenerating Codes with Uncoded Repair for Distributed Storage Systems Yuchong Hu, Patrick P. C. Lee, Kenneth.

Advertisements

Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.

Framework for comparing power system reliability criteria Evelyn Heylen Prof. Geert Deconinck Prof. Dirk Van Hertem Durham Risk and Reliability modelling.

Ceph Distributed File System: Simulating a Site Failure s: {bazli.abkarim, mt.wong, Mohd Bazli Ab Karim, Ming-Tat Wong, Jing-Yuan.

Fabián E. Bustamante, Fall 2005 Efficient Replica Maintenance for Distributed Storage Systems B-G Chun, F. Dabek, A. Haeberlen, E. Sit, H. Weatherspoon,

Availability in Globally Distributed Storage Systems

Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.

1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.

Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems Antony Rowstron and Peter Druschel Proc. of the 18th IFIP/ACM.

Three heuristics for transmission scheduling in sensor networks with multiple mobile sinks Damla Turgut and Lotzi Bölöni University of Central Florida.

1 High Availability, Scalable Storage, Dynamic Peer Networks: Pick Two Nov. 24, 2003 Byung-Gon Chun.

On Object Maintenance in Peer-to-Peer Systems IPTPS 2006 Kiran Tati and Geoffrey M. Voelker UC San Diego.

Exploring Tradeoffs in Failure Detection in P2P Networks Shelley Zhuang, Ion Stoica, Randy Katz HIIT Short Course August 18-20, 2003.

DNA Research Group 1 Growth Codes: Maximizing Sensor Network Data Persistence Abhinav Kamra, Vishal Misra, Dan Rubenstein Department of Computer Science,

1 Node Selection For a Fault- Tolerant Streaming Service On A Peer-to-Peer Network Hyunjoo Kim, Sooyong Kang and Yeom H.Y.

ICNP'061 Benefit-based Data Caching in Ad Hoc Networks Bin Tang, Himanshu Gupta and Samir Das Computer Science Department Stony Brook University.

Distributed Cluster Repair for OceanStore Irena Nadjakova and Arindam Chakrabarti Acknowledgements: Hakim Weatherspoon John Kubiatowicz.

Multiple Sender Distributed Video Streaming Thinh Nguyen, Avideh Zakhor appears on “IEEE Transactions On Multimedia, vol. 6, no. 2, April, 2004”

Efficient replica maintenance for distributed storage systems Byung-Gon Chun, Frank Dabek, Andreas Haeberlen, Emil Sit, Hakim Weatherspoon, M. Frans Kaashoek,

Distributed Sensing and Data Collection Via Broken Ad Hoc Wireless Connected Networks Mobile Robots By Alan FT Winfield Presented By Navpreet Bawa.

1 PLuSH – Mesh Tree Fast and Robust Wide-Area Remote Execution Mikhail Afanasyev ‧ Jose Garcia ‧ Brian Lum.

A Real-Time Video Multicast Architecture for Assured Forwarding Services Ashraf Matrawy, Ioannis Lambadaris IEEE TRANSACTIONS ON MULTIMEDIA, AUGUST 2005.

Erasure Coding vs. Replication: A Quantiative Comparison

Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.

Path Protection in MPLS Networks Using Segment Based Approach.

Worm Defense. Outline  Internet Quarantine: Requirements for Containing Self-Propagating Code  Netbait: a Distributed Worm Detection Service  Midgard.

Locality Optimizations in Tapestry Jeremy Stribling Joint work with: Kris Hildrum Ben Y. Zhao Anthony D. Joseph John D. Kubiatowicz Sahara/OceanStore Winter.

1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.

Long Term Durability with Seagull Hakim Weatherspoon (Joint work with Jeremy Stribling and OceanStore group) University of California, Berkeley ROC/Sahara/OceanStore.

Multiple Sender Distributed Video Streaming Nguyen, Zakhor IEEE Transactions on Multimedia April 2004.

1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.

1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,

Papers on Storage Systems 1) Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud, SC ) Making Cloud Intermediate Data Fault-Tolerant,

1/30 Energy-Efficient Forwarding Strategies for Geographic Routing in Lossy Wireless Sensor Networks Wireless and Sensor Network Seminar Dec 01, 2004.

Growth Codes: Maximizing Sensor Network Data Persistence abhinav Kamra, Vishal Misra, Jon Feldman, Dan Rubenstein Columbia University, Google Inc. (SIGSOMM’06)

Salah A. Aly,Moustafa Youssef, Hager S. Darwish,Mahmoud Zidan Distributed Flooding-based Storage Algorithms for Large-Scale Wireless Sensor Networks Communications,

Cooperative Recovery of Distributed Storage Systems from Multiple Losses with Network Coding Yuchong Hu, Yinlong Xu, Xiaozhao Wang, Cheng Zhan and Pei.

The virtue of dependent failures in multi-site systems Flavio Junqueira and Keith Marzullo University of California, San Diego Workshop on Hot Topics in.

Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,

Probabilistic Coverage in Wireless Sensor Networks Authors : Nadeem Ahmed, Salil S. Kanhere, Sanjay Jha Presenter : Hyeon, Seung-Il.

Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.

Combs, Needles, Haystacks: Balancing Push and Pull for Discovery in Large Scale Sensor Networks Xin Liu Department of Computer Science University of California.

Secure In-Network Aggregation for Wireless Sensor Networks

1 Enabling Efficient and Reliable Transitions from Replication to Erasure Coding for Clustered File Systems Runhui Li, Yuchong Hu, Patrick P. C. Lee The.

Effective Replica Maintenance for Distributed Storage Systems USENIX NSDI’ 06 Byung-Gon Chun, Frank Dabek, Andreas Haeberlen, Emil Sit, Hakim Weatherspoon,

Slide #1 Performance Evaluation of Routing Protocol for Low Power and Lossy Networks (RPL) draft-tripathi-roll-rpl-simulation-04 IETF Virtual Interim WG.

)1()1( Presenter: Noam Presman Advanced Topics in Storage Systems – Semester B 2013 Authors: A.Cidon, R.Stutsman, S.Rumble, S.Katti,

Exact Regenerating Codes on Hierarchical Codes Ernst Biersack Eurecom France Joint work and Zhen Huang.

Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,

Authors: Ing-Ray Chen and Ding-Chau Wang Presented by Chaitanya,Geetanjali and Bavani Modeling and Analysis of Regional Registration Based Mobile Service.

Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.

1 Elastically Replicated Information Services: Sustaining the Availability of Distributed Storage Across Dynamic Topological Changes Sponsored by Program.

Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.

Repairing Sensor Network Using Mobile Robots Y. Mei, C. Xian, S. Das, Y. C. Hu and Y. H. Lu Purdue University, West Lafayette ICDCS 2006 Speaker ： Shih-Yun.

Robustness in the Salus scalable block store Yang Wang, Manos Kapritsos, Zuocheng Ren, Prince Mahajan, Jeevitha Kirubanandam, Lorenzo Alvisi, and Mike.

PATH DIVERSITY WITH FORWARD ERROR CORRECTION SYSTEM FOR PACKET SWITCHED NETWORKS Thinh Nguyen and Avideh Zakhor IEEE INFOCOM 2003.

Adaptive Configuration of a Web Caching Hierarchy Pranav A. Desai Jaspal Subhlok Presented by: Pranav A. Desai.

Pouya Ostovari and Jie Wu Computer & Information Sciences

CS791Aravind Elango Maintenance-Free Global Data Storage Sean Rhea, Chris Wells, Patrick Eaten, Dennis Geels, Ben Zhao, Hakim Weatherspoon and John Kubiatowicz.

1 On the Interaction between Dynamic Routing in the Native and Overlay Layers Infocom2006 Srinivasan Seetharaman and Mostafa Ammar College of Computing.

Persistence of Data in a Dynamic Unreliable Network

Salah A. Aly ,Moustafa Youssef, Hager S. Darwish ,Mahmoud Zidan

Measurement-based Design

Vivaldi: A Decentralized Network Coordinate System

Authors Alessandro Duminuco, Ernst Biersack Taoufik and En-Najjary

Content Distribution Networks

Locality Optimizations in Tapestry Sahara/OceanStore Winter Retreat

An Original Model of Infrastructure System Resilience

Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays

End-to-End Internet Delay Behavior

Presentation transcript:

Replica Placement Strategy for Wide-Area Storage Systems Byung-Gon Chun and Hakim Weatherspoon RADS Final Presentation December 9, 2004

Final Presentation:2 Environment Store large quantities of data persistently and availably Storage Strategy –Redundancy - duplicate data to protect against data loss –Place data throughout wide area for availability and durability Avoid correlated failures –Continuously repair loss redundancy as needed Detect permanent node failures and trigger data recovery

Final Presentation:3 Assumptions Data is maintained on nodes, in the wide area, and in well maintained sites. Sites contribute resources –Nodes (storage, cpu) –Network - bandwidth Nodes collectively maintain data –Adaptive - Constant change, Self-organizing, self-maintaining Costs –Data Recovery Process of maintaining data availability –Limit wide area bandwidth used to maintain data

Final Presentation:4 Challenge Avoiding correlated failures/downtime with careful data placement –Minimize cost of resources used to maintain data Storage Bandwidth –Maximize Data availability

Final Presentation:5 Outline Analysis of correlated failures –Show that correlated failures exist - are significant Effects of common subnet (admin area, geographic location, etc) –Pick a threshold and extra redundancy Effects of extra redundancy –Vary extra redundancy –Compare random, random w/ constraint, and oracle placement –Show that margin between oracle and random is small

Final Presentation:6 Analysis of PlanetLab Trace characteristics Trace-driven simulation Model maintaining data on PlanetLab Create trace using all-pairs ping* –Collected from February 16, 2003 to October 6, 2004 Measure –Correlated failures v. time –Probability of k nodes down simultaneously –{5th Percentile, Median} number of available replicas v. time –Cumulative number of triggered data recovery v. time *Jeremy Stribling

Final Presentation:7 Analysis of PlanetLab II Correlated failures

Final Presentation:8 Analysis I - Node characteristics

Final Presentation:9 Analysis II- Correlated Failures

Final Presentation:10 Correlated Failures

Final Presentation:11 Correlated Failures (machine with downtime <= 1000 slots)

Final Presentation:12 Availability Trace

Final Presentation:13 Replica Placement Strategies Random RandomSite –Avoid to place multiple replicas in the same site –A site in PlanetLab is identified by 2B IP address prefix. RandomBlacklist –Avoid to use machines, in blacklist, that are top k machines with long down time RandomSiteBlacklist –Combine RandomSite and RandomBlacklist

Final Presentation:14 Comparison of simple strategies (m=1, th=9, n=14, |blacklist|=35) StrategyRandomRandom Site Random Blacklist Random Site Blacklist # of repairs Improve ment (%)

Final Presentation:15 Simulation setup Placement Algorithm –Random vs. Oracle –Oracle strategies Max-Lifetime-Availability Min-Max-TTR, Min-Sum-TTR, Min-Mean-TTR Simulation Parameters –Replication m = 1, threshold th = 9, total replicas n = 15 –Initial repository size 2TB –Write rate 1Kbps per node and 10Kbps per node 300 storage nodes System increases in size at rate of 3TB and 30TB per year, respective. Metrics –Number of available nodes –Number of data repairs

Final Presentation:16 Comparison of simple strategies(m=1, th=9)

Final Presentation:17 Results - Random Placement (1Kbps)

Final Presentation:18 Results - Oracle Max-Lifetime-Avail (1Kbps)

Final Presentation:19 Results - Breakdown of Random (1Kbps)

Final Presentation:20 Results - Random (10Kbps)

Final Presentation:21 Results - Breakdown of Random (10Kbps)

Final Presentation:22 Conclusion There does exist correlated downtimes. Random is sufficient –A minimum data availability threshold and extra redundancy is sufficient to absorb most correlation.