Replication Degree Customization for High Availability

Slides:



Advertisements
Similar presentations
Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 17 Auction-based.
Advertisements

VCRIB: Virtual Cloud Rule Information Base Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan HotCloud 2012.
PrincetonUniversity Performance Isolation and Fairness for Multi-Tenant Cloud Storage David Shue*, Michael Freedman*, and Anees Shaikh ✦ *Princeton ✦ IBM.
Autonomic Systems Justin Moles, Winter 2006 Enabling autonomic behavior in systems software with hot swapping Paper by: J. Appavoo, et al. Presentation.
Novasky: Cinematic-Quality VoD in a P2P Storage Cloud Speaker : 童耀民 MA1G Authors: Fangming Liu†, Shijun Shen§,Bo Li†, Baochun Li‡, Hao Yin§,
© 2005 Andreas Haeberlen, Rice University 1 Glacier: Highly durable, decentralized storage despite massive correlated failures Andreas Haeberlen Alan Mislove.
Availability in Globally Distributed Storage Systems
Linear Obfuscation to Combat Symbolic Execution Zhi Wang 1, Jiang Ming 2, Chunfu Jia 1 and Debin Gao 3 1 Nankai University 2 Pennsylvania State University.
1 Defragmenting DHT-based Distributed File Systems Jeffrey Pang, Srinivasan Seshan Carnegie Mellon University Phillip B. Gibbons, Michael Kaminsky Intel.
1 One Torus to Rule Them All: Multi-dimensional Queries in P2P Systems Prasanna Ganesan Beverly Yang Hector Garcia-Molina Stanford University.
Efficient replica maintenance for distributed storage systems Byung-Gon Chun, Frank Dabek, Andreas Haeberlen, Emil Sit, Hakim Weatherspoon, M. Frans Kaashoek,
Adaptive Content Management in Structured P2P Communities Jussi Kangasharju Keith W. Ross David A. Turner.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Clustering of Web Content for Efficient Replication Yan Chen, Lili Qiu, Wei Chen, Luan Nguyen and Randy H. Katz {yanchen, wychen, luann,
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
Proposed Work 1. Client-Server Synchronization Proposed Work 2.
1 Experimental Evidence on Partitioning in Parallel Data Warehouses Pedro Furtado Prof. at Univ. of Coimbra & Researcher at CISUC DEI/CISUC-Universidade.
Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis [1] 4/24/2014 Presented by: Rakesh Kumar [1 ]
Profile Driven Component Placement for Cluster-based Online Services Christopher Stewart (University of Rochester) Kai Shen (University of Rochester) Sandhya.
RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.
Selective Block Minimization for Faster Convergence of Limited Memory Large-scale Linear Models Kai-Wei Chang and Dan Roth Experiment Settings Block Minimization.
Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen
임규찬. 1. Abstract 2. Introduction 3. Design Goals 4. Sample-Based Scheduling for Parallel Jobs 5. Implements.
A Formal Analysis of Conservative Update Based Approximate Counting Gil Einziger and Roy Freidman Technion, Haifa.
1 LD-Sketch: A Distributed Sketching Design for Accurate and Scalable Anomaly Detection in Network Data Streams Qun Huang and Patrick P. C. Lee The Chinese.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Edouard et al. Madhura S Rama.
The virtue of dependent failures in multi-site systems Flavio Junqueira and Keith Marzullo University of California, San Diego Workshop on Hot Topics in.
Database replication policies for dynamic content applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto Presented by Ahmed.
TRUST Self-Organizing Systems Emin G ü n Sirer, Cornell University.
BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data Authored by Sameer Agarwal, et. al. Presented by Atul Sandur.
A Tale of Two Erasure Codes in HDFS
Talal H. Noor, Quan Z. Sheng, Lina Yao,
Data Driven Resource Allocation for Distributed Learning
A Dynamic Scheduling Framework for Emerging Heterogeneous Systems
Introduction to Load Balancing:
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
International Conference on Data Engineering (ICDE 2016)
Curator: Self-Managing Storage for Enterprise Clusters
Measurement-based Design
Scale and Performance in the CoBlitz Large-File Distribution Service
Outline Introduction Related Work
Vivaldi: A Decentralized Network Coordinate System
Xiaodong Wang, Shuang Chen, Jeff Setter,
Presented by Haoran Wang
Resource Elasticity for Large-Scale Machine Learning
The Variable-Increment Counting Bloom Filter
Lecture 13: Large Cache Design I
On the Scale and Performance of Cooperative Web Proxy Caching
Understanding and Exploiting Amazon EC2 Spot Instances
Department of Computer Science University of California, Santa Barbara
StreamApprox Approximate Stream Analytics in Apache Spark
Data Integration with Dependent Sources
Reference-Driven Performance Anomaly Identification
Edge computing (1) Content Distribution Networks
Xiaoyang Zhang1, Yuchong Hu1, Patrick P. C. Lee2, Pan Zhou1
CS-514 Final Project How circular arrays behave under successive rounds of uniform insertions and deletions Diogo Andrade Gábor Rudolf.
Coded Caching in Information-Centric Networks
Optimizing Data Popularity Conscious Bloom Filters
Is Dynamic Multi-Rate Worth the Effort?
Pinjia He, Jieming Zhu, Jianlong Xu, and
Dept. of Computer Science, Univ. of Rochester
Balancing the Tradeoffs between Data Accessibility and Query Delay
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Hardware Counter Driven On-the-Fly Request Signatures
Request Behavior Variations
REED : Robust, Efficient Filtering and Event Detection
Realizing Closed-loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges Islam S. Badreldin*, Ann Gordon-Ross*,
Department of Computer Science University of California, Santa Barbara
Tiresias A GPU Cluster Manager for Distributed Deep Learning
Presentation transcript:

Replication Degree Customization for High Availability Kai Shen 12/7/2018 Replication Degree Customization for High Availability Ming Zhong Kai Shen Joel Seiferas Google University of Rochester EuroSys 2008

Kai Shen 12/7/2018 Problem Context Replication degree tradeoff between availability and space Skewed data popularity distributions intuitive to highly replicate popular objects goal: improve availability under certain space constraint Should we worry about space constraint today? decentralized wide-area systems – high machine failure/inaccessibility rate demand high-degree replication 0.265 failure rate from a Planetlab machine accessibility trace centrally-managed local-area clusters – often requiring data in-memory and relatively high memory cost 12/7/2018 EuroSys 2008 EuroSys 2008

Related Work uniform-degree replication for availability or durability Kai Shen 12/7/2018 Related Work uniform-degree replication for availability or durability Farsite, CFS, GFS, Glacier, IrisStore, MOAT simple adaptive approach in some production systems higher replication for most popular objects; lower replication for the rest other utilizations of skewed data object popularity fast search/lookup – Cohen et al., Beehive other ways to improve data availability in distributed systems replica placement – Farsite erasure coding – Reed-Solomon 12/7/2018 EuroSys 2008 EuroSys 2008

Basic Analytical Result Kai Shen 12/7/2018 Basic Analytical Result Problem formulation: p is machine failure probability ⇒ unavailability of an object with k replicas is pow(p,k) a system with n objects: object i popularity is ri, size is si find object replication degrees k1, k2, …, kn to minimize expected unavailability ∑1≤i≤n ri ∙ pow(p,ki) subject to space constraint ∑1≤i≤n si ∙ ki ≤ K Result: Lagrangian function: ∑1≤i≤n ri ∙ pow(p,ki) + λ ∙ (∑ 1≤i≤n si ∙ ki – K) optimization is reached when the function’s partial derivatives on ki’s and λ are all zero therefore ki = C + log1/p(ri/si), C is a constant 12/7/2018 EuroSys 2008 EuroSys 2008

Challenges in Systems Context Kai Shen 12/7/2018 Challenges in Systems Context Basic analytical result: optimal object replication degree ki = C + log1/p(ri/si) p is machine failure probability; ri is object popularity; si is object size Systems issues: complex system models: multi-object operations, nonuniform machine failure rates realistic workload behaviors: skewness and stability of object popularity maintenance overhead: replica creation/deletion under dynamic system changes 12/7/2018 EuroSys 2008 EuroSys 2008

Kai Shen 12/7/2018 Complex System Models Basic analytical result can be adapted to handle complex system models. Multi-object operations multi-object op unavailability is approximately the sum of the unavailabilities of individual objects when xi‘s are small, 1-Π1≤i≤m (1-xi) ≈ ∑1≤i≤m xi Nonuniform machine failure rates redefine p as the average per-machine failure rate 12/7/2018 EuroSys 2008 EuroSys 2008

Object Popularity/Size Skewness Kai Shen 12/7/2018 Object Popularity/Size Skewness Popularity to size skewness correlation Trace-driven examination: Most popular objects are not necessarily subject to most replication 12/7/2018 EuroSys 2008 EuroSys 2008

Object Popularity Stability Kai Shen 12/7/2018 Object Popularity Stability Stable object popularity is important for learning the object popularity and for low adjustment overhead Illustration of stability across month-long trace segments: Not designed to handle flash crowds 12/7/2018 EuroSys 2008 EuroSys 2008

Dynamic Maintenance Overhead Kai Shen 12/7/2018 Dynamic Maintenance Overhead System adaptation may require dynamic maintenance object popularity may change over time Low maintenance overhead due to stable object popularities stable replica assignment in the analytical result object replication degree ki = C + log1/p(ri/si) coarse-grain adaptation e.g., support only two replication degrees – low/high 12/7/2018 EuroSys 2008 EuroSys 2008

Trace-driven Evaluation – Availability Improvement Kai Shen 12/7/2018 Trace-driven Evaluation – Availability Improvement Availability improvement on four real application traces and two machine failure patterns Compared to uniform replication under same space constraint: 12/7/2018 EuroSys 2008 EuroSys 2008

Trace-driven Evaluation – Changing Space Constraint Kai Shen 12/7/2018 Trace-driven Evaluation – Changing Space Constraint Results on the Planetlab machine failure pattern Availability improvement is independent of space limit High replication needed for some decentralized wide-area systems 12/7/2018 EuroSys 2008 EuroSys 2008

Trace-driven Evaluation – Dynamic Maintenance Overhead Kai Shen 12/7/2018 Trace-driven Evaluation – Dynamic Maintenance Overhead Overhead of weekly changes on replication degree number of replica creations/deletions size of replica creations 12/7/2018 EuroSys 2008 EuroSys 2008

Kai Shen 12/7/2018 Conclusion Results analytical result: optimal replication when object replication degree is linear to log(popularity/size) address systems issues in complex system models, realistic workload behaviors, and maintenance overhead Big picture: skewed & stable data distributions motivate per-object adaptation in distributed system management adapt replication degree for high availability [this paper] adapt Bloom filter hash number for low false-positive rate [other result] adapt co-placement of correlated data objects for fast multi-object operations [other result] 12/7/2018 EuroSys 2008 EuroSys 2008