Replication Libby Rasnick Christopher Newport University CPSC 550 Spring 2003.

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Replication Management. Motivations for Replication Performance enhancement Increased availability Fault tolerance.
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
DISTRIBUTED SYSTEMS II REPLICATION CNT. II Prof Philippas Tsigas Distributed Computing and Systems Research Group.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Consistency Steve Ko Computer Sciences and Engineering University at Buffalo.
EEC 688/788 Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank
A Dependable Auction System: Architecture and an Implementation Framework
1 Chapter 14: Replication From Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edition 3, © Addison-Wesley 2001 Presentation.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
EEC 688/788 Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Overview Distributed vs. decentralized Why distributed databases
Distributed Systems Fall 2011 Gossip and highly available services.
EEC 693/793 Special Topics in Electrical Engineering Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
CS 425 / ECE 428 Distributed Systems Fall 2014 Indranil Gupta (Indy) Lecture 18: Replication Control All slides © IG.
Computer Science Lecture 16, page 1 CS677: Distributed OS Last Class:Consistency Semantics Consistency models –Data-centric consistency models –Client-centric.
CH2 System models.
From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Edition 5, © Addison-Wesley 2012 Slides for Chapter 18: Replication.
Slides for Chapter 14: Replication From Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edition 3, © Addison-Wesley 2001.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
Session-8 Data Management for Decision Support
Distributed File System By Manshu Zhang. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Replication with View Synchronous Group Communication Steve Ko Computer Sciences and Engineering.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Gossiping Steve Ko Computer Sciences and Engineering University at Buffalo.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Chapter 6.5 Distributed File Systems Summary Junfei Wen Fall 2013.
DISTRIBUTED SYSTEMS II REPLICATION Prof Philippas Tsigas Distributed Computing and Systems Research Group.
IM NTU Distributed Information Systems 2004 Replication Management -- 1 Replication Management Yih-Kuen Tsay Dept. of Information Management National Taiwan.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
DISTRIBUTED SYSTEMS II AGREEMENT - COMMIT (2-3 PHASE COMMIT) Prof Philippas Tsigas Distributed Computing and Systems Research Group.
By Shruti poundarik.  Data Objects and Files are replicated to increase system performance and availability.  Increased system performance achieved.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
Fault Tolerant Services
Fault Tolerance and Replication
Distributed Systems CS Consistency and Replication – Part I Lecture 10, September 30, 2013 Mohammad Hammoud.
Chap 7: Consistency and Replication
Replication (1). Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Replication and Group Communication. Management of Replicated Data FE Requests and replies C Replica C Service Clients Front ends managers RM FE RM Instructor’s.
Replication Store it in multiple places.... Literature Colouris, Dollimore, Kindberg, 2000 –Gets deep into the details of reliable communication, byzantine.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Replication Improves reliability Improves availability ( What good is a reliable system if it is not available?) Replication must be transparent and create.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Replication Steve Ko Computer Sciences and Engineering University at Buffalo.
Exercises for Chapter 2: System models From Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edition 4, © Pearson Education 2005.
Lecture 13: Replication Haibin Zhu, PhD. Assistant Professor Department of Computer Science Nipissing University © 2002.
Highly Available Services and Transactions with Replicated Data Jason Lenthe.
CSE 486/586, Spring 2014 CSE 486/586 Distributed Systems Transactions on Replicated Data Steve Ko Computer Sciences and Engineering University at Buffalo.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
PERFORMANCE MANAGEMENT IMPROVING PERFORMANCE TECHNIQUES Network management system 1.
Distributed Computing Systems Replication Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris,
Replication Chapter Katherine Dawicki. Motivations Performance enhancement Increased availability Fault Tolerance.
Exercises for Chapter 14: Replication
Chapter 19: Distributed Databases
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Outline Announcements Fault Tolerance.
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
Active replication for fault tolerance
Lecture 21: Replication Control
Distributed Systems CS
Slides for Chapter 15: Replication
Slides for Chapter 18: Replication
Lecture 21: Replication Control
Network management system
Presentation transcript:

Replication Libby Rasnick Christopher Newport University CPSC 550 Spring 2003

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University2 Table of Contents DefinitionHistoryGoalsFeaturesStructure How to Use Applications Benefits and Issues Cost of Replication Significant Points SummaryReferences

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University3 Defining replication “the maintenance of copies of data at multiple computers as a technique for automatically maintaining the availability of data despite server failures”

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University4 Defining replication “the maintenance of copies of data at multiple computers as a technique for automatically maintaining the availability of data despite server failures” “a key to the effectiveness of distributed systems in that it provides enhanced performance, high availability and fault tolerance”

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University5 History Initially, stored data back-ups (tape)

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University6 History Initially, stored data back-ups (tape) With distributed systems, partial or entire copies of a node’s data are stored by another node

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University7 History Initially, stored data back-ups (tape) With distributed systems, partial or entire copies of a node’s data are stored by another node Recent development, peer-to-peer resource trading (community-based redundancy system) ~ Napster

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University8 Goals “to improve a service’s performance, to increase its availability, or to make it fault-tolerant.”

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University9 Goals “to improve a service’s performance, to increase its availability, or to make it fault-tolerant.” enhances distributed systems by restricting or eliminating downtime

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University10 Goals “to improve a service’s performance, to increase its availability, or to make it fault-tolerant.” enhances distributed systems by restricting or eliminating downtime fundamental trade-off in fault-tolerance: consistency v. availability (and performance)

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University11 Features Passive master replication manager allocates requests to slave managers

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University12 Features Passive master replication manager allocates requests to slave managers Active requests are multicast to all replication managers and each replies independently

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University13 Structure Replication system transaction steps [1] Step Name Description 1 request FE issues a request for service to one or more RMs 2 coordination RMs communicate together to deliver consistent results An ordering scheme must be applied to incoming requests. Common ordering schemes used: FIFO, casual, and total. 3 execution RM executes the request in a reversible manner 4 agreement RMs agree on the effects of the request 5 response RM(s) response sent to FE FE must consolidated and forward to client.

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University14 How to Use (server/RM) Determine the number of servers & availability of data Number of RMs (servers): n servers, probability p of server access failure 1-probability(all servers fail) = 1 - p n Data availability (objects): If f of f+1 servers crash, then 1 server is still operating & if up to f servers can exhibit Byzantine failures, then 2f+1 servers would permit full function during most worst case scenarios

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University15 How to Use (server/RM) add RMs (as calculated) ensure communication between RMs install FEs to accommodate number of clients follow five transaction steps: request, coordination, execution, agreement, and response

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University16 Applications DNS servers eases request bottlenecks

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University17 Applications DNS servers eases request bottlenecks Community-based redundancy systems propagated data alleviates pressure from server failures

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University18 Applications DNS servers eases request bottlenecks Community-based redundancy systems propagated data alleviates pressure from server failures gossip, Bayou, Coda

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University19 Benefits and Issues Benefits “selecting a nearby replica to serve an access request results in a shorter service time” [2] reduced or eliminated down-time due to server failure

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University20 Benefits and Issues Benefits “selecting a nearby replica to serve an access request results in a shorter service time” [2] reduced or eliminated down-time due to server failure Issues location dependent replica updating cost of replication

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University21 Cost of Replication What to charge for and when

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University22 Cost of Replication What to charge for and when Calculated per transaction

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University23 Cost of Replication What to charge for and when Calculated per transaction Cost modeling

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University24 Cost of Replication What to charge for and when Calculated per transaction Cost modeling Cost Model Analysis V – set of sites in a network S – set in question R – resident set R = S  V of copy-bearing sites

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University25 Significant Points Proportional cost based on number of accesses

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University26 Significant Points Proportional cost based on number of accesses fee paid by the server for every access past a set maximum allotment of accesses

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University27 Significant Points Proportional cost based on number of accesses fee paid by the server for every access past a set maximum allotment of accesses rewards servers that perform good file maintenance (requesting a replica of a frequently accessed file)

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University28 Significant Points Proportional cost based on number of accesses fee paid by the server for every access past a set maximum allotment of accesses rewards servers that perform good file maintenance (requesting a replica of a frequently accessed file) more of the cost of the replication system is absorbed by those servers that do not use it properly

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University29 Summary Definition: maintaining copies of data in fault- independent location Goals: accessibility, fault-tolerance, consistency Features: passive and active models Structure: clients, front ends, replication managers

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University30 Summary How to Use: request, coordination, execution, agreement, response Applications: DNS servers, Napster, gossip, Bayou, Coda Benefits: increased data accessibility and fault- tolerance Issues: data consistency and transparency, cost of replication

Libby Rasnick CPSC 550 Replication Case Study Christopher Newport University31 References [1] Coulouris, George, Dollimore, Jean, and Kindberg, Tim. Distributed Systems: Concepts and Design. Third Ed. New York: Addison Wesley [2] Levy, Eliezer and Silberschatz, Abraham. Distributed File Systems: Concepts and Examples. ACM Computing Surveys, 22(4): , December [3] Cooper, Brian F. and Garcia-Molina, Hector. Peer-to-peer resource trading in a reliable distributed system. 1 st International Workshop on Peer- to-Peer Systems, 11 November [4] Goldweber, Michael and Johnson, Donald B. Minimizing Access Cost in Replicated Distributed Systems. Symposium on Principles of Distributed Computing: 56, [5] Comer, Douglas E. Computer Networks and Internets. Second Ed. Upper Saddle River: Prentice Hall