CS60002: Distributed Systems

Slides:



Advertisements
Similar presentations
Impossibility of Distributed Consensus with One Faulty Process
Advertisements

A General Characterization of Indulgence R. Guerraoui EPFL joint work with N. Lynch (MIT)
Lecture 8: Asynchronous Network Algorithms
Teaser - Introduction to Distributed Computing
Failure Detection The ping-ack failure detector in a synchronous system satisfies – A: completeness – B: accuracy – C: neither – D: both.
Leader Election Let G = (V,E) define the network topology. Each process i has a variable L(i) that defines the leader.  i,j  V  i,j are non-faulty.
Jaringan Informasi Pengantar Sistem Terdistribusi oleh Ir. Risanuri Hidayat, M.Sc.
Università degli Studi dell’Aquila Academic Year 2009/2010 Course: Algorithms for Distributed Systems Instructor: Prof. Guido Proietti Time: Monday:
Distributed Systems Dinesh Bhat - Advanced Systems (Some slides from 2009 class) CS 6410 – Fall 2010 Time Clocks and Ordering of events Distributed Snapshots.
CPSC 668Set 1: Introduction1 CPSC 668 Distributed Algorithms and Systems Fall 2006 Prof. Jennifer Welch.
CPSC 668Set 9: Fault Tolerant Consensus1 CPSC 668 Distributed Algorithms and Systems Spring 2008 Prof. Jennifer Welch.
Distributed systems Module 2 -Distributed algorithms Teaching unit 1 – Basic techniques Ernesto Damiani University of Bozen Lesson 2 – Distributed Systems.
Vinay Kumar Madhadi 09/23/2009 CSC Outline  Part 1 : Leader Election?  Part 2 : Different Design Topologies Algorithms Used-Bully, Ring, Invitation.
Distributed Consensus Reaching agreement is a fundamental problem in distributed computing. Some examples are Leader election / Mutual Exclusion Commit.
Distributed Consensus Reaching agreement is a fundamental problem in distributed computing. Some examples are Leader election / Mutual Exclusion Commit.
DISTRIBUTED ALGORITHMS Luc Onana Seif Haridi. DISTRIBUTED SYSTEMS Collection of autonomous computers, processes, or processors (nodes) interconnected.
Dionicio D. Gante, Genevev G. Reyes & Vanylive T. Galima DDistributed Operating Systems.
Lecture #12 Distributed Algorithms (I) CS492 Special Topics in Computer Science: Distributed Algorithms and Systems.
Lecture 0 Anish Arora CSE 6333 Introduction to Distributed Computing.
Parallel Distributed Programming Introduction.
Consensus and Its Impossibility in Asynchronous Systems.
Review for Exam 2. Topics included Deadlock detection Resource and communication deadlock Graph algorithms: Routing, spanning tree, MST, leader election.
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS Spring 2014 Prof. Jennifer Welch CSCE 668 Set 1: Introduction 1.
L EADER E LECTION Yu Meng O UTLINE Basic knowledge Overview of Leader Election Complete Topology Logical Ring Topology Three Topology Latest.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Wrap-up Steve Ko Computer Sciences and Engineering University at Buffalo.
Advanced Principles of Operating Systems (CE-403).
Lecture 4: Sun: 23/4/1435 Distributed Operating Systems Lecturer/ Kawther Abas CS- 492 : Distributed system & Parallel Processing.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 February 10, 2005 Session 9.
Teacher: Chun-Yuan Lin
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Wrap-up Steve Ko Computer Sciences and Engineering University at Buffalo.
Chap 15. Agreement. Problem Processes need to agree on a single bit No link failures A process can fail by crashing (no malicious behavior) Messages take.
Several sets of slides by Prof. Jennifer Welch will be used in this course. The slides are mostly identical to her slides, with some minor changes. Set.
Agreement in Distributed Systems n definition of agreement problems n impossibility of consensus with a single crash n solvable problems u consensus with.
1 Fault tolerance in distributed systems n Motivation n robust and stabilizing algorithms n failure models n robust algorithms u decision problems u impossibility.
DISTRIBUTED ALGORITHMS Spring 2014 Prof. Jennifer Welch Set 9: Fault Tolerant Consensus 1.
Fault tolerance and related issues in distributed computing Shmuel Zaks GSSI - Feb
CIS 825 Review session. P1: Assume that processes are arranged in a ring topology. Consider the following modification of the Lamport’s mutual exclusion.
Distributed Mutual Exclusion Synchronization in Distributed Systems Synchronization in distributed systems are often more difficult compared to synchronization.
1 AGREEMENT PROTOCOLS. 2 Introduction Processes/Sites in distributed systems often compete as well as cooperate to achieve a common goal. Mutual Trust/agreement.
Exercises for Chapter 11: COORDINATION AND AGREEMENT
Coordination and Agreement
CSE 486/586 Distributed Systems Wrap-up
CS/CE/TE 6378 Advanced Operating Systems
CSE 486/586 Distributed Systems Wrap-up
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS
CSE 486/586 Distributed Systems Failure Detectors
CSE 486/586 Distributed Systems Failure Detectors
Lecture 9: Asynchronous Network Algorithms
Outline Distributed Mutual Exclusion Distributed Deadlock Detection
Lecture 7: Introduction to Distributed Computing.
Leader Election (if we ignore the failure detection part)
Alternating Bit Protocol
Distributed Consensus
Agreement Protocols CS60002: Distributed Systems
Distributed Consensus
Introduction to locality sensitive approach to distributed systems
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS
Distributed Systems CS
Logical Clocks and Casual Ordering
Non-Distributed Excercises
Distributed Computing:
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS
Lecture 10: Coordination and Agreement
Abstraction.
Lecture 11: Coordination and Agreement
Distributed Systems (15-440)
Leader Election Bernard Chen.
CS60002: Distributed Systems
CSE 486/586 Distributed Systems Failure Detectors
Presentation transcript:

CS60002: Distributed Systems Introduction CS60002: Distributed Systems Pallab Dasgupta Professor, Dept. of Computer Sc. & Engg., Indian Institute of Technology Kharagpur INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Books Advanced Concepts in Operating Systems Mukesh Singhal and Niranjan G. Shivaratri McGraw Hill International Edition Introduction to Distributed Algorithms Gerard Tel Cambridge University Press Available in the CSE Dept Library (Acc No: I-455) INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

What is a distributed system? A very broad definition: A set of autonomous processes communicating among themselves to perform a task Issues: Un-reliability of communication Lack of global knowledge Lack of synchronization and causal ordering Concurrency control Failure and recovery INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Advantages Resource Sharing Higher Performance Fault Tolerance Scalability INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Examples of problems Reliable communication – Theoretically impossible? Muddy forehead and related problems Concurrency problems INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Example: Automotive Control Source: Leen and Hefferman, IEEE Computer, Jan 2002 INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Why is it hard to design them? The usual problem of concurrent systems: Arbitrary interleaving of actions makes the system hard to verify + No globally shared memory (therefore hard to collect global state) No global clock Unpredictable communication delays INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Models for Distributed Algorithms Topology: Completely connected, Ring, Tree etc. Communication: Shared memory / Message passing (reliable? Delay? FIFO/Causal? Broadcast/multicast?) Synchronous/asynchronous Failure models: Fail stop, Crash, Omission, Byzantine… An algorithm needs to specify the model on which it is supposed to work INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Complexity Measures Message complexity: no. of messages Communication complexity / Bit Complexity: no. of bits Time complexity: For synchronous systems, no. of rounds For asynchronous systems, different definitions are there. INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

Some Fundamental Problems Ordering events in the absence of a global clock Capturing the global state Mutual exclusion Leader election Clock synchronization Termination detection Constructing spanning trees Agreement protocols INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR