Distributed (Operating) Systems -Communication in Distributed Systems- Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015
Communication Communication is at the heart of all distributed systems Examining the ways that processes on different machines can exchange information. IPC – Inter Process Communication Communication through message passing is harder than using primitives based on shared memory Communication in distributed systems is always based on low-level message passing as offered by underlying network
Outline of Next 3 Lessons Underlying network layer communication protocols – OSI Model (ISO/OSI) – Internet Protocol Suit TCP/IP Widely used models for communication, also called middleware communication services : – Remote Procedure Call (RPC) – Message Oriented Middleware (MOM) – RTP: Streams for transferring real-time data, such as needed for multimedia applications
Messages in Communication Communication messaging rules are defined in protocols Due to the absence of shared memory, all communication in distributed systems is based on sending and receiving (low-level) messages Many different agreements – IBM’s EBCDIC and ASCII – How many volts should be used to signal 0 – How does the receiver know which is the last bit – How can the receiver detect if a message has been damaged
OSI Model-Layered Protocol ISO/OSI is never widely used and are essentially dead In the OSI model, communication is divided up into 7 layers OSI layers were never popular. In contrast, protocols developed for the internet, such as TCP and IP, are mostly used
Communication Protocols There are some rules that communicating processes must adhere to – known as protocols Protocols are agreements/rules on communication Protocols could be connection-oriented or connectionless
Layered Protocols A typical message as it appears on the network.
Physical and Data-Link Layers Physical layer – Transmitting 0s and 1s – How many bits per sec can be sent – Can transmission take place in both direction – Standardizing the electrical, mechanical and signaling interfaces. Data Link Layer – Group the bits into units, frames, and see if each frame is correctly received. – Puts special bit pattern on the start and end of each frame – Computing checksum
Network Layer Routing It is important in WAN Shortest path is not always the best route – Depends on amount of delay, on a given route, amount of traffic and the number of messages queued up. Some routing algorithms. Most widely used network protocol is IP An IP packet can be sent without any setup. I.E. each IP packet is routed to its destination independent of all others.
Transport Layer Turns the underlying network into something that an application developer can use Messages from the application is broken into small pieces-packets TCP packed header concerns - which packets have been sent, which have been received, which should be retransmitted Connection-oriented: messages arrive in the same order (as in TCP) - reliable Connectionless: message can arrive in different order (as in UDP) – unreliable (just IP with some additions) The combination of TCP/IP is now used as a de facto standard for network communication
Client-Server TCP 3-way TCP hand-shake
Session and Presentation Layers Session Layer – Enhanced version of transport layer – Keeps track of which party is currently talking and it provides a synchronization – Puts checking points – in case of crash going to the last check point, Instead of going to the starting point Presentation Layer – Concerned with the meaning of the bits – Structure information such as people’s names, addresses etc.
OSI Model summary Missing in this model: A clear distinction between applications, application-specific protocols and general purpose protocols. Application-specific protocol – FTP – protocol for transferring files between client and server – HTTP – protocol is implemented by applications such as web browsers and web servers. – SMTP - Mail delivery system General-purpose protocols – Useful to many applications but cannot be qualified as transport protocols. These protocols fall into the category of middleware protocols.
Middleware Protocols Middleware: Layer that resides between an OS and an application – May implement general-purpose protocols that warrant their own layers
Middleware Protocols 1 Some services are not tied to any specific applications, but instead can be integrated into a middleware system as a general service – Authentication protocols Proof of claimed identity – Atomicity protocols Distributed commit protocols Widely applied in transactions – Distributed locking protocols Resources can be protected against simultaneous accesses
Middleware Protocols 2 HIGH level middleware communication services: – Remote Procedure Call (RPC) – Message Oriented Middleware (MOM) – RTP: Streams for transferring real-time data, such as needed for multimedia applications
Communication in Distributed Systems Communication is done through message passing Expressing communication through message passing is harder than using primitives based on shared memory Remote Procedure Calls – Transparency but poor for passing references – Ideal for client server applications Message-oriented Communication Stream-oriented Communication – Continuous media
Communication Between Processes Unstructured communication – Use shared memory or shared data structures Structured communication – Use explicit messages (IPCs) Distributed Systems: Both need low-level communication support
Types of Communication -1 Transient Communication – A message is stored by the communication system only as long as the sending and the receiving application are executing – Typically all transport level communication is transient communication Persistent Communication – A message that has been submitted for transmission is kept by middleware as long as it takes to deliver it to the receiver – Receiving application don’t need to be executing when the message is submitted
Types of Communication -2 Asynchronous Communication – Sender continues immediately after it has submitted its message for transmission – It means, message is temporarily stored in middleware upon submission Synchronous Communication – Sender is blocked until its request is known to be accepted – Three points Received based Delivery based Response based
Persistence and Synchronicity in Communication: RPC 1 Received Based 2 Delivery Based3 Response Based
Types of Communication -3 Connection-oriented (telephone) – With connection-oriented protocols, before exchanging data, the sender and receiver first explicitly establish a connection – TCP Connectionless (mailbox – dropping a letter) – No setup in advance is needed – IP, UDP
Persistence Persistent communication – Messages are stored until the next receiver is ready to receive that message – Examples: , pony express – Network buffers are not persistent they are transient
Transient Communication Transient communication – Message is stored only so long as sending/receiving application are executing – Discard message if it can’t be delivered to next server/receiver – Example: Transport-level communication services offer transient communication – Example: Typical network router – Example: Chatting
Synchronicity Asynchronous communication – Sender continues immediately after it has submitted the message – Need a local buffer at the sending host – Also called nonblocking Synchronous communication – Sender blocks until message is stored in a local buffer at the receiving host or actually delivered to receiver – Variant: Block until receiver processes the message – Also called blocking
NOW we have 4 combinations Persistence vs. Transient Asynchronous vs. Synchronous
Persistence and Synchronicity Combinations a) Persistent asynchronous communication (e.g., ) b) Persistent synchronous communication
Persistence and Synchronicity Combinations c) Transient asynchronous communication (e.g., UDP) d) Receipt-based transient synchronous communication
Persistence and Synchronicity Combinations e) Delivery-based transient synchronous communication at message delivery (e.g., asynchronous RPC) f) Response-based transient synchronous communication (RPC)
Back Up
RTP Actually a transport protocol Real-Time Transport Protocol Support real time data transfers A framework protocol - it specifies packet formats for real-time data without providing the actual mechanisms for guaranteeing data delivery.