Chapter 7 Input/Output (Continued)
DMA Function DMA controller(s) takes over Bus supervision from CPU for I/O Additional Module(s) attached to bus to control DMA operation
Typical DMA Module Organization
DMA Operation CPU provides direction to DMA controller(s) —Read/Write —Device address (which controller) —Starting address of memory block for data —Amount of data to be transferred —Mode(s) of data transfer CPU carries on with other work DMA controller deals with transfer DMA controller sends interrupt when finished
DMA Transfer - Cycle Stealing (?) DMA controller takes over bus for a cycle Transfers one (limited) word(s) of data Not an interrupt —CPU does not switch context CPU suspended between bus cycles —i.e. before an operand or data fetch or a data write Slows down CPU but not as much as CPU doing transfer
DMA and Interrupt Breakpoints During an Instruction Cycle What could be wrong with this?
Several questions What effect does caching memory have on DMA? What effect does use of DRAMs have on DMA ?
DMA Configurations (1) Single Bus, Detached DMA controller Each transfer uses bus twice —I/O to DMA then DMA to memory CPU is suspended twice
DMA Configurations (2) Single Bus, Integrated DMA controller Controller may support >1 device Each transfer uses bus once —DMA to memory CPU may be suspended only once
DMA Configurations (3) Separate I/O Bus Bus supports all DMA enabled devices Each transfer uses bus once —DMA to memory CPU is suspended once
Other “Cycle Stealing” Options Stealing cycles when they aren’t going to be used anyway — when the cache is providing data — when a instruction is not going to use the bus — when a partition of memory is not likely to be accessed – implication?
Interfacing Options (many) Parallel - PCI - SCSI Serial - RS 232 Local Networks - Ethernet Some Newer technologies - FireWire - InfiniBand - USB Wireless - BlueTooth - WiFi Automation - CAN
I/O Channels I/O channels are processors dedicated to I/O e.g. 3D graphics cards CPU instructs I/O controller to do transfer and provides it mode I/O controller does entire transfer from one or many devices Makes transfers less visible to CPU Improves speed —Takes load off CPU
I/O Channel Architectures
Intel 82C55A Programmable Peripheral Interface
Keyboard/Display Interfaces to 82C55A
Protocols Asynchronous Synchronous Packet Switching
Serial - RS 232 UART (Universal Asynchronous Receiver & Transmitter) Serial interface on a chip Historically very significant After 30 years, still a standard
RS232 Character transmission (NRZ)
UART Block Diagram
Connector Wiring – Null Modem
UART Application
USB (Universal Serial Bus) Serial - To replace legacy serial and parallel PC interfaces —90 ohm terminated twisted pair with 5V power and gnd – For speed —NRZ, half duplex, differential - Robust Tiered Star Topology (Host down stream ports) — Up to 127 devices — Connector enforce topology — Does not support extension cables! — FCFS (can easily run out of bandwidth) Multiple speeds — 1.5 Mbits/sec (Version 1.0) — 12 Mbits/sec (Version 1.1) — 480 Mbits/sec (Version 2.0) — 4.8 Gbits/sec (Version 3.0) Plug and Play — Supports hot connection (no “installation” required) — Establishes initial connection at low speed - “Chirping” Potentially competing with Firewire
USB Topology
Ethernet CSMA/CD (Carrier Sense Multiple Access/Collision Detection) A local area network access method in which contention between two or more stations is resolved by collision detection. When two stations transmit at the same time, they both stop and signal a collision has occurred. Each then tries again after waiting a predetermined time period. To avoid another collision, the stations involved each choose a random time interval to schedule the retransmission of the collided frame. To make sure that the collision is recognized, Ethernet requires that a station must continue transmitting until the 50 microsecond period has ended. If the station has less than 64 bytes of data to send, then it must pad the data by adding zeros at the end.
Bob Metcalf’s Ethernet Concept
TDMA Communications Protocol TDMA (Time delay Multiple Access) Based upon a reservation system to allow multiaccess to a communication channel. - It has longer nominal delay, but operates more reliably at heavy load. It is the basis for GSM (Global System for Mobile telecommunication) - the protocol used by MOST of the world for cell phones. - In the US the dominant protocol is CDMA.
Layering – Example: OSI Network Layers The Physical Layer describes the physical properties of the various communications media, as well as the electrical properties and interpretation of the exchanged signals. Example: this layer defines the size of Ethernet coaxial cable, the type of BNC connector used, and the termination method. The Data Link Layer describes the logical organization of data bits transmitted on a particular medium. Example: this layer defines the framing, addressing and check-summing of Ethernet packets. The Network Layer describes how a series of exchanges over various data links can deliver data between any two nodes in a network. Example: this layer defines the addressing and routing structure of the Internet. The Transport Layer describes the quality and nature of the data delivery. Example: this layer defines if and how retransmissions will be used to ensure data delivery. The Session Layer describes the organization of data sequences larger than the packets handled by lower layers. Example: this layer describes how request and reply packets are paired in a remote procedure call. The Presentation Layer describes the syntax of data being transferred. Example: this layer describes how floating point numbers can be exchanged between hosts with different math formats. The Application Layer describes how real work actually gets done. Example: this layer would implement file system operations. International Standards Organization’s (ISO) Open Systems Interconnection (ISO) Model:
Simple Example OF 7 Layer OSI Model Application Layer: Set of C Instructions, Set of Data {I0 I1 I2 …. IN Do D1 D2 … Dm} Presentation Layer: ASCII Coding {ASC {I0 I1 I2 …. IN Do D1 D2 … Dm}} Session Layer: What process at computer x is communicating with what process at computer y {X4 Y6 {ASC {I0 I1 I2 …. IN Do D1 D2 … Dm}}} Transport Layer: Guaranteed Transmission, sequentially numbered packets of 4096 bytes {GT4 P34 {X4 Y6 {ASC {I0 I1 I2 …. IN Do D1 D2 … Dm}}} PCKSUM} Network Layer: Path through Network {N23 N3 N53 {GT4 P34 {X4 Y6 {ASC {I0 I1 I2 …. IN Do D1 D2 … Dm}}} PCKSUM}} Data Link Layer: Serial 256 bytes per frame {STRT T{N23 N3 N53 {GT4 P34 {X4 Y6 {ASC {I0 I1 I2 …. IN Do D1 D2 … Dm}}} PCKSUM}}CHKSM} Physical Layer: 9600Baud, Coax cable - {Start {….}Parity Stop Stop}
Network Reference model - Ethernet
Ethernet packet
Ethernet block diagram
IEEE 1394 FireWire (Competitor to USB) High performance serial bus Fast Low cost Easy to implement Also being used in digital cameras, VCRs and TV
FireWire Configuration Daisy chain Up to 63 devices on single port —Really 64 of which one is the interface itself Up to 1022 buses can be connected with bridges Automatic configuration No bus terminators May be tree structure
Simple FireWire Configuration
FireWire 3 Layer Stack Physical —Transmission medium, electrical and signaling characteristics Link —Transmission of data in packets Transaction —Request-response protocol
FireWire Protocol Stack
FireWire - Physical Layer Data rates from 25 to 400Mbps Two forms of arbitration —Based on tree structure —Root acts as arbiter —First come first served —Natural priority controls simultaneous requests –i.e. who is nearest to root —Fair arbitration —Urgent arbitration
FireWire - Link Layer Two transmission types —Asynchronous –Variable amount of data and several bytes of transaction data transferred as a packet –To explicit address –Acknowledgement returned —Isochronous –Variable amount of data in sequence of fixed size packets at regular intervals –Simplified addressing –No acknowledgement
FireWire Subactions
InfiniBand I/O specification aimed at high end servers —Merger of Future I/O (Cisco, HP, Compaq, IBM) and Next Generation I/O (Intel) Version 1 released early 2001 Architecture and spec. for data flow between processor and intelligent I/O devices Intended to replace PCI in servers Increased capacity, expandability, flexibility
InfiniBand Architecture Remote storage, networking and connection between servers Attach servers, remote storage, network devices to central fabric of switches and links Greater server density Scalable data centre Independent nodes added as required I/O distance from server up to —17m using copper —300m multimode fibre optic —10km single mode fibre Up to 30Gbps
InfiniBand Switch Fabric
InfiniBand Operation 16 logical channels (virtual lanes) per physical link One lane for management, rest for data Data in stream of packets Virtual lane dedicated temporarily to end to end transfer Switch maps traffic from incoming to outgoing lane
InfiniBand Protocol Stack