Communication Synthesis: Buses and Network-on-Chip (NOC) Dr. Eng. Amr T. Abdel-Hamid ELECT 1002 Spring 2008 System-On-a-Chip Design.

Slides:



Advertisements
Similar presentations
Dan Marconett Next-Generation Networking Systems Lab
Advertisements

Comparison Of Network On Chip Topologies Ahmet Salih BÜYÜKKAYHAN Fall.
Prof. Natalie Enright Jerger
A Novel 3D Layer-Multiplexed On-Chip Network
1 IK1500 Communication Systems IK1330 Lecture 3: Networking Anders Västberg
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
Flattened Butterfly Topology for On-Chip Networks John Kim, James Balfour, and William J. Dally Presented by Jun Pang.
What is Flow Control ? Flow Control determines how a network resources, such as channel bandwidth, buffer capacity and control state are allocated to packet.
Chapter 8 Hardware Conventional Computer Hardware Architecture.
Networks on Chip : a very quick introduction! Jeremy Chan 11 May 2005.
Addressing the System-on-a-Chip Interconnect Woes Through Communication-Based Design N. Vinay Krishnan EE249 Class Presentation.
1 Lecture 17: On-Chip Networks Today: background wrap-up and innovations.
Module R R RRR R RRRRR RR R R R R Efficient Link Capacity and QoS Design for Wormhole Network-on-Chip Zvika Guz, Isask ’ har Walter, Evgeny Bolotin, Israel.
High Performance Router Architectures for Network- based Computing By Dr. Timothy Mark Pinkston University of South California Computer Engineering Division.
1 Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control.
Network-on-Chip Examples System-on-Chip Group, CSE-IMM, DTU.
Issues in System-Level Direct Networks Jason D. Bakos.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Storage area network and System area network (SAN)
Low-Latency Virtual-Channel Routers for On-Chip Networks Robert Mullins, Andrew West, Simon Moore Presented by Sailesh Kumar.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
High Performance Embedded Computing © 2007 Elsevier Lecture 16: Interconnection Networks Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte.
Networks-on-Chip. Seminar contents  The Premises  Homogenous and Heterogeneous Systems- on-Chip and their interconnection networks  The Network-on-Chip.
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
NETWORK-ON-CHIP (NOC): A New SoC Paradigm
Communication issues for NOC By Farhadur Arifin. Objective: Future system of NOC will have strong requirment on reusability and communication performance.
Interconnect Networks
On-Chip Networks and Testing
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
R OUTE P ACKETS, N OT W IRES : O N -C HIP I NTERCONNECTION N ETWORKS Veronica Eyo Sharvari Joshi.
Switched network.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
Author : Jing Lin, Xiaola Lin, Liang Tang Publish Journal of parallel and Distributed Computing MAKING-A-STOP: A NEW BUFFERLESS ROUTING ALGORITHM FOR ON-CHIP.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
TELE202 Lecture 5 Packet switching in WAN 1 Lecturer Dr Z. Huang Overview ¥Last Lectures »C programming »Source: ¥This Lecture »Packet switching in Wide.
ECE 4450:427/527 - Computer Networks Spring 2015 Dr. Nghi Tran Department of Electrical & Computer Engineering Lecture 2: Overview of Computer Network.
Network-on-Chip Introduction Axel Jantsch / Ingo Sander
Network on Chip - Architectures and Design Methodology Natt Thepayasuwan Rohit Pai.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
Non-Minimal Routing Strategy for Application-Specific Networks-on-Chips Hiroki Matsutani Michihiro Koibuchi Yutaka Yamada Jouraku Akiya Hideharu Amano.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
BZUPAGES.COM Presentation On SWITCHING TECHNIQUE Presented To; Sir Taimoor Presented By; Beenish Jahangir 07_04 Uzma Noreen 07_08 Tayyaba Jahangir 07_33.
Performance, Cost, and Energy Evaluation of Fat H-Tree: A Cost-Efficient Tree-Based On-Chip Network Hiroki Matsutani (Keio Univ, JAPAN) Michihiro Koibuchi.
Soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Virtual-Channel Flow Control William J. Dally
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix F)
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
1. Layered Architecture of Communication Networks: Circuit Switching & Packet Switching.
Network-on-Chip Paradigm Erman Doğan. OUTLINE SoC Communication Basics  Bus Architecture  Pros, Cons and Alternatives NoC  Why NoC?  Components 
OSI Model OSI MODEL. Communication Architecture Strategy for connecting host computers and other communicating equipment. Defines necessary elements for.
OSI Model OSI MODEL.
Advanced Computer Networks
Lecture 23: Interconnection Networks
Lecturer, Department of Computer Application
Azeddien M. Sllame, Amani Hasan Abdelkader
Israel Cidon, Ran Ginosar and Avinoam Kolodny
ECE 4450:427/527 - Computer Networks Spring 2017
Multiprocessor network topologies
Networks-on-Chip.
Low-Latency Virtual-Channel Routers for On-Chip Networks Robert Mullins, Andrew West, Simon Moore Presented by Sailesh Kumar.
OSI Model OSI MODEL.
CS 6290 Many-core & Interconnect
Presentation transcript:

Communication Synthesis: Buses and Network-on-Chip (NOC) Dr. Eng. Amr T. Abdel-Hamid ELECT 1002 Spring 2008 System-On-a-Chip Design

Dr. Amr Talaat ELECT1002 SoC Design The SoC nightmare The architecture is tightly coupled Source: Prof Jan Rabaey CS UC Berkeley DMACPUDSP Mem Ctrl. Bridge MPEG Ioo The “Board-on-a-Chip” Approach C System Bus Peripheral Bus

Dr. Amr Talaat ELECT1002 SoC Design Very long wires 1 ns (1 GHz)0.1 ns (10 GHz) A B A B Year 2005Year 2010

Dr. Amr Talaat ELECT1002 SoC Design Why NoC?  Global wire delays  increase exponentially or linearly by inserting repeaters  The delay may exceed one clock cycle after repeater insertion  In ultra-deep submicron processes, 80% or more of the delay of cr itical paths will be due to interconnections  Communication structures need to be designed first and then followed by fun ctional blocks

Dr. Amr Talaat ELECT1002 SoC Design Homogeneous SoC (MP-SoC) CPU MEM CPU MEM CPU MEM CPU MEM CPU MEM CPU MEM CPU MEM CPU MEM Interconnection network (BUS, XBAR)

Dr. Amr Talaat ELECT1002 SoC Design Why not bus?  Shared medium arbitrated bus, the most frequently used on-chip interconnect architectures  Pros  Simple, low area cost, and extensibility  Cons  The intrinsic parasitic resistance and capacitance can be quite high fo r a long bus line  Every additional IP block adds to parasitic capacitance and causes inc reased propagation delay  The number of IP blocks that can be connected by the bus is limited

Dr. Amr Talaat ELECT1002 SoC Design On-Chip Communication  Bus based interconnect  Low cost  Easier to Implement  Flexible  Networks on Chip  Layered Approach  Buses replaced with Networked arc hitectures  Better electrical properties  Higher bandwidth  Energy efficiency  Scalable Irregular architectures Regular Architectures Bus-based architectures

Dr. Amr Talaat ELECT1002 SoC Design Network on Chip Software Transport Network Wiring Separation of concerns Software Transport Network Wiring Data Link Layer  Communication-based Design  Orthogonalizes function and communication  Builds on well-known models-of-computation and correct-by-constru ction synthesis flow  Parallels layered approach exploited by communications community

Dr. Amr Talaat ELECT1002 SoC Design NoC What is Network-on-Chip (NoC)? Leveraging existing computer networking principles to improve inter- component intra-chip communications for SoC. Each on chip component connected by switch to a particular comm wire(s) Improvement over standard bus based interconnections for SoC architectures in terms of throughput

Dr. Amr Talaat ELECT1002 SoC Design SOC Current Trend  Explicitly parallel SoC architectures  Integrating huge amounts of Memory in chip designs  Distributed Shared Memory Environments  Should allow Interconnection centric design flow and better predictab ility  Physical design Closure  Wire delay dominates gate delay

Dr. Amr Talaat ELECT1002 SoC Design Design goal of NoC  High throughput  Low latency  Less energy consumption  Small area requirements  Network-on-Chip Basics:  Architectures  Routing Strategies  Evaluation

Dr. Amr Talaat ELECT1002 SoC Design Routing: Circuit/Packet Switching Circuit Switching Dedicated path, or circuit, is established over which data packets will travel Naturally lends itself to time-sensitive guaranteed service due to resource allocation Reservation of bandwidth decreases overall throughput and increases average delays Packet Switching Intermediate routers are now responsible for the routing of individual packets through the network, rather than following a single path Provides for so-called best-effort services

Dr. Amr Talaat ELECT1002 SoC Design Routing: Wormhole/Virtual Cut Through Wormhole Switching Message is divided up into smaller, fixed length flow units called flits Only first flit contains routing information, subsequent flits follow Buffer size is significantly reduced due to the limitation on the number of flits needed to be buffered at any given time Virtual Cut Through Switching Much like Wormhole switching Header flit can travel ahead and undergo processing while remaining flits are still navigating the network Higher acceptance rates and lower latencies than Wormhole

Dr. Amr Talaat ELECT1002 SoC Design Wormhole Switching

Dr. Amr Talaat ELECT1002 SoC Design Routing: Contention Contention occurs when routers or IP blocks attempt to send data over the same link at the same time For Circuit switching, contention is resolved at the time of actual connection setup For packet switching, contention resolution is handled at a much finer level, by the router buffering and scheduling individual packets of information Better overall performance for packet switched networks at the cost of lack of service guarantee

Dr. Amr Talaat ELECT1002 SoC Design Architectures: SPIN SPIN: Scalable, Programmable, Integrated Network Every level has same number switches Network grows like (NlogN)/8 Trades area overhead and decreased power efficiency for higher throughput Illustrative of performance vs. power consumption

Dr. Amr Talaat ELECT1002 SoC Design Architectures: CLICHE CLICHÉ: Chip-Level Integration of Communicating Heterogeneous Elements Two-dimensional mesh network layout for NoC design All switches are connected to the four closest other switches and target resource block, except those switches on the edge of the layout Connections are two unidirectional links

Dr. Amr Talaat ELECT1002 SoC Design Architectures: Torus Similar to mesh based architectures Wires are wrapped around from the top component to the bottom and rightmost to leftmost Smaller hop count Higher bandwidth Decreased Contention Increased chip space usage

Dr. Amr Talaat ELECT1002 SoC Design Architectures: Folded Torus Similar to Torus Torus, the long end-around connections can yield excessive delays Avoided by folding the torus

Dr. Amr Talaat ELECT1002 SoC Design Architectures: Octagon Standard model: 8 components, 12 interconnects Design complexity increases linearly with number of nodes Largest packet travel distance is two hops High throughput Shortest path routing easy to implement

Dr. Amr Talaat ELECT1002 SoC Design Architectures: BFT BFT: Butterfly Fat Tree Each node in tree model has coordinates (level, position) where level is depth and position is from left to right Leaves are component blocks Interior nodes are switches Four child ports per switch and two parent ports LogN levels, ith level has n/(2^i+1) switches, n = leaves (blocks) Use traffic aggregation to reduce congestion

Dr. Amr Talaat ELECT1002 SoC Design Network interface  Open Core Protocol (OCP)  An interface standard between IP cores and the interconnection f abric

Dr. Amr Talaat ELECT1002 SoC Design Packet Format Type: Head, Data, Tail and Complete VCID: Virtual Channel Identifier Route: ‘N’ bit route field with last 2 bits specifying the Route to be used in the next controller 00 - Left 01 - Right 10 - Straight 11 - Extract Data: Actual Data field

Dr. Amr Talaat ELECT1002 SoC Design Routing Example

Dr. Amr Talaat ELECT1002 SoC Design Simulation A simulator is used to investigate various metrics: Each system consists of 256 functional IP blocks Wormhole routing is used User can choose uniform and localized traffic Support both Poisson and self-similar message injection distributions A flit is only one word (36 bits, 4 bits are for packet framing).

Dr. Amr Talaat ELECT1002 SoC Design Area comparison  SPIN and Octagon have a considerably higher silicon area overhead.

Dr. Amr Talaat ELECT1002 SoC Design Projected performance