MACS: A Minimal Adaptive Routing Circuit Switched Architecture for Scalable and Parametric NoCs Rohit Kumar Dr. Ann Gordon-Ross Introduction MACS: A.

Slides:



Advertisements
Similar presentations
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
Advertisements

2009 Midyear Workshop F4-09: Virtual Architecture and Design Automation for Partial Reconfiguration All Hands Meeting November 10th, 2009 Dr. Ann Gordon-Ross.
1 Agenda TMA2 Feedback TMA3 T821 Bock 2. 2 Packet Switching.
Chapter 9 Local Area Network Technology
OpenFlow-Based Server Load Balancing GoneWild
What is Flow Control ? Flow Control determines how a network resources, such as channel bandwidth, buffer capacity and control state are allocated to packet.
1/14 Ad Hoc Networking, Eli M. Gafni and Dimitri P. Bertsekas Distributed Algorithm for Generating Loop-free Routes in Networks With Frequently.
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Final presentation part A Winter 2006.
High Performance Router Architectures for Network- based Computing By Dr. Timothy Mark Pinkston University of South California Computer Engineering Division.
1 Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control.
Network based System on Chip Final Presentation Part B Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Switching When we have a lot of devices that want to communicate, how do we set up connections between them?
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control Final exam reminders:  Plan well – attempt every question.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control.
MAC Layer Protocols for Sensor Networks Leonardo Leiria Fernandes.
Layer-3 Routing Natawut Nupairoj, Ph.D. Department of Computer Engineering Chulalongkorn University.
Switching, routing, and flow control in interconnection networks.
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
On-Chip Networks and Testing
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
A Distributed Scheduling Algorithm for Real-time (D-SAR) Industrial Wireless Sensor and Actuator Networks By Kiana Karimpour.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
Improving Capacity and Flexibility of Wireless Mesh Networks by Interface Switching Yunxia Feng, Minglu Li and Min-You Wu Presented by: Yunxia Feng Dept.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
Circuit & Packet Switching. ► Two ways of achieving the same goal. ► The transfer of data across networks. ► Both methods have advantages and disadvantages.
George Michelogiannakis William J. Dally Stanford University Router Designs for Elastic- Buffer On-Chip Networks.
Chi-Cheng Lin, Winona State University CS 313 Introduction to Computer Networking & Telecommunication Chapter 5 Network Layer.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
1 Module 4: Implementing OSPF. 2 Lessons OSPF OSPF Areas and Hierarchical Routing OSPF Operation OSPF Routing Tables Designing an OSPF Network.
Chapter 12 - Packet Switching Introduction Sharing Saves Money –Arranging for multiple devices to share a single transmission path lowers cost because.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
1 Lecture 15: Interconnection Routing Topics: deadlock, flow control.
A Configurable High-Throughput Linear Sorter System Jorge Ortiz Information and Telecommunication Technology Center 2335 Irving Hill Road Lawrence, KS.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
BZUPAGES.COM Presentation On SWITCHING TECHNIQUE Presented To; Sir Taimoor Presented By; Beenish Jahangir 07_04 Uzma Noreen 07_08 Tayyaba Jahangir 07_33.
Run-time Adaptive on-chip Communication Scheme 林孟諭 Dept. of Electrical Engineering National Cheng Kung University Tainan, Taiwan, R.O.C.
A Protocol for Packet Network Intercommunication Wei Zhang
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
1 Switching and Forwarding Sections Connecting More Than Two Hosts Multi-access link: Ethernet, wireless –Single physical link, shared by multiple.
Effective bandwidth with link pipelining Pipeline the flight and transmission of packets over the links Overlap the sending overhead with the transport.
Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.
1587: COMMUNICATION SYSTEMS 1 Wide Area Networks Dr. George Loukas University of Greenwich,
Power-aware NOC Reuse on the Testing of Core-based Systems* CSCE 932 Class Presentation by Xinwang Zhang April 26, 2007 * Erika Cota, et al., International.
1. Layered Architecture of Communication Networks: Circuit Switching & Packet Switching.
1 Lecture 22: Interconnection Networks Topics: Routing, deadlock, flow control, virtual channels.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Communication Costs in Parallel Machines Dr. Xiao Qin Auburn University
Mohamed Abdelfattah Vaughn Betz
Modeling and Evaluation of Fibre Channel Storage Area Networks
Overview Parallel Processing Pipelining
The Underlying Technologies
Advanced Computer Networks
Packet Switching Datagram Approach Virtual Circuit Approach
Switching and High-Speed Networks
LOAD SENSITIVE CONTENTION RESOLUTION MAC-PROTOCOL
IOS Network Model 2nd semester
SWITCHING Switched Network Circuit-Switched Network Datagram Networks
Switching, routing, and flow control in interconnection networks
Using Packet Information for Efficient Communication in NoCs
High Throughput Route Selection in Multi-Rate Ad Hoc Wireless Networks
Embedded Computer Architecture 5SAI0 Interconnection Networks
Lecture: Interconnection Networks
CS 6290 Many-core & Interconnect
Lecture 25: Interconnection Networks
Switching, routing, and flow control in interconnection networks
Circuit Switched Network
Chapter 1 Introduction Networking Architecture Overview.
Presentation transcript:

MACS: A Minimal Adaptive Routing Circuit Switched Architecture for Scalable and Parametric NoCs Rohit Kumar Dr. Ann Gordon-Ross Introduction MACS: A stream-based communication architecture for scalable NoCs Motivations and ideas Reduce data transfer latency between HW modules Avoid involving central processing unit e.g. MicroBlaze Quick channel formation Establish communication channel along shortest path between modules Reduce resource bottlenecks in communication channel formation Provide alternate communication paths Select best communication path Identify route cost metric for each path Intelligently select path that reduces contention and best distributes network load Overview Switches connected in mesh topology Each switch contains six ports with similar interfaces Four ports connect to four neighboring switches Two ports connect to two modules Each port contains lanes Communication channels are established on per lane basis Number of lanes per port is an architectural parameter (Ku, Kd etc.) Data width (W) per lane is constant across ports but is parameterized Routing control logic blocks at each port ensures correct functionality Features Impacts Mesh topology Suitable for FPGAs Minimal adaptive routing Choices of alternate paths Circuit switched architecture No packetization overhead, high Speed Highly parametric Application dependent sizing Innovative path resolution Contention free path, network load distribution Two modules per switch Instant ultra low latency channel formation for paired modules InSIF ExSIF down_data_in down_data_out down_req_in down_req_out down_gnt_in down_dny_in down_ful_in down_gnt_out down_dny_out down_ful_out r_loc_data_in Krl*w r_loc_data_out r_loc_req_in r_loc_req_out r_loc_gnt_in r_loc_dny_in r_loc_ful_in r_loc_gnt_out r_loc_dny_out r_loc_ful_out right_data_in right_data_out right_req_in right_req_out right_gnt_in right_dny_in right_ful_in right_gnt_out right_dny_out right_ful_out up_data_in up_data_out up_req_in up_req_out up_gnt_in up_dny_in up_ful_in up_gnt_out up_dny_out up_ful_out kr*w left_data_in kl*w left_data_out left_req_in left_req_out left_gnt_in left_dny_in left_ful_in left_gnt_out left_dny_out left_ful_out l_loc_data_in l_loc_data_out l_loc_req_in l_loc_req_out l_loc_gnt_in l_loc_dny_in l_loc_ful_in l_loc_gnt_out l_loc_dny_out l_loc_ful_out Routing Control Logic S N Routing Control Logic Blocks kl kr ku*w ku kd*w kd Krl Kll*w Kll Left Switch Port Right Switch Port Up Switch Port Right Local Port Left Local Port Down Switch Port Figure 1: A 3x3 MACS NoC. A single switch is enlarged to show all ports, lanes, architectural parameters associated with each port and routing control logic blocks Channel routing algorithm Implementation results Channel phases Channel is a unique communication path between two modules along one or more switches' lanes Four phases of channel life cycle Request service phase: All possible path(s) are determined according to minimal adaptive routing algorithm and request is forwarded to those path(s) Grant/deny phase: Unique path is determined according to route availability and/or route cost Data transfer phase Resource release phase: Switch lanes are released from channel Path resolution based on route cost In case of two grants, port having lower number of engaged lanes contains lower route cost path Further, if both grant ports has same number of channels, port with lower PID has lower route cost path Try route in all shortest path directions (at most 2: ‘C0’, ‘C1’) Reserve resources for OKs Forward request Wait for all grants/denies Port in ‘C0’ has lower no. of busy lanes, or if no. of lanes are same, port in ‘C1’ has lower PID Send grant of ‘C0’ to requesting switch. Release resources for ‘C1’ if any Data transfer Release all Send deny to requesting switch Idle One or both available Request All denies No available route Grant on ‘C0’, deny on other, if any All grants if both ‘C0’ and ‘C1’ are resent No more data Figure 3: Area usage in number of slices per module for data widths W = 8, 16, and 32 bits for a varying number of lanes per switch and local port. The x-axis in each graph varies the Kl, Kr, Kd, and Ku parameters from 1 to 3 lanes per switch port. Left to right, the graphs vary the Kll and Krl parameters from 1 to 3 lanes per local port. Figure 2: State diagram for a channel establishment phase actions and transitions. Figure 4: Maximum operating frequency for data widths W = 8, 16, and 32 bits for a varying number of lanes per switch and local port. The x-axis in each graph varies the Kl, Kr, Kd, and Ku parameters from 1 to 3 lanes per switch port. From left to right, the graphs vary the Kll and Krl parameters from 1 to 3 lanes per local port.