Download presentation
Presentation is loading. Please wait.
Published byErika McDaniel Modified over 9 years ago
1
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick Barrow-Williams
2
Introduction Semiconductor industry has grown rapidly for several decades Continued shrinking of device dimension introduces new design challenges Moving data around a chip can now be the limiting factor of performance Existing solutions do not scale well
3
Why do existing solutions not scale? Die size has been growing consistently Global connections are longer Wire depth increased to counter width decrease Parasitic capacitive effects increase and cause slow signal propagation
4
Why do existing solutions not scale? Existing system-level connection uses buses Buses increase resource efficiency and decrease wiring congestion Not suitable for a large number of modules A network based alternative would offer higher aggregate bandwidth
5
Why design for FPGA systems? FPGA market growth sustained for several years FPGA silicon area already dominated by wiring Global wires are limited in number Increasing gate count only increases wiring congestion
6
The Solution: Network-on-Chip Use technologies from network systems Replace inefficient global wiring with high-level interconnection network Create scalable systems to handle large numbers of modules Use high metal layers to avoid parasitic effects
7
Existing Solutions Most existing systems are for ASIC designs Stanford Interconnect RAW SCALE SPIN PNoC: An solution for FPGAs Complex High hardware cost Other simulated solutions exist but few are implemented
8
Proposal: Two network systems Existing solutions use either packet switching or circuit switching techniques Design, implement, test and synthesise one of each to compare performance and hardware cost Map solutions to an FPGA platform to evaluate hardware cost in current generation systems
9
Network Architecture Design Topology Simple Scalable Low wiring requirements Solution: 2D mesh Topology
10
Network Architecture Design Routing Algorithm Deterministic Data always follows same path through network Simple hardware Sensitive to congestion Adaptive Paths through network can change according to load Complex hardware Avoids congestion
11
Network Architecture Design When choosing routing algorithms must avoid: Deadlock: Livelock Solution: Use unidirectional wiring and allow each node to make two connection Solution: Use deterministic routing
12
Network Architecture Design Flow control methods Circuit switched Circuit request propagates through network Path reserved to destination Grant signal propagates back Data sent then circuit deallocated Packet switched Use header, body and tail Wormhole routing Forward header and body without waiting for tail Need buffers to store stalled packets
13
Router Design Each router contains a number of modules FIFOs (only present in packet switched router) Address to port-request decoder Arbiter Control finite state machines Crossbar
14
Router Design: Address decoder Takes addresses from each five input ports Outputs the direction to route the packet Addresses In Port Requests Out Router Address Registers Logic
15
Router Design: Control FSMs Each FSM has multiplexed inputs and outputs Reduces the size of the FSM considerably Example here is from circuit switched router FSM Requests In Grant Out Grant In Requests Out In PortOut Port
16
Router Design: Crossbar Each crossbar can make two connections to avoid deadlock Pipelined design to increase router throughput Data In Data Out In Port x 2 Out Port x 2
17
Circuit Switched Router Structure Request In Request Out Grant In Grant Out Data In Data Out Data In In & Out Ports CrossbarCrossbar FSMFSM ArbiterArbiter Address to Port Decoder
18
Packet Switched Router Structure Request From FIFOs Request In Write Out Full In Grant Out Data From FIFOs Data Out Data From FIFOs In & Out Ports CrossbarCrossbar ControlControl ArbiterArbiter Address to Port Decoder FIFO FSM Data In Full Write Grant Req FIFO FSM Data In Full Write Grant Req 5 x Queue Data
19
Router Implementation and Testing Both routers were coded using VHDL Simulation and testing used a combination of ModelSim and Xilinx ISE 9.1 Ad-hoc tests used for individual modules VHDL testbench used for system verification
20
Testbench Structure Mesh Network Read Input Read Input Input Tables Test Table Source Output Table Sink Compare TESTBENCH Command File Output File Clock Gen Reset Gen Cycle Count Success: ID: 1 Source : (0,3) Dest : (1,0) Hops : 4 Latency: 34 Success: ID: 2 Source : (0,2) Dest : (1,0) Hops : 3 Latency: 27 Success: ID: 3 Source : (3,2) Dest : (1,1) Hops : 3 Latency: 22 Success: ID: 4 Source : (1,3) Dest : (0,1) Hops : 3 Latency: 22 Success: ID: 5 Source : (3,0) Dest : (3,1) Hops : 1 Latency: 12 #STARTSOURCEDESTSIZEID # ------------------------------------------------------ 2 3 0 0 1 8 1 3 2 0 0 1 2 2 3 2 3 1 1 2 3 4 3 1 1 0 8 4 5 0 3 1 3 7 5
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.