Module R R RRR R RRRRR RR R R R R Efficient Link Capacity and QoS Design for Wormhole Network-on-Chip Zvika Guz, Isask ’ har Walter, Evgeny Bolotin, Israel.

Slides:



Advertisements
Similar presentations
Best of Both Worlds: A Bus-Enhanced Network on-Chip (BENoC) Ran Manevich, Isask har (Zigi) Walter, Israel Cidon, and Avinoam Kolodny Technion – Israel.
Advertisements

Interconnection Networks: Flow Control and Microarchitecture.
Prof. Natalie Enright Jerger
1 Routing Protocols I. 2 Routing Recall: There are two parts to routing IP packets: 1. How to pass a packet from an input interface to the output interface.
A Novel 3D Layer-Multiplexed On-Chip Network
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
REAL-TIME COMMUNICATION ANALYSIS FOR NOCS WITH WORMHOLE SWITCHING Presented by Sina Gholamian, 1 09/11/2011.
An Analytical Model for Worst-case Reorder Buffer Size of Multi-path Minimal Routing NoCs Gaoming Du 1, Miao Li 1, Zhonghai Lu 2, Minglun Gao 1, Chunhua.
1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
What is Flow Control ? Flow Control determines how a network resources, such as channel bandwidth, buffer capacity and control state are allocated to packet.
Technion – Israel Institute of Technology Qualcomm Corp. Research and Development, San Diego, California Leveraging Application-Level Requirements in the.
Module R R RRR R RRRRR RR R R R R Technion – Israel Institute of Technology The Era of Many-Module SoC: Revisiting the NoC Mapping Problem Isask’har (Zigi)
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Final presentation part A Winter 2006.
1 Evgeny Bolotin – Efficient Routing, DATE 2007 Routing Table Minimization for Irregular Mesh NoCs Evgeny Bolotin, Israel Cidon, Ran Ginosar, Avinoam Kolodny.
MICRO-MODEM RELIABILITY SOLUTION FOR NOC COMMUNICATIONS Arkadiy Morgenshtein, Evgeny Bolotin, Israel Cidon, Avinoam Kolodny, Ran Ginosar Technion – Israel.
LOW-LEAKAGE REPEATERS FOR NETWORK-ON-CHIP INTERCONNECTS Arkadiy Morgenshtein, Israel Cidon, Avinoam Kolodny, Ran Ginosar Technion – Israel Institute of.
Network based System on Chip Part A Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
NoC: Network OR Chip? Israel Cidon Technion. Israel Cidon, Technion Technion’s NoC Research: PIs  Israel Cidon (networking)  Ran Ginosar (VLSI)  Idit.
GeNoLator – Generic Network Simulator Final Presentation Students: Gal Ben-Haim, Dan Blechner Supervisor: Isask'har Walter Winter 08/09 18/08/2009.
1 Link Division Multiplexing (LDM) for NoC Links IEEE 2006 LDM Link Division Multiplexing Arkadiy Morgenshtein, Avinoam Kolodny, Ran Ginosar Technion –
Adaptive Routing in (Q)NoC
Module R R RRR R RRRRR RR R R R R Quality of Service in Network on Chip Isask’har (Zigi) Walter Supervised by: Prof. Israel Cidon, Prof. Ran Ginosar and.
Network-on-Chip Examples System-on-Chip Group, CSE-IMM, DTU.
1 Evgeny Bolotin – ClubNet Nov 2003 Network on Chip (NoC) Evgeny Bolotin Supervisors: Israel Cidon, Ran Ginosar and Avinoam Kolodny ClubNet - November.
1 E. Bolotin – The Power of Priority, NoCs 2007 The Power of Priority : NoC based Distributed Cache Coherency Evgeny Bolotin, Zvika Guz, Israel Cidon,
1 1 Networks on Chips (NoC) – Keeping up with Rent’s Rule and Moore’s Law Avi Kolodny Technion – Israel Institute of Technology International Workshop.
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
User-level Internet Path Diagnosis R. Mahajan, N. Spring, D. Wetherall and T. Anderson.
Architecture and Routing for NoC-based FPGA Israel Cidon* *joint work with Roman Gindin and Idit Keidar.
Issues in System-Level Direct Networks Jason D. Bakos.
Networks: Performance Measures1 Network Performance Measures.
1 Indirect Adaptive Routing on Large Scale Interconnection Networks Nan Jiang, William J. Dally Computer System Laboratory Stanford University John Kim.
QoS-Aware Path Protection in MPLS Networks Ashish Gupta Ashish Gupta Bijendra Jain Indian Institute of Technology Delhi Satish Tripathi University of California.
1 Near-Optimal Oblivious Routing for 3D-Mesh Networks ICCD 2008 Rohit Sunkam Ramanujam Bill Lin Electrical and Computer Engineering Department University.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Dragonfly Topology and Routing
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
High Performance Embedded Computing © 2007 Elsevier Lecture 16: Interconnection Networks Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte.
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Distributed Routing Algorithms. In a message passing distributed system, message passing is the only means of interprocessor communication. Unicast, Multicast,
Review of Networking Concepts Part 1: Switching Networks
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
June 21, 2007 Minimum Interference Channel Assignment in Multi-Radio Wireless Mesh Networks Anand Prabhu Subramanian, Himanshu Gupta.
Improving Capacity and Flexibility of Wireless Mesh Networks by Interface Switching Yunxia Feng, Minglu Li and Min-You Wu Presented by: Yunxia Feng Dept.
Author : Jing Lin, Xiaola Lin, Liang Tang Publish Journal of parallel and Distributed Computing MAKING-A-STOP: A NEW BUFFERLESS ROUTING ALGORITHM FOR ON-CHIP.
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
Network-on-Chip Introduction Axel Jantsch / Ingo Sander
Data and Computer Communications Chapter 11 – Asynchronous Transfer Mode.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
Worcester Polytechnic Insitute, Worcester, MA, USA1 Traffic Sensitive Active Queue Management for Improved Multimedia Streaming Authors: Vishal Phirke,
Non-Minimal Routing Strategy for Application-Specific Networks-on-Chips Hiroki Matsutani Michihiro Koibuchi Yutaka Yamada Jouraku Akiya Hideharu Amano.
Performance, Cost, and Energy Evaluation of Fat H-Tree: A Cost-Efficient Tree-Based On-Chip Network Hiroki Matsutani (Keio Univ, JAPAN) Michihiro Koibuchi.
Dynamic Traffic Distribution among Hierarchy Levels in Hierarchical Networks-on-Chip Ran Manevich, Israel Cidon, and Avinoam Kolodny Group Research QNoC.
Module R R RRR R RRRRR RR R R R R Access Regulation to Hot-Modules in Wormhole NoCs Isask’har (Zigi) Walter Supervised by: Israel Cidon, Ran Ginosar and.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Minimizing Delay in Shared Pipelines Ori Rottenstreich (Technion, Israel) Joint work with Isaac Keslassy (Technion, Israel) Yoram Revah, Aviran Kadosh.
Advanced Processor Group The School of Computer Science A Dynamic Link Allocation Router Wei Song, Doug Edwards Advanced Processor Group The University.
Virtual-Channel Flow Control William J. Dally
Technion – Israel Institute of Technology Faculty of Electrical Engineering NOC Seminar Error Handling in Wormhole Networks Author: Amit Berman Mentor:
Mohamed ABDELFATTAH Andrew BITAR Vaughn BETZ. 2 Module 1 Module 2 Module 3 Module 4 FPGAs are big! Design big systems High on-chip communication.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
WAN Technologies. 2 Large Spans and Wide Area Networks MAN networks: Have not been commercially successful.
On-time Network On-Chip: Analysis and Architecture CS252 Project Presentation Dai Bui.
Network-on-Chip Paradigm Erman Doğan. OUTLINE SoC Communication Basics  Bus Architecture  Pros, Cons and Alternatives NoC  Why NoC?  Components 
How to Train your Dragonfly
NoC: Network OR Chip? Israel Cidon Technion.
Israel Cidon, Ran Ginosar and Avinoam Kolodny
Virtual-Channel Flow Control
On-time Network On-chip
Presentation transcript:

Module R R RRR R RRRRR RR R R R R Efficient Link Capacity and QoS Design for Wormhole Network-on-Chip Zvika Guz, Isask ’ har Walter, Evgeny Bolotin, Israel Cidon, Ran Ginosar and Avinoam Kolodny Technion, Israel Institute of Technology

DATE ’ 06NoC Capacity Allocation2 Problem Essence How much capacity [bits/sec] should be assigned to each link? -All flows must meet delay requirements -Minimize total resources R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation3 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation4 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation5 IP1 Interface IP2 Wormhole Switching Interface Suits on chip interconnect Small number of buffers Low latency Virtual Channels -interleaving packets on the same link

DATE ’ 06NoC Capacity Allocation6 Wormhole Switching Suits on chip interconnect Small number of buffers Low latency Virtual Channels -interleaving packets on the same link IP1 Interface IP3 IP2 Interface

DATE ’ 06NoC Capacity Allocation7 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation8 Module NoC Design Flow Define inter- module traffic Place modules Allocate link capacities Verify QoS and cost R R RRR R RRRRR RR R R R RRR R RRRRR RR RRRR RR R RR R R R R

DATE ’ 06NoC Capacity Allocation9 Module R R RRR R RRRRR RR R R R R NoC Design Flow Module R R RRR R RRRRR RR R R R R Define inter- module traffic Place modules Allocate link capacities Verify QoS and cost Too low capacity results in poor QoS Too high capacity wastes power/area Module R R RRR R RRRRR RR R R R R R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation10 Capacity Allocation Problem Simulation takes too long  a simulation based solution is not scalable If no simulations are used: -How to extract flows ’ delays? -How to reassign capacity? Our solution: -Analytical model to forecast QoS -Capacity allocation algorithm that exploit the model

DATE ’ 06NoC Capacity Allocation11 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation12 Delay Analysis s1s1 d2d2 s2s2 d1d1 R R RRR R RRRRR RR R R R R Approximate per-flow latencies Given: -Network topology -Link capacities -Communication demands

DATE ’ 06NoC Capacity Allocation13 Because they assume: -Symmetrical communication demands -No virtual channels -Identical link capacity! Generally, they calculate the delay of an “ average flow ” -A per-flow analysis is needed Why Previous Models Do Not Apply?

DATE ’ 06NoC Capacity Allocation14 IP1 Interface IP2 Interface Wormhole Delay Analysis The delivery resembles a pipeline pass Packet transmission can be divided into two separated phases: -Path acquisition -Packet delivery We focus on packet delivery phase

DATE ’ 06NoC Capacity Allocation15 IP1 Interface IP2 Interface Packet delivery time is dominated by the slowest link -Transmission rate -Link sharing Packet Delivery Time Low-capacity link

DATE ’ 06NoC Capacity Allocation16 IP1 Interface IP2 Packet Delivery Time Packet delivery time is dominated by the slowest link -Transmission rate -Link sharing IP3

DATE ’ 06NoC Capacity Allocation17 Analysis Basics Determines the flow ’ s effective bandwidth Per link Account for interleaving tt

DATE ’ 06NoC Capacity Allocation18 - mean time to deliver a flit of flow i over link j [sec] - capacity of link j [bits per sec] - flit length [bits/flit] - total flit injection rate of all flows sharing link j except for flow i [flits/sec] Single Hop Flow, no Sharing t

DATE ’ 06NoC Capacity Allocation19 - mean time to deliver a flit of flow i over link j [sec] - capacity of link j [bits per sec] - flit length [bits/flit] - total flit injection rate of all flows sharing link j except for flow i [flits/sec] Bandwidth used by other flows on link j Single Hop Flow, with Sharing tt

DATE ’ 06NoC Capacity Allocation20 The Convoy Effect Consider inter-link dependencies -Wormhole backpressure -Traffic jams down the road Link Load Account for all subsequent hops Basic delay weighted by distance

DATE ’ 06NoC Capacity Allocation21 Weakest link dominates packet delivery time Total Packet Transmission Time - mean packet latency for flow i [sec] Packet size [flits/packet] Account for weakest link = - mean packet latency for flow i [sec]

DATE ’ 06NoC Capacity Allocation22 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation23 Greedy, iterative algorithm Capacity Allocation Algorithm For each src-dst pair: Use delay model to identify most sensitive link Increase its capacity Repeat until delay requirements are met

DATE ’ 06NoC Capacity Allocation24 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 0625 Capacity Allocation – Example#1 Before optimization After optimization Total capacity reduced by 7% Uniform traffic with identical requirements Uniform allocation: 74.4Gbit/sec Capacity allocation algorithm: 69Gbit/sec

DATE ’ 0626 After optimization Before optimization Capacity Allocation – Example#2 A SoC-like system -Heterogeneous traffic demands and delay requirements Uniform allocation: 41.8Gbit/sec Total capacity reduced by 30% Capacity allocation algorithm: 28.7Gbit/sec

DATE ’ 06NoC Capacity Allocation27 Outline Wormhole based NoC The problem of link capacity allocation Solution: -Wormhole delay model -Capacity allocation algorithm Design Examples Summary Module R R RRR R RRRRR RR R R R R

DATE ’ 06NoC Capacity Allocation28 Summary SoCs need non uniform link capacities -Capacity allocation Wormhole delay analysis -Heterogeneous link capacities -Heterogeneous communication demands -Multiple VCs Greedy allocation algorithm Design examples -NoC cost considerably reduced

DATE ’ 06NoC Capacity Allocation29 Questions? QNoC Research Group Group Research QNoC

DATE ’ 06NoC Capacity Allocation30 Backup

DATE ’ 06NoC Capacity Allocation31 Grid topology Packet-switched Wormhole switching Fixed path XY routing Heterogeneous link capacities Quality-of-Service QNoC Architecture Module R R R R RR R R R RRRRR RRRRR RRRRR R Router Link E. Bolotin, I. Cidon, R. Ginosar, A. Kolodny, “ QoS Architecture and Design Process for Cost-Effective Network on Chip ”, Journal of Systems Architecture, 2004

DATE ’ 06NoC Capacity Allocation32 Analysis and Simulation vs. Load Normalized Delay Utilization Analytical model was validated using simulations -Different link capacities -Different communication demands Analysis Validation

DATE ’ 06NoC Capacity Allocation33 Slack Elimination Packet Delay Slack Slack [%] Flow