Technion – Israel Institute of Technology Qualcomm Corp. Research and Development, San Diego, California Leveraging Application-Level Requirements in the.

Slides:



Advertisements
Similar presentations
Best of Both Worlds: A Bus-Enhanced Network on-Chip (BENoC) Ran Manevich, Isask har (Zigi) Walter, Israel Cidon, and Avinoam Kolodny Technion – Israel.
Advertisements

A Novel 3D Layer-Multiplexed On-Chip Network
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
REAL-TIME COMMUNICATION ANALYSIS FOR NOCS WITH WORMHOLE SWITCHING Presented by Sina Gholamian, 1 09/11/2011.
1 Traffic Engineering (TE). 2 Network Congestion Causes of congestion –Lack of network resources –Uneven distribution of traffic caused by current dynamic.
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
Handling Global Traffic in Future CMP NoCs Ran Manevich, Israel Cidon, and Avinoam Kolodny. Group Research QNoC Electrical Engineering Department Technion.
Networks on Chip : a very quick introduction! Jeremy Chan 11 May 2005.
Module R R RRR R RRRRR RR R R R R Technion – Israel Institute of Technology The Era of Many-Module SoC: Revisiting the NoC Mapping Problem Isask’har (Zigi)
Reporter: Bo-Yi Shiu Date: 2011/05/27 Virtual Point-to-Point Connections for NoCs Mehdi Modarressi, Arash Tavakkol, and Hamid Sarbazi- Azad IEEE TRANSACTIONS.
Module R R RRR R RRRRR RR R R R R Efficient Link Capacity and QoS Design for Wormhole Network-on-Chip Zvika Guz, Isask ’ har Walter, Evgeny Bolotin, Israel.
Packet-Switched vs. Time-Multiplexed FPGA Overlay Networks Kapre et. al RC Reading Group – 3/29/2006 Presenter: Ilya Tabakh.
Multiple constraints QoS Routing Given: - a (real time) connection request with specified QoS requirements (e.g., Bdw, Delay, Jitter, packet loss, path.
1 Evgeny Bolotin – Efficient Routing, DATE 2007 Routing Table Minimization for Irregular Mesh NoCs Evgeny Bolotin, Israel Cidon, Ran Ginosar, Avinoam Kolodny.
ASWP – Ad-hoc Routing with Interference Consideration June 28, 2005.
Network based System on Chip Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Network based System on Chip Students: Medvedev Alexey Shimon Ofir Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
MICRO-MODEM RELIABILITY SOLUTION FOR NOC COMMUNICATIONS Arkadiy Morgenshtein, Evgeny Bolotin, Israel Cidon, Avinoam Kolodny, Ran Ginosar Technion – Israel.
MINIMISING DYNAMIC POWER CONSUMPTION IN ON-CHIP NETWORKS Robert Mullins Computer Architecture Group Computer Laboratory University of Cambridge, UK.
NoC: Network OR Chip? Israel Cidon Technion. Israel Cidon, Technion Technion’s NoC Research: PIs  Israel Cidon (networking)  Ran Ginosar (VLSI)  Idit.
GeNoLator – Generic Network Simulator Final Presentation Students: Gal Ben-Haim, Dan Blechner Supervisor: Isask'har Walter Winter 08/09 18/08/2009.
1 Link Division Multiplexing (LDM) for NoC Links IEEE 2006 LDM Link Division Multiplexing Arkadiy Morgenshtein, Avinoam Kolodny, Ran Ginosar Technion –
Module R R RRR R RRRRR RR R R R R Quality of Service in Network on Chip Isask’har (Zigi) Walter Supervised by: Prof. Israel Cidon, Prof. Ran Ginosar and.
1 Evgeny Bolotin – ClubNet Nov 2003 Network on Chip (NoC) Evgeny Bolotin Supervisors: Israel Cidon, Ran Ginosar and Avinoam Kolodny ClubNet - November.
1 E. Bolotin – The Power of Priority, NoCs 2007 The Power of Priority : NoC based Distributed Cache Coherency Evgeny Bolotin, Zvika Guz, Israel Cidon,
1 1 Networks on Chips (NoC) – Keeping up with Rent’s Rule and Moore’s Law Avi Kolodny Technion – Israel Institute of Technology International Workshop.
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
Architecture and Routing for NoC-based FPGA Israel Cidon* *joint work with Roman Gindin and Idit Keidar.
An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.
Trace-Driven Optimization of Networks-on-Chip Configurations Andrew B. Kahng †‡ Bill Lin ‡ Kambiz Samadi ‡ Rohit Sunkam Ramanujam ‡ University of California,
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Modeling and Evaluation of Fibre Channel Storage Area Networks Xavier Molero, Federico Silla, Vicente Santonia and Jose Duato.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
TitleEfficient Timing Channel Protection for On-Chip Networks Yao Wang and G. Edward Suh Cornell University.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
IEEE Globecom 2010 Tan Le Yong Liu Department of Electrical and Computer Engineering Polytechnic Institute of NYU Opportunistic Overlay Multicast in Wireless.
Improving Capacity and Flexibility of Wireless Mesh Networks by Interface Switching Yunxia Feng, Minglu Li and Min-You Wu Presented by: Yunxia Feng Dept.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
Network on Chip - Architectures and Design Methodology Natt Thepayasuwan Rohit Pai.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
50 th Annual Allerton Conference, 2012 On the Capacity of Bufferless Networks-on-Chip Alex Shpiner, Erez Kantor, Pu Li, Israel Cidon and Isaac Keslassy.
Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints Yao Zhao, Zhaosheng Zhu, Yan Chen, Northwestern University Dan.
Run-time Adaptive on-chip Communication Scheme 林孟諭 Dept. of Electrical Engineering National Cheng Kung University Tainan, Taiwan, R.O.C.
Flow and Congestion Control for Reliable Multicast Communication In Wide-Area Networks Supratik Bhattacharyya Department of Computer Science University.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Yu Cai Ken Mai Onur Mutlu
Dynamic Traffic Distribution among Hierarchy Levels in Hierarchical Networks-on-Chip Ran Manevich, Israel Cidon, and Avinoam Kolodny Group Research QNoC.
Module R R RRR R RRRRR RR R R R R Access Regulation to Hot-Modules in Wormhole NoCs Isask’har (Zigi) Walter Supervised by: Israel Cidon, Ran Ginosar and.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Bandwidth Balancing in Multi- Channel IEEE Wireless Mesh networks Claudio Cicconetti, Ian F. Akyildiz School of Electrical and Computer Engineering.
Heterogeneous Wireless Access in Large Mesh Networks Haiping Liu, Xin Liu, Chen-Nee Chuah, Prasant Mohapatra University of California, Davis IEEE MASS.
1 Lecture 22: Router Design Papers: Power-Driven Design of Router Microarchitectures in On-Chip Networks, MICRO’03, Princeton A Gracefully Degrading and.
Virtual-Channel Flow Control William J. Dally
Technion – Israel Institute of Technology Faculty of Electrical Engineering NOC Seminar Error Handling in Wormhole Networks Author: Amit Berman Mentor:
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
On-time Network On-Chip: Analysis and Architecture CS252 Project Presentation Dai Bui.
ProbeCast: MANET Admission Control via Probing Soon Y. Oh, Gustavo Marfia, and Mario Gerla Dept. of Computer Science, UCLA Los Angeles, CA 90095, USA {soonoh,
Network-on-Chip Paradigm Erman Doğan. OUTLINE SoC Communication Basics  Bus Architecture  Pros, Cons and Alternatives NoC  Why NoC?  Components 
Mohamed Abdelfattah Vaughn Betz
Modeling and Evaluation of Fibre Channel Storage Area Networks
How to Train your Dragonfly
NoC: Network OR Chip? Israel Cidon Technion.
Israel Cidon, Ran Ginosar and Avinoam Kolodny
EE 122: Lecture 7 Ion Stoica September 18, 2001.
An Optimization Problem in Adaptive Virtual Environments
Towards Predictable Datacenter Networks
Presentation transcript:

Technion – Israel Institute of Technology Qualcomm Corp. Research and Development, San Diego, California Leveraging Application-Level Requirements in the Design of a NoC for a 4G SoC – a Case Study Rudy Beraha, Isask’har (Zigi) Walter, Israel Cidon, Avinoam Kolodny March, 2010

Network on-Chip (NoC)  Introduction  Design Process NoC Design  A Case Study Outline 2

Why Network on-Chip? Buses scale badly  Power, area, performance  Testability, verification, timing closure, … Networks are replacing system buses Low area Low power Better scalability Higher parallelism Spatial reuse Unicast 3

Grid topology Packet-switched XY Routing Wormhole flow-control NoC Architecture Basics Module R R R R RR R R R RRRRR RRRRR RRRRR R Router Link 4

Module NoC Design Flow Map modules Allocate link capacities Evaluate QoS and cost R R RRR R RRRRR RR R R R RRR R RRRRR RR RRRR RR R RR R R R R inter-module traffic Synthesize+P&R 5

Module R R RRR R RRRRR RR R R R R NoC Design Flow Module R R RRR R RRRRR RR R R R R Map modules Allocate link capacities Evaluate QoS and cost inter-module traffic Synthesize+P&R Goal: Design a NoC for a 4G SoC Study design alternatives 6

Typical modeling  Latency and dynamic power proportional to distance  Dynamic power consumed by the NoC: Why is Mapping Important? 7 Cost of mapping π

Example PE1PE2 PE4PE5 PE3 PE Mapping π 1 Mapping π 2 8

Network on-Chip (NoC) NoC Design – a Case Study  Mapping  Link capacity allocation  Results Outline 9

Approached by Qualcomm R&D  Got a real, 4G Modem SoC design to analyze! Very few NoCs for real systems are described in the literature A Case Study… 10

Challenge: a Bus-Based 4G SoC Modules, ~100 flows 2 AXI buses Several modes of operation (Data, voice, data+voice, etc.)

Given:  Traffic pattern Optimize:  Mapping  Link capacities Synthesize+place&route Design Flow 12 Step A Step B

Traditional P2P traffic requirements Input Data – Traffic Pattern 13 Bandwidth demands [Mb/s]Point-to-point timing requirements [nSec] 'R' is for read operations, 'W' is for write operations

Minimize power subject to performance constraints Captures dynamic power and area (static power) Mapping Optimization - Goal 14 Static powerDynamic power

Scheme 1: Ignore timing requirements  Account for them in subsequent design phases Scheme 2: Use P2P timing requirements  Discard solutions that violate any requirement Scheme 3: Use application-level requirements Mapping Alternatives 15 New! LatencyDstSrc T1T1 CPUIO T2T2 DSPCPU T3T3 MEMDSP LatencyDstSrc T 1 + T 2 + T 3 MEMIO CPUMEMDSP

Assumption: latency  hop distance NP-hard  Use heuristic algorithm Simulated annealing Solving the Mapping Problem 16 Power optimized Power and point-to-point timing requirements Power and end-to-end timing requirements Scheme 1Scheme 2Scheme 3

Find minimal “NoC capacity” such that all timing requirements are met  Account for run-time effects finite router queues, backpressure mechanism, virtual channel multiplexing, network contention, etc.  Too much capacity: waste of resources  Too little capacity: insufficient performance Step 2: Setting Link Capacities 17

18 IP1 Interface IP2 Interface More difficult than off-chip networks Cannot set link capacity independently Link Capacity and Wormhole

Scheme 1: Uniform link capacity  Simulation based Scheme 2: Individually tuned, heuristic-based  Simulation based Capacity Allocation Alternatives 19 Result: 12 NoCs to compare (3 mappings)*(2 allocation schemes)*(2 VC configurations)

Network on-Chip (NoC) NoC Design – a Case Study  Mapping  Link capacity allocation  Results Outline 20

Using E2E requirements during the design process reduces the total capacity  Both for uniform and non-uniform link capacity allocation Results: Total NoC Capacity 21 Total Capacity Requirements [Gbps] Scheme 3 (Power+ETE Latency) Scheme 2 (Power+P2P Latency) Scheme 1 (Power only)

Synthesis Results 22 Up to 49% savings! Up to 40% savings! Scheme 1Scheme 2Scheme 3 Scheme 1Scheme 2Scheme 3 Total router areaTotal wiring area Mapping scheme 1: Ignore timing requirements during mapping Mapping scheme 2: map using P2P timing requirements Mapping scheme 3: map using application-level requirements

Evaluated the benefit of mapping using application-level requirements  Rather than P2P constraints Using two link capacity allocation schemes Real application  Meaningful savings To do  Analyze place&route results  Compare to a bus-based implementation Conclusions and Future Work 23

Thank you! Questions? Leveraging Application-Level Requirements in the Design of a NoC QNoC Research Group Group Research QNoC 24