Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m5141153 Akram Ben Ahmed Xinyu LI, Omar Hammami.

Slides:



Advertisements
Similar presentations
Evaluation of On-Chip Interconnect Architectures for Multi-Core DSP Students : Haim Assor, Horesh Ben Shitrit 2. Shared Bus 3. Fabric 4. Network on Chip.
Advertisements

3D Graphics Content Over OCP Martti Venell Sr. Verification Engineer Bitboys.
Presenter : Cheng-Ta Wu Kenichiro Anjo, Member, IEEE, Atsushi Okamura, and Masato Motomura IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 39,NO. 5, MAY 2004.
© 2004 Wayne Wolf Topics Task-level partitioning. Hardware/software partitioning.  Bus-based systems.
NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Internal Logic Analyzer Final presentation-part B
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
MotoHawk Training Model-Based Design of Embedded Systems.
A NoC Generation and Evaluation Framework
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Mid semester Presentation Data Packages Generator & Flow Management Data Packages Generator & Flow Management Data Packages Generator & Flow Management.
1 實驗五:媒介存取協定模擬 教師: 助教:. 2 Outline  Background  Transmission Protocols  ALOHA  CSMA/CD  CSMA/CA  Network Devices  Hub  Switch  Access Point (AP)
Slide 1 ITC 2005 Gunnar Carlsson 1, David Bäckström 2, Erik Larsson 2 2) Linköpings Universitet Department of Computer Science Sweden 1) Ericsson Radio.
NoC Modeling Networks-on-Chips seminar May, 2008 Anton Lavro.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Network based System on Chip Final Presentation Part B Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Asynchronous Pipelined Ring Interconnection for SoC Final Presentation One semester project, Spring 2005 Supervisor: Nitzan Miron Students: Ziv Zeev Shwaitser.
1 ENTS689L: Packet Processing and Switching Buffer-less Switch Fabric Architectures Buffer-less Switch Fabric Architectures Vahid Tabatabaee Fall 2006.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
Efficient Monitoring of QoS Parameters (EMQP) Authors: Vadim Drabkin Arie Orlovsky Constantine Elster Instructors: Dr. Danny Raz Mr. Ran Wolff.
1 Link Division Multiplexing (LDM) for NoC Links IEEE 2006 LDM Link Division Multiplexing Arkadiy Morgenshtein, Avinoam Kolodny, Ran Ginosar Technion –
Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Project performed by: Naor Huri Idan Shmuel.
ARTIST2 Network of Excellence on Embedded Systems Design cluster meeting –Bologna, May 22 nd, 2006 System Modelling Infrastructure Activity leader : Jan.
Interface of DSP to Peripherals of PC Spring 2002 Supervisor: Broodney, Hen | Presenting: Yair Tshop Michael Behar בס " ד.
1 RAMP Infrastructure Krste Asanovic UC Berkeley RAMP Tutorial, ISCA/FCRC, San Diego June 10, 2007.
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Receiver-driven Layered Multicast Paper by- Steven McCanne, Van Jacobson and Martin Vetterli – ACM SIGCOMM 1996 Presented By – Manoj Sivakumar.
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
Redes Inalámbricas Máster Ingeniería de Computadores 2008/2009 Tema 7.- CASTADIVA PROJECT Performance Evaluation of a MANET architecture.
1 VERILOG Fundamentals Workshop סמסטר א ' תשע " ה מרצה : משה דורון הפקולטה להנדסה Workshop Objectives: Gain basic understanding of the essential concepts.
SystemC: A Complete Digital System Modeling Language: A Case Study Reni Rambus Inc.
REXAPP Bilal Saqib. REXAPP  Radio EXperimentation And Prototyping Platform Based on NOC  REXAPP Compiler.
Chapter 3 Parallel Algorithm Design. Outline Task/channel model Task/channel model Algorithm design methodology Algorithm design methodology Case studies.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
SHA-3 Candidate Evaluation 1. FPGA Benchmarking - Phase Round-2 SHA-3 Candidates implemented by 33 graduate students following the same design.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
1 Fly – A Modifiable Hardware Compiler C. H. Ho 1, P.H.W. Leong 1, K.H. Tsoi 1, R. Ludewig 2, P. Zipf 2, A.G. Oritz 2 and M. Glesner 2 1 Department of.
Distributed computing using Projective Geometry: Decoding of Error correcting codes Nachiket Gajare, Hrishikesh Sharma and Prof. Sachin Patkar IIT Bombay.
- 1 - EE898_HW/SW Partitioning Hardware/software partitioning  Functionality to be implemented in software or in hardware? No need to consider special.
Simulation of the OLSRv2 Protocol First Report Presentation.
1 Optical Packet Switching Techniques Walter Picco MS Thesis Defense December 2001 Fabio Neri, Marco Ajmone Marsan Telecommunication Networks Group
Blue Brain Project Carlos Osuna, Carlos Aguado, Fabien Delalondre.
Axel Jantsch 1 NOCARC Network on Chip Architecture KTH, VTT Nokia, Ericsson, Spirea TEKES, Vinnova.
CAS 721 Course Project Implementing Branch and Bound, and Tabu search for combinatorial computing problem By Ho Fai Ko ( )
XStream: Rapid Generation of Custom Processors for ASIC Designs Binu Mathew * ASIC: Application Specific Integrated Circuit.
6. A PPLICATION MAPPING 6.3 HW/SW partitioning 6.4 Mapping to heterogeneous multi-processors 1 6. Application mapping (part 2)
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Performance Analysis of a JPEG Encoder Mapped To a Virtual MPSoC-NoC Architecture Using TLM 林孟諭 Dept. of Electrical Engineering National Cheng Kung.
Modeling and Simulating Time- Sensitive Networking Harri Laine.
High Performance Embedded Computing © 2007 Elsevier Chapter 7, part 3: Hardware/Software Co-Design High Performance Embedded Computing Wayne Wolf.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
Mini scope one semester project Project final Presentation Svetlana Gnatyshchak Lior Haiby Advisor: Moshe Porian Febuary 2014.
Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.
ISCUG Keynote May 2008 Acknowledgements to the TI-Nokia ESL forum (held Jan 2007) and to James Aldis, TI and OSCI TLM WG Chair 1 SystemC: Untapped Value.
1 Chapter 6 Reformulation-Linearization Technique and Applications.
System on a Programmable Chip (System on a Reprogrammable Chip)
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
DAC50, Designer Track, 156-VB543 Parallel Design Methodology for Video Codec LSI with High-level Synthesis and FPGA-based Platform Kazuya YOKOHARI, Koyo.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Mohamed Abdelfattah Vaughn Betz
Prototyping SoC-based Gate Drive Logic for Power Convertors by Generating code from Simulink models. Researchers Rounak Siddaiah, Graduate Student-University.
Chapter 5: Inter-VLAN Routing
Presentation transcript:

Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m Akram Ben Ahmed Xinyu LI, Omar Hammami

Outline Introduction Multi-objective NoC Synthesis Flow Evaluation results Summary Research Paper Reading22010/7/12

Introduction NoC synthesis problem targets to generate NoC topology for different system design objectives such as system performance and area. Multi-objective NoC synthesis is aimed to optimize the NoC topology and supply Pareto solutions for NoC area and performance. 06/22/2011Research Progress Seminar3

Introduction Existing works: – Some focused on single objective synthesis, targeting one parameter without taking care of the remaining ones. – Some were based on multiple objectives synthesis without emphasizing on simulation and execution on FPGA platform. 06/22/2011Research Progress Seminar4

Outline Introduction Multi-objective NoC Synthesis Flow Evaluation results Summary Research Paper Reading52010/7/12

Multi-objective NoC synthesis workflow: Basic notions The application, represented in the core graph, is divided into Master cores (PEs) and Slave cores (memory units). Routers connecting these cores may have different number of input and output ports.

Multi-objective NoC synthesis workflow: Basic notions Multi-objective NoC synthesis aims to decide the convenient number of routers needed besides the number of input and output ports for each router. A maximum number of input output ports is set for each router and should not be exceeded.

Multi-objective NoC synthesis workflow Contains the application communication graph

Multi-objective NoC synthesis workflow: Core graph Core graph example Slave cores: Memory units Master cores: Processing elements a, b, c, d, e, f denote the communication bandwidth

Multi-objective NoC synthesis workflow Generates one initial generation of possible NoC architectures according to parameters

Multi-objective NoC synthesis workflow According to the core graph, TLM traffic generator is generated, which represents the master core with the same throughput and traffic distribution and TLM memory model is used, which represents the slave cores.

Multi-objective NoC synthesis workflow: Traffic generation According to the core graph Master cores are the responsible for traffic generation and try to communicate with slave cores using OCP IP protocol.

Multi-objective NoC synthesis workflow: Traffic generation Traffic generator: execute a c code for traffic generation

Multi-objective NoC synthesis workflow: Traffic generation sends the traffic throughput and target address through its Master Fast Simplex Link (MFSL) bus interface.

Multi-objective NoC synthesis workflow: Traffic generation Performance onitoring unit: (throughput, average and maximum latency)

Multi-objective NoC synthesis workflow: Traffic generation sends back the monitoring information to the processor

Multi-objective NoC synthesis workflow: Traffic generation Information can be also sent to Host Pc if needed

Multi-objective NoC synthesis workflow The area of each router is obtained from the router library according to their number of input and output ports.

Multi-objective NoC synthesis workflow Verifies whether the solution obtained satisfies the constraints

Multi-objective NoC synthesis workflow: Fitness verification Performance parameters (Latency, throughput, execution time…) and area fitness are verified. The maximum number of in-out constraint is also verified.

Multi-objective NoC synthesis workflow: Fitness verification For each solution violating those constraints, a penalty is added, and then forcing the MOEA to find legal solutions. The MOEA generates new solutions from the new generation and selects the solution that fits the most to the user objectives.

Multi-objective NoC synthesis workflow Final and optimal Pareto solutions are selected depending on the user objectives

Multi-objective NoC synthesis workflow FPGA prototyping for emulation and validation

Outline Introduction Multi-objective NoC Synthesis Flow Evaluation results Summary Research Paper Reading242010/7/12

Evaluation results 06/22/2011Research Progress Seminar25 Input Core graph: 8 masters and 4 slaves

Evaluation results 06/22/2011Research Progress Seminar26 3 Pareto solutions are obtained with TLM simulation (latency and area utilization)

Evaluation results 06/22/2011Research Progress Seminar27 More routers  larger area+ lower latency

Evaluation results 06/22/2011Research Progress Seminar28 Less routers  smaller area+ higher latency

Evaluation results 06/22/2011Research Progress Seminar29 Comparison between TLM Simulation and FPGA Implementation

Evaluation results The latency results of TLM simulation and FPGA emulation are different. The ASIC and FPGA implementation technologies are different. But the relative relationship between the final Pareto solutions rests the same, which means that we can still use TLM simulation to accelerate NoC synthesis, but the final Pareto solution must be validated on RTL. 06/22/2011Research Progress Seminar30

Outline Introduction Multi-objective NoC Synthesis Flow Evaluation results Summary Research Paper Reading312010/7/12

Summary Multi-objective NoC synthesis based on MOEA technology and prototyping workflow is presented. It offers flexible decisions according to different design objectives and budgets for the designer. 06/22/2011Research Progress Seminar32