MultiNoC  What is it?  a programmable on-chip multiprocessing platform using a network-on-chip (NoC) as communication media  To whom is it addressed?

Slides:



Advertisements
Similar presentations
Multiple Processor Systems
Advertisements

Multiple Processor Systems
System Busses / Networks-on-Chip
I/O Organization popo.
Reporter :LYWang We propose a multimedia SoC platform with a crossbar on-chip bus which can reduce the bottleneck of on-chip communication.
Super computers Parallel Processing By: Lecturer \ Aisha Dawood.
Processor System Architecture
Chapter 4 Conventional Computer Hardware Architecture
Multiple Processor Systems
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
CSCI 8150 Advanced Computer Architecture Hwang, Chapter 1 Parallel Computer Models 1.2 Multiprocessors and Multicomputers.
ATLAS: The Network-on-Chip Design Exploration Flow
Multiple Processor Systems Chapter Multiprocessors 8.2 Multicomputers 8.3 Distributed systems.
Packet-Switched vs. Time-Multiplexed FPGA Overlay Networks Kapre et. al RC Reading Group – 3/29/2006 Presenter: Ilya Tabakh.
Multiprocessors ELEC 6200: Computer Architecture and Design Instructor : Agrawal Name: Nam.
Network based System on Chip Students: Medvedev Alexey Shimon Ofir Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Asynchronous Pipelined Ring Interconnection for SoC Final Presentation One semester project, Spring 2005 Supervisor: Nitzan Miron Students: Ziv Zeev Shwaitser.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
Configurable System-on-Chip: Xilinx EDK
Presenter: Jyun-Yan Li Multiprocessor System-on-Chip Profiling Architecture: Design and Implementation Po-Hui Chen, Chung-Ta King, Yuan-Ying Chang, Shau-Yin.
Configuration. Mirjana Stojanovic Process of loading bitstream of a design into the configuration memory. Bitstream is the transmission.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Chapter 6 Memory and Programmable Logic Devices
- 1 - A Powerful Dual-mode IP core for a/b Wireless LANs.
Input/Output. Input/Output Problems Wide variety of peripherals —Delivering different amounts of data —At different speeds —In different formats All slower.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
A Compact and Efficient FPGA Implementation of DES Algorithm Saqib, N.A et al. In:International Conference on Reconfigurable Computing and FPGAs, Sept.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Synchronization and Communication in the T3E Multiprocessor.
SHAPES scalable Software Hardware Architecture Platform for Embedded Systems Hardware Architecture Atmel Roma, INFN Roma, ST Microelectronics Grenoble,
REXAPP Bilal Saqib. REXAPP  Radio EXperimentation And Prototyping Platform Based on NOC  REXAPP Compiler.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
Computer Architecture Lecture10: Input/output devices Piotr Bilski.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
J. Christiansen, CERN - EP/MIC
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
RICE UNIVERSITY ‘Stream’-based wireless computing Sridhar Rajagopal Research group meeting December 17, 2002 The figures used in the slides are borrowed.
L11: Lower Power High Level Synthesis(2) 성균관대학교 조 준 동 교수
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
COARSE GRAINED RECONFIGURABLE ARCHITECTURE FOR VARIABLE BLOCK SIZE MOTION ESTIMATION 03/26/
Computer Architecture Memory, Math and Logic. Basic Building Blocks Seen: – Memory – Logic & Math.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
Performance Analysis of a JPEG Encoder Mapped To a Virtual MPSoC-NoC Architecture Using TLM 林孟諭 Dept. of Electrical Engineering National Cheng Kung.
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
Axel Jantsch 1 Networks on Chip Axel Jantsch 1 Shashi Kumar 1, Juha-Pekka Soininen 2, Martti Forsell 2, Mikael Millberg 1, Johnny Öberg 1, Kari Tiensurjä.
Improving NoC-based Testing Through Compression Schemes Érika Cota 1 Julien Dalmasso 2 Marie-Lise Flottes 2 Bruno Rouzeyre 2 WNOC
1 Presenter: Min Yu,Lo 2015/12/21 Kumar, S.; Jantsch, A.; Soininen, J.-P.; Forsell, M.; Millberg, M.; Oberg, J.; Tiensyrja, K.; Hemani, A. VLSI, 2002.
EECB 473 Data Network Architecture and Electronics Lecture 1 Conventional Computer Hardware Architecture
Team LDPC, SoC Lab. Graduate Institute of CSIE, NTU Implementing LDPC Decoding on Network-On-Chip T. Theocharides, G. Link, N. Vijaykrishnan, M. J. Irwin.
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
1 Basic Processor Architecture. 2 Building Blocks of Processor Systems CPU.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Research Interests  NOCs – Networks-on-Chip  Embedded Real-Time Software  Real-Time Embedded Operating Systems (RTOS)  System Level Modeling and Synthesis.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
System on a Programmable Chip (System on a Reprogrammable Chip)
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
ESE532: System-on-a-Chip Architecture
Computer Architecture
Multiple Processor Systems
RECONFIGURABLE NETWORK ON CHIP ARCHITECTURE FOR AEROSPACE APPLICATIONS
CS 286 Computer Organization and Architecture
Presentation transcript:

MultiNoC  What is it?  a programmable on-chip multiprocessing platform using a network-on-chip (NoC) as communication media  To whom is it addressed?  industrial - applications requiring high bandwidth and/or parallel processing  academic - undergraduate/graduate advanced disciplines  Why use it?  multiprocessed system in small FPGA  demonstrates that parallel processing can be easily prototyped in low cost FPGAs  NoCs are more scalable, more performant than busses MultiNoC Architecture  NUMA architecture  each processor has its local memory and may access other processor (remote) memories  wait/notify instructions to synchronize IPs Occupied Slices: Number of 4 input LUTs: Number of Block RAMs: MultiNoC Synthesis Chip Constrained Floorplan  Application 1: Parallel edge detection  Host computer sends an image line  Each embedded processor computes one gradient (gx and gy)  One embedded processor adds gx and gy and notifies the host  Host receives processed line and sends a new line  Application 2: Parallel Sort  Bubble sort algorithm executed in parallel by both processors on each vector half  Merge sort algorithm executed by only one processor, generating the final result from the two sorted halves. MultiNoC Demo Application MultiNoC System Flow Diagram 2,325 out of 2,352 98% 3,699 out of 4,704 78% 12 out of 14 85% R8 Simulator SpartanII-E Application

A Multiprocessing System Enabled by a Network on Chip Xilinx Design Contest CONTACT Leandro Möller, Graduate Computer Science Student Aline Mello, Graduate Computer Science Student Everton Carara, Undergraduate Computer Science Student Fernando Moraes, Professor Ney Calazans, Professor {moller, alinev, carara, moraes, SOME HERMES REFERENCES Moraes, F.; Mello, A; Möller, L; Ost, L.; Calazans, N. A Low Area Overhead Packet-switched Network on Chip: Architecture and Prototyping. In: IFIP VLSI SOC 2003, pp Moraes, F.; Calazans, N.; Mello, A.; Möller, L.; Ost, L. HERMES: an Infrastructure for Low Area Overhead Packet-switching Networks on Chip. Integration, the VLSI Journal, IN PRESS, ADDRESS Faculdade de Informática - PUCRS Av. Ipiranga, Prédio PORTO ALEGRE - BRASIL Telefone: FAX: pucrs.br/~gaph Grupo de Apoio ao Projeto de Hardware Hardware Design Support Group NoC – Network on a Chip PUCRS + - Parallelism Network-on- Chip (NoC) Shared Bus Dedicated Wires Interconnection Structures Lower, shorter wires Higher, longer wires Lower, shorter wires Power Consumption Scalability + - Reusability HERMES IP  2 x 2 mesh network  wormhole packet switching  no global address map - NUMA  8 different message formats: read memory, write memory, active processor, printf, scanf, scanf return, printf return, notify R8 embedded processor IP  load-store 16-bit processor architecture  16x16 bit register file  36 distinct instructions  1K 16-bit words local memory for program and data Memory IP  1K 16-bit words, using 4 Block RAMs Serial IP  RS-232 protocol  provides bidirectional communication with a host processor MultiNoC IPs