1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.

Slides:



Advertisements
Similar presentations
Best of Both Worlds: A Bus-Enhanced Network on-Chip (BENoC) Ran Manevich, Isask har (Zigi) Walter, Israel Cidon, and Avinoam Kolodny Technion – Israel.
Advertisements

SE-292 High Performance Computing
Layer 3 Switching. Routers vs Layer 3 Switches Both forward on the basis of IP addresses But Layer 3 switches are faster and cheaper However, Layer 3.
Week 1- Fall 2009 Dr. Kimberly E. Newman University of Colorado.
Chris Madill Molecular Structure and Function, Hospital for Sick Children Department of Biochemistry, University of Toronto Supervised by Dr. Paul Chow.
Performed by: Moshe Emmer, Harar Meir Instructor: Alkalay Daniel Cooperated with: AE faculty המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory.
Spring 2008 Network On Chip Platform Instructor: Yaniv Ben-Itzhak Students: Ofir Shimon Guy Assedou.
Multiprocessors ELEC 6200: Computer Architecture and Design Instructor : Agrawal Name: Nam.
Spring 08, Jan 15 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Fast Communication for Multi – Core SOPC Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab.
Proxy Cache Engine Performed by:Artyom Borzin Stas Lapchev Stas Lapchev Instructor: Hen Broodney In cooperation with Magnifier Ltd. הטכניון - מכון טכנולוגי.
Performed by : Rivka Cohen and Sharon Solomon Instructor : Walter Isaschar המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון.
1 Evgeny Bolotin – ClubNet Nov 2003 Network on Chip (NoC) Evgeny Bolotin Supervisors: Israel Cidon, Ran Ginosar and Avinoam Kolodny ClubNet - November.
Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent.
Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Project performed by: Naor Huri Idan Shmuel.
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
1 Fast Communication for Multi – Core SOPC Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab.
Device Driver for Generic ASC Module - Project Presentation - By: Yigal Korman Erez Fuchs Instructor: Evgeny Fiksman Sponsored by: High Speed Digital Systems.
Interface of DSP to Peripherals of PC Spring 2002 Supervisor: Broodney, Hen | Presenting: Yair Tshop Michael Behar בס " ד.
ECE 526 – Network Processing Systems Design
Measuring Network Performance of Multi-Core Multi-Cluster (MCMCA) Norhazlina Hamid Supervisor: R J Walters and G B Wills PUBLIC.
המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון - מכון טכנולוגי לישראל הפקולטה להנדסת חשמל Technion - Israel institute of technology.
Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Written by: Haim Natan Benny Pano Supervisor:
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
NETWORKING CONCEPTS. Data Communication Communication is for sharing information Sharing can be local or remote Local communication between individuals.
Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Spring 2009.
LECTURE 9 CT1303 LAN. LAN DEVICES Network: Nodes: Service units: PC Interface processing Modules: it doesn’t generate data, but just it process it and.
Multi Core Processor Submitted by: Lizolen Pradhan
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
Repeaters and Hubs Repeaters: simplest type of connectivity devices that regenerate a digital signal Operate in Physical layer Cannot improve or correct.
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
There are Physical and logical network layout. Physical : Topology of a network refers to the configuration of cables, computers, and other peripherals.
Chapter 2 Parallel Architecture. Moore’s Law The number of transistors on a chip doubles every years. – Has been valid for over 40 years – Can’t.
Course Wrap-Up Miodrag Bolic CEG4136. What was covered Interconnection network topologies and performance Shared-memory architectures Message passing.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 2: Embedded Computing High Performance Embedded Computing Wayne Wolf.
Hybrid Prototyping of MPSoCs Samar Abdi Electrical and Computer Engineering Concordia University Montreal, Canada
A Profiler for a Multi-Core Multi-FPGA System by Daniel Nunes Supervisor: Professor Paul Chow September 30 th, 2008 University of Toronto Electrical and.
80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.
Outline  Over view  Design  Performance  Advantages and disadvantages  Examples  Conclusion  Bibliography.
Multiprossesors Systems.. What are Distributed Databases ? “ A Logically interrelated collection of shared data ( and a description of this data) physically.
1 Introduction CEG 4131 Computer Architecture III Miodrag Bolic.
The Cosmic Cube Charles L. Seitz Presented By: Jason D. Robey 2 APR 03.
Performed by: Guy Assedou Ofir Shimon Instructor: Yaniv Ben-Yitzhak המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון - מכון.
Performance Analysis of a JPEG Encoder Mapped To a Virtual MPSoC-NoC Architecture Using TLM 林孟諭 Dept. of Electrical Engineering National Cheng Kung.
Network On Chip Platform
SCHOOL OF ELECTRICAL AND COMPUTER ENGINEERING | SCHOOL OF COMPUTER SCIENCE | GEORGIA INSTITUTE OF TECHNOLOGY MANIFOLD Manifold Execution Model and System.
Axel Jantsch 1 Networks on Chip Axel Jantsch 1 Shashi Kumar 1, Juha-Pekka Soininen 2, Martti Forsell 2, Mikael Millberg 1, Johnny Öberg 1, Kari Tiensurjä.
NET 324 D Networks and Communication Department Lec1 : Network Devices.
Rehab AlFallaj.  Network:  Nodes: Service units: PC Interface processing Modules: it doesn’t generate data, but just it process it and do specific task.
SYSTEM ADMINISTRATION Chapter 2 The OSI Model. The OSI Model was designed by the International Standards Organization (ISO) as a structural framework.
Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m Akram Ben Ahmed Xinyu LI, Omar Hammami.
1 Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Instructor: Evgeny Fiksman Students: Meir.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
1 Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Instructor: Evgeny Fiksman Students: Meir.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
1 of 14 Lab 2: Formal verification with UPPAAL. 2 of 14 2 The gossiping persons There are n persons. All have one secret to tell, which is not known to.
A Low-Area Interconnect Architecture for Chip Multiprocessors Zhiyi Yu and Bevan Baas VLSI Computation Lab ECE Department, UC Davis.
Constructing a system with multiple computers or processors 1 ITCS 4/5145 Parallel Programming, UNC-Charlotte, B. Wilkinson. Jan 13, 2016.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
1 Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Instructor: Evgeny Fiksman Students: Meir.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Fault-Tolerant NoC-based Manycore system: Reconfiguration & Scheduling
Israel Cidon, Ran Ginosar and Avinoam Kolodny
Chapter 1 Introduction.
Hybrid Programming with OpenMP and MPI
Types of Parallel Computers
Presentation transcript:

1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Performed by: Moshe Bino Alex Tikh Supervisor: Evgeny Fiksman Spring 2008

2 Single processor vs. multi-core Single core is reaching its performance limits:  Dynamic power consumption rises linearly with freq’  Heat dissipation ‘Power wall’ solution is keeping the frequency, while rising the number of transistors. Natural solution is Parallel computing Multi-Core efficiency depends on fast comm` between cores and network topology.

3 Fast communication Major inter-communication techniques:  Shared memory  Hardware memory synchronously accessed by multiple processors to provide inter-communication through data sharing.  Remote procedure calls  Inter-processors communication technology that allows one processor to cause a subroutine or a procedure to execute in another processor’s address space.  Message passing interface (MPI)  Programmable interface for advanced data passing

4 Network topologies - pros & cons Point to point (PTP) High speed x Bad scalability Bus Simple to implement x Low throughput Star Easily expendable network x Performance & scalability depend on hub capabilities Mesh - NoC Easy to expend the system efficiently x Difficult to troubleshoot

5 Chosen topology Mesh topology NoC Routing nodes Leaf processor’s cores MPI logically defines clusters Comm - cluster Rank - member Cores amount is limited only by chip resources NoC is the best choice for network topology

6 System architecture Router node Project examines FPGA chips category Main core connected to I/O Multi - clock domain

7 Logic design – Router node Time limited Round Robin arbiter. Port to Port & broadcasting Modular design Two main units: 1.Permission Unit 2.Port FSM

8 Software Layers Application MPI functions interface Network hardware independent implementation Data relies on message structure Physical designed for FSL bus Design modularity in hardware and software

9 Software Synchronization Parallelism: Several messages traverse in the system simultaneously Keep network clean: Processor forced to receive incoming message Ease network load: Maintenance: False/error messages dismissed locally by software Synchronization: Each processor DB synchronized locally by software

10 System development – Project workflow Hardware Test and Simulate Software Test and Simulate Hardware – Simulation environment Message generator Software – Development environment Timing

11 System verification & integration Hardware Test and Simulate Software Test and Simulate Software & Hardware Debug & Synth & PAR Send messages with different lengths Measure time - statistics Scalable application to measure network efficiency/ performance Optimization Performance App.

12 Summary Multi core solution for single core problem Energy consumption Heat dissipation NoC is the best choice for network topology Scalability - Area Performance - frequency Design modularity in hardware and software Easy to expend the system efficiently NoC implementation adjusted for FPGA platform Minimal lines and logic units Synchronous system Complete HW & SW solution for Multi-core Comm. system

13 References [1] I.Cidon & I.Keidar: Zooming in on Network on Chip Architectures. [2] E.Bolotin: NoC clubnet presentation.