Hardware/Software Co-Design

Slides:



Advertisements
Similar presentations
Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures Pree Thiengburanathum Advanced computer architecture Oct 24,
Advertisements

Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
History Copyright2 Research History 10 years 1989 FASTCHART Single Processor FASTHARD Single Processor 1996.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Computer Hardware The Processing Unit.
PC hardware and x86 programming Lec 2 Jinyang Li.
CENTRAL PROCESSING UNIT. CPU Does the actual processing in the computer. A single chip called a microprocessor. Composed of an arithmetic and logic unit.
FPGA-Based System Design: Chapter 7 Copyright  2004 Prentice Hall PTR Topics n Hardware/software co-design.
What is a Microprocessor ? A microprocessor consists of an ALU to perform arithmetic and logic manipulations, registers, and a control unit Its has some.
Compiler Research How I spent my last 22 summer vacations Philip Sweany.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
CPU Central Processing Unit
A. Aloisio, R. Giordano Univ. of Naples ‘Federico II’
M. Bellato INFN Padova and U. Marconi INFN Bologna
The White Rabbit MCH Javier Serrano on behalf of Tomasz Włostowski
FPGAs for next gen DAQ and Computing systems at CERN
Chapter 10: Computer systems (1)
Hee-Seok Kim, Izzat El Hajj, John Stratton,
Immediate Addressing Mode
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
ECE354 Embedded Systems Introduction C Andras Moritz.
Introduction to parallel programming
Chilimbi, et al. (2014) Microsoft Research
Multi-core processors
System On Chip.
Central Processing Unit- CPU
Parallel Computing Lecture
Chapter 4: Multithreaded Programming
Lecture on Microcomputer
Assembly Language for Intel-Based Computers, 5th Edition
Multi-core processors
Chapter 3 Top Level View of Computer Function and Interconnection
Chapter III Desktop Imaging Systems & Issues
Microcomputer Architecture
CALICE/EUDET Electronics in 2007
CPU Central Processing Unit
CSCI206 - Computer Organization & Programming
Some challenges in heterogeneous multi-core systems
Hui Chen, Shinan Wang and Weisong Shi Wayne State University
Lecture 2: Performance Today’s topics: Technology wrap-up
Challenges Implementing Complex Systems with FPGA Components
Intro to Architecture & Organization
Hook was a famous year for what…..?
CPU and Embedded Systems
Overview Why VLSI? Moore’s Law. Why FPGAs?
Paper discussed in class: H. Zhang, V. Prabhu, V. George, M. Wan, M
ECEG-3202 Computer Architecture and Organization
Coe818 Advanced Computer Architecture
Bus-Based Computer Systems
Mid Term review CSC345.
The longest delay in reading a file sector is:
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
The Little Man Computer
Thread Synchronization
Chapter 1 Introduction.
On-Chip Communication Architectures
CSE 153 Design of Operating Systems Winter 19
Learning Objectives To be able to describe the purpose of the CPU
Chip&Core Architecture
Physical Implementation
William Stallings Computer Organization and Architecture 7th Edition
Computer Architecture
Research: Past, Present and Future
CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators
Chapter 3: Process Management
Optimal Co-design of FPGA Implementations for MPC
Resource Replication 6 Integer Units 4 FP units 8 Sets of architectural registers Renaming registers (Int/FP) HW Context (PC, Return Stack.
Overview Why VLSI? Moore’s Law. Why FPGAs?
We are working on developing “cheap” RTD monitoring channels for the CMS Phase-2 requirements. The channels will provide the complete readout for 4-wire.
Presentation transcript:

Hardware/Software Co-Design Nick Wieder

Heterogeneous Architecture Why to Use? Decreased total operation time Energy / Power vs. Cost Why Not to use? Other operation causes bottleneck Cost / Improvement

In general don’t execute instructions Accelerators In general don’t execute instructions May be ASICs or FPGAs Can be on Chip

Accelerators Design PC-Based Custom PCB Single Chip Commercial Board plugged into a PC-Buss Normally used for development Custom PCB More up front cost / Lower power Single Chip Commercial Includes FPGA and CPU on Single Chip Custom IC

CPU+ Accelerator Communicate using shared memory Requires Synchronization --Communicate using Shared memory for large amounts of data --Require Synchronization mechanisms

Single Vs Multi Thread Total Processing Time Single Multi

Performance Analysis Single Threaded Multi Threaded Performance may be increased Multi Threaded Performance Should be increased More calls to Accelerator = Greater increase Should be increased depending on delay to xfer data between ACC and CPU

Questions ? None = Good!