Processor Architectures and Program Mapping 5kk10 TU/e 2006 Henk Corporaal Jef van Meerbergen Bart Mesman.

Slides:



Advertisements
Similar presentations
Lab 2 – DSP software architecture and the real life DSP characteristics of signals that make it necessary.
Advertisements

DSPs Vs General Purpose Microprocessors
Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
Instruction-Level Parallel Processors {Objective: executing two or more instructions in parallel} 4.1 Evolution and overview of ILP-processors 4.2 Dependencies.
Modern CPU Architectures Design Principles and Assessments First Meeting
Final Class, ECE472 Midterm #2 due today – 1-5% extra credit for written report of Dally’s video Oral presentation of class project: today Graduate students:
Platform-based Design 5KK70 TU/e 2009 Henk Corporaal Bart Mesman.
Embedded Computer Architecture 5KK73 TU/e Henk Corporaal
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
Spring 2008, Jan. 14 ELEC / Lecture 2 1 ELEC / Computer Architecture and Design Spring 2007 Introduction Vishwani D. Agrawal.
Processor Architectures and Program Mapping TU/e 5kk10 Henk Corporaal Jef van Meerbergen Bart Mesman Exploiting DLP SIMD architectures.
Instruction Level Parallelism (ILP) Colin Stevens.
Platforms, ASIPs and LISATek Federico Angiolini DEIS Università di Bologna.
Define Embedded Systems Small (?) Application Specific Computer Systems.
Microprocessors Introduction to ia64 Architecture Jan 31st, 2002 General Principles.
Processor Design 5Z032 Henk Corporaal Eindhoven University of Technology 2011.
6/25/2015Platform Design H.Corporaal and B. Mesman1 Platform Design TU/e 5kk70 Henk Corporaal Bart Mesman ASIP Application Specific Instruction-set Processor.
Advanced Computer Architecture 5MD00 / 5Z033 Overview Henk Corporaal TUEindhoven 2009.
Embedded Systems in Silicon TD5102 Henk Corporaal Technical University Eindhoven DTI / NUS Singapore.
The Effect of Data-Reuse Transformations on Multimedia Applications for Different Processing Platforms N. Vassiliadis, A. Chormoviti, N. Kavvadias, S.
Reconfigurable Computing in the Undergraduate Curriculum Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
Winter-Spring 2001Codesign of Embedded Systems1 Introduction to HW/SW Codesign Part of HW/SW Codesign of Embedded Systems Course (CE )
Platform Design TU/e 5kk70 Henk Corporaal Bart Mesman ASIP Application Specific Instruction-set Processor.
Processor Architectures and Program Mapping Application domain specific processors (ADSP or ASIP) 5kk10 TU/e Henk Corporaal Jef van Meerbergen Bart Mesman.
Platform-based Design TU/e 5kk70 Henk Corporaal Bart Mesman ILP compilation (part b)
COM181 Computer Hardware Ian McCrumRoom 5B18,
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수,
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
DOP - A CPU CORE FOR TEACHING BASICS OF COMPUTER ARCHITECTURE Miloš Bečvář, Alois Pluháček and Jiří Daněček Department of Computer Science and Engineering.
Basics and Architectures
Hy-C A Compiler Retargetable for 2014 and beyond Philip Sweany 4/29/2014.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Fall 2015, Aug 17 ELEC / Lecture 1 1 ELEC / Computer Architecture and Design Fall 2015 Introduction Vishwani D. Agrawal.
Paper Review: XiSystem - A Reconfigurable Processor and System
CASTNESS11, Rome Italy © 2011 Target Compiler Technologies L 1 Ideas for the design of an ASIP for LQCD Target Compiler Technologies CASTNESS’11, Rome,
1 Advance Computer Architecture CSE 8383 Ranya Alawadhi.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
VTU – IISc Workshop Compiler, Architecture and HPC Research in Heterogeneous Multi-Core Era R. Govindarajan CSA & SERC, IISc
R2D2 team R2D2 team Reconfigurable and Retargetable Digital Devices  Application domains Mobile telecommunications  WCDMA/UMTS (Wideband Code Division.
Advanced Computer Architecture 5MD00 / 5Z033 Overview Henk Corporaal TUEindhoven 2007.
Embedded Computer Architecture ASIP Application Specific Instruction-set Processor 5KK73 Bart Mesman and Henk Corporaal.
Compilers for Embedded Systems Ram, Vasanth, and VJ Instructor : Dr. Edwin Sha Synthesis and Optimization of High-Performance Systems.
Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors Philip Sweany 8/30/2013.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
6. A PPLICATION MAPPING 6.3 HW/SW partitioning 6.4 Mapping to heterogeneous multi-processors 1 6. Application mapping (part 2)
The TM3270 Media-Processor. Introduction Design objective – exploit the high level of parallelism available. GPPs with Multi-media extensions (Ex: Intel’s.
Advanced Computer Architecture 5MD00 Overview Henk Corporaal TUEindhoven 2014.
Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors Philip Sweany 8/27/2010.
Multiprocessor SoC integration Method: A Case Study on Nexperia, Li Bin, Mengtian Rong Presented by Pei-Wei Li.
Spring 2016, Jan 13 ELEC / Lecture 1 1 ELEC / Computer Architecture and Design Spring 2016 Introduction Vishwani D. Agrawal.
Embedded Computer Architecture
CPS 258 Announcements –Lecture calendar with slides –Pointers to related material.
Winter-Spring 2001Codesign of Embedded Systems1 Essential Issues in Codesign: Architectures Part of HW/SW Codesign of Embedded Systems Course (CE )
1 of 14 Lab 2: Design-Space Exploration with MPARM.
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
Compiler Research How I spent my last 22 summer vacations Philip Sweany.
Computer Architecture Organization and Architecture
Evaluating Register File Size
Advanced Computer Architecture 5MD00 / 5Z033 Overview
Platform-based Design
Embedded Computer Architecture 5SIA0 Overview
A High Performance SoC: PkunityTM
Processor Architectures and Program Mapping
Embedded Computer Architecture 5SAI0 Wrap-Up, we are almost there...
Advanced Computer Architecture 5MD00 / 5Z033 Overview
Research: Past, Present and Future
ELEC / Computer Architecture and Design Fall 2014 Introduction
Presentation transcript:

Processor Architectures and Program Mapping 5kk10 TU/e 2006 Henk Corporaal Jef van Meerbergen Bart Mesman

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman2 Embedded Systems Courses We go through all the design steps of a complete multi- processor embedded system –(containing hardware and software) Discuss many design trade-offs 4 connected courses: –Designing Embedded Systems on Silicon: 5kk00 –Processor Architectures and Program Mapping: 5kk10 –Multiprocessors: 5kk20 –Design of an Embedded System: 5kk53 (lab course)

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman3 Processor Architectures and Program Mapping Objectives: Study the processing components of future multi- processor platforms, ranging from –highly flexible processors, to –highly computational-efficient processors Learn how to map applications to these components Learn how to exploit the (data) memory hierarchy

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman4 flexibility efficiency DS P Programmable CPU Programmable DSP Application specific instruction set processor (ASIP) Application specific processor Processor design spectrum

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman5 low medium high high medium low flexibility efficiency ASIC GP proc FPGA DSP ASIP Processor design spectrum

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman6 i386SX i486DX P microsparc Super sparc Ultra sparc P6 604e 21164a Turbosparc 604e DTV Query by humming Computational efficiency [MOPS/W] Feature size [  m] [Roza] Intrinsic computational efficiency ICE of silicon

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman7 Schedule for Processor Architectures and Program Mapping Lect 1, Nov 30: Bart Mesman Programmable CPU cores + hands-on 1 Lect 2, Dec 7: Bart Mesman Programmable DSP cores Lect 3, Dec 14: Henk Corporaal ILP Architectures Lect 4, Dec 21: Henk Corporaal VLIW compilers

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman8 Schedule for Processor Architectures and Program Mapping Lect 5, Jan 11: Bart Mesman ASIPs + hands-on 3 Lect 6, Feb 8: Bart Mesman SIMD + NoC Lect 7, Feb 15: Jef van Meerbergen Real processors and Platforms Lect 8, Feb 22: Jef van Meerbergen / Guest WSN and system level design aspects / trade-offs Lect 9,10, Mar 1,8: Henk Corporaal Data management (DTSE) + hands-on 3

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman9 Topic 1: Programmable CPU cores Exploiting PLP = pipeline parallelism = clock frequency Identify bottlenecks in CPU architectures for media applications (performance and power) Good for application development and prototyping Different criteria for stand-alone vs. embedded CPUs Introduction of the applications Hands-on

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman10 Topic 2: DSP cores Evolution from CPU to DSP architectures from a DSP application point of view: exploit ILP Compilation is complicated as a result of DSP architectural features

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman11 Topic 3: ASIPs Synthesis approach: design your own processor!! Tuning the instruction-set and architecture by adding application-specific computational units, registers, and busses Computational efficiency while maintaining flexibility Compiler should be flexible!: retargetable compilation

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman12 Topic 4: VLIW processing Exploiting ILP = Instruction/Operation level parallelism Architectures Code generation Limits of VLIW Disadvantage: code size

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman13 Topic 5: SIMD processing Exploiting Data level parallelism Advantage: code size Architectures Code generation Communication issues RC-SIMD and D-SIMD architectures

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman14 Topic 6: Real processors and Platforms

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman15 Topic 7: WSN and system level design aspects / trade-offs

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman16 Topic 8 details: Data management How to effectively exploit a data memory hierarchy for low energy and high performance Methodology: –Rewrite your program such that –data locality is exploited –find interesting 'copy candidates' –assign 'copy candidates' to local memories –schedule accesses to memories –exploit data life-time to overlap data structures in memory –use loop transformations

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman17 Hands-on We consider the mapping of 3 applications: –16-tap FIR filter (MIPS) –TTA –YUV2RGB conversion (DTSE) Map application to 1. Programmable RISC core 2. VLIW 3. Data memory hierarchy

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman18 Hands-on 1: RISC core Programmable RISC cores –MIPS assembly code –use of SPIM simulator

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman19 Hands-on 2: VLIW TTA –Application not yet decided

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman20 Hands-on 3: Use of Memory Hierarchy Data management –Code transformations –Exploiting the memory hierarchy for low power

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman21 Exam Oral exam –Course theory 40%, –Hands-on 60% MIPS assembly hands-on (only for exam admission) TTA DTSE hands-on Material: all slides, Jef’s homepage->education- >5p520 (embedded multimedia systems)

6/22/2015 Processor Architectures and Program Mapping H. Corporaal, J. van Meerbergen, and B. Mesman22 Other Website (under construction): Location: EH 2.19 Time: Thursday – hour