1 Architectural Analysis of a DSP Device, the Instruction Set and the Addressing Modes SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software.

Slides:



Advertisements
Similar presentations
Clare Smtih SHARC Presentation1 The SHARC Super Harvard Architecture Computer.
Advertisements

DSPs Vs General Purpose Microprocessors
Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
Intel Pentium 4 ENCM Jonathan Bienert Tyson Marchuk.
Jan 28, 2004Blackfin Compute Unit REV B A comparison of DSP Architectures BlackFin ADSP-BFXXX Compute Unit Based on a ENEL white paper prepared by.
Survey of Digital Signal Processors Michael Warner ECD: VLSI Communication Systems.
MICROPROCESSORS TWO TYPES OF MODELS ARE USED :  PROGRAMMER’S MODEL :- THIS MODEL SHOWS FEATURES, SUCH AS INTERNAL REGISTERS, ADDRESS,DATA & CONTROL BUSES.
Digital Signal Processors: fundamentals & system design Lecture 1
Real time DSP Professors: Eng. Julian S. Bruno Eng. Jerónimo F. Atencio Sr. Lucio Martinez Garbino.
Microprocessor or Microcontroller Not just a case of “you say tomarto and I say tomayto” M. Smith, ECE University of Calgary, Canada.
Overview of Popular DSP Architectures: TI, ADI, Motorola R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2003.
Processor System Architecture
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
May 2007Alon Slapak, Afeka college of 271 DSP: Digital Signal Processors Introduction EZ-KIT Fundamentals Architecture Developmen t flow Introduction At.
1 Analog Devices TigerSHARC® DSP Family Presented By: Mike Lee and Mike Demcoe Date: April 8 th, 2002.
1 2015/5/18 Digital Signal Processor /5/18 Analog to Digital Shift.
Blackfin ADSP Versus Sharc ADSP-21061
Analog Device Chipsets Featured In New Mobile Phones Eric Tong April 6, 2004.
TigerSHARC and Blackfin Different Applications. Introduction Quick overview of TigerSHARC Quick overview of Blackfin low power processor Case Study: Blackfin.
Muhammad Jahangir Ikram, March, Computer System Application: Design and Development Instructor: Muhammad Jahangir Ikram.
1-1 Microprocessor Engineering Microprocessor Systems Microcontrollers Infineon 16-bit Processor Family  specifically 167CS microcontroller C Programming.
Introduction.
1 SHARC ‘S’uper ‘H’arvard ‘ARC’hitecture Nagendra Doddapaneni ER hit HAR ect VARD ure SUP Arc.
Alyssa Concha Microprocessors Final Project ADSP – SHARC Digital Signal Processor.
PowerPC 601 Stephen Tam. To be tackled today Architecture Execution Units Fixed-Point (Integer) Unit Floating-Point Unit Branch Processing Unit Cache.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
Ehsan Shams Saeed Sharifi Tehrani. What is DSP ? Digital Signal Processing (DSP) is used in a wide variety of applications, and it is hard to find a good.
DSP Development System
Embedded Systems Design
Real time DSP Professors: Eng. Julian Bruno Eng. Mariano Llamedo Soria.
Internal hardware and external components of a computer Three-box Model  Processor The brain of the system Executes programs A big finite state machine.
Using Analog Devices’ Blackfin for Embedded Processing Diana Franklin and John Seng.
PS - 87C51Mx2 - SLS-1 Philips Semiconductors 87C51Mx2 Microcontroller.
0 Blackfin BF52x / Low Power. 1 Performance MHz Performance MHz Power mW BF MHz 132 KB RAM HDMA BF MHz 132 KB RAM USB BF MHz 132 KB.
DSP Lecture Series DSP Memory Architecture Dr. E.W. Hu Nov. 28, 2000.
DSP Processors We have seen that the Multiply and Accumulate (MAC) operation is very prevalent in DSP computation computation of energy MA filters AR filters.
Developing Power-Aware Strategies for the Blackfin Processor Steven VanderSanden Giuseppe Olivadoti David Kaeli Richard Gentile Northeastern University.
Microprocessor-based Systems
ATtiny23131 A SEMINAR ON AVR MICROCONTROLLER ATtiny2313.
Overview of Super-Harvard Architecture (SHARC) Daniel GlickDaniel Glick – May 15, 2002 for V (Dewar)
Chapter 1 Introduction. Dr. Naim Dahnoun, Bristol University, (c) Texas Instruments 2002 Chapter 1, Slide 2 Learning Objectives  Why process signals.
SHARC DSPs SHARC is a DSP architecture designed and fabricated by Analog Devices, Inc. As of Spring 2005, they manufacture and sell 22 DSPs based on the.
Microprocessor Microprocessor (cont..) It is a 16 bit μp has a 20 bit address bus can access upto 220 memory locations ( 1 MB). It can support.
TI MSP430 MICROCONTROLLERS BY ADITYA PATHAK. THE MSP FAMILY Ultra-low power; mixed signal processors Widely used in battery operated applications Uses.
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
VLSI Algorithmic Design Automation Lab. THE TI OMAP PLATFORM APPROACH TO SOC.
IBM/Motorola/Apple PowerPC
Digital Signal Processors (DSPs). DSP Advanced signal processor circuits MAC (Multiply and Accumulate) unit (s) - provides fast multiplication of two.
AT91 Products Overview. 2 The Atmel AT91 Series of microcontrollers are based upon the powerful ARM7TDMI processor. Atmel has taken these cores, added.
DIGITAL SIGNAL PROCESSORS. What are Digital Signals? Digital signals have finite precision in both the time (sampled) and amplitude (quantized) domains.
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
Fundamentals of Programming Languages-II
MODULE 5 INTEL TODAY WE ARE GOING TO DISCUSS ABOUT, FEATURES OF 8086 LOGICAL PIN DIAGRAM INTERNAL ARCHITECTURE REGISTERS AND FLAGS OPERATING MODES.
1 Basic Processor Architecture. 2 Building Blocks of Processor Systems CPU.
2D-Graphic Accelerator
Seminar On 8085 microprocessor
Microprocessor or Microcontroller
Embedded Systems Design
Introduction.
Digital Signal Processor
Digital Signal Processors
Introduction to Microprocessors and Microcontrollers
The TMS320C6x Family of DSPs
Introduction to Digital Signal Processors (DSPs)
Chapter 1 Introduction.
Digital Signal Processors-1
Introduction to 5685x Series
ADSP 21065L.
Presentation transcript:

1 Architectural Analysis of a DSP Device, the Instruction Set and the Addressing Modes SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic

2 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

3 ADSP -21x Copied from [Kester03]

4 CALCULATING OUTPUTS OF 4-TAP FIR FILTER USING A CIRCULAR BUFFER y(3) = h(0) x(3) + h(1) x(2) + h(2) x(1) + h(3) x(0) y(4) = h(0) x(4) + h(1) x(3) + h(2) x(2) + h(3) x(1) y(5) = h(0) x(5) + h(1) x(4) + h(2) x(3) + h(3) x(2) Memory Location Read x(0) x(1) x(2) x(3) Write x(4) Read x(4) x(1) x(2) x(3) Write x(5) Read x(4) x(5) x(2) x(3) Copied from [Kester03]

5 FIR filter steps 1. Obtain a sample with the ADC; generate an interrupt 2. Detect and manage the interrupt 3. Move the sample into the input signal's circular buffer 4. Update the pointer for the input signal's circular buffer 5. Zero the accumulator 6. Control the loop through each of the coefficients 7. Fetch the coefficient from the coefficient's circular buffer 8. Update the pointer for the coefficient's circular buffer 9. Fetch the sample from the input signal's circular buffer 10. Update the pointer for the input signal's circular buffer 11. Multiply the coefficient by the sample 12. Add the product to the accumulator 13. Move the output sample (accumulator) to a holding buffer 14. Move the output sample from the holding buffer to the DAC Copied from [Kester03]

6 FIR filter steps (cont.) ADSP21xx Example code: CNTR = N-1; DO convolution UNTIL CE; convolution: MR = MR + MX0 * MY0(SS), MX0 = DM(I0,M1), MY0 = PM(I4,M5); Single Cycle Instruction Copied from [Kester03]

7 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

8 Copied from [Takala05]

9

10 Motorola DSP5600X Copied from [Takala05]

11 Copied from [Takala05]

12 Copied from [Takala05]

13 ADSP -21x MAC

14 Copied from [Takala05]

15 SHARC Architecture ADSP-2106X Copied from [Takala05]

16 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

17 Copied from [Takala05]

18 Copied from [Takala05]

19 Copied from [Takala05]

20 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

21 Copied from [Takala05]

22 Copied from [Takala05]

23 Hardware loops Software loop: MOVE #16,BInitialize loop counter B LOOP: MAC (R0)+,(R4)+,ARegister-indirect addressing with post-increment DEC B JNELOOP Hardware loops: no time is spent on –Decrementing counters –Checking to see if the loop is finished –Branching back to the top of the loop RPT #16 MAC (R0)+,(R4)+,A [Lapsley97]

24 Copied from [Kester03]

25 Upto 3000MMACS Image compression Digital Still/Video Camera MMOIP Telematics Biometrics Upto 160MMACS Wired Voice Wireless Voice VOIP/VON Industrial Control ADSP-218x/9x Power Efficient $5 - $10 ADSP-218x/9x Power Efficient $5 - $10 Upto 4800MMACS (16-bit) or 1200MMACS (32-bit) 2.5G/3G Infrastructure Medical Imaging Industrial Imaging Multiprocessing TigerSHARCHigh-Performance $35 - $200 TigerSHARCHigh-Performance Performance Blackfin Media Enabled $5 - $30 Blackfin Media Enabled $5 - $30 ADI General Purpose DSP Product Families Upto 600MMACS (32-bit) Audio Infotainment Industrial SHARCLow-Cost Floating Point $10 - $100 SHARCLow-Cost Floating Point $10 - $100

26 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

27 SHARC Architecture Copied from [Smith97]

28 SHARC Architecture - Features SHARCThe Super Harvard ARChitecture 100MHz Core / 300 MFLOPS Peak Parallel Operation of: Multiplier, ALU, 2 Address Generators & Sequencer –No Arithmetic Pipeline; All Computations Are Single-Cycle High Precision and Extended Dynamic Range –32/40-Bit IEEE Floating-Point Math –32-Bit Fixed-Point MAC’s with 64-Bit Product & 80-Bit Accumulation Single-Cycle Transfers with Dual-Ported Memory Structures –Supported by Cache Memory and Enhanced HarvardArchitecture Glueless Multiprocessing Features JTAG Test and Emulation Port DMA Controller, Serial Ports, Link Ports, External Bus, SDRAM Controller, Timers

29 ADSP-2106x Core Architecture

30 Example- Dot product C code Copied from [Smith97]

31 Example- Dot product - Assembly Copied from [Smith97]

32 Example- Dot product - Assembly Copied from [Smith97]

33 C or Assembly How complicated is the program? Are you pushing the maximum speed of the DSP? How many programmers will be working together? Which is more important, product cost or development cost? What is your background? What does the DSP's manufacturer suggest you use? Copied from [Smith97]

34 Outline FIR filter on ADPS-21x DSP Requirements Fast Multiply-Accumulates (Data-path) Extended Precision Accumulator Register (Data-path) Dual Operand Fetch (Memory) Circular Buffering (Addressing) Zero-Overhead Looping (Instruction set) Analog Devices Architectures and Programming SHARC Blackfin Performance Optimization

35 B LACK fin Processor Core Two 16-bit Multipliers Two 40-bit ALUs, Four 8-bit Video ALUs Barrel Shifter Sixteen 16-bit /Eight 32-bit Math Registers Two DAGs, byte addressing Eight 32-bit pointer registers Four Sets of 32-bit Index, Modify, Length, Base 16-bit Instructions, 32-bit Instructions Multi-Issue, 64-bit Instructions Interlocked Pipeline Micro Signal Architecture, developed with Intel

36 ADSP-BF535 BLACKfin Processor Architecture Great Performance Value Highest Frequency (350 MHz) 1.0V to 1.6V 260 PBGA High System Integration Address range 768Mbytes SPORTs support 8 Channels of I2S Audio (532Mbps) I/O Bandwidth, DMA Bandwidth & Memory Bandwidth Microcontroller features include WDT, PCI, USB1.1 SDRAM controller To 350 MHz BLACKfin Processor Core SDRAM FLASH/SRAM Interfaces Real Time Clock Watchdog JTAG System Peripherals 308 Kbytes On-Chip SRAM DMA SPI 2 UART 2 Timers 3 (32bit) GPIO 16 User Peripherals Dynamic Power Management SPORTs 2 PCI Memory PLL 264Kbytes On-Chip SRAM 48 Kbytes On-Chip Cache USB 1.1

37 Seminars about Blackfin

38 Seminars about Blackfin

39 Seminars about Blackfin

40 Seminars about Blackfin

41 Seminars about Blackfin

42 Seminars about Blackfin

43 Seminars about Blackfin

44 Seminars about Blackfin

45 Seminars about Blackfin

46 Seminars about Blackfin

47 Seminars about Blackfin

48 Seminars about Blackfin

49 Seminars about Blackfin

50 Seminars about Blackfin