DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –600 036

Slides:

Advertisements

Similar presentations

RAM (cont.) 220 bytes of RAM (1 Mega-byte) 20 bits of address Address

Advertisements

Microprocessors A Beginning.

DSPs Vs General Purpose Microprocessors

Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.

Instruction Set Design

Dr. Rabie A. Ramadan Al-Azhar University Lecture 3

CPU Review and Programming Models CT101 – Computing Systems.

POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:

CENTRAL PROCESSING UNIT

Microprocessor and Microcontroller Based Systems Instructor: Eng.Moayed N. EL Mobaied The Islamic University of Gaza Faculty of Engineering Electrical.

Chapter 6 สถาปัตยกรรมไมโครโพรเซสเซอร์แบบต่างๆ Processor Architectures

Processor Technology and Architecture

Instruction Level Parallelism (ILP) Colin Stevens.

Chapter 16 Control Unit Implemntation. A Basic Computer Model.

Chapter 4 Processor Technology and Architecture. Chapter goals Describe CPU instruction and execution cycles Explain how primitive CPU instructions are.

State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.

Vacuum tubes Transistor 1948 –Smaller, Cheaper, Less heat dissipation, Made from Silicon (Sand) –Invented at Bell Labs –Shockley, Brittain, Bardeen ICs.

11/11/05ELEC CISC (Complex Instruction Set Computer) Veeraraghavan Ramamurthy ELEC 6200 Computer Architecture and Design Fall 2005.

Processor Architecture Kieran Mathieson. Outline Memory CPU Structure Design a CPU Programming Design Issues.

GCSE Computing - The CPU

What’s on the Motherboard? The two main parts of the CPU are the control unit and the arithmetic logic unit. The control unit retrieves instructions from.

RISC and CISC. Dec. 2008/Dec. and RISC versus CISC The world of microprocessors and CPUs can be divided into two parts:

Real time DSP Professors: Eng. Julian Bruno Eng. Mariano Llamedo Soria.

Lecture#14. Last Lecture Summary Memory Address, size What memory stores OS, Application programs, Data, Instructions Types of Memory Non Volatile and.

CHAPTER 8: CPU and Memory Design, Enhancement, and Implementation

Basics and Architectures

Invitation to Computer Science 5th Edition

Levels of Architecture & Language CHAPTER 1 © copyright Bobby Hoggard / material may not be redistributed without permission.

Computer Architecture

1 4.2 MARIE This is the MARIE architecture shown graphically.

Introduction of Intel Processors

1Copyright © Prentice Hall 2000 The Central Processing Unit Chapter 3 What Goes on Inside the Computer.

DSP Processors We have seen that the Multiply and Accumulate (MAC) operation is very prevalent in DSP computation computation of energy MA filters AR filters.

Cis303a_chapt04.ppt Chapter 4 Processor Technology and Architecture Internal Components CPU Operation (internal components) Control Unit Move data and.

Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.

Chapter 16 Micro-programmed Control

Microprogrammed Control Chapter11:. Two methods for generating the control signals are: 1)Hardwired control o Sequential logic circuit that generates.

1 Computer Organization & Design Microcode for Control Sec. 5.7 (CDROM) Appendix C (CDROM) / / pdf / lec_3a_notes.pdf.

PART 6: (1/2) Enhancing CPU Performance CHAPTER 16: MICROPROGRAMMED CONTROL 1.

Ted Pedersen – CS 3011 – Chapter 10 1 A brief history of computer architectures CISC – complex instruction set computing –Intel x86, VAX –Evolved from.

Computer Architecture 2 nd year (computer and Information Sc.)

Electronic Analog Computer Dr. Amin Danial Asham by.

DIGITAL SIGNAL PROCESSORS. Von Neumann Architecture Computers to be programmed by codes residing in memory. Single Memory to store data and program.

Next Generation ISA Itanium / IA-64. Operating Environments IA-32 Protected Mode/Real Mode/Virtual Mode - if supported by the OS IA-64 Instruction Set.

Computer and Information Sciences College / Computer Science Department CS 206 D Computer Organization and Assembly Language.

Processor Structure and Function Chapter8:. CPU Structure  CPU must:  Fetch instructions –Read instruction from memory  Interpret instructions –Instruction.

Pentium Architecture Arithmetic/Logic Units (ALUs) : – There are two parallel integer instruction pipelines: u-pipeline and v-pipeline – The u-pipeline.

MICROPROGRAMMED CONTROL

Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.

Fundamentals of Programming Languages-II

Hardwired Control Department of Computer Engineering, M.S.P.V.L Polytechnic College, Pavoorchatram. A Presentation On.

The Processor & its components. The CPU The brain. Performs all major calculations. Controls and manages the operations of other components of the computer.

CISC. What is it?  CISC - Complex Instruction Set Computer  CISC is a design philosophy that:  1) uses microcode instruction sets  2) uses larger.

GCSE Computing - The CPU

Low-power Digital Signal Processing for Mobile Phone chipsets

Micro-programmed Control

Embedded Systems Design

The Central Processing Unit

Computer Organization & Design Microcode for Control Sec. 5

Scalable Processor Design

Digital Signal Processors

CISC AND RISC SYSTEM Based on instruction set, we broadly classify Computer/microprocessor/microcontroller into CISC and RISC. CISC SYSTEM: COMPLEX INSTRUCTION.

Morgan Kaufmann Publishers Computer Organization and Assembly Language

The ARM Instruction Set

GCSE Computing - The CPU

William Stallings Computer Organization and Architecture

Presentation transcript:

DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –

Figure 4.3(a) Block diagram of a barrel shifter

Figure 4.3(b) Implementation of a 4-bit, shift-right barrel shifter

Figure 4.5 A MAC unit with accumulator guard bits

Figure 4.6 A schematic diagram of the saturation logic

Figure 4.7 Block diagram of an arithmetic logic unit

Figure 4.9 Register pointer updating algorithm for circular buffer addressing mode: SAR = start address register contents, EAR = end address register contents, PNTR = pointer

Figure 4.10 Different cases that arise in updating the pointer in circular buffer addressing mode

Figure 4.10 Continued

Figure 4.11 Block diagram of an address generation unit

Bit-reversal Hardware

Figure 4.12 A conceptual diagram of a program sequencer

Instruction Level Parallelism VLIW architecture Each instruction specifies several operations to be done in parallel Advantages : Simple hardware compilers can spot ILP easily Disadvantages : Little compatibilty between generations Explicit NOPs bloat code size

Super scalar architecture Hardware responsible for finding ILP in a sequential program Advantage : Compatibility between generations Disadvantage : Very complex hardware

Explicitly Parallel Instruction Computing (EPIC) Combines VLIW and super scalar architectures Instructions are grouped into 3 operating blocks and a template block Template block tells hardware if instructions can be executed in parallel Also gives information whether the block can be executed in parallel

ILP versus Power Increasing instructions / cycle  Requires fewer cycles to execute a task  Uses longer clock for same performance  Uses lower supply voltage  And hence uses less power However, too many functional units and too many transitions per clock cycle increase power consumption.

Low Power architecture  Power consumed by additional circuits vs. ability to lower clock rate while maintaining performance  Circuits must be highly used  Move complexity into software  Voltage scaling : Reduce V dd  Clock gating : Turn off clock when chip is not in use ( applies to sub-modules of chip also)

 VLIW is more suitable than super scalar for low power - VLIW is smaller for same number of functional units - Compiler is better at finding parallelism than hardware  Put multiple processors on chip rather than lots of functional units in one processor  Helps in running independent tasks

General Purpose Microprocessor 2000  GHz clock speed  32-bit address or more  32-bit bus, 128-bit instructions  Complex MMU  Super scalar CPU  MMX instructions  On chip cache  Single cycle execution  32-bit floating point ALU on board  Very expensive  10s of watts of power

DSP in 2000  Clock 100 ~ 200 MHz  16-bit floating point or 32-bit floating point  bits address space  Large on-chip and off-chip memories  Single cycle execution of most instructions  Harvard architecture  Lots of special DSP instructions  50 mw to 2w power  Cheap

Future of DSP Microprocessor  Sufficiently unique for an independent class of applications (HDD, cell phone)  Low power consumption, low cost  High performance within power, cost constraints (MIPS/mw, MIPS/$)  Fixed point & floating point  Better compilers - but users must be informed  Hybrid DSP/ GP systems