Nov. 12, 1997 Bob Brodersen (http://infopad.eecs.berkeley.edu) CS 152 Computer Architecture and Engineering Introduction to Architectures for Digital.

Slides:



Advertisements
Similar presentations
DSPs Vs General Purpose Microprocessors
Advertisements

Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
EEE226 MICROPROCESSORBY DR. ZAINI ABDUL HALIM School of Electrical & Electronic Engineering USM.
Microprocessor 8085/8086 Lecturer M A Rahim Khan Computer Engineering and Networks Deptt.
Embedded Software Optimization for MP3 Decoder Implemented on RISC Core Yingbiao Yao, Qingdong Yao, Peng Liu, Zhibin Xiao Zhejiang University Information.
L27:Lower Power Algorithm for Multimedia Systems 성균관대학교 조 준 동
Introduction to ARM Architecture, Programmer’s Model and Assembler Embedded Systems Programming.
Embedded Computing From Theory to Practice November 2008 USTC Suzhou.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Vacuum tubes Transistor 1948 –Smaller, Cheaper, Less heat dissipation, Made from Silicon (Sand) –Invented at Bell Labs –Shockley, Brittain, Bardeen ICs.
Embedded Systems Programming
Prardiva Mangilipally
© 2009 Acehub Vista Sdn. Bhd Introduction to ARM ® Processors.
Microcontroller: Introduction
Computer performance.
Embedded Systems Design ICT Embedded System What is an embedded System??? Any IDEA???
Real time DSP Professors: Eng. Julian Bruno Eng. Mariano Llamedo Soria.
Motivation Mobile embedded systems are present in: –Cell phones –PDA’s –MP3 players –GPS units.
Internal hardware and external components of a computer Three-box Model  Processor The brain of the system Executes programs A big finite state machine.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Classifying GPR Machines TypeNumber of Operands Memory Operands Examples Register- Register 30 SPARC, MIPS, etc. Register- Memory 21 Intel 80x86, Motorola.
The variety Of Processors And Computational Engines CS – 355 Chapter- 4 `
ARM for Wireless Applications ARM11 Microarchitecture On the ARMv6 Connie Wang.
Computer Organization & Assembly Language © by DR. M. Amer.
Introduction to Microprocessors
MICROPROCESSOR FUNCTION Technician Series Created Mar 2015 gmail.com.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
What is a Microprocessor ? A microprocessor consists of an ALU to perform arithmetic and logic manipulations, registers, and a control unit Its has some.
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
Chapter 2 Turning Data into Something You Can Use
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
Fundamentals of Programming Languages-II
Capability of processor determine the capability of the computer system. Therefore, processor is the key element or heart of a computer system. Other.
CPU (Central Processing Unit). The CPU is the brain of the computer. Sometimes referred to simply as the processor or central processor, the CPU is where.
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
Introduction to Microcontroller Technology
Programmable Logic Devices
Computer Organization and Assembly Languages Yung-Yu Chuang
HISTORY OF MICROPROCESSORS
ECE354 Embedded Systems Introduction C Andras Moritz.
Low-power Digital Signal Processing for Mobile Phone chipsets
6. Structure of Computers
UNIT – Microcontroller.
Discovering Computers 2011: Living in a Digital World Chapter 4
Embedded Systems Design
Lecture on Microcomputer
Architecture & Organization 1
Dr. Michael Nasief Lecture 2
INTRODUCTION TO MICROPROCESSORS
Basic Computer Organization
Chapter 1: The 8051 Microcontrollers
Introduction to Microprocessors and Microcontrollers
Introduction to Digital Signal Processors (DSPs)
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
Microprocessor & Assembly Language
ECEG-3202 Computer Architecture and Organization
ECEG-3202 Computer Architecture and Organization
INTRODUCTION TO COMPUTER ARCHITECTURE
Computer Architecture
#2 Basic of Computer Architecture
Computer Evolution and Performance
Embedded Processors.
Introduction to Microprocessor Programming
Digital Signal Processors-1
Introduction to 5685x Series
CHAPTER 1 THE 8051 MICROCONTROLLERS
Computer Organization and Assembly Languages Yung-Yu Chuang 2008/11/17
ADSP 21065L.
ARM920T Processor This training module provides an introduction to the ARM920T processor embedded in the AT91RM9200 microcontroller.We’ll identify the.
Presentation transcript:

Nov. 12, 1997 Bob Brodersen (http://infopad.eecs.berkeley.edu) CS 152 Computer Architecture and Engineering Introduction to Architectures for Digital Signal Processing Nov. 12, 1997 Bob Brodersen (http://infopad.eecs.berkeley.edu)

Processor Applications General Purpose - high performance Pentiums, Alpha’s, SPARC Used for general purpose software Heavy weight OS - UNIX, NT Workstations, PC’s Embedded processors and processor cores ARM, 486SX, Hitachi SH7000, NEC V800 Single program Lightweight, often realtime OS DSP support Cellular phones, consumer electronics (e.g. CD players) Microcontrollers Extremely cost sensitive Small word size - 8 bit common Highest volume processors by far Automobiles, toasters, thermostats, ... Increasing Cost Increasing volume

The Processor Design Space Application specific architectures for performance Embedded processors Microprocessors Performance is everything & Software rules Performance Microcontrollers Cost is everything Cost

World’s Cellular Subscribers Millions Will provide a ubiquitous infrastructure for wireless data as well as voice Digital Analog Year Source: Ericsson Radio Systems, Inc.

Multimedia I/O Architecture Radio Modem Embedded Processor Sched ECC Pact Interface Low Power Bus FB Fifo Fifo Video Decomp SRAM Pen Data Flow Graphics Audio Video

Embedded applications E.g. Multimedia terminal electronics Uplink Radio Downlink Radio Graphics Out Video I/O Voice I/O Pen In Future chips will be a mix of processors, memory and dedicated hardware for specific algorithms and I/O µP DSP Coms Video Unit custom Memory

Requirements of the Embedded Processors Optimized for a single program - code often in on-chip ROM or off chip EPROM Minimum code size (one of the motivations initially for Java) Performance obtained by optimizing datapath Low cost Lowest possible area Technology behind the leading edge High level of integration of peripherals (reduces system cost) Fast time to market Compatible architectures (e.g. ARM) allows reuseable code Customizable core Low power if application requires portability

Area of processor cores = Cost Nintendo processor Cellular phones

Another figure of merit Computation per unit area Nintendo processor ??? Cellular phones

National Semiconductor - Embedded Processor Family Simple architecture 3 stage pipeline - fetch - decode - execute Minimum power and size Short pipeline avoids branch prediction and bypass Versions range from 8-64 bit - choose minimum that meets requirements

Code size If a majority of the chip is the program stored in ROM, then code size is a critical issue The Piranha has 3 sized instructions - basic 2 byte, and 2 byte plus 16 or 32 bit immediate

Example application (single chip system)

The DSP Module (DSPM) Vector instructions directly supported Pipelined datapath supprts single cycle: Multiply, Add, Shift, Load/Store and Pointer adjustment Operates in parallel to processor core Saturation, overflow and rounding for ALU operations Automatic support for cyclic buffers (modulo arithmetic)

The National DSP Module Architecture Three simultaneous addresses Zero overhead repeat X Y Z Single cycle MAC support is typical for DSP acceleration

The 486 “Embedded Processor”

The “Embedded” Features of the 486 GX Said to be designed “for embedded battery-operated and hand-held applications” (???) Fully static design (clock can stop and all state is kept) “Auto Clock Freeze” stops circuits which are not being used in a given instruction (gated clocks) Stop Clock (60 mW), Stop Grant - clock runs but no program execution (40-85 mW) Split power supply - 2.0-3.3 Volt core, 3.3V. I/O,

Power = C V2 fclock Note the clock rates Power 130 mW 350 mW 190 mW

Characterizing programs for their energy consumption Process Subframe 330mW ComputeLag 107mW ComputeLag(...) IFilterCodebook 63mW { R=dotprod(res,res); QuantizeGains 46mW for (lag=0..127) { CodebookSearch 44mW lp=getLT(lt); UpdateFilterState 8mW G = dotprod(lp, lp); } OrthogonalizeCodebook 6mW } ComputeWeightedInput 22mW ThetaToCodeword 8mW Top four functions account for 90 % of the power 65% of power dissipation in dot-vector products (data obtained from profiling of C++-code, weighted with estimated instruction energy costs)

An architecture optimized for multiply-accumulate AddressGen AddressGen Energy/Flexibility Tradeoff’s Arm 6 core (5V, 20 MHz): .02 MIPS/mW ZSP DSP Superscaler (3V, 200 MHz) .3 MOPS/mW Reconfigurable Dot-Vector Processor (1.5V, 30 MHz) 5.9 MIPS/mW * MOPS = millions of operations/sec = millions of MACS/sec Memory Memory MAC MAC L G C Control Processor