Low-power Digital Signal Processing for Mobile Phone chipsets

Slides:



Advertisements
Similar presentations
The CPU The Central Presentation Unit What is the CPU?
Advertisements

Microprocessors A Beginning.
DSPs Vs General Purpose Microprocessors
1/1/ / faculty of Electrical Engineering eindhoven university of technology Speeding it up Part 3: Out-Of-Order and SuperScalar execution dr.ir. A.C. Verschueren.
ISA Issues; Performance Considerations. Testing / System Verilog: ECE385.
POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:
MICROPROCESSORS TWO TYPES OF MODELS ARE USED :  PROGRAMMER’S MODEL :- THIS MODEL SHOWS FEATURES, SUCH AS INTERNAL REGISTERS, ADDRESS,DATA & CONTROL BUSES.
PDP-8 Jon Andrews Project Outline: To implement a real commercial processor into an Field Programmable Gate Array (FPGA). It should be able to run code.
Computer Architecture Abhinav Agarwal Veeramani V.
Computer Organization and Architecture
An instruction buffer for a low-power DSP 1 An Instruction Buffer for a Low Power DSP Mike Lewis AMULET group.
Midterm Wednesday Chapter 1-3: Number /character representation and conversion Number arithmetic Combinational logic elements and design (DeMorgan’s Law)
Data Manipulation Computer System consists of the following parts:
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
Chapter 12 CPU Structure and Function. Example Register Organizations.
GCSE Computing - The CPU
1 Combining the strengths of UMIST and The Victoria University of Manchester Asynchronous Signal Processing Systems Linda Brackenbury APT GROUP, Computer.
Motivation Mobile embedded systems are present in: –Cell phones –PDA’s –MP3 players –GPS units.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
Computer Architecture
Mahesh Sukumar Subramanian Srinivasan. Introduction Embedded system products keep arriving in the market. There is a continuous growing demand for more.
DSP Processors We have seen that the Multiply and Accumulate (MAC) operation is very prevalent in DSP computation computation of energy MA filters AR filters.
ARM for Wireless Applications ARM11 Microarchitecture On the ARMv6 Connie Wang.
Computer Architecture Memory, Math and Logic. Basic Building Blocks Seen: – Memory – Logic & Math.
8085. Microcomputer Major components of the computer - the processor, the control unit, one or more memory ICs, one or more I/O ICs, and the clock Major.
Computer Organization CDA 3103 Dr. Hassan Foroosh Dept. of Computer Science UCF © Copyright Hassan Foroosh 2002.
Reconfigurable Computing - Pipelined Systems John Morris Chung-Ang University The University of Auckland ‘Iolanthe’ at 13 knots on Cockburn Sound, Western.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
ALU (Continued) Computer Architecture (Fall 2006).
Computer and Information Sciences College / Computer Science Department CS 206 D Computer Organization and Assembly Language.
System Hardware FPU – Floating Point Unit –Handles floating point and extended integer calculations 8284/82C284 Clock Generator (clock) –Synchronizes the.
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
CBP 2002ITY 270 Computer Architecture1 Module Structure Whirlwind Review – Fetch-Execute Simulation Instruction Set Architectures RISC vs x86 How to build.
The Processor & its components. The CPU The brain. Performs all major calculations. Controls and manages the operations of other components of the computer.
8085 INTERNAL ARCHITECTURE.  Upon completing this topic, you should be able to: State all the register available in the 8085 microprocessor and explain.
1 3 Computing System Fundamentals 3.2 Computer Architecture.
Riyadh Philanthropic Society For Science Prince Sultan College For Woman Dept. of Computer & Information Sciences CS 251 Introduction to Computer Organization.
STUDY OF PIC MICROCONTROLLERS.. Design Flow C CODE Hex File Assembly Code Compiler Assembler Chip Programming.
GCSE Computing - The CPU
Electrical and Computer Engineering University of Cyprus
Assembly language.
The 8085 Microprocessor Architecture
Basic Processor Structure/design
Variable Word Width Computation for Low Power
ECE354 Embedded Systems Introduction C Andras Moritz.
Central Processing Unit Architecture
Embedded Systems Design
The 8085 Microprocessor Architecture
Basics of digital systems
The 8085 Microprocessor Architecture
An Introduction to Microprocessor Architecture using intel 8085 as a classic processor
CISC AND RISC SYSTEM Based on instruction set, we broadly classify Computer/microprocessor/microcontroller into CISC and RISC. CISC SYSTEM: COMPLEX INSTRUCTION.
Morgan Kaufmann Publishers Computer Organization and Assembly Language
Control Unit Introduction Types Comparison Control Memory
Rocky K. C. Chang 6 November 2017
The Processor Lecture 3.1: Introduction & Logic Design Conventions
The 8085 Microprocessor Architecture
Computer Architecture
Computer Evolution and Performance
The ARM Instruction Set
ECE 352 Digital System Fundamentals
ARM ORGANISATION.
CPU Structure CPU must:
GCSE Computing - The CPU
Author: Xianghui Hu, Xinan Tang, Bei Hua Lecturer: Bo Xu
ADSP 21065L.
Computer Operation 6/22/2019.
Microprocessor Lecture 7 (8086 Registers).
COMPUTER ARCHITECTURE
Presentation transcript:

Low-power Digital Signal Processing for Mobile Phone chipsets Mike Lewis AMULET group

GSM (digital) mobile phones Huge (and increasing) market Highly competitive Battery size and lifetime are key

Low power DSP for GSM Chipsets GSM Chipsets typically based on microprocessor + DSP GEM301 baseband processor: ARM + OAK Microprocessor performs control tasks DSP performs intensive calculations How is power consumption divided?

Sources of power consumption In GEM301 chipset: DSP consumes 1mA per MIP Maximum DSP activity of 36 MIPS Maximum total consumption 55mA DSP consumes 36mA, 65% of total current DSP power consumption will increase with increased algorithmic complexity DSP power particularly important for wireless LANs (e.g. Bluetooth)

What’s needed from the DSP? >100 MIPS throughput OAK DSP maintains 40 MIPS for half-rate More intensive coding algorithms More additional applications required Choose 4x speed increase:160 MIPS

Sources of power consumption Program and data memory access Power dissipated by RAM Power dissipated in system buses Calculations on data Transitions on internal buses Transitions within arithmetic units Control overhead

How to keep the power down... Parallel structure Keep processing rate of each unit down Turn excess speed into power reduction VLIW preferable for low power Parallelism exploited by programmer But… VLIW involves long instructions!

Compressed Instructions DSP activity characterized by regular repetition of fixed algorithms Store the instructions in configuration memories internal to the DSP Execute instructions with 32 bit word System speed remains at 40MHz

Compressed instructions Instructions stored in each functional unit Split into operand and opcode memories Operands: register selections / immediates Opcode: operation selection Instruction buffer handles looping Complex algorithms can be executed with only one pass from program memory

How to keep the power down… (2) Use a large register file to reduce accesses to data memory Simple RISC-like load-store architecture Register file segmented into two 128 word banks (X and Y to match memory) Maximum data reuse Data accessed by 7-bit index register rather than 24-bit address register Results go to ALU accumulators

How to keep the power down… (3) Sign-magnitude number system chosen Investigation showed a power reduction of 10-50% over 2’s complement scheme Power saving particularly accentuated when long buses are driven

Asynchronous design Synchronous design All activity synchronised by common clock Latch

How to keep the power down… (4) Asynchronous design No clock distribution network No need for clock gating Exploit end-of-block idle time Reduced EMC problems Modular design

How to keep the power down… (5) DSPs are good at number-crunching… less good at control code Use DSP as a coprocessor Limited support for context switches Microprocessor prepares tasks and directs the DSP to perform them Simple interrupt structure to synchronize on arrival of data

A low-power DSP architecture Fetch unit- autonomous instruction fetch X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

A low-power DSP architecture Instruction buffer: 32 entry FIFO also handles loops X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

A low-power DSP architecture Decode instruction, read VLIW operand X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

A low-power DSP architecture Substitute and update index registers X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

A low-power DSP architecture Read registers and VLIW opcode X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

A low-power DSP architecture Perform operation X/Y mem Register Bank (2x128x16 bit) Load-store unit P mem Fetch Buffer ALU ALU ALU ALU VLIW mem VLIW mem VLIW mem VLIW mem Decode int0, int1, nmi Operand Opcode Index reg. Index register values

Conclusions Mobile communications has special requirements from the DSP Multi-level power reduction strategy should dramatically reduce power Async design will give simple power management and low EM interference DSP architecture fully designed, circuit level design underway So, to conclude; Mobile communication devices require low power consumption from the DSP. The tasks required from the DSP are quite specialised, and so a number of power saving strategies are possible which are implemented in the new DSP architecture. Power reduction is a multi-level process: there’s no ‘magic bullet’ that will give low power consumption. However, the strategies adopted should dramatically reduce power consumption. A simulation model of this new architecture has been completed, and work is now underway on the circuit-level design. So, we hope soon to be able to demonstrate how this new DSP can prolong the battery life of the next generation of mobile products! OK, does anybody have any questions?