Presentation is loading. Please wait.

Presentation is loading. Please wait.

ARM7 TDMI INTRODUCTION.

Similar presentations


Presentation on theme: "ARM7 TDMI INTRODUCTION."— Presentation transcript:

1 ARM7 TDMI INTRODUCTION

2 The History of ARM Developed at Acorn Computers Limited,
of Cambridge, England, between 1983 and 1985 ARM follows RISC Mechanism It is used for small size and high performance applications. Simple architecture – low power consumption.

3 ARM7 TDMI Processor The ARM7TDMI processor is a member of the Advanced RISC machine family of general purpose 32-bit microprocessor What does mean ARM7TDMI ? ARM bit Advanced RISC Machine T - Thumb architecture extension Two separate instruction sets, 32-bit ARM instructions and 16-bit Thumb instructions D - Debug extension M - Enhanced multiplier I - Embedded ICE macrocell extension The ARM7TDMI is a member of the Advanced RISC Machines (ARM) family of general purpose 32-bit microprocessors, which offer high performance for very low power consumption and price.

4 ARM7 TDMI Block Diagram Von Neumann Architecture 3-stage pipeline
fetch, decode, execute 32-bit Data Bus 32-bit Address Bus bit registers 32-bit ARM instruction set 16-bit THUMB instruction set 32x8 Multiplier Barrel Shifter The processor architecture is Von Neumann load/store architectury which is characterized by a single data and address bus for instructions and data. The ARM7TDMI is a 3-stage pipeline 32-bit RISC processor. Pipelining is employed so that all parts of the processing and memory systems can operate continuously. Typically, while one instruction is being executed, its successor is being decoded, and a third instruction is being fetched from memory. The ARM7TDMI processor is built around a bank of bit registers including six status registers. Its outstanding feature is the 16-bit THUMB subset of the most commonly used 32-bit instructions. This gives 16-bit code density coupled with 32-bit processor performance. It features an integrated 32x8 multiplier and 32-bit barrel shifter. Five independent internal buses allow a high degree of parallelism in instruction execution.

5 COMPARISION BETWEEN ARM AND 8051 Microcontroller
1. ARM executes almost all the instruction in only one cycle where as 8051 micro controller takes more than one cycles in almost all the instruction except register transfer. Ex: conditional jump takes 3 cycles for execution ex: DJNZ in 8051 conditional jump takes 1 cycles for execution ex: BNEQ in ARM 2. ARM is a RISC based architecture is a CISC but having less number of instruction as compared to ARM which is RISC. 3. ARM is based on load store architecture i.e data processing instruction can not access memory directly , data has to be stored in a register before processing can access memory directly . 4. ARM have conditional data processing instruction whereas 8051 does not .

6 ARM Architecture Typical RISC architecture:
Large uniform register file Load/store architecture Simple addressing modes Uniform and fixed-length instruction fields

7 Data Sizes and Instruction Sets
When used in relation to the ARM: Byte means 8 bits Half word means 16 bits (two bytes) Word means 32 bits (four bytes) Most ARM’s implement two instruction sets 32-bit ARM Instruction Set 16-bit Thumb Instruction Set The cause of confusion here is the term “word” which will mean 16-bits to people with a 16-bit background. In the ARM world 16-bits is a “halfword” as the architecture is a 32-bit one, whereas “word” means 32-bits. There are actually three instruction sets in modern ARM cores (such as the version 7 cores – A8, M3, and R4, and even the M1): you have Thumb, Thumb2 and ARM. One core in particular, the M3, executes only Thumb2 code, not ARM. All the instructions are 16 bits. Java bytecodes are 8-bit instructions designed to be architecture independent. Jazelle transparently executes most bytecodes in hardware and some in highly optimized ARM code. This is due to a tradeoff between hardware complexity (power consumption & silicon area) and speed.

8 Registers Description
ARM has 37 registers all of which are 32-bits long. 1 dedicated program counter 1 dedicated current program status register 5 dedicated saved program status registers 30 general purpose registers 20 registers are hidden from program at different times(Which are called as Banked Out Registers)

9 The ARM Register Set Current Visible Registers Banked out Registers
r15 (pc) cpsr r13 (sp) r14 (lr) spsr r8 r9 r10 r11 r12 Current Visible Registers Banked out Registers User IRQ SVC Undef Abort FIQ Mode SVC Mode r0 r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 r11 r12 r15 (pc) cpsr r13 (sp) r14 (lr) spsr Current Visible Registers Banked out Registers User FIQ IRQ Undef Abort User Mode r0 r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 r11 r12 r15 (pc) cpsr r13 (sp) r14 (lr) spsr Current Visible Registers Banked out Registers Abort FIQ IRQ SVC Undef Undef Mode r0 r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 r11 r12 r15 (pc) cpsr r13 (sp) r14 (lr) spsr Current Visible Registers Banked out Registers User FIQ IRQ SVC Abort r0 r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 r11 r12 r13 (sp) r14 (lr) r15 (pc) cpsr spsr FIQ IRQ SVC Undef Abort User Mode Current Visible Registers Banked out Registers IRQ Mode r0 r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 r11 r12 r15 (pc) cpsr r13 (sp) r14 (lr) spsr Current Visible Registers Banked out Registers User FIQ SVC Undef Abort This animated slide shows the way that the banking of registers works. On the left the currently visible set of registers are shown for a particular mode. On the right are the registers that are banked out whilst in that mode. Each key press will switch mode: user -> FIQ ->user -> IRQ -> user ->SVC -> User -> Undef -> User -> Abort and then back to user. The following slide then shows this in a more static way that is more useful for reference

10 Operating Modes Seven operating modes: exception modes
User(Non Privileged mode) Privileged: System FIQ IRQ Abort Undefined Supervisor exception modes

11 User : unprivileged mode under which most tasks run
FIQ : entered when a high priority (fast) interrupt is raised IRQ : entered when a low priority (normal) interrupt is raised Supervisor : Entered on reset and when a Software Interrupt instruction is executed Abort : used to handle memory access violations Un def : used to handle undefined instructions System : privileged mode using the same registers as user mode

12 Register Organization Summary
User FIQ IRQ SVC Undef Abort r8 r9 r10 r11 r12 r13 (sp) r14 (lr) r15 (pc) cpsr r0 r1 r2 r3 r4 r5 r6 r7 User mode r0-r7, r15, and cpsr User mode r0-r12, r15, and cpsr User mode r0-r12, r15, and cpsr User mode r0-r12, r15, and cpsr User mode r0-r12, r15, and cpsr Thumb state Low registers r8 r9 Thumb state High registers r10 r11 r12 This slide shows the registers visible in each mode - basically in a more static fashion than the previous animated slide that is more useful for reference. The main point to state here is the splitting of the registers in Thumb state into Low and High registers. ARM register banking is the minimum necessary for fast handling of overlapping exceptions of different types (e.g. ABORT during SWI during IRQ). For nested exceptions of the same type (e.g. re-entrant interrupts) some additional pushing of registers to the stack is required. r13 (sp) r13 (sp) r13 (sp) r13 (sp) r13 (sp) r14 (lr) r14 (lr) r14 (lr) r14 (lr) r14 (lr) spsr spsr spsr spsr spsr Note: System mode uses the User mode register set

13 Exception types, sorted by Interrupt Vector addresses
Exceptions Exception Mode Priority IV Address Reset Supervisor 1 0x Undefined instruction Undefined 6 0x Software interrupt 0x Pre fetch Abort Abort 5 0x C Data Abort 2 0x Interrupt IRQ 4 0x Fast interrupt FIQ 3 0x C Exception types, sorted by Interrupt Vector addresses

14 Progr am Status Register
27 31 N Z C V Q 28 6 7 I F T mode 16 23 8 15 5 4 24 f s x c U n d e f i n e d J Condition code flags N = Negative result from ALU Z = Zero result from ALU C = ALU operation Carried out V = ALU operation Overflowed Sticky Overflow flag - Q flag Architecture 5TE/J only Indicates if saturation has occurred J bit Architecture 5TEJ only J = 1: Processor in Jazelle state Interrupt Disable bits. I = 1: Disables the IRQ. F = 1: Disables the FIQ. T Bit Architecture x T only T = 0: Processor in ARM state T = 1: Processor in Thumb state Mode bits Specify the processor mode

15 Undefined Instruction
Exception Handling When an exception occurs, the ARM: Copies CPSR into SPSR_<mode> Sets appropriate CPSR bits Change to ARM state Change to exception mode Disable interrupts (if appropriate) Stores the return address in LR_<mode> Sets PC to vector address To return, exception handler needs to: Restore CPSR from SPSR_<mode> Restore PC from LR_<mode> This can only be done in ARM state. 0x1C 0x18 0x14 0x10 0x0C 0x08 0x04 0x00 FIQ IRQ (Reserved) Data Abort Prefetch Abort Software Interrupt Exception handling on the ARM is controlled through the use of an area of memory called the vector table. This lives (normally) at the bottom of the memory map from 0x0 to 0x1c. Within this table one word is allocated to each of the various exception types. This word will contain some form of ARM instruction that should perform a branch. It does not contain an address. Reset - executed on power on Undef - when an invalid instruction reaches the execute stage of the pipeline SWI - when a software interrupt instruction is executed Prefetch - when an instruction is fetched from memory that is invalid for some reason, if it reaches the execute stage then this exception is taken Data - if a load/store instruction tries to access an invalid memory location, then this exception is taken IRQ - normal interrupt FIQ - fast interrupt When one of these exceptions is taken, the ARM goes through a low-overhead sequence of actions in order to invoke the appropriate exception handler. The current instruction is always allowed to complete (except in case of Reset). IRQ is disabled on entry to all exceptions; FIQ is also disabled on entry to Reset and FIQ. Undefined Instruction Reset Vector Table Vector table can be at 0xFFFF0000 on ARM720T and on ARM9/10 family devices

16 Using a Barrel Shifter:The 2nd Operand
Register, optionally with shift operation Shift value can be either be: 5 bit unsigned integer Specified in bottom byte of another register. Used for multiplication by constant Immediate value 8 bit number, with a range of Rotated right through even number of positions Allows increased range of 32-bit constants to be loaded directly into registers Result Operand 1 Barrel Shifter Operand 2 ALU

17 Pipeline Organization
3-stage pipeline: Fetch – Decode - Execute Three-cycle latency, one instruction per cycle throughput instruction i Fetch Decode Execute i+1 Fetch Decode Execute i+2 Fetch Decode Execute cycle t t+1 t+2 t+3 t+4

18 AMBA bus architecture The ARM7 family processors are designed for use with the Advanced Microcontroller Bus Architecture (AMBA). The AMBA specification defines two buses: Advanced High-performance Bus (AHB) Advanced Peripheral Bus (APB).

19 An Example AMBA System APB AHB High Performance ARM processor UART
Bandwidth External Memory Interface Timer AHB APB Bridge Keypad High-bandwidth on-chip RAM DMA Bus Master PIO AMBA systems are based around two buses, a high performance system bus and a lower performance peripheral bus. The high performance bus (AHB) should connect all of the high performance, high bandwidth modules, such as the ARM Processor, any DMA engines, perhaps some fast, 32 bit wide local RAM (or wider to suit the ARM Bus Master being used), an external memory interface, and the interface to the lower performance bus. The number of modules connected here should be kept to a minimum to reduce the bus loading on this high performance bus and allow it to run much faster. The bulk of the design modules are placed on the lower performance Peripheral Bus (APB). Modules placed here are not accessed as frequently as AHB modules, and as a result of the APB timing, need not consume power (in their APB interfaces) when there is no APB activity. The address and data bus widths on the APB only need to be as wide as required, compared to the AHB where the maximum address and data widths are specified to maximize system performance in this critical area. So when a design is being partitioned, the main design criteria are AHB APB High Performance Low Power Frequent Access Simple Interface Pipelined timing Non-pipelined We don’t go into multi-layer AHB, AHB Lite or any other AMBA configurations in this presentation. Low Power Non-pipelined Simple Interface High Performance Pipelined Burst Support Multiple Bus Masters

20 Queries


Download ppt "ARM7 TDMI INTRODUCTION."

Similar presentations


Ads by Google