1 ARM University Program Copyright © ARM Ltd 2013 Using Direct Memory Access to Improve Performance.

Slides:



Advertisements
Similar presentations
DMA Controller (8237 Programming Examples)
Advertisements

Interrupts, Low Power Modes and Timer A (Chapters 6 & 8)
Chapter 10 Input/Output Organization. Connections between a CPU and an I/O device Types of bus (Figure 10.1) –Address bus –Data bus –Control bus.
8253/54 Timer Section /54 Timer Description and Initialization
Direct Memory Access Introduction to 8237
Instructor: Dr Aleksandar Milenkovic Lecture Notes
CS-334: Computer Architecture
Interrupts What is an interrupt? What does an interrupt do to the “flow of control” Interrupts used to overlap computation & I/O – Examples would be console.
6-1 I/O Methods I/O – Transfer of data between memory of the system and the I/O device Most devices operate asynchronously from the CPU Most methods involve.
Computer System Structures memory memory controller disk controller disk controller printer controller printer controller tape-drive controller tape-drive.
Timers and Interrupts Shivendu Bhushan Summer Camp ‘13.
Unit-5 CO-MPI autonomous
Group 7 Jhonathan Briceño Reginal Etienne Christian Kruger Felix Martinez Dane Minott Immer S Rivera Ander Sahonero.
2. Methods for I/O Operations
The 8051 Microcontroller and Embedded Systems
INTERRUPTS PROGRAMMING
NS Training Hardware. System Controller Module.
CS-334: Computer Architecture
The computer system’s I/O architecture is its interface to the outside world. This architecture provides a systematic means of controlling interaction.
Chapter 7 Input/Output Luisa Botero Santiago Del Portillo Ivan Vega.
General System Architecture and I/O.  I/O devices and the CPU can execute concurrently.  Each device controller is in charge of a particular device.
Operating System Concepts Ku-Yaw Chang Assistant Professor, Department of Computer Science and Information Engineering Da-Yeh University.
Interrupts. What Are Interrupts? Interrupts alter a program’s flow of control  Behavior is similar to a procedure call »Some significant differences.
Chapter 10: Input / Output Devices Dr Mohamed Menacer Taibah University
8254 Programmable Interval Timer
7/23 Timers in Coldfire Processor Computer Science & Engineering Department Arizona State University Tempe, AZ Dr. Yann-Hang Lee (480)
PIT: Programmable Interval Timer
Timer Peripherals.
The 8253 Programmable Interval Timer
Interrupts and DMA CSCI The Role of the Operating System in Performing I/O Two main jobs of a computer are: –Processing –Performing I/O manage and.
CHAPTER 3 TOP LEVEL VIEW OF COMPUTER FUNCTION AND INTERCONNECTION
1-1 Embedded Network Interface (ENI) API Concepts Shared RAM vs. FIFO modes ENI API’s.
1 ARM University Program Copyright © ARM Ltd 2013 Timer Peripherals.
Lecture 14 DMA Controller & Serial Communications Interface (UART)
Top Level View of Computer Function and Interconnection.
Timers.
Using Direct Memory Access to Improve Performance
I/O management is a major component of operating system design and operation Important aspect of computer operation I/O devices vary greatly Various methods.
COMPUTER ORGANIZATIONS CSNB123 NSMS2013 Ver.1Systems and Networking1.
Interrupts, Buses Chapter 6.2.5, Introduction to Interrupts Interrupts are a mechanism by which other modules (e.g. I/O) may interrupt normal.
Direct Memory Access (DMA) Microprocessors I -1. Topics to be discussed  Basic DMA Concept Basic DMA Concept  DMA pins and timing DMA pins and timing.
Computer Architecture Lecture 2 System Buses. Program Concept Hardwired systems are inflexible General purpose hardware can do different tasks, given.
EEE440 Computer Architecture
Direct Memory Access Module MTT M DIRECT MEMORY ACCESS MODULE (DMA)
Accessing I/O Devices Processor Memory BUS I/O Device 1 I/O Device 2.
© 2008, Renesas Technology America, Inc., All Rights Reserved 1 Module Introduction Purpose  This training module provides an overview of the peripherals.
The computer system’s I/O architecture is its interface to the outside world. This architecture provides a systematic means of controlling interaction.
CE Operating Systems Lecture 2 Low level hardware support for operating systems.
© 2008, Renesas Technology America, Inc., All Rights Reserved 1 Course Introduction  Purpose:  This course provides an overview of the Direct Memory.
Silberschatz, Galvin and Gagne  Applied Operating System Concepts Chapter 2: Computer-System Structures Computer System Architecture and Operation.
CE Operating Systems Lecture 2 Low level hardware support for operating systems.
Dr Mohamed Menacer College of Computer Science and Engineering, Taibah University CE-321: Computer.
Interrupt driven I/O Computer Organization and Assembly Language: Module 12.
Time Management.  Time management is concerned with OS facilities and services which measure real time.  These services include:  Keeping track of.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
Amdahl’s Law & I/O Control Method 1. Amdahl’s Law The overall performance of a system is a result of the interaction of all of its components. System.
Introduction to Exceptions 1 Introduction to Exceptions ARM Advanced RISC Machines.
EE 345S Real-Time Digital Signal Processing Lab Fall 2008 Lab #3 Generating a Sine Wave Using the Hardware & Software Tools for the TI TMS320C6713 DSP.
Homework Reading Machine Projects Labs
Timer and Interrupts.
DMA CONTROLLER 8257 Features: It is a 4-channel DMA.
Interrupt and Exception Programming
CPE 323 Introduction to Embedded Computer Systems: DMA Controller
8253 – PROGRAMMABLE INTERVAL TIMER (PIT). What is a Timer? Timer is a specialized type of device that is used to measure timing intervals. Timers can.
Chapter 13: I/O Systems.
Presentation transcript:

1 ARM University Program Copyright © ARM Ltd 2013 Using Direct Memory Access to Improve Performance

2 ARM University Program Copyright © ARM Ltd 2013 Overview  Basic Concepts  DMA Peripherals in Cortex-M0+  DMA Applications  Data Transfer  Replacing ISRs

3 ARM University Program Copyright © ARM Ltd 2013 Basic Concepts  Special hardware to read data from a source and write it to a destination  Various configurable options  Number of data items to copy  Source and destination addresses can be fixed or change (e.g. increment, decrement)  Size of data item  When transfer starts  Operation  Initialization: Configure controller  Transfer: Data is copied  Termination: Channel indicates transfer has completed

4 ARM University Program Copyright © ARM Ltd 2013 DMA Controller Features  4 independent channels  Channel 0 has highest priority  8-, 16- or 32-bit transfers, data size can differ between source and destination  Modulo addressable  Can trigger with hardware signal or software  Can run continuously or periodically (“cycle-stealing”)  Hardware acknowledge/done signal

5 ARM University Program Copyright © ARM Ltd 2013 DMA Controller

6 ARM University Program Copyright © ARM Ltd 2013 Registers  DMA_SARn  Source address register,  Valid values 0 to 0x000f ffff  DMA_DARn  Destination address register  Valid values 0 to 0x000f ffff

7 ARM University Program Copyright © ARM Ltd 2013 Status Register/Byte Count Register DMA_DSR_BCRn  Status flags: 1 indicates error  CE: Configuration error  BES: Bus error on source  BED: Bus error on destination  REQ: A transfer request is pending (more transfers to perform)  BSY: DMA channel is busy  DONE: Channel transfers have completed or an error occurred. Clear this bit in an ISR.

8 ARM University Program Copyright © ARM Ltd 2013 Byte Count Register  BCR: Bytes remaining to transfer  Decremented by 1, 2 or 4 after completing write (determined by destination data size)

9 ARM University Program Copyright © ARM Ltd 2013 DMA Control Register (DMA_DCRn)  EINT: Enable interrupt on transfer completion  ERQ: Enable peripheral request to start transfer  CS: Cycle steal  0: Greedy - DMA makes continuous transfers until BCR == 0  1: DMA shares bus, performs only one transfer per request  AA: Autoalign

10 ARM University Program Copyright © ARM Ltd 2013 DMA Control Register (DMA_DCRn)  EADRQ – Enable asynchronous DMA requests when MCU is in Stop mode  SINC/DINC – Increment SAR/DAR by 1,2 or 4 based on size of data  SSIZE/DSIZE – Source/Destination data size.  Don’t need to match – controller will perform extra reads or writes as needed (e.g. read one word, write two bytes).  00: longword (32 bits)  01: byte (8 bits)  10: word (16 bits)  START – Write 1 to start transfer

11 ARM University Program Copyright © ARM Ltd 2013 DMA Control Register (DMA_DCRn)  SMOD, DMOD – Source/Destination address modulo  When non-zero, supports circular data buffer – address wraps around after 2 n+3 bytes (16 bytes to 64 kilobytes)  When zero, circular buffer is disabled  D_REQ: If 1, then when BCR reaches zero ERQ bit will be cleared  LINKCC: Enables this channel to trigger another channel  00: Disabled  01: Two stages:  Link to channel LCH1 after each cycle-steal transfer  Link to channel LCH2 after BCR reaches 0  10: Link to channel LCH1 after each cycle-steal transfer  11: Link to channel LCH1 after BCR reaches 0  LCH1, LCH2: Values 00 to 11 specify linked DMA channel (0 to 3)

12 ARM University Program Copyright © ARM Ltd 2013 Basic Use of DMA  Enable clock to DMA module (in SIM register SCGC7)  Initialize control registers  Load SARn with source address  Load DARn with destination address  Load BCRn with number of bytes to transfer  Clear DSRn[DONE]  Start transfer by setting DCRn[START]  Wait for end of transfer  Interrupt generated if DCRn[EINT] is set (DMAn_IRQHandler)  Poll DSRn[DONE]

13 ARM University Program Copyright © ARM Ltd 2013 Demonstration: Memory Copy  Software-triggered  void Copy_Longwords(uint32_t * source, uint32_t * dest, uint32_t count)  Could use as a fast version of memcpy function, but need to handle all cases  Alignment of source and destination  Data size  Detecting overlapping buffers

14 ARM University Program Copyright © ARM Ltd 2013 Triggering DMA Activity Using Peripherals  Can use trigger events from peripherals to start DMA transfer  Upon triggering, DMA can perform:  One transfer (cycle steal mode)  All transfers until BCR == 0 (continuous mode)  DMA Multiplexer (DMAMUX)  Selects which source will trigger a DMA channel  Each DMA channel n has a configuration register DMAMUX_CHCFGn  ENBL: Enable DMA channel  TRIG: Enables triggering of DMA channel  SOURCE: Selects triggering source

15 ARM University Program Copyright © ARM Ltd 2013 Trigger Sources  Trigger sources are chip-specific, so see KL25 Sub-Family Reference Manual (rev. 1), Chapter 3 - Chip Configuration, Section DMA MUX Request Sources Source #ModuleDescription 0-Disabled 2-7UART0,1, 2Receive, Transmit 16-19SPI0, 1Receive, Transmit 22-23I 2 C0, TPM0Channels TPM1-2Channels ADC0 42CMP0 45DAC Port Control ModulePort A-E 54-56TPM0-2Overflow 57TSI 60-63DMAMUXAlways enabled

16 ARM University Program Copyright © ARM Ltd 2013 Triggering DMA Transfers with PITs  DMA Channels 0, 1 can be triggered by PITs 0 and 1 as well as the previously listed sources  PIT generates a trigger pulse each time it reaches 0  Disable interrupt for PITn by clearing TIE (interrupt enable) bit.  Clear ENBL and TRIG bits in CHCFGn register  Configure DMA Controller channel  Configure timer  Select trigger source with CHCFGn register and set ENBL and TRIG bits

17 ARM University Program Copyright © ARM Ltd 2013 Demonstration of ISR Replacement: DAC Playback  TPM0 expires each time we need to update the DAC  When DMA is triggered by TPM:  Copy one 16-bit data element from wave table (source) to DAC data register (DAC0- >DAT[0])  Increment source address, decrement byte count  When DMA finishes last transfer  Generate interrupt  IRQ handler will clear interrupt flags, restart DMA transfers  DMA Configuration  Source, destination sizes = 2 bytes: SSIZE = DSIZE = 2  Enable peripheral request: ERQ = 1  Select hardware trigger  One transfer per peripheral request, so use cycle-steal mode: CS = 1  Two bytes per transfer: BCR = 2 bytes  Increment source register: SINC = 1  Do not increment destination register: DINC = 0  TPM Configuration  Don’t enable timer interrupt  TPM will still generate trigger event

18 ARM University Program Copyright © ARM Ltd 2013 Performance Comparison  Traces  Yellow: ISR is executing when trace is low  Blue: DAC output (PTE30, pin 11 of J10)  Without DMA: Interrupt per sample  4.7 microseconds per 620 microseconds  0.758% of processor’s time  With DMA: Interrupt per cycle  5.0 microseconds per 20 milliseconds  0.025% of processor’s time  How is this useful?  Saves CPU time  Reduces timing vulnerability to interrupts being disabled  Enables CPU to sleep longer, wake up less often (20 milliseconds vs. 620 microseconds)