System Design Tricks for Low-Power Video Processing Jonah Probell, Director of Multimedia Solutions, ARC International.

Slides:



Advertisements
Similar presentations
Main MemoryCS510 Computer ArchitecturesLecture Lecture 15 Main Memory.
Advertisements

Chapter 5 Internal Memory
Computer Organization and Architecture
+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
Anshul Kumar, CSE IITD CSL718 : Main Memory 6th Mar, 2006.
COEN 180 DRAM. Dynamic Random Access Memory Dynamic: Periodically refresh information in a bit cell. Else it is lost. Small footprint: transistor + capacitor.
Power Reduction Techniques For Microprocessor Systems
Memory Chapter 3. Slide 2 of 14Chapter 1 Objectives  Explain the types of memory  Explain the types of RAM  Explain the working of the RAM  List the.
Main Mem.. CSE 471 Autumn 011 Main Memory The last level in the cache – main memory hierarchy is the main memory made of DRAM chips DRAM parameters (memory.
CS.305 Computer Architecture Memory: Structures Adapted from Computer Organization and Design, Patterson & Hennessy, © 2005, and from slides kindly made.
1 The Basic Memory Element - The Flip-Flop Up until know we have looked upon memory elements as black boxes. The basic memory element is called the flip-flop.
DRAM. Any read or write cycle starts with the falling edge of the RAS signal. –As a result the address applied in the address lines will be latched.
Memory Hierarchy.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output.
Processor Frequency Setting for Energy Minimization of Streaming Multimedia Application by A. Acquaviva, L. Benini, and B. Riccò, in Proc. 9th Internation.
8-5 DRAM ICs High storage capacity Low cost Dominate high-capacity memory application Need “refresh” (main difference between DRAM and SRAM) -- dynamic.
Jamie Unger-Fink John David Eriksen. Outline Intro to LCDs Power Issues Energy Model New Reduction Techniques Results Conclusion.
©Wen-mei W. Hwu and David Kirk/NVIDIA, ECE408/CS483/ECE498AL, University of Illinois, ECE408/CS483 Applied Parallel Programming Lecture 7: DRAM.
Computer Architecture Part III-A: Memory. A Quote on Memory “With 1 MB RAM, we had a memory capacity which will NEVER be fully utilized” - Bill Gates.
Faculty of Information Technology Department of Computer Science Computer Organization and Assembly Language Chapter 5 Internal Memory.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
CSIE30300 Computer Architecture Unit 07: Main Memory Hsin-Chou Chi [Adapted from material by and
Raster-scan system In addition to the central processing unit a special purpose processor called the video controller or display controller is used to.
Survey of Existing Memory Devices Renee Gayle M. Chua.
Memory Systems Embedded Systems Design and Implementation Witawas Srisa-an.
Chapter 5 Internal Memory. Semiconductor Memory Types.
Systems Overview Computer is composed of three main components: CPU Main memory IO devices Refers to page
Chapter 3 Internal Memory. Objectives  To describe the types of memory used for the main memory  To discuss about errors and error corrections in the.
CPEN Digital System Design
University of Tehran 1 Interface Design DRAM Modules Omid Fatemi
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
ATtiny23131 A SEMINAR ON AVR MICROCONTROLLER ATtiny2313.
Computer Architecture Lecture 24 Fasih ur Rehman.
Physical Memory and Physical Addressing By Alex Ames.
CS/EE 5810 CS/EE 6810 F00: 1 Main Memory. CS/EE 5810 CS/EE 6810 F00: 2 Main Memory Bottom Rung of the Memory Hierarchy 3 important issues –capacity »BellÕs.
How do you model a RAM in Verilog. Basic Memory Model.
Semiconductor Memory Types
COMP541 Memories II: DRAMs
1 Adapted from UC Berkeley CS252 S01 Lecture 18: Reducing Cache Hit Time and Main Memory Design Virtucal Cache, pipelined cache, cache summary, main memory.
Contemporary DRAM memories and optimization of their usage Nebojša Milenković and Vladimir Stanković, Faculty of Electronic Engineering, Niš.
Chapter 5 Internal Memory. contents  Semiconductor main memory - organisation - organisation - DRAM and SRAM - DRAM and SRAM - types of ROM - types of.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
CS35101 Computer Architecture Spring 2006 Lecture 18: Memory Hierarchy Paul Durand ( ) [Adapted from M Irwin (
Computer Architecture Chapter (5): Internal Memory
RAM RAM - random access memory RAM (pronounced ramm) random access memory, a type of computer memory that can be accessed randomly;
COMP541 Memories II: DRAMs
Chapter 5 Internal Memory
William Stallings Computer Organization and Architecture 7th Edition
Address – 32 bits WRITE Write Cache Write Main Byte Offset Tag Index Valid Tag Data 16K entries 16.
COMP541 Memories II: DRAMs
William Stallings Computer Organization and Architecture 7th Edition
William Stallings Computer Organization and Architecture 8th Edition
William Stallings Computer Organization and Architecture 7th Edition
William Stallings Computer Organization and Architecture 8th Edition
BIC 10503: COMPUTER ARCHITECTURE
Computer Evolution and Performance
AKT211 – CAO 07 – Computer Memory
William Stallings Computer Organization and Architecture 8th Edition
Memory Considerations
Presentation transcript:

System Design Tricks for Low-Power Video Processing Jonah Probell, Director of Multimedia Solutions, ARC International

2 About Jonah Probell Author of VideoBits.ORG Digital video expert Processor architect Semiconductor IP developer Entrepreneur Consultant Solutions architect for ARC International

3 Consumer Video Device Diagram

4 Tricks Sleep mode Image enhancement SRAM vs SDRAM Data bus width System in package Frame buffer tiling

5 Presentation Format Basis Action Cost

6 The worst frame cycle count often exceeds the average by 2x or more. Trick: Sleep mode Run the clock fast enough to process the worst frame without dropping frames. Most frames finish early. For these, enter a power saving sleep mode until it is time to process the next frame. Chip design complexity due to clock gating and multiple voltage islands.

7 Example frame processing CPU work / sleep time

8 Trick: Image enhancement LCD display illumination is the largest consumer of power in most mobile video devices. A sharper, brighter, higher contrast image can be discerned with less illumination. Digitally enhance the brightness, contrast, and edge sharpness of the image in the CPU and dim the display illumination to save power. CPU power consumption is increased to perform the extra calculations.

9 Example original image

10 Example enhanced image

11 Increase on-chip cache and buffer SRAMs to decrease off-chip SDRAM accesses. Processor chip size increases. Trick: SRAM vs SDRAM Off-chip memory access bandwidth can be reduced by increasing on-chip memory.

12 SRAM vs SDRAM example

13 Trick: Data bus width Increase data bus width to decrease transfer cycles. Increased pin count and pad ring die area. A significant component of SDRAM power consumption is proportional to the number of data transfer cycles.

14 Data transfer power at different bus widths DRAM Bank Array sense amps DRAM Bank Array DRAM Bank Array DRAM Bank Array X16 bits DRAM Bank Array DRAM Bank Array DRAM Bank Array DRAM Bank Array X32 bits 32-bit wide DRAM→ ←16-bit wide DRAM

15 Trick: System in package Include processor chip and DRAM chip within the same package to increase speed and reduce power consumption. Increased packaging complexity and component cost. Traces within a package are smaller and have lower capacitance than traces on a board.

16 System in package diagram

17 Trick: Frame buffer tiling Store reference frame data ordered by rectangular tiles, rather than in raster order, to probabilistically reduce the number of SDRAM rows accessed for each prediction block read. Rasterization step required before display. A large portion of SDRAM access time for motion compensation block reads is due to row address switching (RAS) delay.

18 Frame buffer data order Raster scan order Tiled order

19 Conclusion Correct design can significantly reduce the power consumption of digital video systems. Making optimal design trade-offs requires a software programmable, configurable, video optimized processor. Sleep mode – software programmable Image enhancement – software programmable SRAM vs SDRAM – configurable Data bus width – configurable System in package – configurable Frame buffer tiling – video optimized