Power Efficient Rapid System Prototyping Using CoDeL: The 2D DWT Using Lifting Nainesh Agarwal & Nikitas Dimopoulos University of Victoria, Canada PacRim,

Slides:



Advertisements
Similar presentations
VHDL Design of Multifunctional RISC Processor on FPGA
Advertisements

FPGA (Field Programmable Gate Array)
TOPIC : SYNTHESIS DESIGN FLOW Module 4.3 Verilog Synthesis.
MP3 Optimization Exploiting Processor Architecture and Using Better Algorithms Mancia Anguita Universidad de Granada J. Manuel Martinez – Lechado Vitelcom.
VADA Lab.SungKyunKwan Univ. 1 L3: Lower Power Design Overview (2) 성균관대학교 조 준 동 교수
High Level Languages: A Comparison By Joel Best. 2 Sources The Challenges of Synthesizing Hardware from C-Like Languages  by Stephen A. Edwards High-Level.
Digital Design with VHDL Presented by: Amir Masoud Gharehbaghi
University Of Vaasa Telecommunications Engineering Automation Seminar Signal Generator By Tibebu Sime 13 th December 2011.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4a1 Design for Testability Theory and Practice Lecture 4a: Simulation n What is simulation? n Design.
Behavioral Design Outline –Design Specification –Behavioral Design –Behavioral Specification –Hardware Description Languages –Behavioral Simulation –Behavioral.
Mehdi Amirijoo1 Power estimation n General power dissipation in CMOS n High-level power estimation metrics n Power estimation of the HW part.
ELEN468 Lecture 11 ELEN468 Advanced Logic Design Lecture 1Introduction.
Using CoDeL to rapidly prototype network processsor extensions Nainesh Agarwal and Nikitas J. Dimopoulos Department of Electrical and Computer Engineering.
1 Chapter 7 Design Implementation. 2 Overview 3 Main Steps of an FPGA Design ’ s Implementation Design architecture Defining the structure, interface.
Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.
VLSI Design Lab Introduction
Digital System Design EEE344 Lecture 1 INTRODUCTION TO THE COURSE
(1) Introduction © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
(1) Modeling Digital Systems © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
CSET 4650 Field Programmable Logic Devices
Chap. 1 Overview of Digital Design with Verilog. 2 Overview of Digital Design with Verilog HDL Evolution of computer aided digital circuit design Emergence.
Ronny Krashinsky Seongmoo Heo Michael Zhang Krste Asanovic MIT Laboratory for Computer Science SyCHOSys Synchronous.
ASIC/FPGA design flow. FPGA Design Flow Detailed (RTL) Design Detailed (RTL) Design Ideas (Specifications) Design Ideas (Specifications) Device Programming.
System Arch 2008 (Fire Tom Wada) /10/9 Field Programmable Gate Array.
Automated Design of Custom Architecture Tulika Mitra
Section 10: Advanced Topics 1 M. Balakrishnan Dept. of Comp. Sci. & Engg. I.I.T. Delhi.
1 H ardware D escription L anguages Modeling Digital Systems.
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
COE 405 Design and Modeling of Digital Systems
Array Synthesis in SystemC Hardware Compilation Authors: J. Ditmar and S. McKeever Oxford University Computing Laboratory, UK Conference: Field Programmable.
1 Fly – A Modifiable Hardware Compiler C. H. Ho 1, P.H.W. Leong 1, K.H. Tsoi 1, R. Ludewig 2, P. Zipf 2, A.G. Oritz 2 and M. Glesner 2 1 Department of.
Modern VLSI Design 3e: Chapters 3 & 8Partly from 2002 Prentice Hall PTR week6-1 Lectures 16 Transfer Characteristics (Delay and Power) Feb. 10, 2003.
Power Estimation and Optimization for SoC Design
Slide 1 2. Verilog Elements. Slide 2 Why (V)HDL? (VHDL, Verilog etc.), Karen Parnell, Nick Mehta, “Programmable Logic Design Quick Start Handbook”, Xilinx.
1 Hardware Description Languages: a Comparison of AHPL and VHDL By Tamas Kasza AHPL&VHDL Digital System Design 1 (ECE 5571) Spring 2003 A presentation.
Evaluating and Improving an OpenMP-based Circuit Design Tool Tim Beatty, Dr. Ken Kent, Dr. Eric Aubanel Faculty of Computer Science University of New Brunswick.
Introduction to VHDL Simulation … Synthesis …. The digital design process… Initial specification Block diagram Final product Circuit equations Logic design.
Modern VLSI Design 4e: Chapter 8 Copyright  2008 Wayne Wolf Topics Modeling with hardware description languages (HDLs).
Modern VLSI Design 3e: Chapter 8 Copyright  1998, 2002 Prentice Hall PTR Topics n Modeling with hardware description languages (HDLs).
IMPLEMENTATION OF MIPS 64 WITH VERILOG HARDWARE DESIGN LANGUAGE BY PRAMOD MENON CET520 S’03.
ELEE 4303 Digital II Introduction to Verilog. ELEE 4303 Digital II Learning Objectives Get familiar with background of HDLs Basic concepts of Verilog.
04/26/20031 ECE 551: Digital System Design & Synthesis Lecture Set : Introduction to VHDL 12.2: VHDL versus Verilog (Separate File)
FPGA-Based System Design Copyright  2004 Prentice Hall PTR Topics n Modeling with hardware description languages (HDLs).
VLSI Design of 2-D Discrete Wavelet Transform for Area-Efficient and High- Speed Image Computing - End Presentation Presentor: Eyal Vakrat Instructor:
Implementing JPEG Encoder for FPGA ECE 734 PROJECT Deepak Agarwal.
LOW POWER DESIGN METHODS
DAC50, Designer Track, 156-VB543 Parallel Design Methodology for Video Codec LSI with High-level Synthesis and FPGA-based Platform Kazuya YOKOHARI, Koyo.
SUBJECT : DIGITAL ELECTRONICS CLASS : SEM 3(B) TOPIC : INTRODUCTION OF VHDL.
IAY 0600 Digital Systems Design
ASIC Design Methodology
VLSI Testing Lecture 5: Logic Simulation
EEE2135 Digital Logic Design Chapter 1. Introduction
Introduction to Programmable Logic
Topics Modeling with hardware description languages (HDLs).
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
VLSI Testing Lecture 5: Logic Simulation
Vishwani D. Agrawal Department of ECE, Auburn University
Topics Modeling with hardware description languages (HDLs).
Introduction to cosynthesis Rabi Mahapatra CSCE617
Reconfigurable Computing
Field Programmable Gate Array
Field Programmable Gate Array
Hardware Acceleration of the Lifting Based DWT
Field Programmable Gate Array
Lesson 4 Synchronous Design Architectures: Data Path and High-level Synthesis (part two) Sept EE37E Adv. Digital Electronics.
Image Transforms for Robust Coding
VHDL Introduction.
Digital Designs – What does it take
Presentation transcript:

Power Efficient Rapid System Prototyping Using CoDeL: The 2D DWT Using Lifting Nainesh Agarwal & Nikitas Dimopoulos University of Victoria, Canada PacRim, August, 2005

PacRim /21/2015 Outline Motivation Power Dissipation Clock Gating Hardware Description Languages  System Level Design Languages CoDeL Power Savings Analysis Framework Evaluation: DWT Conclusion

PacRim /21/2015 Motivation Increase in portable systems that run on batteries, such as cell phones, PDAs, digital cameras DSP techniques needed to process data, and transmit or display this data As processing algorithms become complex, power requirements increase Higher power requirements means  Low battery life  Expensive cooling and packaging techniques, which may increase the size of the device  Lower circuit density  Shorter component life

PacRim /21/2015 Motivation (contd.) Long design cycles for hardware architectures  Can take up to a year for a team of engineers to develop an ASIC Emergence of System-Design Languages (SLDLs)  Do not address power dissipation Power efficient architecture design is tricky by hand and requires even longer lead times.

PacRim /21/2015 Power Dissipation CMOS circuits Static Dissipation  Steady state  No Switching Dynamic Dissipation  Switching  Changes in digital state

PacRim /21/2015 Static Dissipation Ideal static dissipation = 0 Reverse biased diodes between pn junctions Sub-threshold current when gate to source voltage is below the threshold Becoming significant Source: Kursun and Friedman, Sleep Switch Dual Threshold Voltage Domino Logic with Reduced Standby Leakage Current. IEEE Trans. VLSI, Vol. 12, No. 5, May 2004.

PacRim /21/2015 Dynamic Dissipation Short-circuit dissipation  When both n- and p-type transistors are on for a brief moment, there is a short current pulse  Not significant Current required to charge and discharge the capacitive load  Significant Activity factor Capacitive load Source voltage Circuit frequency

PacRim /21/2015 Clock Gating Reduce dynamic power dissipation Reduce the clock switching activity Enable clock only when a useful write is needed

PacRim /21/2015 Hardware Description Languages Describe the temporal and spatial behaviour of a circuit Common targets: ASIC and FPGA VHDL and Verilog  Design at Register Transfer Level (RTL) Abstraction level too low

PacRim /21/2015 System Level Design Languages Started late 1990s Provide a high level of abstraction for system development Categories  Extend existing HDLs: SystemVerilog  Extend existing software languages: SystemC, SpecC, Handel-C, JHDL  Newly created languages: Rosetta, CoDeL Algorithmic level design  Only CoDeL and Handel-C Assembly Language High Level Languages: C, Java HDL (RTL) SLDL Higher Abstraction Fast development Easy to learn Platform independence

PacRim /21/2015 CoDeL - Overview CoDeL (Controller Description Language), targets the specification and design at the behavioral level. Order of the statements implicitly represents the sequence of activities. Extracts the data and control flow from the program automatically, assigns the necessary hardware blocks and exploits inherent parallelism. Similar to the C language, so easy to learn. Includes a library of I/O protocols that simplify (sub)system interaction. Compiler produces synthesizable VHDL code which can be targeted to any technology including FPGA or ASIC.

PacRim /21/2015 CoDeL – Ports and Protocols CoDeL abstracts module interaction through ports and protocols. Protocols define the sequence of events necessary to transfer information from one module to another.

PacRim /21/2015 CoDeL – Simple Counter A very simple counter

PacRim /21/2015 CoDeL – Clock Gating Example shows write in state x Gate turned on in state x-1, off in state x+1 State x - 1 State xState x + 1 Clk Enable GClk Data Latched

PacRim /21/2015 Power Savings Analysis Framework Power saved = + Power saved in avoiding useless switching + Power saved in avoiding clock switching - Power required for clock gating (overhead)

PacRim /21/2015 Evaluation: 2D DWT Key component in JPEG2000 image compression Lossy compression using MIT 9/7 wavelet Lossless compression using Le Gall 5/3 integer- to-integer wavelet  Integer to integer mapping  No quantization needed  Exact recovery of input signal

PacRim /21/2015 DWT Structure Successive pair of low-pass and high-pass filters, followed by factor 2 down-sampling Analysis stage decomposes, while synthesis reconstructs h 0 is the low-pass filter and h 1 is the high-pass filter Low-pass signal recursively decomposed for full, dyadic transform h0h0 h0h0 h1h1 h1h1 22 22 g0g0 g0g0 g1g1 g1g1 22 22 x(n) ^ Analysis Filter BankSynthesis Filter Bank

PacRim /21/2015 DWT - Lifting Reduction in memory and computational complexity In-place computation of the wavelet coefficients Output is identical to a direct filter bank convolution Lazy Transform Predict Update - + Input Low-pass output High-pass output Even samples Odd samples Predict Update

PacRim /21/2015 Implementation fStart fReady Start Ready Start Ready iStart iReady EndPt Step EndPt Step EndPt Step Start Ready N (Cols) DWT Module Synthesis Filter Bank Module Analysis Filter Bank Module Forward/Inverse M (Rows) Size (M*N) StartPt Register File

PacRim /21/2015 Code Complexity Analysis and synthesis filter bank modules  120 lines of CoDeL code each  Generate about 1000 lines of VHDL code each DWT module  110 lines of CoDeL code  Generates 560 lines of VHDL Synthesized on a Xilinx 2v2000ff896-4 FPGA  About 7% area used  Maximum clock frequency of 103 MHz Eight element DWT takes 3.9μs

PacRim /21/2015 Power Savings Estimation No useless switching found Analysis & Synthesis filter bank modules  85% area  17% power saved DWT modules  15% area  8% power saved Use area complexity as an approximation for power complexity 16% total power saved

PacRim /21/2015 Future Work Clock gating  Verify analytical framework using simulation and ASIC implementation  Efficient clock gating mechanism CoDel compiler  Automated clock gating  Register and state reuse  Allow explicit parallelism (similar to technique used in OpenMP and Handel-C)

PacRim /21/2015 Questions