1 An Exploration of the MPEG Algorithm Using Latency Insensitive Design EE249 Presentation (12/04/1999) Trevor Meyerowitz Mentored by: Luca Carloni.

Slides:



Advertisements
Similar presentations
3D Graphics Content Over OCP Martti Venell Sr. Verification Engineer Bitboys.
Advertisements

Presenter : Shao-Chieh Hou VLSI Design, Automation and Test, VLSI-DAT 2007.
Copyright  2003 Dan Gajski and Lukai Cai 1 Transaction Level Modeling: An Overview Daniel Gajski Lukai Cai Center for Embedded Computer Systems University.
Computer Organization and Architecture
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
Anurag Dwivedi.  Verilog- Hardware Description Language  Modules  Combinational circuits  assign statement  Control statements  Sequential circuits.
Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.
ECE 734: Project Presentation Pankhuri May 8, 2013 Pankhuri May 8, point FFT Algorithm for OFDM Applications using 8-point DFT processor (radix-8)
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
Evolution and History of Programming Languages Software/Hardware/System.
Instruction-Level Parallelism (ILP)
Hardware Implementation of Antenna Beamforming using Genetic Algorithm Kevin Hsiue Bryan Teague.
Power-Window Control A design exploration using Felix VCC Claudio Pinello EE249 Fall 1998 Prof. Alberto Sangiovanni-Vincentelli Mentor: Dr. Alberto Ferrari.
STARI: A Case Study in Compositional and Hierarchical Timing Verification Serdar Tasiran, Prof. Robert K. Brayton Department of Electrical Engineering.
The Design Process Outline Goal Reading Design Domain Design Flow
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Conversion Between Video Compression Protocols Performed by: Dmitry Sezganov, Vitaly Spector Instructor: Stas Lapchev, Artyom Borzin Cooperated with:
02/02/20091 Logic devices can be classified into two broad categories Fixed Programmable Programmable Logic Device Introduction Lecture Notes – Lab 2.
Behavioral Design Outline –Design Specification –Behavioral Design –Behavioral Specification –Hardware Description Languages –Behavioral Simulation –Behavioral.
Huffman Encoder Project. Howd - Zur Hung Eric Lai Wei Jie Lee Yu - Chiang Lee Design Manager: Jonathan P. Lee Huffman Encoder Project Final Presentation.
1 Pupil Detection and Tracking System Lior Zimet Sean Kao EE 249 Project Mentors: Dr. Arnon Amir Yoshi Watanabe.
Lab for Reliable Computing Generalized Latency-Insensitive Systems for Single-Clock and Multi-Clock Architectures Singh, M.; Theobald, M.; Design, Automation.
1/31/20081 Logic devices can be classified into two broad categories Fixed Programmable Programmable Logic Device Introduction Lecture Notes – Lab 2.
Logic Design Outline –Logic Design –Schematic Capture –Logic Simulation –Logic Synthesis –Technology Mapping –Logic Verification Goal –Understand logic.
Verilog Code Generation for an ASIP Design Environment Nathan Kitchen Vinay Krishnan Mentor: Scott Weber (MESCAL) EE 244/249 Joint Project Fall 2002.
Automatic Interface Generation P.I.G. : Presented by Trevor Meyerowitz Sonics: Presented by Michael Sheets EE249 Discussion November 30, 1999.
From Miles to Millimeter: On-chip Communication Networks Alessandro Pinto U.C. Berkeley
Transaction Level Modeling Definitions and Approximations Trevor Meyerowitz EE290A Presentation May 12, 2005.
Principle of Functional Verification Chapter 1~3 Presenter : Fu-Ching Yang.
Pipelining By Toan Nguyen.
HW/SW CODESIGN OF THE MPEG-2 VIDEO DECODER Matjaz Verderber, Andrej Zemva, Andrej Trost University of Ljubljana Faculty of Electrical Engineering Trzaska.
HW/SW Co-Design of an MPEG-2 Decoder Pradeep Dhananjay Kiran Divakar Leela Kishore Kothamasu Anthony Weerasinghe.
HW/SW CODESIGN OF THE MPEG-2 VIDEO DECODER Matjaz Verderber, Andrej Zemva, Andrej Trost University of Ljubljana Faculty of Electrical Engineering Trzaska.
CSE679: MPEG r MPEG-1 r MPEG-2. MPEG r MPEG: Motion Pictures Experts Group r Standard for encoding videos/movies/motion pictures r Evolving set of standards.
1 Chapter 2. The System-on-a-Chip Design Process Canonical SoC Design System design flow The Specification Problem System design.
TM Efficient IP Design flow for Low-Power High-Level Synthesis Quick & Accurate Power Analysis and Optimization Flow JAN Asher Berkovitz Yaniv.
Processor Structure & Operations of an Accumulator Machine
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
1 WORLD CLASS – through people, technology and dedication High level modem development for Radio Link INF3430/4431 H2013.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
HW/SW PARTITIONING OF FLOATING POINT SOFTWARE APPLICATIONS TO FIXED - POINTED COPROCESSOR CIRCUITS - Nalini Kumar Gaurav Chitroda Komal Kasat.
1 Sixth Lecture: Chapter 3: CISC Processors (Tomasulo Scheduling and IBM System 360/91) Please recall:  Multicycle instructions lead to the requirement.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
Hardware/Software Co-design Design of Hardware/Software Systems A Class Presentation for VLSI Course by : Akbar Sharifi Based on the work presented in.
Fast Simulation Techniques for Design Space Exploration Daniel Knorreck, Ludovic Apvrille, Renaud Pacalet
- 1 - EE898_HW/SW Partitioning Hardware/software partitioning  Functionality to be implemented in software or in hardware? No need to consider special.
Design & Co-design of Embedded Systems Next Step: Transaction-Level Modeling Maziar Goudarzi.
Correct-by-construction asynchronous implementation of modular synchronous specifications Jacky Potop Benoît Caillaud Albert Benveniste IRISA, France.
LZRW3 Decompressor dual semester project Part A Mid Presentation Students: Peleg Rosen Tal Czeizler Advisors: Moshe Porian Netanel Yamin
Hardware Image Signal Processing and Integration into Architectural Simulator for SoC Platform Hao Wang University of Wisconsin, Madison.
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Sub-Nyquist Sampling Algorithm Implementation on Flex Rio
PROJECT - ZYNQ Yakir Peretz Idan Homri Semester - winter 2014 Duration - one semester.
DEVICES AND DESIGN : ASIC. DEFINITION Any IC other than a general purpose IC which contains the functionality of thousands of gates is usually called.
1 6-Performance Analysis of Embedded System Designs: Digital Camera Case Study (cont.)
1 Copyright  2001 Pao-Ann Hsiung SW HW Module Outline l Introduction l Unified HW/SW Representations l HW/SW Partitioning Techniques l Integrated HW/SW.
Fundamentals of Programming Languages-II
Application Domains for Fixed-Length Block Structured Architectures ACSAC-2001 Gold Coast, January 30, 2001 ACSAC-2001 Gold Coast, January 30, 2001.
Automated Software Generation and Hardware Coprocessor Synthesis for Data Adaptable Reconfigurable Systems Andrew Milakovich, Vijay Shankar Gopinath, Roman.
15-740/ Computer Architecture Lecture 3: Performance
Andrea Acquaviva, Luca Benini, Bruno Riccò
Digital readout architecture for Velopix
Elementary Microarchitecture Algebra
Multiscalar Processors
Interfacing Memory Interfacing.
Progress Report Algorithm Mapping onto Hardware in Simulink – IBOB/ROACH Environment Yuta Toriyama November 19, 2010.
High Level Synthesis Overview
Formal Verification of Partial Good Self-Test Fencing Structures
Transaction Level Modeling: An Overview
Presentation transcript:

1 An Exploration of the MPEG Algorithm Using Latency Insensitive Design EE249 Presentation (12/04/1999) Trevor Meyerowitz Mentored by: Luca Carloni

2 Outline n Motivation n Explanation of Latency Insensitive Design n Why MPEG is interesting? n Current Implementation n Conclusions and Future work

3 Motivation: Doomsday for IC design? n Die Size, Design complexity, and Clock speed are all increasing. n Clock propagation is an increasing problem. –Multiple clock cycles to cross a chip –Verification becomes harder –Buffering the clock isn’t enough

4 The Current Situation n Traditional design flow uses direct connections between individual blocks n The wires might be too long. n Blocks are highly coupled. –Difficult to connect independent IP’s

5 Solution: Latency Insensitive Design Remove wires and put shells around blocks RS Remove wires and put shells around blocks Connect shells with an arbitrary number of relay stations Data coherency is preserved

6 Latency Insensitive Design: The Relay Station n Data- the signal n Void- whether the signal is legitimate data n Stop- tells previous relay station whether it’s ready or not to receive data control Data_in, Void_in Data_out, Void_out Stop_OutStop_In Source: A Methodology for Correct by Construction Latency Insensitive Design (carloni et al)

7 Current Status: n Proof of concept (Carloni, McMillan, Sangiovanni-Vincentelli 1999) n Functional Example n Case Study: The PDLX micro-processor n Different examples are required –Demonstrate SOC feasibility –Especially Multimedia

8 Sample SOC idea MPEG Decode & Wireless Comm Chip HDTV (picture from High MHz High Power and Bandwidth Cell Phone (picture from Low MHz Low Power and Bandwidth

9 Why MPEG Decode is Good n Complicated & Highly Data Dependant –Different resolutions, encoding techniques n Not as arbitrary or complicated as Encode n Real Time Constraints n Great Target for HW/SW Partitioning/Co-Design n Relevant for SOC today

10 Description of MPEG Decode n Aside from headers can be classified as 4 types of blocks –D-Block - (control information) –I-Block - Full Picture –P-Block - Picture Dependant on prior picture –B-Block - Picture Dependant on future picture n Each non-D-Block is split up into 8x8 chunks

11 Description of MPEG Decode Header Decode VLC/RLC Decode Inverse Quantize Inverse DCT Inverse ZigZag Add Picture Input Stream Picture Store (I)Picture Store (P) Forward Predictor Backward Predictor Decoded Picture Memory Bus Source: MPEG-2 John Watkinson hardware software ?

12 Current Implementation n Have translated hardware portions from MPEG-2 Decoder C file to Verilog n Hooked up them up using relay stations n Verified Functionality Inverse Quantize Inverse ZigZag Inverse DCT Relay Station

13 Results: n Relay Stations do add a startup delay similar to pipelining –Basic Simulation confirms 4.5* cycle delay n Software-ish portions are a nightmare to translate to Verilog –Memory access and pointers –Software is sequential, HW isn’t usually

14 Conclusion n What Remains to be Done: –More thorough verification –Further exploration of results –Perhaps add VLC/RLC to hardware –Possibly interface with PLI n Possible Future Work: –Exploration of interfacing with software –Further Performance Analysis –Automatic Shell Synthesis –Instantiation of Multiple Execution Units