High Dynamic Range Emeka Ezekwe M11 Christopher Thayer M12 Shabnam Aggarwal M13 Charles Fan M14 Manager: Matthew Russo 6/26/2015 1.

Slides:



Advertisements
Similar presentations
Computer Architecture
Advertisements

Programmable FIR Filter Design
H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.
[M2] Traffic Control Group 2 Chun Han Chen Timothy Kwan Tom Bolds Shang Yi Lin Manager Randal Hong Wed. Oct. 29 Overall Project Objective : Dynamic Control.
Idongesit Ebong (1-1) Jenna Fu (1-2) Bowei Gai (1-3) Syed Hussain (1-4) Jonathan Lee (1-5) Design Manager: Myron Kwai Overall Project Objective: Design.
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 12 MAD MAC th April, 2006 Short Final Presentation.
Virtual Wallet Gates Winkler Yin Shen Jordan Samuel Fei /23/2009 A handheld device that saves time and money through smart budget management and.
1 Team M-2: Charles Norman (M2-1) Julio Segundo (M2-2) Nan Li (M2-3) Shanshan Ma (M2-4) Design Manager: Zack Menegakis Presentation 13: Final Presentation.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 7 MAD MAC th March, 2006 Functional Block.
Noise Canceling in 1-D Data: Presentation #13 Seri Rahayu Abd Rauf Fatima Boujarwah Juan Chen Liyana Mohd Sharipp Arti Thumar M2 April 20 th, 2005 Short.
[M2] Traffic Control Group 2 Chun Han Chen Timothy Kwan Tom Bolds Shang Yi Lin Manager Randal Hong Wed. Oct. 22 Overall Project Objective : Dynamic Control.
EE 141 Project 2May 8, Outstanding Features of Design Maximize speed of one 8-bit Division by: i. Observing loop-holes in 8-bit division ii. Taking.
Team W3: Anthony Marchetta Derek Ritchea David Roderick Adam Stoler Milestone 3: Feb. 4 th Size Estimates/Floorplan Overall Project Objective: Design an.
Viterbi Decoder: Presentation #11 M1 Overall Project Objective: Design a high speed Viterbi Decoder Stage 11: 12 th April 2004 Short Final Presentation.
Huffman Encoder Project. Howd - Zur Hung Eric Lai Wei Jie Lee Yu - Chiang Lee Design Manager: Jonathan P. Lee Huffman Encoder Project Final Presentation.
Noise Canceling in 1-D Data: Presentation #10 Seri Rahayu Abd Rauf Fatima Boujarwah Juan Chen Liyana Mohd Sharipp Arti Thumar M2 Mar 28 rd, 2005 Chip Level.
[M2] Traffic Control Group 2 Chun Han Chen Timothy Kwan Tom Bolds Shang Yi Lin Manager Randal Hong Wed. Oct. 27 Overall Project Objective : Dynamic Control.
Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 4 MAD MAC th February, 2006 Gate Level Design.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 5 MAD MAC nd February, 2006 Top Level Integration.
Random Number Generator Dmitriy Solmonov W1-1 David Levitt W1-2 Jesse Guss W1-3 Sirisha Pillalamarri W1-4 Matt Russo W1-5 Design Manager – Thiago Hersan.
Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 3 MAD MAC th February, 2006 Size estimates/Floor.
1 EECS Components and Design Techniques for Digital Systems Lec 21 – RTL Design Optimization 11/16/2004 David Culler Electrical Engineering and Computer.
Sprinkler Buddy Presentation #8: “Testing/Finalization of all Modules and Global Placement” 3/26/2007 Team M3 Kartik Murthy Panchalam Ramanujan Sasidhar.
1. 2 Farhan Mohamed Ali Jigar Vora Sonali Kapoor Avni Jhunjhunwala 1 st May, 2006 Final Presentation MAD MAC 525 Design Manager: Zack Menegakis Design.
Sprinkler Buddy Presentation #7: “Redesign of Adder Parts And Layout of Other Major Blocks” 3/07/2007 Team M3 Kalyan Kommineni Kartik Murthy Panchalam.
Noise Canceling in 1-D Data: Presentation #5 Seri Rahayu Abd Rauf Fatima Boujarwah Juan Chen Liyana Mohd Sharipp Arti Thumar M2 Feb 21 st, 2005 Component.
Evolutions of GPU Architectures Andrew Coile CMPE220 3/2007.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 8 MAD MAC nd March, 2006 Functional Block.
1 3/22/02 Benchmark Update u Carnegie Cell Library: “Free to all who Enter” s Need to build scaling model of standard cell library s Based on our open.
Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Siven Seth (W2-5) Presentation 1 MAD MAC th January, 2006.
Team W1 Design Manager: Rebecca Miller 1. Bobby Colyer (W11) 2. Jeffrey Kuo (W12) 3. Myron Kwai (W13) 4. Shirlene Lim (W14) Stage II: February 4 th 2004.
Team W1 Design Manager: Rebecca Miller 1. Bobby Colyer (W11) 2. Jeffrey Kuo (W12) 3. Myron Kwai (W13) 4. Shirlene Lim (W14) Stage II: 26 th January 2004.
Final Presentation: April 25 th, 2005 Seri Abd Rauf Fatima Boujarwah Juan Chen Liyana Sharipp Arti Thumar : Integrated Circuit Design Project, Spring.
Viterbi Decoder: Presentation #4 Omar Ahmad Prateek Goenka Saim Qidwai Lingyan Sun M1 Overall Project Objective: Design of a high speed Viterbi Decoder.
1 GPS Waypoint Navigation Team M-2: Charles Norman (M2-1) Julio Segundo (M2-2) Nan Li (M2-3) Shanshan Ma (M2-4) Design Manager: Zack Menegakis Presentation.
Camera Auto Focus Group W1 Tom Goff Dave Hwang Kate Killfoile Greg Look Design Manager: Bowei Gai Final Presentation, April 30 th, 2007 Project Objective:
Random Number Generator Dmitriy Solmonov W1-1 David Levitt W1-2 Jesse Guss W1-3 Sirisha Pillalamarri W1-4 Matt Russo W1-5 Design Manager – Thiago Hersan.
Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Shiven Seth (W2-5) Presentation 1 MAD MAC st February,
From Concept to Silicon How an idea becomes a part of a new chip at ATI Richard Huddy ATI Research.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 9 MAD MAC th March, 2006 Functional Block.
More Basics of CPU Design Lecture for CPSC 5155 Edward Bosworth, Ph.D. Computer Science Department Columbus State University.
Graphics Hardware Display (CRT, LCD,…) Graphics accelerator
Computer performance.
Virtual Wallet Gates Winkler Yin Shen Jordan Fei Project Manager: Prajna Shetty /02/2009 A handheld device that saves time and money through smart.
Practical PC, 7th Edition Chapter 17: Looking Under the Hood
Registers CPE 49 RMUTI KOTAT.
CAD for Physical Design of VLSI Circuits
Interactive Time-Dependent Tone Mapping Using Programmable Graphics Hardware Nolan GoodnightGreg HumphreysCliff WoolleyRui Wang University of Virginia.
Sprinkler Buddy Presentation #3: “System Level View and Floor Plan / Sizing” 2/07/2007 Team M3 Kartik Murthy Kalyan Kommineni Panchalam Ramanujan Sasidhar.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Eng.Samra Essalaimeh Philadelphia University 2013/ nd Semester PIC Microcontrollers.
Tone Mapping on GPUs Cliff Woolley University of Virginia Slides courtesy Nolan Goodnight.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Lecture 11, Advance Digital Design
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
EE141 Arithmetic Circuits 1 Chapter 14 Arithmetic Circuits Rev /12/2003 Rev /05/2003.
Howd - Zur Hung Eric Lai Wei Jie Lee Yu - Chiang Lee Design Manager: Jonathan P. Lee [M2] Huffman Encoder Project Presentation #3 February 7 th, 2007 Overall.
HDR- Design Presentation Team M1: Emeka Ezekwe (M11) Chris Thayer (M12) Shabnam Aggarwal (M13) Charles Fan (M14) Team M1 Manager: Matthew Russo.
ADPCM Adaptive Differential Pulse Code Modulation
Architecture & Organization 1
Alpha Blending and Smoothing
Components of Computer
Multipliers Multipliers play an important role in today’s digital signal processing and various other applications. The common multiplication method is.
Architecture & Organization 1
Topics Multipliers..
Chapter 1 Introduction.
Computer Evolution and Performance
RADEON™ 9700 Architecture and 3D Performance
Arithmetic Building Blocks
Presentation transcript:

High Dynamic Range Emeka Ezekwe M11 Christopher Thayer M12 Shabnam Aggarwal M13 Charles Fan M14 Manager: Matthew Russo 6/26/2015 1

Agenda 2  Project Description Charles  MarketingShabnam  Behavioral DescriptionEmeka  Design ProcessChris  Floorplan EvolutionShabnam  Design SpecificationsChris  LayoutCharles  ConclusionEmeka

Charles Fan Project Description 3

4  High Dynamic Range??  Bright colors are BRIGHT  Dark colors are DARK  Details are seen CLEARLY  Otherwise…  Colors and lights look distorted & bland  FP HDR Format requires 48 bits per pixel  Problem: Too much storage space & memory bandwidth!!  Solution: HDR encoding yields 6:1 compression  OUR GOAL: Implement efficient HDR decoding in hardware  6:1 pixel compression  Increases useable storage space by 6 fold  decrease memory bandwidth by 6 fold  Effectively increases performance

Shabnam Aggarwal Marketing 6

7  AMD’s ATI Mobility Radeon X1900  48-bit floating point HDR HDR Compression is currently NOT supported Performance hit deters developers  Windows Vista also now requires a high end GPU to realize its full graphics potential.  Laptops & portable devices are using dedicated processors for graphics  OLED (Organic Light Emitting Diode) Displays are being developed by Sony  Contrast Ratio: :1

Marketing 9  Our decoder is designed to interface between specially encoded textures stored on the GPU’s memory and one of the GPU’s texture caches that feed into the shader processor.  Each ROP on (**ATI) is capable of processing 4 pixels per clock cycle. We plan for our hardware to decode the texture information for 4 pixels during each clock cycle.  This decoder will allow smaller textures to be stored in the GPU’s memory, which will allow graphics cards to provide the same functions with less memory.  Ultimately, this decoder can provide savings in cost, power consumption, heat dissipation, and size in current graphics cards. Our HDR Decoder!!

Marketing 10  Our HDR Decoder:  Smaller textures stored in GPU’s memory  Same functions…less memory  Savings in:  Cost  Power consumption  Heat dissipation  Size  HDR is the next generation of display technology

Emeka Ezekwe Behavioral & Algorithmic Description 11

Algorithmic Description  Encoding  Break texture into 4X4 pixel blocks.  Extract luminance value of each pixel.  Normalize red and blue values and average over each 2X2 block. Green can be recalculated while decoding.  Allocate more bits to luminance values.  After encoding, a 4X4 block of pixels can be compressed from 48 bpp to 8 bpp.

Algorithmic Description  Decoding (Luminance values)  Reconstruct Lp 1 Logical shift 1 Integer addition  Calculate GQ 1 Integer addition  Calculate final pixel values 3 floating-point multiplications  Total calculations 1 logical shift + 2 Integer additions + 3 floating-point multiplications

Data Flow 14 Find G Reg Compute 1 pixel Compute 1 pixel Compute 1 pixel Compute 1 pixel Int to FP Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Reg 16 Serialize output Serialize output Serialize output Serialize output

Chris Thayer Design Process 15

Design Process 16  Goal: Speed  400 MHz  4 pixels per cycle, 4 cycles per block  Architectural decisions  No denormal support in Floating Point Multiplier  Pipelined design  Storing input values  Integer Multiplication  Wallace trees  Booth encoding  Critical adders  Carry select  Integer- Floating Point Conversion

 Circuit level decisions  Mirror FA’s to reduce carry-chain delay  Two different HA’s  AOI/OAI gates  Gate sizing along critical paths  Utilize Q and ~Q outputs from registers  Clock buffers built into register blocks  Double/Triple strapped VDD and GND  Repeaters to break up long wires  Balanced clock tree  Device Folding Design Process

Verification Process 18  C Implementation  Structural Verilog  Gate Level Schematic  Layout  Major Modules  Pipeline Stages  Global Signals

Shabnam Aggarwal Floorplan Evolution 19

Floorplan Evolution

Chris Thayer Design Specifications 21

Design Specifications 22  Delays  Stage one pipeline: 1.8 ns  Stage two pipeline: 1.53ns  Stage three pipeline: 2.479ns  Skew  Stage one: x  Stage two: x  Stage three: x  Resulting Clock Speed: 500 MHz  2 BILLION pixels per second  Size: 442x453 microns  Aspect Ratio: 1:1.024  Transistors: 42,772  Density: 0.21 T/micron^2

Charles Fan Layout 23

Floating Point Multiplier Layout 24 Pretty beautiful

Floating Point Multiplier Data Flow

Poly Layer 26

Metal One Layer 27

Metal Two Layer 28

Metal Three Layer 29

Metal Four Layer 30

Questions?