Caltech CS184 Winter2003 -- DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 8: January 27, 2003 Empirical Cost Comparisons.

Slides:



Advertisements
Similar presentations
Commercial FPGAs: Altera Stratix Family Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Advertisements

Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 14: March 19, 2014 Compute 2: Cascades, ALUs, PLAs.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day6: October 11, 2000 Instruction Taxonomy VLSI Scaling.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day17: November 20, 2000 Time Multiplexing.
CS294-6 Reconfigurable Computing Day 5 September 8, 1998 Comparing Computing Devices.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 21: April 2, 2007 Time Multiplexing.
CS294-6 Reconfigurable Computing Day 6 September 10, 1998 Comparing Computing Devices.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day10: October 25, 2000 Computing Elements 2: Cascades, ALUs,
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 20: March 28, 2007 Retiming 2: Structures and Balance.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 9: February 7, 2007 Instruction Space Modeling.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 26: April 18, 2007 Et Cetera…
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day4: October 4, 2000 Memories, ALUs, and Virtualization.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
CS294-6 Reconfigurable Computing Day 2 August 27, 1998 FPGA Introduction.
Trends toward Spatial Computing Architectures Dr. André DeHon BRASS Project University of California at Berkeley.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 10: February 12, 2007 Empirical Comparisons.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 18: February 21, 2003 Retiming 2: Structures and Balance.
CS294-6 Reconfigurable Computing Day 14 October 7/8, 1998 Computing with Lookup Tables.
CS294-6 Reconfigurable Computing Day 16 October 20, 1998 Retiming Structures.
Penn ESE Spring DeHon 1 FUTURE Timing seemed good However, only student to give feedback marked confusing (2 of 5 on clarity) and too fast.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 5: January 24, 2007 ALUs, Virtualization…
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
Comparing Computing Machines Dr. André DeHon UC Berkeley November 3, 1998.
Field Programmable Gate Array (FPGA) Layout An FPGA consists of a large array of Configurable Logic Blocks (CLBs) - typically 1,000 to 8,000 CLBs per chip.
1 DIGITAL DESIGN I DR. M. MAROUF FPGAs AUTHOR J. WAKERLY.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 4, 2005 Interconnect 1: Requirements.
Reconfigurable Computing. Lect-02.2 Course Schedule Introduction to Reconfigurable Computing FPGA Technology, Architectures, and Applications FPGA Design.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 15: February 12, 2003 Interconnect 5: Meshes.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: March 3, 2014 Instruction Space Modeling 1.
CBSSS 2002: DeHon Costs André DeHon Wednesday, June 19, 2002.
J. Christiansen, CERN - EP/MIC
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 7: January 24, 2003 Instruction Space.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.
Penn ESE534 Spring DeHon ESE534: Computer Organization Day 15: March 24, 2014 Empirical Comparisons.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day18: November 22, 2000 Control.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day7: October 16, 2000 Instruction Space (computing landscape)
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day16: November 15, 2000 Retiming Structures.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 10: January 31, 2003 Compute 2:
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 5: January 17, 2003 What is required for Computation?
Caltech CS184 Winter DeHon CS184a: Computer Architecture (Structure and Organization) Day 4: January 15, 2003 Memories, ALUs, Virtualization.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
Penn ESE534 Spring DeHon ESE534: Computer Organization Day 14: March 12, 2012 Empirical Comparisons.
EEL 5722 FPGA Design Fall 2003 Digit-Serial DSP Functions Part I.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: February 20, 2012 Instruction Space Modeling.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 12: February 5, 2003 Interconnect 2: Wiring Requirements.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 22: April 16, 2014 Time Multiplexing.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 10: January 28, 2005 Empirical Comparisons.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 20: February 27, 2005 Retiming 2: Structures and Balance.
ESE532: System-on-a-Chip Architecture
ESE534: Computer Organization
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
ESE532: System-on-a-Chip Architecture
CS184a: Computer Architecture (Structure and Organization)
ESE534: Computer Organization
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
ESE534: Computer Organization
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 8: January 27, 2003 Empirical Cost Comparisons

Caltech CS184 Winter DeHon 2 Previously Instruction Design Space –Define –Area and Yield Model

Caltech CS184 Winter DeHon 3 Today Empirical Data –Processors –FPGAs –Custom Gate Array Std. Cell Full

Caltech CS184 Winter DeHon 4 Empirical Comparisons

Caltech CS184 Winter DeHon 5 Empirical Ground modeling in some concretes Start sorting out –custom vs. configurable –spatial configurable vs. temporal

Caltech CS184 Winter DeHon 6 MPGA vs. Custom? AMI CICC’83 –MPGA 1.0 –Std-Cell 0.7 –Custom 0.5 Toshiba DSP –Custom 0.3 Mosaid RAM –Custom 0.2 GE CICC’86 –MPGA 1.0 –Std-Cell FF/counter 0.7 FullAdder 0.4 RAM 0.2 MPGA = Metal Programmable Gate Array (traditional Gate Array)

Caltech CS184 Winter DeHon 7 Metal Programmable Gate Arrays

Caltech CS184 Winter DeHon 8 MPGAs Modern -- “Sea of Gates” yield % maybe 5k  /gate ? –(quite a bit of variance)

Caltech CS184 Winter DeHon 9 FPGA Table

Caltech CS184 Winter DeHon 10 Modern FPGAs APEX 20K1500E  52K LEs  0.18  m  24mm  22mm  1.25M 2 /LE XC2V1000  10.44mm x 9.90mm [source: Chipworks]  0.15  m  11,520 4-LUTs  1. 5M 2 /4-LUT [Both also have RAM in cited area]

Caltech CS184 Winter DeHon 11 Conventional FPGA Tile K-LUT (typical k=4) w/ optional output Flip-Flop

Caltech CS184 Winter DeHon 12 Toronto FPGA Model

Caltech CS184 Winter DeHon 13 How many gates?

Caltech CS184 Winter DeHon 14 “gates” in 2-LUT

Caltech CS184 Winter DeHon 15 Now how many?

Caltech CS184 Winter DeHon 16 Which gives: More usable gates? More gates/unit area?

Caltech CS184 Winter DeHon 17 Gates Required? Depth=3, Depth=2048?

Caltech CS184 Winter DeHon 18 Gate metric for FPGAs? Day5: several components for computations –compute element –interconnect: space time –instructions Not all applications need in same balance Assigning a single “capacity” number to device is an oversimplification

Caltech CS184 Winter DeHon 19 MPGA vs. FPGA MPGA (SOG GA) –5K 2 /gate –35-70% usable (50%) –7-17K 2 /gate net Ratio: (5) Xilinx XC4K –1.25M 2 /CLB – gates (26?) –26-73K 2 /gate net Adding ~2x Custom/MPGA, Custom/FPGA ~10x

Caltech CS184 Winter DeHon 20 MPGA vs. FPGA MPGA (SOG GA)   gd ~1ns Ratio: 1--7 (2.5) Xilinx XC4K   gates in 7ns 2-3 gates typical

Caltech CS184 Winter DeHon 21 Processors vs. FPGAs

Caltech CS184 Winter DeHon 22 Processors and FPGAs

Caltech CS184 Winter DeHon 23 Component Example XC4085XL-09 3,136 CLBs 4.6ns 682 Bit Ops/ns Alpha  64b ALUs 2.3ns 55.7 Bit Ops/ns Single die in 0.35  m [1 “bit op” = 2 gate evaluations]

Caltech CS184 Winter DeHon 24 Processors and FPGAs

Caltech CS184 Winter DeHon 25 Raw Density Summary Area –MPGA 2-3x Custom –FPGA 5x MPGA Area-Time –Gate Array 6-10x Custom –FPGA 15-20x Gate Array –Processor 10x FPGA

Caltech CS184 Winter DeHon 26 Raw Density Caveats Processor/FPGA may solve more specialized problem Problems have different resource balance requirements –…can lead to low yield of raw density

Caltech CS184 Winter DeHon 27 Task Comparisons

Caltech CS184 Winter DeHon 28 Broadening Picture Compare larger computations For comparison –throughput density metric: results/area-time normalize out area-time point selection high throughput density  most in fixed area  least area to satisfy fixed throughput target

Caltech CS184 Winter DeHon 29 Multiply

Caltech CS184 Winter DeHon 30 Example: FIR Filtering Application metric: TAPs = filter taps multiply accumulate Y i =w 1 x i +w 2 x i

Caltech CS184 Winter DeHon 31 IIR/Biquad Simplest IIR: Y i =A  X i +B  Y i-1

Caltech CS184 Winter DeHon 32 DES Keysearch

Caltech CS184 Winter DeHon 33 DNA Sequence Match Problem: “cost” of transform S 1  S 2 Given: cost of insertion, deletion, substitution Relevance: similarity of DNA sequences –evolutionary similarity –structure predict function Typically: new sequence compared to large databse

Caltech CS184 Winter DeHon 34 DNA Sequence Match

Caltech CS184 Winter DeHon 35 Floating-Point Add (single prec.)

Caltech CS184 Winter DeHon 36 Floating-Point Mpy (single prec.)

Caltech CS184 Winter DeHon 37 Degrade from Peak

Caltech CS184 Winter DeHon 38 Degrade from Peak: FPGAs Long path length  not run at cycle Limited throughput requirement –bottlenecks elsewhere limit throughput req. Insufficient interconnect Insufficient retiming resources (bandwidth)

Caltech CS184 Winter DeHon 39 Degrade from Peak: Processors Ops w/ no gate evaluations (interconnect) Ops use limited word width Stalls waiting for retimed data

Caltech CS184 Winter DeHon 40 Degrade from Peak: Custom/MPGA Solve more general problem than required –(more gates than really need) Long path length Limited throughput requirement Not needed or applicable to a problem

Caltech CS184 Winter DeHon 41 Degrade Notes We’ll cover these issues in more detail as we get into them later in the course

Caltech CS184 Winter DeHon 42 Big Ideas [MSB Ideas] Raw densities: custom:ga:fpga:processor –1:5:100:1000 –close gap with specialization

Caltech CS184 Winter DeHon 43 Big Ideas [MSB-1 Ideas] Special-purpose functional units in processors/DSPs, much lower net benefit since need to control and interconnect