Steve Poret RCS – ENG 6530 June 10, 2008. [1] Measuring the Gap between FPGAs and ASICs  Ian Kuon and Jonathan Rose  The Edward S. Rogers Sr. Department.

Slides:



Advertisements
Similar presentations
FPGA (Field Programmable Gate Array)
Advertisements

Dan Lander Haru Yamamoto Shane Erickson (EE 201A Spring 2004)
1 Cleared for Open Publication July 30, S-2144 P148/MAPLD 2004 Rea MAPLD 148:"Is Scaling the Correct Approach for Radiation Hardened Conversions.
A Survey of Logic Block Architectures For Digital Signal Processing Applications.
EELE 367 – Logic Design Module 2 – Modern Digital Design Flow Agenda 1.History of Digital Design Approach 2.HDLs 3.Design Abstraction 4.Modern Design Steps.
Programmable Logic Devices
Architecture Design Methodology. 2 The effects of architecture design on metrics:  Area (cost)  Performance  Power Target market:  A set of application.
Digital Signal Processing and Field Programmable Gate Arrays By: Peter Holko.
EECE579: Digital Design Flows
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
FPGA chips and DSP Algorithms By Emily Fabes. 2 Agenda FPGA Background Reasons to use FPGA’s Advantages and disadvantages of using FPGA’s Sample VHDL.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Programmable logic and FPGA
ASIC vs. FPGA – A Comparisson Hardware-Software Codesign Voin Legourski.
1 Chapter 12 Advanced Topics--Introduction. 2 Overview To achieve higher growth Additional features, software and IP offerings Application: consumer electronics,
February 4, 2002 John Wawrzynek
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
GPGPU platforms GP - General Purpose computation using GPU
Future FPGA Development Duane McDonald Digital Electronics 3.
General FPGA Architecture Field Programmable Gate Array.
Xilinx at Work in Hot New Technologies ® Spartan-II 64- and 32-bit PCI Solutions Below ASSP Prices January
Section I Introduction to Xilinx
BR 1/001 Implementation Technologies We can implement a design with many different implementation technologies - different implementation technologies.
April 15, Synthesis of Signal Processing on FPGA Hongtao
Programmable Solutions in Video Capture/Editing. Overview  Xilinx - Industry Leader in FPGAs/CPLDs High-density, high-speed, programmable, low cost logic.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
EE4OI4 Engineering Design Programmable Logic Technology.
Power Reduction for FPGA using Multiple Vdd/Vth
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 4 Programmable.
Ch.9 CPLD/FPGA Design TAIST ICTES Program VLSI Design Methodology Hiroaki Kunieda Tokyo Institute of Technology.
Open Discussion of Design Flow Today’s task: Design an ASIC that will drive a TV cell phone Exercise objective: Importance of codesign.
Introduction 1 Introduction. 2 Why Programmable Logic ?  Custom logic without NRE —needed for product differentiation  Fast time to market —shorter.
SHA-3 Candidate Evaluation 1. FPGA Benchmarking - Phase Round-2 SHA-3 Candidates implemented by 33 graduate students following the same design.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
J. Christiansen, CERN - EP/MIC
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
Programmable Logic Devices
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
EE3A1 Computer Hardware and Digital Design
Present – Past -- Future
1 Leakage Power Analysis of a 90nm FPGA Authors: Tim Tuan (Xilinx), Bocheng Lai (UCLA) Presenter: Sang-Kyo Han (ECE, University of Maryland) Published.
1 Carnegie Mellon University Center for Silicon System Implementation An Architectural Exploration of Via Patterned Gate Arrays Chetan Patel, Anthony Cozzie,
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
EE141 © Digital Integrated Circuits 2nd Introduction 1 Principle of CMOS VLSI Design Introduction Adapted from Digital Integrated, Copyright 2003 Prentice.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
Programmable Logic Device Architectures
Overview of VLSI 魏凱城 彰化師範大學資工系. VLSI  Very-Large-Scale Integration Today’s complex VLSI chips  The number of transistors has exceeded 120 million 
Click to edit Master title style Literature Review Measuring the Gap Between FPGAs and ASICs Ian Kuon, Jonathan Rose University of Toronto IEEE TCAD/ICAS.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
FPGA Field Programmable Gate Arrays Shiraz University of shiraz spring 2012.
FPGA Technology Overview Carl Lebsack * Some slides are from the “Programmable Logic” lecture slides by Dr. Morris Chang.
1 Introduction to Engineering Fall 2006 Lecture 17: Digital Tools 1.
Programmable Logic Devices
Presenter: Darshika G. Perera Assistant Professor
Programmable Logic Device Architectures
Introduction to Programmable Logic
Electronics for Physicists
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
XILINX CPLDs The Total ISP Solution
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Electronics for Physicists
Measuring the Gap between FPGAs and ASICs
Programmable logic and FPGA
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

Steve Poret RCS – ENG 6530 June 10, 2008

[1] Measuring the Gap between FPGAs and ASICs  Ian Kuon and Jonathan Rose  The Edward S. Rogers Sr. Department of Electrical and Computer Engineering  University of Toronto, 2006 [2] (When) Will FPGAs Kill ASICs?  Rajeev, Jayaraman  Xilinx Inc.  DAC-2001 [3] ASICs Verses FPGAs  Frank J. Bartos  Control Engineering  6/1/2005 [4] Structured ASICs  Dan Lander, Haru Yamamoto, Shane Erickson  UCLA - EE 201A  Spring 2004

[5] Signal, Timing Integrity Assured with FPGAs - Technology Information  Rich Sevcik  Electronic News  July 24, 2000 [6] Navigating the Silicon Jungle: FPGA or ASIC?  Blyler, John  Chip Design Magazine  June/July 2005 [7] FPGA vs. ASIC  Xilinx Inc.  Getting Started with Xilinx  2006 [8] Xilinx: Virtex-5 Family Overview  Xilinx Inc. datasheet  DS100(v4.2) May 7, 2008

 Introduction  Performance Perspective ◦ Measuring the Gap between FPGAs and ASICs, [1]  Cost/Risk/Lifecycle Perspective ◦ (When) Will FPGAs Kill ASICs?, [2] ◦ ASICs verses FPGAs, [3]  Summary  Future Technology ◦ Structured/Platform ASICs, [4]  Conclusion

ASIC (Application Specific Integrated Circuit) ◦ An IC designed for a particular use ◦ Standard cell, full-custom ASIC Applications ◦ Processors ◦ RAM ◦ ROM

Why are we interested? ◦ The size of transistors are shrinking to sub-micron levels Deep Sub-Micron (DSM) designs have problems  Two main fundamental issues: ◦ Signal Integrity ◦ Timing Closure  Is an ASIC still the best architecture?

FPGA (Field Programmable Gate Array) ◦ Logic blocks and programmable interconnects ◦ Synthesis tools are designed to counter DSM issues Applications ◦ ASIC Prototyping ◦ Massive parallelism (code breaking, cryptography) ◦ DSP, SDR, medical imaging ◦ FFT

 Implementation medium: FPGA or ASIC? ◦ Based on:  Area  Performance  Power consumption  Cost  Time-to-market  Design Cycle  Complexity of design  How do we measure the gap between FPGAs and ASICs?

 New FPGA vs. ASIC comparison, (Kuon, Rose)  System Architecture: ◦ Altera Stratix II (FPGA) ◦ STM CMOS090 Design Platform standard cell (ASIC)  Look at previous comparisons: inadequate  Authors took many considerations to ensure most accurate possible comparison

 Chose 23 benchmark designs  Implement all benchmarks in both FPGA and ASIC  Compare: ◦ Silicon Area ◦ Maximum Operating Frequency ◦ Power Consumption

Silicon Area  Hard DSP blocks significantly reduce the area gap (40 -> 28)  Memories slightly reduced (40 -> 37)  All components utilized (40 -> 21)  Introduction of heterogeneous blocks are very important in decreasing FPGA area

Speed  Hard DSP blocks increase delay? ◦ The multipliers are fixed size, thus will slightly decrease performance, but the additional time comes with extra routing to accommodate for fixed positions of DSPs.  Memories ◦ 3.2 -> 2.3, offer speed-up vs. ASIC design, but slow low power memory is used in the ASIC, and is no real advantage vs. newer memory  Overall, the memory blocks offer the same advantage as the DSP blocks: primary benefit is improved area efficiency

Speed  Fastest speed is useful for understanding the best case solution, but not fair to ASIC.  ASICs are generally designed for worst-case process  As seen the performance gaps are respectively larger, confirming that ASICs perform faster then FPGAs.  2.8 times faster

Power Consumption  FPGAs consume 9-to-12 times the amount of power as an ASIC  Area savings suggest a slight power reduction as less wires & components are used  Introducing DSP and/or memory blocks to the FPGA reduce power consumption

 Presented empirical measurements quantifying the gap between FPGAs and ASICs  FPGA design is times larger than an standard-cell ASIC design  FPGA is times slower than a standard-cell ASIC design  FPGA consumes 9-12 times more power than ASIC

Clearly ASICs have better performance than FPGAs, though they lack the flexibility So why are FPGAs even used?

Non-performance factors:  Unit costs  Non Recurring Engineering (NRE) costs  Time to market  System reconfigurability  Design cycle  Volume/Gate Count/Freq/IP requirements

 Unit cost analysis ◦ ASIC: lower unit costs for high volumes ◦ ASIC: design tools tend to cost more ◦ FPGA: No upfront NRE - costs typically associated with an ASIC design

 Exploding (NRE) ASIC Cost High mask costs as process geometry decreases

 Time to market

 System reconfigurability Lack of reconfigurability is a large opportunity cost of ASICs as FPGAs offer flexible design cycle management

 Design Cycle ASIC: very unforgiving (no late changes) FPGA: flexible to allow late design changes

 Volume Requirements (Unit cost) ASICs are cost effective for large volumes (> 250,000)

 Gate Count Requirements FPGAs have limited gate count: 3 million (2000)

 Performance Requirements (Speed) FPGAs can operate up to 200Mhz (2000) Note: 550Mhz Xilinx Virtex-5 (2008)

 IP in FPGAs ◦ 1995 – Only gates and routing ◦ 2000 – Multiple I/O standards, clock management, RAM, multipliers, processors ◦ 2008 – Ethernet, GTP/GTX transceivers, microprocessors, 65-nm technology, DCM, PLLs, 12 routing layers, triple-oxide for reduced power consumption ◦ 20xx – Power down individual sections of FPGAs?

 Traditionally, ASICs are used for large projects and FPGAs for smaller projects that need to get to market faster, or can benefit from remote upgrades  Improved FPGA performance, density, and fabrication cost are pushing the ASICs out of the market; as the key remains FPGAs quick time-to-market value  2 years for ASIC, verses 9 months for FPGA

 Earlier FPGAs were only viable for prototyping or low-density applications; now they see very high- volume usage in consumer products and other moderate volume high-density applications  Highest-density FPGAs (90 nm) still have a definitive higher unit price than ASICs  However, cost trade-offs often favour FPGAs even with these highest density applications, when development and NRE charges are factored in David Greenfield, senior director of high-density FPGAs at Altera Corp

 Upfront development investment is higher with a cell-based ASIC approach  At high volumes, ROI is significantly better due to smaller die size and lower per unit costs.  FPGAs tend to be a better choice where unit price is less important, or time-to-market, or low initial development cost drives the solution  FPGAs and structured ASICs are suitable for low volume, short lifetime applications where customers can compromise on functionality and performance while still achieving their system objectives John DiFilippo, silicon architect for TI's ASIC Communications Infrastructure Business Unit

 FPGAs are closing in on ASICs for performance values, even though ASICs are smaller, faster and more efficient at the moment  FPGAs provide cost, time, reconfigurability and flexibility over ASIC designs which make them attractive  Gap is narrowing between the technologies

Structured ASICs  Newer term in the industry

 The logic mask layers of a device are predefined by the ASIC vendor  Design differentiation and customization is achieved by creating custom metal layers that create custom connections between predefined lower layer logic elements  Because only a small number of chip layers must be custom-produced, structured ASIC designs have much smaller NRE costs than other ASIC chips, which require that a full mask set be produced for every design

 Both manufacturing cycle time and design cycle time are reduced compared to standard cell-based ASIC by:  Pre-defined metal layers  Pre-characterization of what is on the silicon  Pre-defined power, clock, test structures Pre-Routed Layer Routing Layer

 FPGA vendors have also designed their own version of the structured ASICs:  Altera (HardCopy) ◦ Same design cells as FPGA, but programmable routing replaced with fixed wire interconnects  Xilinx (EasyPath) ◦ “customer specific FPGA”; 30-70% cheaper than standard FPGA, same standard FPGA cells  Both designs have the programming capabilities removed

Advantages  Mainly used for mid-density designs  High performance (close to standard-cell)  Low power consumption  Less complex (fewer layers to fabricate)  Small time-to-market (pre-defined cell blocks)

Disadvantages  Design tools  Expensive  Immature architecture ◦ Therefore have not been formally evaluated and compared;  Tradeoffs within the architecture (LUTs, RAM size)

ParameterFPGA*Structured ASICStandard-cell ASIC Area40101 Speed Power consumed1221 Unit costsHighMediumLow (high V) NRE costLowMediumHigh Time-to-marketLowLow-MediumHigh ReconfigurabilityFullNo Market VolumeLow-mediumMediumHigh *used worst-case values (from Kuon, Rose) Structured ASICs have many of the advantages of both FPGAs and standard ASICs

 FPGAs, Structured ASICs and ASICs each have their own advantages and disadvantages ◦ ASICs  High costs, high performance  Low flexibility  Difficult and long design cycle ◦ FPGAs  Low cost, low performance  Reconfigurable  Quick and easy for designers

 Do you have any questions?  Project …..

Project Outline:  Start with 1-bit ALU design using VHDL (Xilinx)  Modify an ALU from a 4-bit to 16-bits  Add other capabilities to the ALU (Multiplication, Division)  Duplicate the ALU to max capacity of the FPGA  The ALUs are connected using different connections: ◦ Bus ◦ Point-to-point matrix connection  Comparisons between the two implementations  Solve Benchmarks

Project Outline:  Start with 1-bit ALU design  Re-learn VHDL code  Modify an ALU from a 4-bit to 16-bits  Add other capabilities to the ALU (Multiplication, Division)  Duplicate the ALU to max capacity of the FPGA  The ALUs are connected using different connections: ◦ Bus ◦ Point-to-point matrix connection  Comparisons between the two implementations  Solve Benchmarks