® www.xilinx.com Virtex-E Extended Memory Technical Overview and Applications.

Slides:



Advertisements
Similar presentations
Programmable FIR Filter Design
Advertisements

ECE 506 Reconfigurable Computing ece. arizona
Lecture 15 Finite State Machine Implementation
Architecture-Specific Packing for Virtex-5 FPGAs
Commercial FPGAs: Altera Stratix Family Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Reconfigurable Computing (EN2911X, Fall07) Lecture 04: Programmable Logic Technology (2/3) Prof. Sherief Reda Division of Engineering, Brown University.
Digital Design Copyright © 2006 Frank Vahid 1 FPGA Internals: Lookup Tables (LUTs) Basic idea: Memory can implement combinational logic –e.g., 2-address.
Xilinx CPLDs and FPGAs Module F2-1. CPLDs and FPGAs XC9500 CPLD XC4000 FPGA Spartan FPGA Spartan II FPGA Virtex FPGA.
Spartan II Features  Plentiful logic and memory resources –15K to 200K system gates (up to 5,292 logic cells) –Up to 57 Kb block RAM storage  Flexible.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Implementing Logic Gates and Circuits Discussion D5.1.
Computes the partial dot products for only the diagonal and upper triangle of the input matrix. The vector computed by this architecture is added to the.
Implementing Logic Gates and Circuits Discussion D5.3 Section 11-2.
Lecture 2: Field Programmable Gate Arrays I September 5, 2013 ECE 636 Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays I.
1 Lecture 16B Memories. 2 Memories in General Computers have mostly RAM ROM (or equivalent) needed to boot ROM is in same class as Programmable Logic.
The Spartan 3e FPGA. CS/EE 3710 The Spartan 3e FPGA  What’s inside the chip? How does it implement random logic? What other features can you use?  What.
Programmable logic and FPGA
ECE 331 – Digital System Design Tristate Buffers, Read-Only Memories and Programmable Logic Devices (Lecture #16) The slides included herein were taken.
1 Lecture 16B Memories. 2 Memories in General RAM - the predominant memory ROM (or equivalent) needed to boot ROM is in same class as Programmable Logic.
Real Time Image Feature Vector Generator Employing Functional Cache Memory for Edge Takuki Nakagawa, Department of Electronic Engineering The University.
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved The Digital Logic Level.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
The Xilinx Spartan 3 FPGA EGRE 631 2/2/09. Basic types of FPGA’s One time programmable Reprogrammable (non-volatile) –Retains program when powered down.
Approaching Ideal NoC Latency with Pre-Configured Routes George Michelogiannakis, Dionisios Pnevmatikatos and Manolis Katevenis Institute of Computer Science.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 7 Programmable.
8088 I/F with basic IO, RAM and 8255
Highest Performance Programmable DSP Solution September 17, 2015.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
Open Discussion of Design Flow Today’s task: Design an ASIC that will drive a TV cell phone Exercise objective: Importance of codesign.
Electronics in High Energy Physics Introduction to Electronics in HEP Field Programmable Gate Arrays Part 1 based on the lecture of S.Haas.
10/10/20151 DIF – Digital Imaging Fast Ali Nuhi and Everett Salley EEL4924 Senior Design Date: 02 March 2011.
J. Christiansen, CERN - EP/MIC
The Xilinx Spartan 3 FPGA EGRE 631 2/2/09. Basic types of FPGA’s One time programmable Reprogrammable (non-volatile) –Retains program when powered down.
FPGA Implementations for Volterra DFEs
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
Architecture and Features
CPEN Digital System Design
Digital Logic Design Instructor: Kasım Sinan YILDIRIM
® Spartan-II High Volume Solutions Overview. ® High Performance System Features Software and Cores Smallest Die Size Lowest Possible Cost.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Fabric System Architecture Design: two distinct board designs; replicate and connect –modularity allows us to build any configuration of size 2 n Board.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
® /1 The E is the Edge. ® /2 Density Leadership Virtex XCV1000 Density (system gates) 10M Gates In 2002 Virtex-E.
CORE Generator System V3.1i
A Programmable Single Chip Digital Signal Processing Engine MAPLD 2005 Paul Chiang, MathStar Inc. Pius Ng, Apache Design Solutions.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
ADC – FIR Filter – DAC KEVIN COOLEY. Overview  Components  Schematic  Hardware Design Considerations  Digital Filters/FPGA Design Tools  Questions.
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
PARBIT Tool 1 PARBIT Partial Bitfile Configuration Tool Edson L. Horta Washington University, Applied Research Lab August 15, 2001.
Chapter Microcontroller
1 “A picture speaks a thousand words.” Art By Ranjith & Waquas Islamiah Evening College.
ETE Digital Electronics
Author: Yun R. Qu, Shijie Zhou, and Viktor K. Prasanna Publisher:
Backprojection Project Update January 2002
Video RAM Presented by GHOLAMREZA KAKAMANSHADI
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Head-to-Head Xilinx Virtex-II Pro Altera Stratix 1.5v 130nm copper
SLP1 design Christos Gentsos 9/4/2014.
Electronics for Physicists
Spartan FPGAs مرتضي صاحب الزماني.
The Xilinx Virtex Series FPGA
XC4000E Series Xilinx XC4000 Series Architecture 8/98
Graphics Hardware: Specialty Memories, Simple Framebuffers
The Xilinx Virtex Series FPGA
Implementing Logic Gates and Circuits
FPGA’s 9/22/08.
Programmable logic and FPGA
Presentation transcript:

® Virtex-E Extended Memory Technical Overview and Applications

® BRAM CLBs BRAM XCV812E 0 Virtex-E Extended Memory  CLB Array is 56 height and 84 wide  Same architecture as Virtex-E —BRAM starts at both sides —Each block is 4 CLBs high  Except —BRAM is inserted every 4th CLB column —Except for the middle 12 columns of CLBs

® Virtex-E Extended Memory  14 blocks per column and 20 columns total (starting with 0) = 280 blocks  280 blocks x 4096 bits = (1120K) bits of total BRAM  XCV600E has 72 blocks and XCV1000E has 96 blocks

® BRAM CLBs CLBs BRAM XCV405E 0 Virtex-E Extended Memory  CLB Array is 40 height and 60 wide  Same architecture as Virtex E —BRAM starts at both sides —Each block is 4 CLBs high  Except —BRAM is inserted every 4th CLB column —Except for the middle 12 columns of CLBs

® Virtex-E Extended Memory  10 blocks per column and 14 columns total (starting with 0) = 140 blocks  140 blocks x 4096 bits = (560K) bits of total BRAM  XCV400E has 40 blocks

® Virtex-EM Applications  Video and image enhancement  Larger memory-to-logic ratio required by some DSP functions  Improved memory bandwidth for high performance video applications  2D FIR Filter with on-board line buffers used in image processing

® PCI VRAM JPEG 2D FIR Miixer Field Buffer Resize VIDEO VRAM Potential Applications Image Enhancement

® Line of N pixels M bits wide N lines per image Basic Operation  Square kernel mask is moved across an image pixel-by- pixel and line-by-line  New center pixel is created based on neighborhood  Kernel masks of various sizes Image

® C3 C2 C1 Line Buffers From Frame Buffer Basic Operation  Line buffers store one line of pixels  Line buffers dramatically reduce pin count  Coefficients are symmetric for most video filters  Each pixel is multiplied by a coefficient  Pixels are summed and create a new center pixel value

®  Vertical and horizontal FIR filters are shared through muxing  Typical image is 1024x1024x12 (8) (e.g. Medical Graphics)  Typical kernel is up to 15x15 ( 196,608 Bits=Virtex E)  In high end applications 31x31 (393,216 Bits=Virtex EM 405)  Or even 63x63 are desirable (786,432 Bits=Virtex EM 812) 2D FIR Filter with build-in Buffers

® Implementation Block Diagram

® RAM Requirements Pixels per Line Bits per Pixel XCV405EXCV812E Number of lines 12 bits/pixel

® Conclusion  High performance arithmetic functions allow logic to be shared  On-chip line buffers improve performance and reduce the number of IO required  More functions can be integrated on the same chip  Large memory to logic ratio enables high end graphic systems  High memory bandwidth

® 16 x16 OC-192 Buffered Crossbar Switch Virtex-EM Applications

® N x N Buffered Crossbar Switch  N input ports and N output ports are connected through an N x N matrix of switching elements  Buffers, typically FIFOs are at the core of the switching element  Output port contention addressed by storing packets in FIFOs at each crosspoint  Reduced Head-of-Line (HOL) blocking  Increased thoughput

® 16x16 OC-192 Buffered Crossbar Switch using XCV812E devices

® Switch Performance  Speed requirements —For each port, 128-bit data is sliced into 16 8-bit slices. Each data slice is processed by a XCV812E device —OC-192 line rate = 10Gbit/sec —Byte-wide FIFOs must run at 10/128 = 78 MHz minimum —With internal speedup factor, required FIFO speed > 78 MHz  XCV812E supports > 150 MHz FIFOs  Switch speed = 16 ports x 10 Gbit/sec/port = 160 Gbit/sec is easily achieved

® Switch Performance (contd.)  Memory requirements —Min. buffer size = 2 * port-speed/per-slice * round-trip delay in the control feedback signal —For OC Gbit/sec, 16 data slices, 800ns (approx.) round trip delay: –Min. buffer size = 2 x10 Gb/s x 800ns / 16 = 1000 bits —Total buffer memory needed = 16 x 16 x 1 Kb —Total number of buffers = 256 x 1Kb buffers  XCV812E has 280 blocks of 4Kb Block RAM

® Floorplan Layout of the Design

® Conclusion  Virtex-EM architecture and features are highly suited to the buffered crossbar switch application  High ratio of Block SelectRAM to logic  Select I/O+; up to 20 I/O standards including differential signaling standards supported  High speed synchronous (>150 MHz) FIFOs