DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:

Slides:



Advertisements
Similar presentations
Nios Multi Processor Ethernet Embedded Platform Final Presentation
Advertisements

6-April 06 by Nathan Chien. PCI System Block Diagram.
Fast A/D sampler FINAL presentation
Verifying Performance of a HDL design block
NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
Yaron Doweck Yael Einziger Supervisor: Mike Sumszyk Spring 2011 Semester Project.
Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.
Internal Logic Analyzer Final presentation-part B
CSC 4250 Computer Architectures December 8, 2006 Chapter 5. Memory Hierarchy.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Super Fast Camera System Performed by: Tokman Niv Levenbroun Guy Supervised by: Leonid Boudniak.
1 Performed By: Khaskin Luba Einhorn Raziel Einhorn Raziel Instructor: Rivkin Ina Spring 2004 Spring 2004 Virtex II-Pro Dynamical Test Application Part.
© 2004 Xilinx, Inc. All Rights Reserved Implemented by : Alon Ben Shalom Yoni Landau Project supervised by: Mony Orbach High speed digital systems laboratory.
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Part A Final Presentation.
t Popularity of the Internet t Provides universal interconnection between individual groups that use different hardware suited for their needs t Based.
Configurable System-on-Chip: Xilinx EDK
Students:Gilad Goldman Lior Kamran Supervisor:Mony Orbach Mid-Semester Presentation Spring 2005 Network Sniffer.
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Midterm Presentation.
Final Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
1 Fast Communication for Multi – Core SOPC Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab.
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
Interface of DSP to Peripherals of PC Spring 2002 Supervisor: Broodney, Hen | Presenting: Yair Tshop Michael Behar בס " ד.
1 RAMP Infrastructure Krste Asanovic UC Berkeley RAMP Tutorial, ISCA/FCRC, San Diego June 10, 2007.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Winter 2013 Independent Internet Embedded System - Final A Preformed by: Genady Okrain Instructor: Tsachi Martsiano Duration: Two semesters
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
General Purpose FIFO on Virtex-6 FPGA ML605 board midterm presentation
Final presentation Encryption/Decryption on embedded system Supervisor: Ina Rivkin students: Chen Ponchek Liel Shoshan Winter 2013 Part A.
Programmable Logic- How do they do that? 1/16/2015 Warren Miller Class 5: Software Tools and More 1.
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Final A Presentation By: Vova Menis-Lurie Sonia Gershkovich.
- 1 - A Powerful Dual-mode IP core for a/b Wireless LANs.
Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf
General Purpose FIFO on Virtex-6 FPGA ML605 board Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf 1 Semester: spring 2012.
Xilinx at Work in Hot New Technologies ® Spartan-II 64- and 32-bit PCI Solutions Below ASSP Prices January
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Midterm Presentation By: Vova Menis-Lurie Sonia Gershkovich.
“ Analyzer for 40Gbit Ethernet “ (Bi-semestrial project) Executers: פריד מחאג ' נה Farid Mahajna Husam Kadan חוסאם קעדאן Instructor:
Matrix Multiplication on FPGA Final presentation One semester – winter 2014/15 By : Dana Abergel and Alex Fonariov Supervisor : Mony Orbach High Speed.
Upgrade to Real Time Linux Target: A MATLAB-Based Graphical Control Environment Thesis Defense by Hai Xu CLEMSON U N I V E R S I T Y Department of Electrical.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
HW/SW PARTITIONING OF FLOATING POINT SOFTWARE APPLICATIONS TO FIXED - POINTED COPROCESSOR CIRCUITS - Nalini Kumar Gaurav Chitroda Komal Kasat.
Hardware Implementation of Fast Forwarding Engine using Standard Memory and Dedicated Circuit Kazuya ZAITSU, Shingo ATA, Ikuo OKA (Osaka City University,
J. Christiansen, CERN - EP/MIC
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
Buffer-On-Board Memory System 1 Name: Aurangozeb ISCA 2012.
PROCStar III Performance Charactarization Instructor : Ina Rivkin Performed by: Idan Steinberg Evgeni Riaboy Semestrial Project Winter 2010.
1 Abstract & Main Goal המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory The focus of this project was the creation of an analyzing device.
Performed by: Yaron Recher & Shai Maylat Supervisor: Mr. Rolf Hilgendorf המעבדה למערכות ספרתיות מהירות הטכניון - מכון טכנולוגי לישראל הפקולטה להנדסת חשמל.
LZRW3 Data Compression Core Dual semester project April 2013 Project part A final presentation Shahar Zuta Netanel Yamin Advisor: Moshe porian.
1 DSP handling of Video sources and Etherenet data flow Supervisor: Moni Orbach Students: Reuven Yogev Raviv Zehurai Technion – Israel Institute of Technology.
PROJECT - ZYNQ Yakir Peretz Idan Homri Semester - winter 2014 Duration - one semester.
Final Presentation Implementation of DSP Algorithm on SoC Student : Einat Tevel Supervisor : Isaschar Walter Accompanying engineer : Emilia Burlak The.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
L/O/G/O Input Output Chapter 4 CS.216 Computer Architecture and Organization.
Connecting EPICS with Easily Reconfigurable I/O Hardware EPICS Collaboration Meeting Fall 2011.
Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.
Ethernet Bomber Ethernet Packet Generator for network analysis
Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m Akram Ben Ahmed Xinyu LI, Omar Hammami.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Project D02209: FPGA Bridge between High Speed Channel & External Network Mid Semester Presentation 30/05/10 Supervisor: Mony Orbach Students: Alex Blecherov.
Performed By: Tal Goihman & Irit Kaufman Instructor: Mony Orbach Bi-semesterial Spring /04/2011.
Lab Environment and Miniproject Assignment Spring 2009 ECE554 Digital Engineering Laboratory.
PROCStar III Performance Charactarization Instructor : Ina Rivkin Performed by: Idan Steinberg Evgeni Riaboy Semestrial Project Winter 2010.
ROD Activities at Dresden Andreas Glatte, Andreas Meyer, Andy Kielburg-Jeka, Arno Straessner LAr Electronics Upgrade Meeting – LAr Week September 2009.
Parallel compressing system for satellite on programmable chip Yifat Manzor Yifat Manzor & Reshef Dahan Supervisor: Eran Segev Part A.
Presenter: Darshika G. Perera Assistant Professor
Backprojection Project Update January 2002
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
COMP755 Advanced Operating Systems
NetFPGA - an open network development platform
Presentation transcript:

DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester: spring 2013

CONTENT 1.Project overview 2.Specifications 3.Part B goals 4.Memory on ml605 5.AXI4 6.Part A overview 7.Part B overview Design Performance 2

PROJECT OVERVIEW Design and implementation of General Purpose FIFO IP core which allows usage of external memory (DDR3) as FIFO storage on Xilinx FPGA device Design and implement generic IP core of FIFO Design and implement GUI generator of IP core on PC Create sample design with implemented IP core 3

SPECIFICATIONS Hardware Xilinx Virtex-6 ML605 FPGA Evaluation Kit DDR III memory Ethernet interface PC with Ethernet interface Software ISE Design Suite Logic Edition Version 14.3 Modelsim Wireshark Winpcap library 4

OUR GOALS Create design with configurable in/out word size depth Develop software generator of GP FIFO IP core Achieve best performance Minimize usage of FPGA resources Make our world a better place 5

MEMORY ON ML605 DDR3 memory Capacity: 512MB Max theoretical bandwidth: 800MT/s*64bit = 48.8Gb/sec Xilinx provides us with DDR3 controller which has AXI Memory Mapped interface AXI bus data width up to 1024 bit 256 bit for max memory performance, assuming bus works with 200Mhz 6

4AXI Xilinx provides us with AXI4 Memory Mapped bus, which is a standard bus used in modern ARM SoC. Features Separate Address/Control and Data Phases burst-based transactions with only start address issued separate read and write data channels 7

CONTROLLER Native Fifo Interface MEMORY TO FIFO FIFOFIFO FIFO TO MEMORY FIFOFIFO GENERAL PURPOSE FIFO AXI4 Interface

GENERAL PURPOSE FIFO INTERFACE GP FIFO Native FIFOAXI4

PART A OVERVIEW GENERAL PURPOSE FIFO Interface GP FIFO has native FIFO interface In Word size is 32 bit Out Word size is 32 bit Utilizes DDR3 memory through AXI interconnect 32 bit wide Depth is limited only by available RAM memory on DDR3 Same clock domain for IN and OUT Internal architecture Internal FIFO-To-Memory controller Internal Memory-To-FIFO controller Internal Bypass controller Arbitration between GP FIFOs is managed by AXI interconnect 10

MAC Module EXAMPLE DESIGN BLOCK DIAGRAM 11 MEMORY CONTROLLER AXI4 BUS (INTERCONNECT) General Purpose Fifo Example Design MAC Controller HOST General Purpose Fifo

Interface Utilizes Ethernet MAC level of communication Software has simple and intuitive interface Software handles all the data transmission in between host and GP FIFO Internal architecture MAC module GP FIFO Simple example of a logic which uses GP FIFO Example design of part A is dedicated to verify GP FIFO core under maximum performance condition. In part B we tested different configurations (variable word size). 12 PART A OVERVIEW EXAMPLE DESIGN

OUR GOALS PART B Generalize our GP FIFO Develop software generator for GP FIFO IP core Write detailed manual for IP core and for it’s generator Develop simple design to test performance Simulate and test all the possible configurations and their exceptional case Make possible optimizations 13

PART B OVERVIEW GENERAL PURPOSE FIFO Interface GP FIFO has native FIFO interface GP FIFO has configurable internal word size: 64, 128, 256 bit In/Out word size can be 8, 16, 32, 64, 128, 256 bit Maximum ratio of internal word size to In/Out is x8 Depth is defined by two addresses: start and end. Those two addresses define continuous region of memory which is used for GP FIFO and thus its depth. Same clock domain for reader and writer Internal architecture Input and output buffers has fixed depth of 128 words to maximize performance 14

PART B OVERVIEW SOFTWARE GENERATOR FOR GP FIFO IP CORE Developed in Java Available GP FIFO core configuration parameters Name Start address of utilized RAM memory End address of utilized RAM memory In word size Out word size Internal word size: 64b, 128b, 256b Maximum ratio of internal word size to In/Out is x8 15

PART B OVERVIEW SOFTWARE GENERATOR FOR GP FIFO IP CORE 16

PART B OVERVIEW SOFTWARE GENERATOR FOR GP FIFO IP CORE Provides user with a convenient way of GP FIFO parameterization Prepares simulation of generated GP FIFO 17

MAC Module EXAMPLE DESIGN BLOCK DIAGRAM 18 MEMORY CONTROLLER AXI4 BUS (INTERCONNECT) General Purpose Fifo Example Design MAC Controller HOST

Example design of part B is dedicated to measure maximum performance of GP FIFO core with different configurations. Interface with host Utilizes Ethernet MAC level of communication Software developed in Part A Internal architecture MAC module GP FIFO Simple logic Simultaneously writes/reads GP FIFO. Sends word count to Host every second. 19 PART B OVERVIEW EXAMPLE DESIGN

PART B OVERVIEW PERFORMANCE OF DIFFERENT CONFIGURATIONS We defined three stages in exploring performance of GP FIFO under different configurations. GP FIFO of size 64Mb In/Out is symmetric (variable) Example Design and GP FIFO are running with 200MHz clock Performance dependence on depth of In/Out buffer Internal word size 64 bit In/Out buffer depth is variable Performance dependence on internal word size Internal word size: 128, 256 bit is variable In/Out buffer depth is 128 words 20

PART B OVERVIEW PERFORMANCE DEPENDENCE ON IN/OUT BUFFER DEPTH 21

PART B OVERVIEW PERFORMANCE DEPENDENCE ON IN/OUT BUFFER DEPTH 22

PART B OVERVIEW PERFORMANCE DEPENDENCE ON IN/OUT BUFFER DEPTH 23

PART B OVERVIEW PERFORMANCE DEPENDENCE ON INTERNAL WORD SIZE 24

PART B OVERVIEW PERFORMANCE DEPENDENCE ON INTERNAL WORD SIZE 25

PART B OVERVIEW PERFORMANCE DEPENDENCE ON INTERNAL WORD SIZE 26

PART B OVERVIEW PERFORMANCE DEPENDENCE ON INTERNAL WORD SIZE 27

PART B OVERVIEW PERFORMANCE OF DIFFERENT CONFIGURATIONS Performance dependence on depth of In/Out buffer Deeper buffer results in higher average speed only when in/out word size is greater/equals internal word size Performance dependence on internal word size Increasing internal word size provides significant speedup Optimal internal word size is twice as in/out word size Experiment shows that speed of GP FIFO in real design correlates well with simulated values Conclusion Changing buffer depth can hide some latency in case of not optimal internal word size, does not provide significant speedup to the system Optimal internal word size is x2 max{in, out} word size 28

PROJECT OVERVIEW Our design meets the requirements of the project. GP FIFO is fast limited only by interconnect bus available bandwidth GP FIFO has configurable depth and word size GP FIFO is simple, utilizes small amount of resources on the board GP FIFO gives you another easy way to pass big chunks of data from one module to another. GP FIFO core generator provides an easy way to configure the core. Our example design provides you with efficient communication in between a host and the board. After one and a half year of hard work we finally reached all sgoals of the project 29