VAPRES A Virtual Architecture for Partially Reconfigurable Embedded Systems Presented by Joseph Antoon Abelardo Jara-Berrocal, Ann Gordon-Ross NSF Center.

Slides:



Advertisements
Similar presentations
2009 Midyear Workshop F4-09: Virtual Architecture and Design Automation for Partial Reconfiguration All Hands Meeting November 10th, 2009 Dr. Ann Gordon-Ross.
Advertisements

Computer Architecture (EEL4713, Fall 2013) Partial Reconfiguration Not just a half baked job of reconfiguring Rohit Kumar Research Student University of.
Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.
Run-Time FPGA Partial Reconfiguration for Image Processing Applications Shaon Yousuf Ph.D. Student NSF CHREC Center, University of Florida Dr. Ann Gordon-Ross.
Reconfigurable Computing (EEL4930/5934) Partial Reconfiguration Not just a half baked job of reconfiguring Rohit Kumar Joseph Antoon Research Students.
A self-reconfiguring platform Brandon Blodget,Philip James- Roxby, Eric Keller, Scott McMillan, Prasanna Sundararajan.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Lecture 7 FPGA technology. 2 Implementation Platform Comparison.
HTR: On-Chip Hardware Task Relocation for Partially Reconfigurable FPGAs + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing.
Developing Video Applications on Xilinx FPGAs
QUIZ What does ICAP stand for ? What is its main use ? Why is Partition Pin preferred over Bus Macro? 1.
Hardwired networks on chip for FPGAs and their applications
MEMOCODE 2007 HW/SW Co-design Contest Documentation of the submission by Eric Simpson Pengyuan Yu Sumit Ahuja Sandeep Shukla Patrick Schaumont Electrical.
1 Student: Khinich Fanny Instructor: Fiksman Evgeny המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי לישראל.
Performed by: Lin Ilia Khinich Fanny Instructor: Fiksman Eugene המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
Fall 2006Lecture 16 Lecture 16: Accelerator Design in the XUP Board ECE 412: Microcomputer Laboratory.
1 Students: Lin Ilia Khinich Fanny Instructor: Fiksman Evgeny המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
1 Performed by: Lin Ilia Khinich Fanny Instructor: Fiksman Eugene המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
DSP Algorithm on System on Chip Performed by : Einat Tevel Supervisor : Isaschar Walter Accompanying engineers : Emilia Burlak, Golan Inbar Technion -
Configurable System-on-Chip: Xilinx EDK
The Xilinx EDK Toolset: Xilinx Platform Studio (XPS) Building a base system platform.
An FPGA Based Adaptive Viterbi Decoder Sriram Swaminathan Russell Tessier Department of ECE University of Massachusetts Amherst.
Performance Analysis of Processor Midterm Presentation Performed by : Winter 2005 Alexei Iolin Alexander Faingersh Instructor: Evgeny.
1 Fast Communication for Multi – Core SOPC Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Implementation of DSP Algorithm on SoC. Characterization presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompany engineer : Emilia Burlak.
Virtual Architecture For Partially Reconfigurable Embedded Systems (VAPRES) Architecture for creating partially reconfigurable embedded systems Module.
Using FPGAs with Embedded Processors for Complete Hardware and Software Systems Jonah Weber May 2, 2006.
Bitstream Relocation with Local Clock Domains for Partially Reconfigurable FPGAs Adam Flynn, Ann Gordon-Ross, Alan D. George NSF Center for High-Performance.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Benefits of Partial Reconfiguration Reducing the size of the FPGA device required to implement a given function, with consequent reductions in cost and.
Partially Reconfigurable System-on-Chips for Adaptive Fault Tolerance Shaon Yousuf Adam Jacobs Ph.D. Students NSF CHREC Center, University of Florida Dr.
Impulse Embedded Processing Video Lab Generate FPGA hardware Generate hardware interfaces HDL files HDL files FPGA bitmap FPGA bitmap C language software.
Department of Electrical Engineering Electronics Computers Communications Technion Israel Institute of Technology High Speed Digital Systems Lab. High.
Embedded Systems Seminar (EEL6935, Spring 2013) Partial Reconfiguration Not just a half baked job of reconfiguring Rohit Kumar Research Student University.
Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
DAPR: Design Automation for Partially Reconfigurable FPGAs Shaon Yousuf Ph.D. Student NSF CHREC Center, University of Florida Dr. Ann Gordon-Ross Associate.
Heng Tan Ronald Demara A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management.
© 2007 Xilinx, Inc. All Rights Reserved This material exempt per Department of Commerce license exception TSU Hardware Design INF3430 MicroBlaze 7.1.
Hybrid Prototyping of MPSoCs Samar Abdi Electrical and Computer Engineering Concordia University Montreal, Canada
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Design Framework for Partial Run-Time FPGA Reconfiguration Chris Conger, Ann Gordon-Ross, and Alan D. George Presented by: Abelardo Jara-Berrocal HCS Research.
Exploiting Partially Reconfigurable FPGAs for Situation-Based Reconfiguration in Wireless Sensor Networks Rafael Garcia, Dr. Ann Gordon-Ross, Dr. Alan.
LAB1 Summary Zhaofeng SJTU.SOME. Embedded Software Tools CPU Logic Design Tools I/O FPGA Memory Logic Design Tools FPGA + Memory + IP + High Speed IO.
Partial Region and Bitstream Cost Models for Hardware Multitasking on Partially Reconfigurable FPGAs + Also Affiliated with NSF Center for High- Performance.
Task Graph Scheduling for RTR Paper Review By Gregor Scott.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
A Physical Resource Management Approach to Minimizing FPGA Partial Reconfiguration Overhead Heng Tan and Ronald F. DeMara University of Central Florida.
Reconfigurable Embedded Processor Peripherals Xilinx Aerospace and Defense Applications Brendan Bridgford Brandon Blodget.
FPGA Partial Reconfiguration Presented by: Abelardo Jara-Berrocal HCS Research Laboratory College of Engineering University of Florida April 10 th, 2009.
Jason Li Jeremy Fowers 1. Speedups and Energy Reductions From Mapping DSP Applications on an Embedded Reconfigurable System Michalis D. Galanis, Gregory.
M. ALSAFRJALANI D. DZENITIS Runtime PR for Software Radio 2/26/2010 UFL ECE Dept 1 PARTIAL RECONFIGURATION (PR)
WARP PROCESSORS ROMAN LYSECKY GREG STITT FRANK VAHID Presented by: Xin Guan Mar. 17, 2010.
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
Survey of Reconfigurable Logic Technologies
CoDeveloper Overview Updated February 19, Introducing CoDeveloper™  Targeting hardware/software programmable platforms  Target platforms feature.
System on a Programmable Chip (System on a Reprogrammable Chip)
Automated Software Generation and Hardware Coprocessor Synthesis for Data Adaptable Reconfigurable Systems Andrew Milakovich, Vijay Shankar Gopinath, Roman.
Runtime Temporal Partitioning Assembly to Reduce FPGA Reconfiguration Time Abelardo Jara-Berrocal, Ann Gordon-Ross HCS Research Laboratory College of Engineering.
An Automated Hardware/Software Co-Design
New Opportunities for Computer Architecture Research Using High-Density FPGAs and Design Tools Nahi Abdul-Ghani, Patrick Akl, Mohammad El-Majzoub, Maroulla.
Dynamo: A Runtime Codesign Environment
Ming Liu, Wolfgang Kuehn, Zhonghai Lu, Axel Jantsch
Abelardo Jara-Berrocal Joseph Antoon Ph.D. Students
ChipScope Pro Software
ChipScope Pro Software
Dynamic Partial Reconfiguration of FPGA
Presentation transcript:

VAPRES A Virtual Architecture for Partially Reconfigurable Embedded Systems Presented by Joseph Antoon Abelardo Jara-Berrocal, Ann Gordon-Ross NSF Center for High-Performance Reconfigurable Computing (CHREC) Department of Electrical and Computer Engineering University of Florida

2 Joseph Antoon University of Florida Adaptive Hardware Applications Kalman filter used for target tracking Finds likely location from noisy measurements Optimized filter depends on target type Slow Target Low PowerConstant gain Low BandwidthKalman Filter Fast Target High PowerConstant gain High BandwidthKalman Filter Airborne Target High PowerVariable Gain Low Bandwidth Multi-scale Smoother Noisy Target High PowerVariable Gain Low BandwidthKalman Filter

3 Joseph Antoon University of Florida Adaptive Hardware Applications FPGAs often out-perform CPUs Parallel computing power Kalman filters scale well Partial Reconfiguration (PR) Run-time HW adaptation Allows FPGA time-sharing Communication Challenge Transfers between modules can lock up CPU Inter-module network alleviates resources Processor CPUFPGAs FPGA Device CPU Memory Filter AFilter B

4 Joseph Antoon University of Florida Using Partial Reconfiguration 2. Platform studio 3. Import into ISE 6. Code PR region HDL System Specifications 1. Define system 5. Set PRRs as black boxes top staticprr_aprr_b 4. Divide project into mandated hierarchy 7. Synthesize! 9. Map on to PlanAhead 8. Guess Estimate a good floorplan 12. Write software 11. Implement! 10. Create “configurations” Could you make it just a bit different…

5 Joseph Antoon University of Florida Identifying Issues With PR Support Only supported by Xilinx Altera support announced Lack of abstraction Manual partitioning Manual floor-planning App-specific architectures Increased time-to-market Reduced flexibility Frustrating Design Flow! In this work, we propose VAPRES A Virtual Architecture for PR Embedded Systems Abstracts base system from application Automates design flow and floor-planning Scalable, flexible features

6 Joseph Antoon University of Florida VAPRES Architecture MicroBlaze CPU PR Region 1 PR Region 2 PLB Bus DCR Bridge PR Socket FSL Fast Simplex Links Switch 1Switch 2 IF IO Module To IO MicroBlaze CPU PR Region 1 PR Region 2 PLB Bus DCR Bridge PR Socket FSL Fast Simplex Links Switch 1Switch 2 IF IO Module To IO PR Regions (PRRs) –Independent clocks –FIFO-based I/O –Online placement –Created separately MACS –Intermodule network Flexible, scalable –PR Region Count –PR Region Size –MACS bandwidth Module channel width Left to right channel width Right to left channel width –IO Module Count MicroBlaze CPU PR Region 1 PR Region 2 PLB Bus DCR Bridge PR Socket FSL Fast Simplex Links Switch 1Switch 2 IF IO Module To IO

7 Joseph Antoon University of Florida PR Region Connectivity PR Region MicroBlaze MACS Switch FSL Fast Simplex Links Producer / Consumer Queues Slice Macros PR Socket Device Control Register (DCR) ClockMacro PRR FSL EnableReset Clock Select Regional Clock Buffer (BUFR) Fast Clock Slow Clock Clock Multiplexer (BUFGMUX)

8 Joseph Antoon University of Florida MACS – Intermodule Network Minimal Adaptive-Routing Circuit Switched Network Circuit based Uses streaming channels Circuit set by first word in channel Fast setup (<10 cycles) Switch 2 IF Module 2 Module 1 Switch 2 IF Module 3 dst end

9 Joseph Antoon University of Florida Design Methodology Two separate design flows Base System Application Applications made independently Only base system specs needed Base FlowApp Flow Base system specifications

10 Joseph Antoon University of Florida System Specs Base System Design Flow User feeds specs to VAPRES Base design created from specs Parametric templates used System files generated Floorplan and Constraints Embedded Dev. Kit (EDK) Files HDL Synthesis Implementation Bitstream generated System downloaded to the board Base system flow Generate Bitstream Implementation Synthesis HDLFloorplan Base Design Templates

11 Joseph Antoon University of Florida Application Design Flow Application Flow Executable Link Synthesis Generate Bitstream Implementation System Specs Partition App Hardware Software Software flow Compile Link Hardware Flow Synthesize Implement Bitstream gen Download App API Compile Application Decomposition HDL Source Code

12 Joseph Antoon University of Florida Revisiting Target Tracking MicroBlaze CPU Blank PR Region PLB Bus DCR Bridge PR Socket Switch 2 IF IO Module Sensor ICAP Filter Storage Aerospace Kalman Filter Looks like a spaceship Aerospace Kalman Filter

13 Joseph Antoon University of Florida Seamless Filter Swapping MicroBlaze CPU Blank Module SW2 IF IO Module SW2 IF Blank Module Filter tracks target Target slows down Filter swap needed First load new filter Spare region used Old filter continues Redirect traffic Downtime is now negligible Previously in seconds High Power Kalman Filter Low Power Kalman Filter The target changed!

14 Joseph Antoon University of Florida Post Place and Route Experimental Setup - Resources Implemented on ML401 board Virtex-4 LX25 FPGA VAPRES Two PR Regions 16x11 CLB region size Two IOMs MACS Four switches 32-bit channels Two channels left to right Two channels right to left Floor Plan Base System View

15 Joseph Antoon University of Florida Results – Resource Usage LX60LX25 LX100

16 Joseph Antoon University of Florida FlashBRAM ICAP Experimental Setup – Timing Two methods to reconfigure Implemented in software 1) Write bitfile in one stage 2) Write bitfile in two stages One-stage method Load Flash sector to BRAM Write to ICAP Repeat until bitfile is loaded Two-stage method Load bitfile into BRAM Write bitfile to ICAP Less RAM required Load once, write often ICAP BRAMFlash Board peripheral FPGA structure

17 Joseph Antoon University of Florida Results – Reconfiguration Time Two-Stage Time Breakdown One-Stage Time Breakdown ICAP write reduced to ms

18 Joseph Antoon University of Florida Experimental Setup - Scaling Four VAPRES Systems Set Up Small PRRs: 1 Width:10 CLB Height: 1 row MACS: No Medium PRRs: 1 Width:10 CLB Height:2 rows MACS: No Large PRRs: 2 Width:16 CLB Height: 2 rows MACS: Yes Populous PRRs: 3 Width:16 CLB Height:1 row MACS: Yes

19 Joseph Antoon University of Florida Results - Scalability Increased PRR Size Added PRR Decreased PRR Size

20 Joseph Antoon University of Florida Results - Scalability All designs meet 100Mhz constraint

21 Joseph Antoon University of Florida Conclusions We developed VAPRES Virtual Architecture for Partially Reconfigurable Systems Contributions Modular design methodology PR regions with independent, selectable clocks Highly parametric design Seamless filter swapping Future work Algorithms for runtime module placement Tools to assist system design formulation Context save and restore for modules

22 Joseph Antoon University of Florida Thank you for attending Questions?