Bitstream Relocation with Local Clock Domains for Partially Reconfigurable FPGAs Adam Flynn, Ann Gordon-Ross, Alan D. George NSF Center for High-Performance.

Slides:



Advertisements
Similar presentations
PARTIAL RECONFIGURATION USING FPGAs: ARCHITECTURE
Advertisements

Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
2009 Midyear Workshop F4-09: Virtual Architecture and Design Automation for Partial Reconfiguration All Hands Meeting November 10th, 2009 Dr. Ann Gordon-Ross.
Computer Architecture (EEL4713, Fall 2013) Partial Reconfiguration Not just a half baked job of reconfiguring Rohit Kumar Research Student University of.
Run-Time FPGA Partial Reconfiguration for Image Processing Applications Shaon Yousuf Ph.D. Student NSF CHREC Center, University of Florida Dr. Ann Gordon-Ross.
Reconfigurable Computing (EEL4930/5934) Partial Reconfiguration Not just a half baked job of reconfiguring Rohit Kumar Joseph Antoon Research Students.
Implementation Approaches with FPGAs Compile-time reconfiguration (CTR) CTR is a static implementation strategy where each application consists of one.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Lecture 7 FPGA technology. 2 Implementation Platform Comparison.
ENG6530 Reconfigurable Computing Systems Dynamic Run Time Reconfiguration Operating System Support & Embedded Systems.
HTR: On-Chip Hardware Task Relocation for Partially Reconfigurable FPGAs + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
QUIZ What does ICAP stand for ? What is its main use ? Why is Partition Pin preferred over Bus Macro? 1.
Memory Management Chapter 7.
Committee Members: Annie S. Wu, Jooheung Lee, and Ronald F. DeMara Committee Members: Annie S. Wu, Jooheung Lee, and Ronald F. DeMara Optimizing Dynamic.
FAULT TOLERANCE IN FPGA BASED SPACE-BORNE COMPUTING SYSTEMS Niharika Chatla Vibhav Kundalia
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
BIST for Logic and Memory Resources in Virtex-4 FPGAs Sachin Dhingra, Daniel Milton, and Charles Stroud Electrical and Computer Engineering Auburn University.
Customizing Virtual Networks with Partial FPGA Reconfiguration
1 Student: Khinich Fanny Instructor: Fiksman Evgeny המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי לישראל.
1 Students: Lin Ilia Khinich Fanny Instructor: Fiksman Evgeny המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
1 Performed by: Lin Ilia Khinich Fanny Instructor: Fiksman Eugene המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
Configurable System-on-Chip: Xilinx EDK
FPGA Defect Tolerance: Impact of Granularity Anthony YuGuy Lemieux December 14, 2005.
Institute of Technology, Sligo Dept of Computing Semester 3, version Semester 3 Chapter 3 VLANs.
Virtual Architecture For Partially Reconfigurable Embedded Systems (VAPRES) Architecture for creating partially reconfigurable embedded systems Module.
Mohamed ABDELFATTAH Vaughn BETZ. 2 Why NoCs on FPGAs? Embedded NoCs Area & Power Analysis Comparison Against P2P/Buses 4 4.
Connecting LANs, Backbone Networks, and Virtual LANs
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Chapter 1: Hierarchical Network Design
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Benefits of Partial Reconfiguration Reducing the size of the FPGA device required to implement a given function, with consequent reductions in cost and.
Uniform Reconfigurable Processing Module for Design and Manufacturing Integration V. Kirischian, S. Zhelnokov, P.W. Chun, L. Kirischian and V. Geurkov.
Partially Reconfigurable System-on-Chips for Adaptive Fault Tolerance Shaon Yousuf Adam Jacobs Ph.D. Students NSF CHREC Center, University of Florida Dr.
Coarse and Fine Grain Programmable Overlay Architectures for FPGAs
DAPR: Design Automation for Partially Reconfigurable FPGAs Shaon Yousuf Ph.D. Student NSF CHREC Center, University of Florida Dr. Ann Gordon-Ross Associate.
Heng Tan Ronald Demara A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management.
Memory Management 1 Tanenbaum Ch. 3 Silberschatz Ch. 8,9.
Embedded Runtime Reconfigurable Nodes for wireless sensor networks applications Chris Morales Kaz Onishi 1.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Design Framework for Partial Run-Time FPGA Reconfiguration Chris Conger, Ann Gordon-Ross, and Alan D. George Presented by: Abelardo Jara-Berrocal HCS Research.
Exploiting Partially Reconfigurable FPGAs for Situation-Based Reconfiguration in Wireless Sensor Networks Rafael Garcia, Dr. Ann Gordon-Ross, Dr. Alan.
Partial Region and Bitstream Cost Models for Hardware Multitasking on Partially Reconfigurable FPGAs + Also Affiliated with NSF Center for High- Performance.
CHAPTER 5 Configuration, Reconfiguration and Security.
Task Graph Scheduling for RTR Paper Review By Gregor Scott.
A Physical Resource Management Approach to Minimizing FPGA Partial Reconfiguration Overhead Heng Tan and Ronald F. DeMara University of Central Florida.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Reconfigurable Embedded Processor Peripherals Xilinx Aerospace and Defense Applications Brendan Bridgford Brandon Blodget.
FPGA Partial Reconfiguration Presented by: Abelardo Jara-Berrocal HCS Research Laboratory College of Engineering University of Florida April 10 th, 2009.
M. ALSAFRJALANI D. DZENITIS Runtime PR for Software Radio 2/26/2010 UFL ECE Dept 1 PARTIAL RECONFIGURATION (PR)
VAPRES A Virtual Architecture for Partially Reconfigurable Embedded Systems Presented by Joseph Antoon Abelardo Jara-Berrocal, Ann Gordon-Ross NSF Center.
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
PARBIT Tool 1 PARBIT Partial Bitfile Configuration Tool Edson L. Horta Washington University, Applied Research Lab August 15, 2001.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 1: Hierarchical Network Design Connecting Networks.
A Survey of Fault Tolerant Methodologies for FPGA’s Gökhan Kabukcu
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Runtime Temporal Partitioning Assembly to Reduce FPGA Reconfiguration Time Abelardo Jara-Berrocal, Ann Gordon-Ross HCS Research Laboratory College of Engineering.
Voice Manipulator Department of Electrical & Computer Engineering
Evaluating Partial Reconfiguration for Embedded FPGA Applications
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Abelardo Jara-Berrocal Joseph Antoon Ph.D. Students
Aurelio Morales-Villanueva and Ann Gordon-Ross+
Shaon Yousuf Ph.D. Student NSF CHREC Center, University of Florida
University of Florida, Gainesville, Florida, USA
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Computer Architecture
Voice Manipulator Department of Electrical & Computer Engineering
Introduction to Partial Reconfiguration
Presentation transcript:

Bitstream Relocation with Local Clock Domains for Partially Reconfigurable FPGAs Adam Flynn, Ann Gordon-Ross, Alan D. George NSF Center for High-Performance Reconfigurable Computing (CHREC) Department of Electrical and Computer Engineering University of Florida

2 Partial Reconfiguration Full Reconfiguration Bitstream for entire FPGA is loaded onto FPGA Partial Reconfiguration (PR) Only certain portion(s) of FPGA are reprogrammed with smaller application modules Advantages –Shorter reconfiguration time –Less power –Smaller bitstreams –Rest of FPGA remains operational FPGA Peripheral links can be maintained during reconfiguration Design A Reconfigurable Region Design B Module A Reconfigurable Region Module B Module C Module D Radio Link Video Link

3 PR Terminology Partial Reconfiguration Region (PRR) FPGA fabric is partitioned into multiple reconfigurable regions. Partial Reconfiguration Module (PRM) Application modules Loaded into PRRs on-the-fly Multiple PRMs can map to each PRR Static Region Fixed, base design logic Remains operational during reconfiguration Bus Macro Pre-placed, pre-routed macro to route signals between PRMs and the static region PRR Stat i c S t a t i c PRM A PRM C PRM B PRM D PRM B PRM D

4 Basic Problem with PR Each PRM to PRR mapping requires unique partial bitstream Multiple bitstreams must be stored for each module Unnecessary overhead Bitstream Storage Static PRR 2 PRR 1 PRMA_PRR1 PRMB_PRR1 PRMA_PRR2 PRMB_PRR2 FPGA

5 Solution - Bitstream Relocation Partial bitstream is manipulated to change physical location on FPGA [write] - Frame Address (FAR) register. Top/Bottom: Top. Resource type: CLB/IO/CLK. Row number: 2. Column number: 2. Minor address 3. Changing FAR address column from 2 to 3. Changing FAR address row from 2 to 0. Sample SW Terminal Output PRR FPGA Module Static Module

6 Bitstream Relocation (BR) Benefits Increased flexibility in time-multiplexing FPGA resources Reduced bitstream storage requirements Bitstream migration between devices Ability to move modules away from faults Bitstream Storage PRM A PRM B Static FPGA PRR 1 RRR 2 PRR 3 PRR 4 Bitstream Relocator Bitstream Relocator Bitstream Storage PRM B/PRR 4 PRM B/PRR 3 PRM B/PRR 2 PRM B/PRR 1 PRM A/PRR 4 PRM A/PRR 3 PRM A/PRR 2 PRM A/PRR 1 Eliminates need for multiple copies of same partial bitstream No relocation With relocation

7 Motivating Application Domains Module A Decision Logic Module A Module B 2x Triple Modular Redundancy (TMR) Module A Decision Logic Module A Module B Module C 3x Self Checking Pair (SCP) Reconfigurable fault tolerance & adaptable component-level protection Fault tolerance mode can adapt to current conditions PRRs can be reconfigured to desired protection/performance level Virtual Architecture for PR Framework for online module placement and scheduling Peripheral interface(s) and inter- module communication infrastructure statically defined Multiple benefits of bitstream relocation:  Flexibility for on-the-fly placement and scheduling of modules  Reduced bitstream storage/communication requirements Multiple benefits of bitstream relocation:  Flexibility for on-the-fly placement and scheduling of modules  Reduced bitstream storage/communication requirements

8 Clock Domains Local Clock Domains (LCDs) One global clock signal Leverage regional clock resources in Virtex 4/5 FPGAs Upgrade to existing clock infrastructure techniques necessary for bitstream relocation Clock frequency specified at PRM level Provides a finer-grained control of multiple clock domains Multiple global clock domains Each unique PRR clock frequency requires a global clock signal MUX in each PRR selects intended frequency Imposes overheads such as increased power consumption PRR Multiple Clock Domains w/o LCDs PRR Static MUX (CLBS) PRR Multiple Clock Domains w/ LCDs PRR Static BUFR

9 Power Consumption Improvement Power consumption improvement scales at approximately 3-4 mW/domain * Clock frequency was held constant across domains for comparison Number of Clock Domains

10 Increased Max Clock Frequency 10-15% average increase for LCD vs. Bus Macro Max Clock Frequency for PR Audio Filter Design Modules Global: clock signals routed globally Not compatible with BR (no consistent clock signal interface across PRRs) LCD: clock signals routed through regional clock resources Compatible with BR (regional clock resources provide consistent signal interface) Bus Macro: clock signals routed through bus macros Compatible with BR (bus macros provide consistent signal interface) Introduces routing inefficiencies that degrade max frequency