Download presentation
Presentation is loading. Please wait.
Published byKerrie Butler Modified over 9 years ago
1
Bitstream Relocation with Local Clock Domains for Partially Reconfigurable FPGAs Adam Flynn, Ann Gordon-Ross, Alan D. George NSF Center for High-Performance Reconfigurable Computing (CHREC) Department of Electrical and Computer Engineering University of Florida
2
2 Partial Reconfiguration Full Reconfiguration Bitstream for entire FPGA is loaded onto FPGA Partial Reconfiguration (PR) Only certain portion(s) of FPGA are reprogrammed with smaller application modules Advantages –Shorter reconfiguration time –Less power –Smaller bitstreams –Rest of FPGA remains operational FPGA Peripheral links can be maintained during reconfiguration Design A Reconfigurable Region Design B Module A Reconfigurable Region Module B Module C Module D Radio Link Video Link
3
3 PR Terminology Partial Reconfiguration Region (PRR) FPGA fabric is partitioned into multiple reconfigurable regions. Partial Reconfiguration Module (PRM) Application modules Loaded into PRRs on-the-fly Multiple PRMs can map to each PRR Static Region Fixed, base design logic Remains operational during reconfiguration Bus Macro Pre-placed, pre-routed macro to route signals between PRMs and the static region PRR Stat i c S t a t i c PRM A PRM C PRM B PRM D PRM B PRM D
4
4 Basic Problem with PR Each PRM to PRR mapping requires unique partial bitstream Multiple bitstreams must be stored for each module Unnecessary overhead Bitstream Storage Static PRR 2 PRR 1 PRMA_PRR1 PRMB_PRR1 PRMA_PRR2 PRMB_PRR2 FPGA
5
5 Solution - Bitstream Relocation Partial bitstream is manipulated to change physical location on FPGA [write] - Frame Address (FAR) register. Top/Bottom: Top. Resource type: CLB/IO/CLK. Row number: 2. Column number: 2. Minor address 3. Changing FAR address column from 2 to 3. Changing FAR address row from 2 to 0. Sample SW Terminal Output PRR FPGA Module Static Module
6
6 Bitstream Relocation (BR) Benefits Increased flexibility in time-multiplexing FPGA resources Reduced bitstream storage requirements Bitstream migration between devices Ability to move modules away from faults Bitstream Storage PRM A PRM B Static FPGA PRR 1 RRR 2 PRR 3 PRR 4 Bitstream Relocator Bitstream Relocator Bitstream Storage PRM B/PRR 4 PRM B/PRR 3 PRM B/PRR 2 PRM B/PRR 1 PRM A/PRR 4 PRM A/PRR 3 PRM A/PRR 2 PRM A/PRR 1 Eliminates need for multiple copies of same partial bitstream No relocation With relocation
7
7 Motivating Application Domains Module A Decision Logic Module A Module B 2x Triple Modular Redundancy (TMR) Module A Decision Logic Module A Module B Module C 3x Self Checking Pair (SCP) Reconfigurable fault tolerance & adaptable component-level protection Fault tolerance mode can adapt to current conditions PRRs can be reconfigured to desired protection/performance level Virtual Architecture for PR Framework for online module placement and scheduling Peripheral interface(s) and inter- module communication infrastructure statically defined Multiple benefits of bitstream relocation: Flexibility for on-the-fly placement and scheduling of modules Reduced bitstream storage/communication requirements Multiple benefits of bitstream relocation: Flexibility for on-the-fly placement and scheduling of modules Reduced bitstream storage/communication requirements
8
8 Clock Domains Local Clock Domains (LCDs) One global clock signal Leverage regional clock resources in Virtex 4/5 FPGAs Upgrade to existing clock infrastructure techniques necessary for bitstream relocation Clock frequency specified at PRM level Provides a finer-grained control of multiple clock domains Multiple global clock domains Each unique PRR clock frequency requires a global clock signal MUX in each PRR selects intended frequency Imposes overheads such as increased power consumption PRR Multiple Clock Domains w/o LCDs PRR Static MUX (CLBS) PRR Multiple Clock Domains w/ LCDs PRR Static BUFR
9
9 Power Consumption Improvement Power consumption improvement scales at approximately 3-4 mW/domain * Clock frequency was held constant across domains for comparison Number of Clock Domains
10
10 Increased Max Clock Frequency 10-15% average increase for LCD vs. Bus Macro Max Clock Frequency for PR Audio Filter Design Modules Global: clock signals routed globally Not compatible with BR (no consistent clock signal interface across PRRs) LCD: clock signals routed through regional clock resources Compatible with BR (regional clock resources provide consistent signal interface) Bus Macro: clock signals routed through bus macros Compatible with BR (bus macros provide consistent signal interface) Introduces routing inefficiencies that degrade max frequency
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.