1 Kadlec MAPLD05/P148 Reconfigurable Floating Point Co-Processor for Atmel FPSLIC Jiri Kadlec Institute of Information Theory and Automation (UTIA), Academy.

Slides:



Advertisements
Similar presentations
VHDL Design of Multifunctional RISC Processor on FPGA
Advertisements

Reconfigurable Computing (EN2911X, Fall07) Lecture 04: Programmable Logic Technology (2/3) Prof. Sherief Reda Division of Engineering, Brown University.
Superscalar processors Review. Dependence graph S1S2 Nodes: instructions Edges: ordered relations among the instructions Any ordering-based transformation.
Comp Sci Floating Point Arithmetic 1 Ch. 10 Floating Point Unit.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Survey of Reconfigurable Logic Technologies
Graduate Computer Architecture I Lecture 16: FPGA Design.
© 2003 Xilinx, Inc. All Rights Reserved Looking Under the Hood.
FPGA Design Using the LEON3 Fault Tolerant Processor Core
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
1 Student: Khinich Fanny Instructor: Fiksman Evgeny המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי לישראל.
Behavioral Design Outline –Design Specification –Behavioral Design –Behavioral Specification –Hardware Description Languages –Behavioral Simulation –Behavioral.
1 Performed by: Lin Ilia Khinich Fanny Instructor: Fiksman Eugene המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון - מכון טכנולוגי.
VIRAM-1 Architecture Update and Status Christoforos E. Kozyrakis IRAM Retreat January 2000.
Programmable logic and FPGA
XC6200 Family FPGAs By: Ahmad Alsolaim Alsolaim.
1 3/22/02 Benchmark Update u Carnegie Cell Library: “Free to all who Enter” s Need to build scaling model of standard cell library s Based on our open.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
HW/SW CODESIGN OF THE MPEG-2 VIDEO DECODER Matjaz Verderber, Andrej Zemva, Andrej Trost University of Ljubljana Faculty of Electrical Engineering Trzaska.
HW/SW CODESIGN OF THE MPEG-2 VIDEO DECODER Matjaz Verderber, Andrej Zemva, Andrej Trost University of Ljubljana Faculty of Electrical Engineering Trzaska.
Dynamic Hardware Software Partitioning A First Approach Komal Kasat Nalini Kumar Gaurav Chitroda.
A Flexible Architecture for Simulation and Testing (FAST) Multiprocessor Systems John D. Davis, Lance Hammond, Kunle Olukotun Computer Systems Lab Stanford.
Programmable Logic- How do they do that? 1/16/2015 Warren Miller Class 5: Software Tools and More 1.
Delevopment Tools Beyond HDL
AT94 Training 2001Slide 1 AT40K10 AT40K20 AT40K40 AT40K80 5K-10K10K-20K20K-30K40K-50K80K-100K125K-150K ,0242,3044,0966,400 AT40K Family w/ FreeRAM.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Benefits of Partial Reconfiguration Reducing the size of the FPGA device required to implement a given function, with consequent reductions in cost and.
AT94 Training 2001Slide 1 Configurable SRAM 8 Bit RISC MCU AT40K FPGA Monolithic SRAM Based FPSLIC 20 MIPS* - 8bit RISC MCU Up to 36K bytes of SRAM From.
DOP - A CPU CORE FOR TEACHING BASICS OF COMPUTER ARCHITECTURE Miloš Bečvář, Alois Pluháček and Jiří Daněček Department of Computer Science and Engineering.
FPGA Fault Emulator Jiří Kvasnička, Pavel Kubalík, Hana Kubátová.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
1 Kadlec MAPLD05/P149 Floating Point Controller as PicoBlaze Network on Single Spartan 3 FPGA Jiri Kadlec 1, Roger Gook 2 1 Institute of Information Theory.
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
AT94 Training 2001Slide 1 AT94K Configuration Modes Atmel Corporation 2325 Orchard Parkway San Jose, CA Hotline (408) OR.
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
Automated Design of Custom Architecture Tulika Mitra
Hardware Implementation of a Memetic Algorithm for VLSI Circuit Layout Stephen Coe MSc Engineering Candidate Advisors: Dr. Shawki Areibi Dr. Medhat Moussa.
J. Christiansen, CERN - EP/MIC
Design Framework for Partial Run-Time FPGA Reconfiguration Chris Conger, Ann Gordon-Ross, and Alan D. George Presented by: Abelardo Jara-Berrocal HCS Research.
VLSI Algorithmic Design Automation Lab. 1 Integration of High-Performance ASICs into Reconfigurable Systems Providing Additional Multimedia Functionality.
ISSS 2001, Montréal1 ISSS’01 S.Derrien, S.Rajopadhye, S.Sur-Kolay* IRISA France *ISI calcutta Combined Instruction and Loop Level Parallelism for Regular.
Design of a Novel Bridge to Interface High Speed Image Sensors In Embedded Systems Tareq Hasan Khan ID: ECE, U of S Term Project (EE 800)
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU FPGA Design with Xilinx ISE Presenter: Shu-yen Lin Advisor: Prof. An-Yeu Wu 2005/6/6.
Modelling Self- Adaptive Networked Entities In Matlab/Simulink R. Bartosinski, M. Daněk, P. Honzík, J. Kadlec.
Principles of Linear Pipelining
Evaluating and Improving an OpenMP-based Circuit Design Tool Tim Beatty, Dr. Ken Kent, Dr. Eric Aubanel Faculty of Computer Science University of New Brunswick.
VAPRES A Virtual Architecture for Partially Reconfigurable Embedded Systems Presented by Joseph Antoon Abelardo Jara-Berrocal, Ann Gordon-Ross NSF Center.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
The Alpha – Data Stream Matt Ziegler.
A Reconfigurable System on Chip Implementation for Elliptic Curve Cryptography over GF(2 n ) Michael Jung 1, M. Ernst 1, F. Madlener 1, S. Huss 1, R. Blümel.
Survey of Reconfigurable Logic Technologies
2/19/2016http://csg.csail.mit.edu/6.375L11-01 FPGAs K. Elliott Fleming Computer Science & Artificial Intelligence Lab Massachusetts Institute of Technology.
Final Presentation Hardware DLL Real Time Partial Reconfiguration Management of FPGA by OS Submitters:Alon ReznikAnton Vainer Supervisors:Ina RivkinOz.
Programmable Hardware: Hardware or Software?
Digital Design using FPGAs and Verilog HDL
FPGA IRRADIATION and TESTING PLANS (Update)
ENG3050 Embedded Reconfigurable Computing Systems
Electronics for Physicists
FPGAs in AWS and First Use Cases, Kees Vissers
Getting Started with Programmable Logic
Reconfigurable Computing
Figure 1 PC Emulation System Display Memory [Embedded SOC Software]
ریز پردازنده. ریز پردازنده مراجع درس میکروکنترلرهای AVR برنامه نویسی اسمبلی و C محمدعلی مزیدی، سپهر نعیمی و سرمد نعیمی مرجع کامل میکروکنترلرهای AVR.
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Electronics for Physicists
Pipelined Array Multiplier Aldec Active-HDL Design Flow
ADSP 21065L.
Presentation transcript:

1 Kadlec MAPLD05/P148 Reconfigurable Floating Point Co-Processor for Atmel FPSLIC Jiri Kadlec Institute of Information Theory and Automation (UTIA), Academy of Sciences of the Czech Republic, Prague, CZ Tel :

2 Kadlec MAPLD05/P148 Presentation outline Principle of partial dynamic reconfiguration on Atmel FPSLIC Support infrastructure SW view and SW/HW co-design view Dynamically re-configurable scalable floating-point unit l Where it comes from l Parallel operations: ADD, MUL, FX2FP, FP2FX l Sequential DIV SQRT Case study 1: 32bit pipelined FP multiplier sliced into 2 contexts Case study 2: 24bit FP adder and 24bit FP multiplier as 2 D_macros Lessons learned and thanks

3 Kadlec MAPLD05/P148 Principle of partial dynamic reconfiguration on Atmel FPSLIC HW IP 2 HW IP 1 Software Application SW Data 8 Bit RISC MCU X[7:0] Y[7:0] Z[7:0] D[7:0] write 32 bits X Y Z RSA HW IP 1 Internal SRAM Internal ProcessorSmall internal FPGAInternal modification of LUTs

4 Kadlec MAPLD05/P148 Partially re-configurable scalable floating-point unit Source code has been derived from the Celoxica floating point. From DK3.1 we use: l RTL simulator l Generic VHDL l C++ bit-exact models can be exported to Matlab/Simulink test benches. VHDL code is recompiled for Atmel. Back end is free Figaro P&R with extensions developed in EU RECONF project Our test-bench: Blue block is bit exact representation of the floating point adder. Identical model supports several widths of Mantissa and Exponent.

5 Kadlec MAPLD05/P148 Support infrastructure, SW view for AVR programmer

6 Kadlec MAPLD05/P148 SW / HW cores FLASH data FLASH pgm. FLASH bst. PGM overlay API for macros Bit-stream formatter (PC) Guidelines for Macros & Top SW/HW view:

7 Kadlec MAPLD05/P148 D_reconfigurable Supermacro with 2 contexts and Dualprted SRAM in the top static design. Target AT40/94 Case study 1: A 32 bit pipelined multiplier sliced manually into 2 smaller dynamically reconfigurable contexts to fit in

8 Kadlec MAPLD05/P148 Top-level placement of the I/O 32-bit registers (ra, rb, rz) (left) and the 40bit 32word dual port SRAM (right) Os reflecting the “cut” of the floating point macro Top-level placement of the super-macro. Aligned with the dual port SRAM Nets of the static part with registers ra, rb, rz, and the 40-bit 32word dual port Atmel FREE RAM Sliced 32 bit pipelined multiplier (2)

9 Kadlec MAPLD05/P148 Left: Pipelined 32-bit floating-point multiplier. AT94K40. Macro: 1581 Logic Cells. Right: Top with nets for single FP macro. Maximal clock 16,8 MHz. Latency 7. Dense nets result in lower maximal clock frequency Logic Cells (75%). Left: Stage1 of sliced FP 32-bit multiplier. AT94K40. Macro: 1083 Logic Cells. Right: Top with nets for Stage1 context. Maximal clock 18,6 MHz. Latency 5. The 40-bit Cut Bus from S1 to S2 is using 32 words of 40-bit DP RAM of Logic Cells (55%). Left: Stage 2 of sliced FP 32-bit multiplier. AT94K40. Macro: 512 Logic Cells. Right: Top with Stage2 context. Maximal clock 20,5 MHz. Latency 4. The BE flow is reserving identical subset of cells for both contexts of Logic Cells (55%). Sliced 32 bit pipelined multiplier (3)

10 Kadlec MAPLD05/P148 Interface of the sliced multiplier super-macro in the FPSLIC testbench. It helps to define one-to one connections from static to dynamic part of the design Sliced 32 bit pipelined multiplier (4)

11 Kadlec MAPLD05/P148 Placement of the super-macro in AT94K40. 8bit registers replaced the DP SRAM. Locking of the whole area forced the automatic placement of the rest of the top-level design to be placed to the unlocked “south”. The top design with nets takes 506 Logic Cells (22%). Top with Stage-1 context. (left). Top with Stage-2 context. (right). Sliced 32 bit pipelined multiplier (5) Reconfiguration Time Stag1 -> Stage2 16ms Stage2 -> Stage1 16ms Reconfiguration times for 18MHz, Mode 4, reduced sizes (differences of bit- streams) are stored and downloaded from FLASH by the AVR processor

12 Kadlec MAPLD05/P148 Case study 2: Reconfigurable 24-Bit Floating Point ADD/MUL Cores 24-bit FP adder Placement Routing 24-bit FP multiplier Placement Routing Reconfiguration time: 50ms for 4 MHz AVR clock Partial bit-stream size: 20k 32-bit configuration words

13 Kadlec MAPLD05/P148 Lessons learned and thanks ++ Best for data streaming operations ++ Low cost external FLASH can store tens of HW and SW overlays ++ Partial dynamic reconfiguration can on FPSLIC result in low cost and low power solution. ++ It is valid path for small groups who can not go for an ASIC. -- High complexity needs support in predefined SW/HW infrastructure -- Performance and size of the FPGA part is limited This work has been partially supported by the EU project RECONF