DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach.

Slides:



Advertisements
Similar presentations
Reconfigurable Computing After a Decade: A New Perspective and Challenges For Hardware-Software Co-Design and Development Tirumale K Ramesh, Ph.D. Boeing.
Advertisements

Torino, Italy – June 27th, 2013 A2B: AN I NTEGRATED F RAMEWORK FOR D ESIGNING H ETEROGENEOUS AND R ECONFIGURABLE S YSTEMS C. Pilato, R. Cattaneo, G. Durelli,
D ARMSTADT, G ERMANY - 11/07/2013 A Framework for Effective Exploitation of Partial Reconfiguration in Dataflow Computing Riccardo Cattaneo ∗, Xinyu Niu†,
Dan Lander Haru Yamamoto Shane Erickson (EE 201A Spring 2004)
Torino, Italy – June 25, 2013 NASA/ESA Conference on Adaptive Hardware and Systems (AHS-2013) C. Pilato R. Cattaneo, C. Pilato, M. Mastinu, M.D. Santambrogio.
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
Politecnico di Milano, Italy
Implementation Approaches with FPGAs Compile-time reconfiguration (CTR) CTR is a static implementation strategy where each application consists of one.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
Berlin, Germany – January 21st, 2013 A2B: A F RAMEWORK FOR F AST P ROTOTYPING OF R ECONFIGURABLE S YSTEMS Christian Pilato, R. Cattaneo, G. Durelli, A.A.
Reconfigurable Computing: What, Why, and Implications for Design Automation André DeHon and John Wawrzynek June 23, 1999 BRASS Project University of California.
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
Some Thoughts on Technology and Strategies for Petaflops.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
Trend towards Embedded Multiprocessors Popular Examples –Network processors (Intel, Motorola, etc.) –Graphics (NVIDIA) –Gaming (IBM, Sony, and Toshiba)
ECE 699: Lecture 2 ZYNQ Design Flow.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
1 Presenter: Chien-Chih Chen Proceedings of the 2002 workshop on Memory system performance.
Torino (Italy) – June 25th, 2013 Ant Colony Optimization for Mapping, Scheduling and Placing in Reconfigurable Systems Christian Pilato Fabrizio Ferrandi,
Dynamic Hardware Software Partitioning A First Approach Komal Kasat Nalini Kumar Gaurav Chitroda.
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
© Copyright Alvarion Ltd. Hardware Acceleration February 2006.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
1 © FASTER Consortium Proprietary Novel Design Methods and a Tool Flow for Unleashing Dynamic Reconfiguration Kyprianos Papadimitriou, Christian Pilato,
A Flexible Interconnection Structure for Reconfigurable FPGA Dataflow Applications Gianluca Durelli, Alessandro A. Nacci, Riccardo Cattaneo, Christian.
1 © FASTER Consortium Catalin Ciobanu Chalmers University of Technology Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
Paper Review: XiSystem - A Reconfigurable Processor and System
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
Automated Design of Custom Architecture Tulika Mitra
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
Heng Tan Ronald Demara A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
Back-end (foundation) Working group X-stack PI Kickoff Meeting Sept 19, 2012.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
MAPLD Reconfigurable Computing Birds-of-a-Feather Programming Tools Jeffrey S. Vetter M. C. Smith, P. C. Roth O. O. Storaasli, S. R. Alam
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
ISSS 2001, Montréal1 ISSS’01 S.Derrien, S.Rajopadhye, S.Sur-Kolay* IRISA France *ISI calcutta Combined Instruction and Loop Level Parallelism for Regular.
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
Task Graph Scheduling for RTR Paper Review By Gregor Scott.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Presentation by Tom Hummel OverSoC: A Framework for the Exploration of RTOS for RSoC Platforms.
A Hybrid Design Space Exploration Approach for a Coarse-Grained Reconfigurable Accelerator Farhad Mehdipour, Hamid Noori, Hiroaki Honda, Koji Inoue, Kazuaki.
Reconfigurable Computing Ender YILMAZ, Hasan Tahsin OĞUZ.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
CML Path Selection based Branching for CGRAs ShriHari RajendranRadhika Thesis Committee : Prof. Aviral Shrivastava (Chair) Prof. Jennifer Blain Christen.
Final Presentation Hardware DLL Real Time Partial Reconfiguration Management of FPGA by OS Submitters:Alon ReznikAnton Vainer Supervisors:Ina RivkinOz.
Real-Time System-On-A-Chip Emulation.  Introduction  Describing SOC Designs  System-Level Design Flow  SOC Implemantation Paths-Emulation and.
POLITECNICO DI MILANO A SystemC-based methodology for the simulation of dynamically reconfigurable embedded systems Dynamic Reconfigurability in Embedded.
K-Nearest Neighbor Digit Recognition ApplicationDomainConstraintsKernels/Algorithms Voice Removal and Pitch ShiftingAudio ProcessingLatency (Real-Time)FFT,
April 15, 2013 Atul Kwatra Principal Engineer Intel Corporation Hardware/Software Co-design using SystemC/TLM – Challenges & Opportunities ISCUG ’13.
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
ASIC Design Methodology
Support for Program Analysis as a First-Class Design Constraint in Legion Michael Bauer 02/22/17.
Dynamo: A Runtime Codesign Environment
Ph.D. in Computer Science
A Methodology for System-on-a-Programmable-Chip Resources Utilization
Texas Instruments TDA2x and Vision SDK
FPGA: Real needs and limits
Anne Pratoomtong ECE734, Spring2002
Matlab as a Development Environment for FPGA Design
ECE 699: Lecture 3 ZYNQ Design Flow.
Presentation transcript:

DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach May 29 th – 31 st 2013 International Conference on IC Design and Technology Pavia, Italy M. D. Santambrogio, C. Pilato, D. Pnevmatikatos, K. Papadimitriou, D. Stroobandt, D. Sciuto

Reconfigurable Technology Technology for adaptable hardware systems  Can add/remove components at run-time/product lifetime  Flexibility at hardware speed (not quite ASIC)  Parallelism at hardware level (depending on application)  Ideally: alter function & interconnection of blocks Implementation in:  FPGAs: fine grain, complex gate plus memory and DSP blocks  Coarse Grain (custom) chips: multiple ALUs, multiple (simple) programmable processing blocks, etc. 2

An issue as a new opportunity Programming has become very difficult  Impossible to balance all constraints manually 3 More computational horse-power than ever before  Cores are free, reconfigurable logic available on chip, cores can be heterogeneous Energy is new constraint  Software must become energy and space aware Modern computing systems need to be flexible and adaptive  To optimize and meet their requirements taking advantage as much as possible of the underlying complex heterogeneous architectures

FASTER Motivation Creating reconfigurable systems is not straightforward!  Reconfiguration cost is substantial (use wisely)  Tool support for these tasks is still quite basic  Resource management is up to the user  The designer has to:  Identify portions to be reconfigured  Establish a schedule that (a) respects dependencies (b) achieves performance and other constraints  Manage the system resources (also at run-time)  Verify a changing system! 4

FASTER Goals and Innovation Include reconfigurability as an explicit design concept in computing systems design, along with methods and tools that support run-time reconfiguration in the entire design methodology  Provide a framework for analysis, synthesis and verification of a reconfigurable system  Provide efficient and transparent runtime support for partial and dynamic reconfiguration, including micro-reconfiguration Demonstrate usability & performance on commercial applications and platforms (Maxeler, ST Microelectronics, Synelixis) 5

FASTER Platforms Bridging the gap between HPC and embedded systems  Opportunities and challenges of reconfiguration in both the domains High-Performance Computing Systems  Maxeler MPC MaxWorkstation FPGA-based Embedded Systems  Xilinx University Program Board (XUPV5-LX110T)  AVNET Zedboard (SoC XC7Z020) 6

FASTER: Overall Methodology 7

Design Phase and Runtime Support Define a reconfiguration-aware design methodology that exploits FPGAs:  Generate hardware and software components (including runtime support) on the top of existing vendor flows Exploit dynamic reconfigurability for different target reconfigurable architectures.  Both HPC and embedded systems Define and implement a new generation of self reconfigurable architectures based on Linux 8

System analysis and design 9.c Task Graph Generation High Level Analysis DFG Extraction Static Baseline Scheduling Partitioning and Optimizations Partitioning and Optimizations Run-time Support and Verification.xml.c.xml -Architecture -Additional application information -Annotated source code (C+OpenMP) -Source code for CPU -DFGs for HW blocks -Mapping Configurations Mapping and Floorplanning -HLS -System generation.xml

Identifying Level of Reconfigurability Assigning each task of the application to the “best” processing element  Reconfiguration is implicitly considered Based on a metaheuristic iterative algorithm 10 Objectives: function of occupation area, execution time, power, number of reconfigurations etc... T1 T2T3T4 T5 Architectural Template XML MAP Mapping MAP Iterative, multi objectives: -Runtime -Power -Area -… Convergence Library XML Platform Specification XML

Micro-reconfiguration Optimization In some applications we can identify hardware accelerators with slow ‐ changing “parameters”  Filter coefficients Parameters trigger a small-scale reconfiguration Design of cores based on Tunable FPGA blocks:  Identify parameters  Create bitfile with “holes”  Parameter values => reconfiguration bits for missing “holes”  Fine grain, faster reconfiguration time! 11

Verifying Reconfigurable Systems Study design validation approaches: simulation, emulation and formal verification Extend symbolic simulation to dynamic aspects of reconfigurable design In some cases static approaches may not be able to verify the entire RC system  We use run ‐ time verification. Address and minimize impact on: Speed, area and power Light ‐ weight architectural support 12

Run-time System Evaluate reconfiguration overhead Propose advanced mechanisms to support  Scheduling  Dynamic reconfiguration (including micro- reconfiguration)  Run-time verification Provide run-time support for dynamic reconfiguration based on static analysis  Extension of OS capabilities  Efficient on-line scheduling and placement of task modules 13

OS-based Management Provide software support for dynamic partial reconfiguration on a Linux-based operating systems  Reconfiguration process managed from the OS in a transparent way Hardware-independent interface for software developers based on the GNU/Linux  Addition and removal of reconfigurable components Easier programming interface for specific drivers  OS customization for specific architectures 14

Expected Results and Conclusions FASTER is a focused project that builds on combined partners expertise as well as on past research work and projects We focus on (and hope to demonstrate):  productivity improvement in implementation and verification of dynamically changing systems  total ownership cost reduction (NIDS and RTM systems)  performance improvement under power constraints for Global Illumination and Image Analysis application 15

Challenges & Opportunities Tool support for analysis and system definition Specification of changing system(s) Reconfigurable granularity: influenced by tools and applications Architectural support for reconfiguration (vendor?) Metrics: include design effort/time, total ownership cost 16