VLSI-SoC 2001 IFIP - LIRMM Stream-based Arrays: Converging Design Flows for both, Reiner Hartenstein University of Kaiserslautern December 2- 4, 2001,

Slides:



Advertisements
Similar presentations
CASES 2002 Intl Conference on Compilers, Architectures and Synthesis for Embedded Systems Embedded Architectures: Configurable, Re-configurable, or what?
Advertisements

Reconfigurable Computing After a Decade: A New Perspective and Challenges For Hardware-Software Co-Design and Development Tirumale K Ramesh, Ph.D. Boeing.
Enabling Technologies for Reconfigurable Computing Reiner Hartenstein University of Kaiserslautern November 21, 2001, Tampere, Finland Enabling Technologies.
FPGA (Field Programmable Gate Array)
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
Enabling Technologies for Reconfigurable Computing Enabling Technologies for Reconfigurable Computing Part 2: Stream-based Computing for RC Wednesday,
TIE Extensions for Cryptographic Acceleration Charles-Henri Gros Alan Keefer Ankur Singla.
The von Neumann Syndrome Reiner Hartenstein TU Kaiserslautern TU Delft, Sept 28, (v.2)
Reconfigurable Supercomputing means to brave the paradigm chasm Reiner Hartenstein HiPEAC Workshop on Reconfigurable Computing Ghent, Belgium January 28,
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
The Microprocessor is no more General Purpose. Design Gap.
ICECS 2002 IEEE 9th International Conference on Electronics, Circuits and Systems Trends in Reconfigurable Logic and Reconfigurable Computing (invited.
Enabling Technologies for Reconfigurable Computing Enabling Technologies for Reconfigurable Computing and Software / Configware Co-Design Part 2: Data-Stream-based.
An Introduction to Reconfigurable Computing Mitch Sukalski and Craig Ulmer Dean R&D Seminar 11 December 2003.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Enabling Technologies for System-on-Chip Development Reconfigurable Computing Architectures and Methodologies for System-on-Chip Monday, November 19, 10:15.
Graduate Computer Architecture I Lecture 16: FPGA Design.
The 5th IEEE Workshop on Design & Diagnosis of Electronic Circuits & Systems (DDECS'02)DDECS'02 Configware / Software Co-Design: be prepared for the Next.
Course-Grained Reconfigurable Devices. 2 Dataflow Machines General Structure:  ALU-computing elements,  Programmable interconnections,  I/O components.
Reconfigurable Supercomputing: Hindernisse und Chancen Reiner Hartenstein TU Kaiserslautern Universität Mannheim, 13. Dez
IPDPS 2004 Software or Configware? About the Digital Divide of Parallel Computing Reiner Hartenstein TU Kaiserslautern Santa Fe, NM, April , 2004.
From Organic Computing to Reconfigurable Computing Reiner Hartenstein TU Kaiserslautern PASA, Frankfurt, March 16, 2006.
Reconfigurable HPC Reconfigurable HPC part 1 Introduction Reiner Hartenstein TU Kaiserslautern May 14, 2004, TU Tallinn, Estonia.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Spring 08, Jan 15 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Seminar at Kyushu University Reconfigurable Technologies (1) Reiner Hartenstein TU Kaiserslautern July 23, 2004, Fukuoka, Japan.
Define Embedded Systems Small (?) Application Specific Computer Systems.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
Enabling Technologies for System-on-Chip Development Reconfigurable Computing Architectures and Methodologies for System-on-Chip Monday, November 19, 10:15.
Seven Minute Madness: Reconfigurable Computing Dr. Jason D. Bakos.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수,
The Transdisciplinary Responsibility of CS Curricula Reiner Hartenstein TU Kaiserslautern San Diego, CA, USA, June , 2006 THE NINTH WORLD CONFERENCE.
Development in hardware – Why? Option: array of custom processing nodes Step 1: analyze the application and extract the component tasks Step 2: design.
DRRA Dynamically Reconfigurable Resource Array
1 Chapter 1 Parallel Machines and Computations (Fundamentals of Parallel Processing) Dr. Ranette Halverson.
Making FPGAs a Cost-Effective Computing Architecture Tom VanCourt Yongfeng Gu Martin Herbordt Boston University BOSTON UNIVERSITY.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
Paper Review: XiSystem - A Reconfigurable Processor and System
Extreme Makeover for EDA Industry
Automated Design of Custom Architecture Tulika Mitra
DSD2001 Reconfigurable Computing: the Roadmap to a New Business Model – and its Impact on SoC Design TS4: Tuesday, hrs Reiner Hartenstein University.
J. Christiansen, CERN - EP/MIC
ISSS 2001, Montréal1 ISSS’01 S.Derrien, S.Rajopadhye, S.Sur-Kolay* IRISA France *ISI calcutta Combined Instruction and Loop Level Parallelism for Regular.
Computer Architecture And Organization UNIT-II General System Architecture.
Compilers for Embedded Systems Ram, Vasanth, and VJ Instructor : Dr. Edwin Sha Synthesis and Optimization of High-Performance Systems.
EE3A1 Computer Hardware and Digital Design
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
COARSE GRAINED RECONFIGURABLE ARCHITECTURES 04/18/2014 Aditi Sharma Dhiraj Chaudhary Pruthvi Gowda Rachana Raj Sunku DAY
Evaluating and Improving an OpenMP-based Circuit Design Tool Tim Beatty, Dr. Ken Kent, Dr. Eric Aubanel Faculty of Computer Science University of New Brunswick.
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
The von Neumann Syndrome calls for a Revolution Reiner Hartenstein TU Kaiserslautern Reno, NV, November 11, HPRCTA'07 - First.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
Computing Systems: Next Call for Proposals Dr. Panagiotis Tsarchopoulos Computing Systems ICT Programme European Commission.
Mapping of Regular Nested Loop Programs to Coarse-grained Reconfigurable Arrays – Constraints and Methodology Presented by: Luis Ortiz Department of Computer.
Survey of Reconfigurable Logic Technologies
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Enabling Technologies for Reconfigurable Computing Enabling Technologies for Reconfigurable Computing and Software / Configware Co-Design Part 2: Data-Stream-based.
Computer Organization IS F242. Course Objective It aims at understanding and appreciating the computing system’s functional components, their characteristics,
IPDPS 2004 Software or Configware? About the Digital Divide of Parallel Computing Reiner Hartenstein TU Kaiserslautern Santa Fe, NM, April , 2004.
1 The user’s view  A user is a person employing the computer to do useful work  Examples of useful work include spreadsheets word processing developing.
DSD2001 Reconfigurable Computing: a New Business Model – and its Impact on SoC Design Reiner Hartenstein University of Kaiserslautern Warzaw, Sept. 4 -
Computer Organization and Architecture Lecture 1 : Introduction
Dynamo: A Runtime Codesign Environment
ECE354 Embedded Systems Introduction C Andras Moritz.
Memory Organisation for Datastream-based Reconfigurable Computing
Embedded Architectures: Configurable, Re-configurable, or what?
HIGH LEVEL SYNTHESIS.
Presentation transcript:

VLSI-SoC 2001 IFIP - LIRMM Stream-based Arrays: Converging Design Flows for both, Reiner Hartenstein University of Kaiserslautern December 2- 4, 2001, Montpellier, France Reconfigurable and Hardwired....

© 2001, University of Kaiserslautern 2 >> Stream-based Computing Stream-based Computing Stream-based Compilation Techniques Use in Co-Design Now it’s up to You !

© 2001, University of Kaiserslautern 3 XPU family (IP cores): PACT Corp., Munich commercial rDPAs: rDPA (coarse grain) becoming important XPU128 **) bought ** flexible array: MorphICs CALISTO: Silicon Spice CS2000 family: Chameleon Systems MECA family: Malleable FIPSOC: SIDSA ACM: Quicksilver Tech CHESS array: Elixent MorphoSys: Morpho Tech

© 2001, University of Kaiserslautern 4 array size: 10 x 16 = 160 rDPUs SNN filter Example: KressArray Family not used backbus connect KressArray Xplorer: rout thru only You may use it on your Netscape

© 2001, University of Kaiserslautern 5 Rapidly toward the Break-through replaceConcurrent Processes by more efficient parallelism: stream-based DPAs 1 **) reconfigurable 2 ) KressArray** [1995] and others [later] terms: DPU: datpath unit DPA: data path array rDPU: reconfigurable DPU rDPA: reconfigurable DPA Kress: a generalization of systolic array synthesis: stream-based r DPAs 2 ____ *) hardwired 1 ) systolic array* [1980] [ Broderson ] Bee Project chip-on-a-day* [2000] Generalization of the Systolic Array super systolic synthesis

© 2001, University of Kaiserslautern 6 compare Concurrent Computing DPU instruction sequencer DPU instruction sequencer DPU instruction sequencer DPU instruction sequencer.... Bus (es) or switch box CPU extremely inefficient massive bottleneck phenomena at run time control flow overhead instruction fetch / interpretation overhead address computation overhead - may be massive

© 2001, University of Kaiserslautern 7... with Stream-based Computing: (r) DPA for both, reconfigurable, and hardwired [ Brodersen ] DPU transport-triggered execution driven by data stream fr. / to memory or, fr. / to peripheral interface no instruction sequencer inside ! avoids run time overhead and bottleneck phenomena rDPA: drastically reduced reconfigurability overhead „instruction fetch“: at compile time

© 2001, University of Kaiserslautern 8 Soft rDPA ? Memory soft CPU miscellanous softDPUarraysoftDPUarray HLL Compiler 50 mio system gates soon even large rDPAs as soft IPs become feasible by >2005: don’t care about area efficiency ?

© 2001, University of Kaiserslautern 9 >> Stream-based Compilation Techniques Stream-based Computing Stream-based Compilation Techniques Use in Co-Design Now it’s up to You !

© 2001, University of Kaiserslautern 10 no routing! equations linear projection or algebraic mapping DPU architecture y + * x a placement Systolic Stream-based Computing System linear pipelines and uniform arrays only The Mathematician’s Synthesis Method Systolic Array [ H. T. Kung, 1980 ] : a DPA (Data Path Array) computing in space placement computing in time systolic arrays etc. and other transformations migration by re-timing this dichotomy is completely ignored by our CS curricula y 1 0  y 2 0 y y 1 y 2 y x 1 x 2 x data streams

© 2001, University of Kaiserslautern 11 2 General Stream-based Computing System heterogenous DPA or rDPA simulated annealing free form pipe network Mapper expression tree DPU architectures y + * x a simultaneous placement & routing * * * sh * xf Scheduler data streams 4 2

© 2001, University of Kaiserslautern 12 an example by Nageldinger’s KressArray Xplorer Memory Communication Architecture … hot research topic in embedded systems storage context transformations [ Cathoor, Herz, Kougia, Soudris ] Synthesizable Memory Communication Architecture startups provide memory IPs or generators application not used Legend: sequencers memory ports Optimized Parallel Memory Controller GAG generic sequencer methodology available Herz

© 2001, University of Kaiserslautern 13 >> Use in Co-Design Stream-based Computing Stream-based Compilation Techniques Use in Co-Design Now it’s up to You !

© 2001, University of Kaiserslautern 14 data counter (s) program counter : state register Compiler Memory Datapath hardwired Sequencer Computer tightly coupled by compact instruction code “von Neumann” does not support soft data paths does not support soft data paths Datapath reconfigurable Xputer Scheduler Compiler Memory (multiple) sequencer Datapath Array University of Kaiserslautern loosely coupled by decision data bits only Xputer: The Soft Machine Paradigm reconfigurable Computer: the wrong Machine Paradigm “von Neumann” also for hardwired [ Broderson ] enabling technology published  10 years ago now a hot topic area full day course last week at Tampere, Finland

© 2001, University of Kaiserslautern 15 partitioning compiler high level programming language source Co-Compilation Analyzer / Profiler supporting different platforms Resource Parameters Xputer “Soft” Machine Paradigm Configware running on interface Reconfigurable Accelerators X-C compiler KressArray DPSS GNU C compiler X-C Partitioner Hardware / Software Co-Design turns to Configware / Software Co-Design Jürgen Becker’s Co-DE-X Co-Compiler [ASP-DAC’95] Computer Machine Paradigm Software running on  Processor

© 2001, University of Kaiserslautern 16 Loop Transformation Examples loop 1-8 body endloop loop 9-16 body endloop fork join strip mining loop 1-4 trigger endloop loop 1-2 trigger endloop loop 1-16 body endloop sequential processes: loop 1-8 trigger endloop reconf.array: host: resource parameter driven Co-Compilation loop 1-8 body endloop loop unrolling

© 2001, University of Kaiserslautern 17 >> Now it’s up to You ! Stream-based Computing Stream-based Compilation Techniques Use in Co-Design Now it’s up to You !

© 2001, University of Kaiserslautern 18 However, current CS Education …. Hardware invisible: under the surface … is based on the Submarine Model Brain usage: procedural-only Software Faculty Colleagues shy away from the Paradigm Shift: their Brain hurts? - can’t be: this Half has been amputated Algorithm Assembly Language procedural high level Programming Language Hardware Software This model disables...

© 2001, University of Kaiserslautern 19 Hardware, Configware... this model disables Hardware and Software as Alternatives Algorithm Software partitioning Software only Software & Hardw/Configw procedural structural Brain Usage: both Hemispheres Hardw/Configw only

© 2001, University of Kaiserslautern 20 The Dominance of the Submarine Model... Hardware... indicates, that our CS education system produces zillions of mentally disabled Persons (procedural) structurally disabled … completely disabled to cope with solutions other than software only It‘s time to attack the software faculty dictatorship. Get involved!

© 2001, University of Kaiserslautern 21 >>> thank you thank you for listening It’s up to You !

© 2001, University of Kaiserslautern 22 >>> END END

© 2001, University of Kaiserslautern 23 The Impact of Reconfigurable Logic Reconfigurable platforms bring a new dimension to digital system development and have a strong impact on SoC design. A rapidly growing large user base of HDL-savvy designers with FPGA experience. Flexibility promises spin-around times downto minutes instead of months for real time in-system debugging, profiling, verification, tuning, field-maintenance, and field upgrades A New Business Model (in-field debugging and upgrading... ) A Fundamental Paradigm Shift in Silicon Application Revenue / month Time / months Update 1 Product Update ASIC Product reconfigurable Product with download 30 [T. Kean]

© 2001, University of Kaiserslautern 24 The History of Paradigm Shifts “Mainstream Silicon Application is switching every 10 Years” TTL µproc., memory custom standard Makimoto’s Wave ASICs, accel’s LSI, MSI ? ? “The Programmable System-on-a-Chip is the next wave“ reconfigurable Published in 1989

© 2001, University of Kaiserslautern 25 How’s next Wave ? 2007 custom standard procedural programming algorithm: variable resources: fixed Tredennick’s Paradigm Shifts hardwired algorithm: fixed resources: fixed 2007 FPGAs structural programming algorithm: variable resources: variable no further wave ! Coarse grain RAs Hartenstein’s Curve

© 2001, University of Kaiserslautern 26 The Impact of Makimoto’s Paradigm Shifts TTL µproc., memory custom standard ASICs, accel’s LSI, MSI reconfigurable Procedural personalization via RAM-based Machine Paradigm structural personalization: RAM-based before run time Dr. Makimoto: FPL 2000 keynote Software Industry’s Secret of Success Configware Success Story by new Machine Paradigm

© 2001, University of Kaiserslautern 27 The History of Paradigm Shifts “Mainstream Silicon Application is switching every 10 Years” custom standard Makimoto’s Wave TTL µproc., memory FPGAs ASICs, accel’s LSI, MSI coarse grain

© 2001, University of Kaiserslautern 28 KressArray Family generic Fabrics: a few examples Examples of 2 nd Level Interconnect: layouted over rDPU cell - no separate routing areas ! + rout-through and function rout- through only more NNports: rich Rout Resources Select Function Repertory select Nearest Neighbour (NN) Interconnect: an example rDPU Select mode, number, width of NNports Wired by Abutment