DRRA Dynamically Reconfigurable Resource Array

Slides:



Advertisements
Similar presentations
FPGA (Field Programmable Gate Array)
Advertisements

DSPs Vs General Purpose Microprocessors
Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
Implementation Approaches with FPGAs Compile-time reconfiguration (CTR) CTR is a static implementation strategy where each application consists of one.
Altera FLEX 10K technology in Real Time Application.
An Overview of Software-Defined Network Presenter: Xitao Wen.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
Programmable logic and FPGA
An Overview of Software-Defined Network
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Chapter 6 Memory and Programmable Logic Devices
Dynamic Hardware Software Partitioning A First Approach Komal Kasat Nalini Kumar Gaurav Chitroda.
An Overview of Software-Defined Network Presenter: Xitao Wen.
Lecture 12 Today’s topics –CPU basics Registers ALU Control Unit –The bus –Clocks –Input/output subsystem 1.
Embedded Systems Design ICT Embedded System What is an embedded System??? Any IDEA???
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
1 Hardware Support for Collective Memory Transfers in Stencil Computations George Michelogiannakis, John Shalf Computer Architecture Laboratory Lawrence.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
REXAPP Bilal Saqib. REXAPP  Radio EXperimentation And Prototyping Platform Based on NOC  REXAPP Compiler.
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
TRIPS – An EDGE Instruction Set Architecture Chirag Shah April 24, 2008.
Automated Design of Custom Architecture Tulika Mitra
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
J. Christiansen, CERN - EP/MIC
Eng.Samra Essalaimeh Philadelphia University 2013/ nd Semester PIC Microcontrollers.
Pipes & Filters Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
EE3A1 Computer Hardware and Digital Design
Chapter 4 MARIE: An Introduction to a Simple Computer.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
A Systematic Approach to the Design of Distributed Wearable Systems Urs Anliker, Jan Beutel, Matthias Dyer, Rolf Enzler, Paul Lukowicz Computer Engineering.
COARSE GRAINED RECONFIGURABLE ARCHITECTURES 04/18/2014 Aditi Sharma Dhiraj Chaudhary Pruthvi Gowda Rachana Raj Sunku DAY
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
Axel Jantsch 1 Networks on Chip Axel Jantsch 1 Shashi Kumar 1, Juha-Pekka Soininen 2, Martti Forsell 2, Mikael Millberg 1, Johnny Öberg 1, Kari Tiensurjä.
Computer Organization CDA 3103 Dr. Hassan Foroosh Dept. of Computer Science UCF © Copyright Hassan Foroosh 2002.
UltraSPARC III Hari P. Ananthanarayanan Anand S. Rajan.
A Programmable Single Chip Digital Signal Processing Engine MAPLD 2005 Paul Chiang, MathStar Inc. Pius Ng, Apache Design Solutions.
Graphical Design Environment for a Reconfigurable Processor IAmE Abstract The Field Programmable Processor Array (FPPA) is a new reconfigurable architecture.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
PART 4: (1/2) Central Processing Unit (CPU) Basics CHAPTER 12: P ROCESSOR S TRUCTURE AND F UNCTION.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #23 – Function.
Introduction to Computer Organization Pipelining.
Fast VLSI Implementation of Sorting Algorithm for Standard Median Filters Hyeong-Seok Yu SungKyunKwan Univ. Dept. of ECE, Vada Lab.
Data and Computer Communications Eighth Edition by William Stallings Chapter 15 – Local Area Network Overview.
Introduction to Field Programmable Gate Arrays (FPGAs) EDL Spring 2016 Johns Hopkins University Electrical and Computer Engineering March 2, 2016.
VLSI SP Course 2001 台大電機吳安宇 1 Why Systolic Architecture ? H. T. Kung Carnegie-Mellon University.
System on a Programmable Chip (System on a Reprogrammable Chip)
Reconfigurable Computing1 Reconfigurable Computing Part II.
By Abhishek.S 8 th Sem,CS Under the guidance of Mrs. Annapurna B.E, MTech Lecturer of NIE.
Programmable Hardware: Hardware or Software?
System-on-Chip Design Homework Solutions
Chapter 4 Objectives Learn the components common to every modern computer system. Be able to explain how each component contributes to program execution.
Chap 7. Register Transfers and Datapaths
Embedded Systems Design
NOCs: Past, Present and Future
Architecture & Organization 1
CGRA Express: Accelerating Execution using Dynamic Operation Fusion
CDA 3101 Spring 2016 Introduction to Computer Organization
Parallel and Multiprocessor Architectures
Architecture & Organization 1
Characteristics of Reconfigurable Hardware
A High Performance SoC: PkunityTM
CprE / ComS 583 Reconfigurable Computing
Arithmetic Circuits.
ADSP 21065L.
Chapter 4 The Von Neumann Model
Presentation transcript:

DRRA Dynamically Reconfigurable Resource Array Ahmed Hemani Dept. Of ES, School of ICT, KTH Kista Email: hemani@kth.se Website: www.it.kth.se/~hemani

Dynamically Reconfigurable Resource Array DRRA Dynamically Reconfigurable Resource Array ASIC like arbitrary parallelism, local and hierarchical control Software like flexibility Regular, seamless topology Easy mapping Perfect energy and performance prediction Full custom will compensate the reconfiguration overheads Private execution environments Minimal movement of data Move Logic Not Data Vectorising, Symbolic Assembler under development C/Matlab compiler to be developed

3. DRRA System Architecture DRRA System Concept + Reconfigurable Arithmetic Resource Pool . . . Reconfigurable Control Logic Resource Pool Register File Pool Protocol Processor Pool 3D Distributed Memory Pool Application Processor - RISC Run Time Management System Processor -RISC External Data Exchange Interconnect Control and Configuration Interconnect External IF Manager Resource Application Runtime Management System Power Management Application Controller (SW) Protocol Processing Layers Controller (SW+CW) Algorithm Datapath Physical Layer Controllers (HW) Modem/Codec etc. Controller Tx/Encode etc. Ctrlr Rx/Decode etc. Ctrlr Applications run concurrently in their private execution environments in DRRA Application 1 Application 2 Application 3 Private execution environments for three applications look at http://web.it.kth.se/~hemani/DRRA%20Summary.pdf 3. DRRA System Architecture

DRRA PHY Layer Fabric 12 16b buses 7 X 2 16b buses Regiser File 16b X 64 2 Read , 2 Write Ports Burst Mode, Shift and Circular Buffer Mode Addrs Gen. Unit Context Memory Morphable DPU 16 bit, 32 bit accumulator 4 Inputs, 2 Outputs MAC, Butterfly, 5 stage pipeline ADD/SUB tree Sum of Difference, Difference of Sum LFSR, Counter, Comparators Windowed Truncation, Saturation Context Memory Sequencer Simple sequential flow Conditional and Counter based Loop and branching Controls Register File, mDPU and Interconnect switches Interconnect 3 hop, Sliding window Fully connected Segmented 7 hop Wires Circuit Switchd Network Local Config Memory Fabric Scalabe, Regular Topology Interfaces to Distributed Memory and Central Controller

Initial Results Layout of 7X2 mDPUs, Regfiles and Sequencers, Corresponding to the fabric shown on the previous slide 90 nm, 720 MHz 64 point FFT – radix -2 DIT ~50 nJ, ~300 ns. Unoptimised N.B. 4 mDPUs and 4 Regfiles used 90 nm, 720 MHz 11 tap FIR– Symmetric ~300 pJ, ~6 ns. Unoptimised N.B. 1 mDPU and 1 Regfile used Long term goal is to come very close to ASIC with full custom datapath

Vectorising Symbolic Assembler N is the order of the filter. M is the degree of parallelism All serial parallel solutions are concisely captured by this pattern

The mapping for N=101, M = 7 x0 x6 x100 x94 x7 x13 x93 x87 x14 x20 x86 + × C7-C13 C0-C6 C14-C20 C21-C27 C28-C34 C35-C42 C43-C50 refi_0_0 refi_0_1 refi_0_2 refi_0_3 refi_0_4 refi_0_5 refi_0_6 refi_1_0 refi_1_1 refi_1_2 refi_1_3 refi_1_4 refi_1_5 refi_1_6 p0 p1 p2 p3 p4 p5 p6 adderTree4 newSample convSum

Protocol Processor Concept API To Higher Layer Constants Registers Frag / Defrag Error Chk Encryp/ Decryp Memory Bit Field Analysis Control and Timing API To Lower Layer For explaination look at http://web.it.kth.se/~hemani/DRRA%20Summary.pdf Go to section 5: Protocol Processor Architecture

REXAPP Radio Experimentation & Prototyping Platform Baseband Rx Protocol Processing Layers Application Layer Baseband Tx RF/Analog Tx RF/Analog Rx Configurable Chanel Model Control, Configuration, Debug and Monitoring Control, Configuration, Debug and Monitoring Resources Host Interface Configurable RF/Analog Impairment Models High Capacity, High Bandwidth Storage

Vision for the future Don’t stop dreaming ! RF/Analog/Sensors/peripherals Logic Tile - Giga Gates Volatile Memory - GigaBytes Non-Volatile Memory - TeraBytes Don’t stop dreaming !