February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.

Slides:



Advertisements
Similar presentations
FPGA (Field Programmable Gate Array)
Advertisements

A Search Memory Substrate for High Throughput and Low Power Packet Processing Sangyeun Cho, Michel Hanna and Rami Melhem Dept. of Computer Science University.
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
A Survey of Logic Block Architectures For Digital Signal Processing Applications.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
PLD Technology Basics. Basic PAL Architecture DQ Q CLK OE Fuse.
Reconfigurable Computing: What, Why, and Implications for Design Automation André DeHon and John Wawrzynek June 23, 1999 BRASS Project University of California.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Introduction to Reconfigurable Computing CS61c sp06 Lecture (5/5/06) Hayden So.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
CS294-6 Reconfigurable Computing Day 6 September 10, 1998 Comparing Computing Devices.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
Evolution of implementation technologies
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
Trends toward Spatial Computing Architectures Dr. André DeHon BRASS Project University of California at Berkeley.
Embedded DRAM for a Reconfigurable Array S.Perissakis, Y.Joo 1, J.Ahn 1, A.DeHon, J.Wawrzynek University of California, Berkeley 1 LG Semicon Co., Ltd.
Penn ESE Spring DeHon 1 FUTURE Timing seemed good However, only student to give feedback marked confusing (2 of 5 on clarity) and too fast.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Router Architectures An overview of router architectures.
General FPGA Architecture Field Programmable Gate Array.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Introduction to Computing: Lecture 4
Reconfigurable Computing. Lect-02.2 Course Schedule Introduction to Reconfigurable Computing FPGA Technology, Architectures, and Applications FPGA Design.
Reconfigurable Devices Presentation for Advanced Digital Electronics (ECNG3011) by Calixte George.
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
Amalgam: a Reconfigurable Processor for Future Fabrication Processes Nicholas P. Carter University of Illinois at Urbana-Champaign.
Modern VLSI Design 4e: Chapter 6 Copyright  2008 Wayne Wolf Topics Memories: –ROM; –SRAM; –DRAM; –Flash. Image sensors. FPGAs. PLAs.
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
J. Christiansen, CERN - EP/MIC
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Basic Sequential Components CT101 – Computing Systems Organization.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 7: January 24, 2003 Instruction Space.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
“Politehnica” University of Timisoara Course No. 2: Static and Dynamic Configurable Systems (paper by Sanchez, Sipper, Haenni, Beuchat, Stauffer, Uribe)
EE3A1 Computer Hardware and Digital Design
Orange Coast College Business Division Computer Science Department CS 116- Computer Architecture Multiprocessors.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
M U N - February 17, Phil Bording1 Computer Engineering of Wave Machines for Seismic Modeling and Seismic Migration R. Phillip Bording February.
CPLD Vs. FPGA Positioning Presentation
February 12, 1999 Architecture and Circuits: 1 Interconnect-Oriented Architecture and Circuits William J. Dally Computer Systems Laboratory Stanford University.
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
Full and Para Virtualization
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
What is a Microprocessor ? A microprocessor consists of an ALU to perform arithmetic and logic manipulations, registers, and a control unit Its has some.
Specialized Virtual Configurable Arrays Dominique Lavenier - Frederic Raimbault IRISA Rennes, France UBS Vannes, France
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Caltech CS184 Winter DeHon CS184a: Computer Architecture (Structure and Organization) Day 4: January 15, 2003 Memories, ALUs, Virtualization.
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
FPGA Technology Overview Carl Lebsack * Some slides are from the “Programmable Logic” lecture slides by Dr. Morris Chang.
Lecture 4: Contrasting Processors: Fixed and Configurable September 20, 2004 ECE 697F Reconfigurable Computing Lecture 4 Contrasting Processors: Fixed.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #22 – Multi-Context.
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Architecture & Organization 1
Architecture & Organization 1
Chapter 1 Introduction.
ESE534: Computer Organization
Presentation transcript:

February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and Computer Science Ohio University

February 12, 1998 Aman Sareen 2 What’s going to be covered ?? Part 1 Technology Trends Application Outlook Some Developed Reconfigurable Engines Applications of Reconfigurable Logic Common Objectives of Reconfigurable Devices Limitations of the Current Systems

February 12, 1998 Aman Sareen 3 What’s going to be covered ?? (cont.) Part 2 Uniform Computational Array Model FPGA SIMD Arrays Hybrid Arrays DPGA Applications Benefits DPGA Prototype Highlights Architecture Implementation

February 12, 1998 Aman Sareen 4 What’s going to be covered ?? (cont...) Part 3 DPGA Coupled Processor Applications Costs and Benefits of Reconfiguration Challenges Conclusion

February 12, 1998 Aman Sareen 5 Technology Trends What's going on in the industry?? Operational performance of microprocessors is increasing by 60% each year. More and more transistors (25% increase per year) on a single chip. 12 million transistors on a single chip are estimated by the end of the century. Disadvantages ?? High performance is not we get always. Cost ineffective. Risks overspecialization. Reduced volume utilization per design investment. So what do we do ??=>Reconfigurable Design What does it do ?? Application acceleration. Implement system specific functions.

February 12, 1998 Aman Sareen 6 Application Outlook There’s always a scope of additions/modifications So what do we do ??=>Reconfigurable Design What does it do ?? It allows applications to specialize the hardware.

February 12, 1998 Aman Sareen 7 Some Developed Reconfigurable Engines PRISM ( Processor Reconfiguration through Instruction-Set Metamorphosis) built by Athanas and Silverman. * couples a programmable element with a microprocessor. * each application synthesizes new processor instructions for acceleration. CM-2 built at the Supercomputing Research Center by Cuccaro and Reese. * the processor is augmented with reconfigurable logic to perform common operations. SPLASH built at the Supercomputing Research Center. * used in genome sequence matching.

February 12, 1998 Aman Sareen 8 Applications of Reconfigurable Logic Binary Operations. Arithmetic. Encryption/Decryption/Compression. Sequence and string matching. Sorting. Physical system simulation. Video and image processing.

February 12, 1998 Aman Sareen 9 Common Objectives in Reconfigurable Applications High performance. Clear potential for application acceleration. Exploring bit-level parallel computation. High performance through parallelism. Customize data paths.

February 12, 1998 Aman Sareen 10 Limitations of the Current Systems Low Bandwidth and High Latency Interface Expected acceleration not achievable. Prevents close cooperation between fixed and reconfigurable logic circuits. Expensive. Limits throughput. High Reconfiguration Overhead Single configuration must be maintained throughout an application. Multitasking/Time sharing not possible.

February 12, 1998 Aman Sareen 11 Unified Computational Array Model Array Element Computational Unit Inputs from local state or from other array elements Outputs to local state or to other array elements Instruction Computational Block of AE

February 12, 1998 Aman Sareen 12 Unified Computational Array Model Lookup Models for AE Computational Unit Lookup Table (Memory) Inputs from local state or from other array elements Instruction Outputs to local state or to other array elements Data Outputs Address Inputs Instruction = Memory Programming Outputs to local state or to other array elements Data Outputs Inputs from local state or from other array elements Address Inputs Lookup Table (Memory)

February 12, 1998 Aman Sareen 13 Ideally, different instruction for each AE on each computational cycle Drawback: Instruction distribution resource requirement increases. Instruction bandwidth becomes unmanageable. P * log 2 (N f ) t cycle I BW = Unified Computational Array Model Instruction Distribution P = 100, N f = 64, Operational Freq. = 10 MHz I BW => 6 Gbits/sec

February 12, 1998 Aman Sareen 14 Unified Computational Array Model Weakening Instruction Distribution SIMD Array Global Instruction (common to all elements in array) FPGA Instruction / AE Uniform in time Slow programming phase SIMD Array Instruction / cycle Uniform in space Array Element Computational Unit Inputs from local state or from other array elements Outputs to local state or to other array elements Instruction FPGA Static Instruction ( distinct for each array element efficiently constant during operation)

February 12, 1998 Aman Sareen 15 FPGA v/s SIMD Computation FPGA Fixed Function in Time Spatially Varying Computation Bit-Parallel Computation Build Computation Spatially * Low-latency SIMD Array Operation Varies in Time Homogenous Computation in Space Bit-Serial Computation Build Computation in Time * High Throughput on Homogenous data

February 12, 1998 Aman Sareen 16 Dynamically Programmable Gate Arrays Hybrid Model Multiple Context FPGA Broadcast a Context Identifier Indirect Instruction Lookup Features: Rapid Context Switch Exploits local, on-chip Bandwidth Spatially and Temporally Varying Computation High Logic Density Reuse Gates and Wires in Time

February 12, 1998 Aman Sareen 17 Dynamically Programmable Gate Arrays Configurable Instruction-Store View of DPGA AE Computational Unit (Lookup Table) Inputs from local state or from other array elements Outputs to local state or to other array elements Data Outputs Address Inputs Data Outputs Address Inputs Instruction Store (Lookup Table) Configurational Unit function is configured by Instruction Store output Programming may differ for each array element Global Context Identifier (common to all elements) Instruction

February 12, 1998 Aman Sareen 18 Dynamically Programmable Gate Arrays Applications Rapid Context Switch FPGA Time-Slice Computation Temporal Pipelining Operation Cache Processor Assistance Multi-Stream SIMD Boundary Condition handling Virtual Cells

February 12, 1998 Aman Sareen 19 DPGA Prototype - Highlights 4 on-chip configuration contexts DRAM configuration cells Automatic refresh of dynamic memory elements Non-intrusive background loading Wide bus architecture for high-speed context loading Two-level routing architecture

February 12, 1998 Aman Sareen 20 DPGA Prototype - Overview

February 12, 1998 Aman Sareen 21 DPGA Prototype - Context Memory

February 12, 1998 Aman Sareen 22 DPGA Prototype - Array Element

February 12, 1998 Aman Sareen 23 DPGA Prototype - Local Interconnect

February 12, 1998 Aman Sareen 24 DPGA Prototype - Subarray Interconnect

February 12, 1998 Aman Sareen 25 DPGA Prototype - Areas 3 metal, 1µ drawn 0.85µ effective CMOS process

February 12, 1998 Aman Sareen 26 DPGA Prototype - Area Percentages

February 12, 1998 Aman Sareen 27 DPGA Prototype - Estimated Timings t cycle = t mem + n l * t lut + n x * t xbar

February 12, 1998 Aman Sareen 28 DPGA-Coupled Processor Applications General-Purpose Workstations and Personal Computers. Special-Purpose Computing Machines. Embedded Systems. Multiprocessor Systems

February 12, 1998 Aman Sareen 29 Costs and Benefits of Reconfiguration Specialized design limits range of application. Moving exception handling into reconfigurable logic. * Feature Interaction. * Migrating critical control of fixed resources to reconfigurable logic

February 12, 1998 Aman Sareen 30 Challenges Processor reconfigurable logic interfacing. Grain Size. Area and Pin allocation. Multitasking and state interaction.

February 12, 1998 Aman Sareen 31 Conclusion Prototype demonstrates that efficient DPGAs can be implemented DPGAs allow computation to vary both spatially and temporally DPGAs require no additional bandwidth Both bit-parallel and bit-serial computation in a single array structure Higher performance Higher flexibility Lower part count Microprocessors with tightly integrated, rapidly reconfigurable logic promise to be prime commodity building block.