ECE 636 Reconfigurable Computing Lecture 1 Course Introduction Prof

Slides:



Advertisements
Similar presentations
Day - 3 EL-313: Samar Ansari. INTEGRATED CIRCUITS Integrated Circuit Design Methodology EL-313: Samar Ansari Programmable Logic Programmable Array Logic.
Advertisements

FPGA (Field Programmable Gate Array)
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
TIE Extensions for Cryptographic Acceleration Charles-Henri Gros Alan Keefer Ankur Singla.
A Survey of Logic Block Architectures For Digital Signal Processing Applications.
Modern VLSI Design 3e: Chapter 10 Copyright  2002 Prentice Hall Adapted by Yunsi Fei ECE 300 Advanced VLSI Design Fall 2006 Lecture 24: CAD Systems &
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Reconfigurable Computing: What, Why, and Implications for Design Automation André DeHon and John Wawrzynek June 23, 1999 BRASS Project University of California.
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
Some Thoughts on Technology and Strategies for Petaflops.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
Lecture 1: Course Introduction September 3, 2013 ECE 636 Reconfigurable Computing Lecture 1 Course Introduction Prof. Russell Tessier.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
General FPGA Architecture Field Programmable Gate Array.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
ECE 465 Introduction to CPLDs and FPGAs Shantanu Dutt ECE Dept. University of Illinois at Chicago Acknowledgement: Extracted from lecture notes of Dr.
Automated Design of Custom Architecture Tulika Mitra
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
EE3A1 Computer Hardware and Digital Design
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 1: Course Introduction September 8, 2004 ECE 697F Reconfigurable Computing Lecture 1 Course Introduction Prof. Russell Tessier.
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Introduction to Field Programmable Gate Arrays (FPGAs) EDL Spring 2016 Johns Hopkins University Electrical and Computer Engineering March 2, 2016.
Reconfigurable Computing Nehir Sönmez
SUBJECT : DIGITAL ELECTRONICS CLASS : SEM 3(B) TOPIC : INTRODUCTION OF VHDL.
Introduction to the FPGA and Labs
ENG6530 Reconfigurable Computing Systems
Programmable Logic Devices
Dynamo: A Runtime Codesign Environment
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
ECE354 Embedded Systems Introduction C Andras Moritz.
ELEC 7770 Advanced VLSI Design Spring 2016 Introduction
James Coole PhD student, University of Florida Aaron Landy Greg Stitt
Adapted by Dr. Adel Ammar
Give qualifications of instructors: DAP
Instructor: Dr. Phillip Jones
Electronics for Physicists
Introduction to Reconfigurable Computing
ELEC 7770 Advanced VLSI Design Spring 2014 Introduction
Reconfigurable Computing
We will be studying the architecture of XC3000.
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Lecture 41: Introduction to Reconfigurable Computing
Dynamically Reconfigurable Architectures: An Overview
ELEC 7770 Advanced VLSI Design Spring 2012 Introduction
ELEC 7770 Advanced VLSI Design Spring 2010 Introduction
Introduction to Micro Controllers & Embedded System Design
A High Performance SoC: PkunityTM
Chapter 1 Introduction.
HIGH LEVEL SYNTHESIS.
Programmable Logic- How do they do that?
Electronics for Physicists
Give qualifications of instructors: DAP
NAND and XOR Implementation
Digital Designs – What does it take
Circuit Analysis Procedure by Dr. M
CS295: Modern Systems What Are FPGAs and Why Should You Care
Programmable logic and FPGA
Reconfigurable Computing (EN2911X, Fall07)
CS 151 Digital Systems Design Lecture 1 Course Overview
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

ECE 636 Reconfigurable Computing Lecture 1 Course Introduction Prof ECE 636 Reconfigurable Computing Lecture 1 Course Introduction Prof. Russell Tessier Give qualifications of instructors: DAP teaching computer architecture at Berkeley since 1977 Co-athor of textbook used in class Best known for being one of pioneers of RISC currently author of article on future of microprocessors in SciAm Sept 1995 RY took 152 as student, TAed 152,instructor in 152 undergrad and grad work at Berkeley joined NextGen to design fact 80x86 microprocessors one of architects of UltraSPARC fastest SPARC mper shipping this Fall

What is Reconfigurable Computing? Computation using hardware that can adapt at the logic level to solve specific problems Why is this interesting? Many applications are poorly suited to microprocessors VLSI “explosion” provides increasing resources. How can we use them? “field-programmable” devices Allows for high performance, bug fixes, and fast time-to market for a selection of applications

Background needed for this course Basic VLSI – transistors, delay models. Basic algorithms – graph algorithms, searches Computer Architecture – ALU, microprocessor Digital Design – adder, counter, etc. Topic self-contained! Reconfigurable Computing is a lot more than just devices

Course Organization Homework assignments (15%) Final project (25%) Mid-term (25%) Final exam (30%) Class participation/attendance (5%) Students expected to participate in class discussion No required text – readings will be assigned from research papers. We will use material from the three survey papers on the course web site a lot

What characterizes Reconfigurable Computing? Parallelism, specialization, hardware-level adaptation Parallelism customized to meet design objectives Logic specialized to perform specific function Functionality changed as problem requirements change

Microprocessor-based Systems (Temporal) Data Storage (Register File) A B C ALU 64 Generalized to perform many functions well. Operates on fixed data sizes. Inherently sequential Constrained even with multiple data paths.

Reconfigurable Computing if (A > B) { H = A; L = B; } else { H = B; L = A; A B Functional Unit H L Create specialized hardware for each application. Functional units optimized to perform a special task.

Example: Bubblesort (Spatial) H L H L A B A B H L H L A B H L Smallest Largest Adapt interconnect to problem. Take advantage of parallelism.

Implementation Spectrum Microprocessor Reconfigurable Hardware ASIC ASIC gives high performance at cost of inflexibility. Processor is very flexible but not tuned to the application. Reconfigurable hardware is a nice compromise.

Reconfigurable Hardware Look-up table (LUT) A B Out C D A B C D = out Each LUT operates on four one-bit inputs. Output is one data bit. Can perform any boolean function of four inputs 2 = 64K functions (4096 patterns) 2 4

Logic Element

Field-Programmable Gate Array Logic Element Tracks LE Each logic element outputs one data bit. Interconnect programmable between elements. Interconnect tracks grouped into channels.

FPGA Architecture Issues Logic Element Need to explore architectural issues. How much functionality should go in a logic element? How many routing tracks per channel? Switch “population”?

Xilinx XC4000 Cell 2 4-input look-up tables 1 3-input look-up table 2 D flip flops

Xilinx XC4000 Routing 25

Altera Stratix Logic Element

Altera Stratix Logic Array Blocks (Clusters)

Routing Connections S S Based on the switch and wire parasitic, interconnect routes can be modeled as RC networks. Other issues: Power Routability

Design abstractions specification behavior logic circuit layout register- transfer logic circuit layout English Executable program Sequential machines Logic gates transistors rectangles Throughput, design time Function units, clock cycles Literals, logic depth nanoseconds microns function Spec. level

High-level Compilers Difficult to estimate hardware resources. Some parts of program more appropriate for processor (hardware/software codesign). Compiler must parallelize computation across many resources. Engineers like to write in C rather than pushing little blocks around. for (i = 0; i<n, i++) { c[i] = a[i] + b[i] } Some success stories

Translating a Design to an FPGA RTL . C = A+B Circuit A B + C Array CAD to translate circuit from text description to physical implementation well understood. Most current FPGA designers use register-transfer level specification (allocation and scheduling) Same basic steps as ASIC design.

Circuit Compilation Technology Mapping Placement LUT Routing ? Assign a logical LUT to a physical location. Select wire segments And switches for Interconnection.

Technology Mapping: A Simple Example Made of Full Adders FA A B Co Ci S A+B = D Logic synthesis tool reduces circuit to SOP form S = ABCi + ABCi + ABCi + ABCi A A B LUT Co B LUT S Ci Ci Co = ABCi + ABCi + ABCi + ABCi

Processor + FPGA Three possibilities daughtercard Proc FPGA chip Backplane bus (e.g. PCI) 1. FPGA serves as coprocessor for data intensive applications – possible project. Proc FPGA chip 2. FPGA serves as embedded computer for low latency transfer. “Reconfigurable Functional Unit”

Processor + FPGA (cont..) 3. Processor integration RF ALU FPGA Processor FPGA logic embedded inside processor. A number of problems with 2 and 3. Process technology an issue. ALU much faster than FPGA generally. FPGA much faster than the entire processor.

Programmable System on a Chip Specialized hardware is commonly used for compute-intensive applications Coprocessors (FPU, graphics, sound, …) Accelerator boards (FPGA boards, I/O cards, …) FPGAs also perform well for many of these apps Reconfigurable logic in System-On-A-Chip RAM FPGA DSP CPU Custom Logic Analog DSP RAM CPU Custom Logic FPGA

A Success Story: Logic Emulation F F F F F F F F F Most applications don’t fit on one device. Create need for partitioning designs across many devices. Effectively a “netlist computer” Each FPGA is a logic processor interconnected in a given topology.

Dynamic Reconfiguration L What if I want to exchange part of the design in the device with another piece? Need to create architectures and software to incrementally change designs. Effectively a “configuration cache” Examples: encryption, filtering.

Classifying Reconfiguration Reconfiguration methodology Static Partially static (=partial reconfiguration) Dynamic

Types of Reconfiguration Static reconfiguration – application is not running Semi-static – different portions of application time-sliced on same hardware fabric Dynamic reconfiguration – application modified in response to changing environmental issues What is reconfigured? Processor reconfiguration – customized instruction sets, pipelining, bit width Parameter reconfiguration Control reconfiguration Can you think of other applications?

Hot Reconfigurable Computing Research Areas Determining what fixed components should go in an FPGA (multiplexers, routers, network interfaces, etc) Important new applications for reconfigurable devices (especially embedded applications and security) Better understanding the role of standard microprocessors, GPUs, and reconfigurable hardware. They all have benefits Programming environments are improving Computing in the cloud Compiler / run-time environments Broadening of programming (Altera: OpenCL, Xilinx: Vivado) Operating system management / virtual machine

Course Software: Versatile Place and Route Performs FPGA placement and routing. Written in C Runs on Linux and PCs We support execution on quark.ecs.umass.edu Generally straightforward to install and use. Estimates device sizes and performance Very widely used in FPGA research community More advanced version of VPR are available. Appropriate for use in course projects

Ideas for the Course Project Evaluate an application using a microprocessor and an FPGA Consider performance and power consumption Modify/update CAD algorithms associated with VPR Model modern FPGAs which include multiple dice. FPGA security: encryption, hardware obfuscation High-level compilation Adapt and evaluate LegUp (high-level synthesis tool) Use OpenCL compiler to target multiple platforms Topic must involve experimentation.

Reconfigurable computing relies heavily on new VLSI technology Summary Reconfigurable computing relies heavily on new VLSI technology Device architectures have matured. New features constantly considered Application development progressing at rapid pace Integration of hardware and software a difficult challenge FPGAs are a multi-billion dollar a year business Devices are much different now than 20 years ago Active area of research at UMass.