FPL 2003 - Sept. 2, 2003 Software Decelerators Eric Keller, Gordon Brebner and Phil James-Roxby Xilinx Research Labs.

Slides:



Advertisements
Similar presentations
© 2003 Xilinx, Inc. All Rights Reserved Course Wrap Up DSP Design Flow.
Advertisements

Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Week 1- Fall 2009 Dr. Kimberly E. Newman University of Colorado.
MEMOCODE 2007 HW/SW Co-design Contest Documentation of the submission by Eric Simpson Pengyuan Yu Sumit Ahuja Sandeep Shukla Patrick Schaumont Electrical.
Some Thoughts on Technology and Strategies for Petaflops.
1 Performed By: Khaskin Luba Einhorn Raziel Einhorn Raziel Instructor: Rivkin Ina Spring 2004 Spring 2004 Virtex II-Pro Dynamical Test Application Part.
Mahapatra-Texas A&M-Fall'001 cosynthesis Introduction to cosynthesis Rabi Mahapatra CPSC498.
Configurable System-on-Chip: Xilinx EDK
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
Chapter 1 and 2 Computer System and Operating System Overview
Logic Design Outline –Logic Design –Schematic Capture –Logic Simulation –Logic Synthesis –Technology Mapping –Logic Verification Goal –Understand logic.
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
George Mason University ECE 448 – FPGA and ASIC Design with VHDL Finite State Machines State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts,
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Technion Digital Lab Project Performance evaluation of Virtex-II-Pro embedded solution of Xilinx Students: Tsimerman Igor Firdman Leonid Firdman.
Mahesh Sukumar Subramanian Srinivasan. Introduction Face detection - determines the locations of human faces in digital images. Binary pattern-classification.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Using FPGAs with Embedded Processors for Complete Hardware and Software Systems Jonah Weber May 2, 2006.
Microcontroller: Introduction
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
1 DSP Implementation on FPGA Ahmed Elhossini ENGG*6090 : Reconfigurable Computing Systems Winter 2006.
Future FPGA Development Duane McDonald Digital Electronics 3.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
An Introduction Chapter Chapter 1 Introduction2 Computer Systems  Programmable machines  Hardware + Software (program) HardwareProgram.
Computer Architecture ECE 4801 Berk Sunar Erkay Savas.
Lecture 18 Lecture 18: Case Study of SoC Design ECE 412: Microcomputer Laboratory.
E0001 Computers in Engineering1 The System Unit & Memory.
Add on cards. Also known as Expansion card or interface adapter. It can be inserted into an expansion slot of a motherboard to add functionality to a.
2 Systems Architecture, Fifth Edition Chapter Goals Discuss the development of automated computing Describe the general capabilities of a computer Describe.
Principles Of Digital Design Chapter 1 Introduction Design Representation Levels of Abstraction Design Tasks and Design Processes CAD Tools.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
TRIPS – An EDGE Instruction Set Architecture Chirag Shah April 24, 2008.
Computers Are Your Future Eleventh Edition Chapter 2: Inside the System Unit Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall1.
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
Computer Design Basics
IEEE ICECS 2010 SysPy: Using Python for processor-centric SoC design Evangelos Logaras Elias S. Manolakos {evlog, Department of Informatics.
ESL and High-level Design: Who Cares? Anmol Mathur CTO and co-founder, Calypto Design Systems.
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
Programming a Hyper-Programmable Architectures for Networked Systems Eric Keller and Gordon Brebner Xilinx Research Labs, USA.
Design of a Novel Bridge to Interface High Speed Image Sensors In Embedded Systems Tareq Hasan Khan ID: ECE, U of S Term Project (EE 800)
Part A Presentation Implementation of DSP Algorithm on SoC Student : Einat Tevel Supervisor : Isaschar Walter Accompanying engineer : Emilia Burlak The.
Computer Organization & Assembly Language © by DR. M. Amer.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Computer Architecture Lecture 32 Fasih ur Rehman.
Motherboard A motherboard allows all the parts of your computer to receive power and communicate with one another.
Lecture 11: FPGA-Based System Design October 18, 2004 ECE 697F Reconfigurable Computing Lecture 11 FPGA-Based System Design.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Chapter 5A Transforming Data Into Information.
PTII Model  VHDL Codegen Verification Project Overview 1.Generate VHDL descriptions for Ptolemy models. 2.Maintain bit and cycle accuracy in implementation.
Introduction to Field Programmable Gate Arrays Lecture 1/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May – 9 June 2007 Javier.
WARP PROCESSORS ROMAN LYSECKY GREG STITT FRANK VAHID Presented by: Xin Guan Mar. 17, 2010.
Survey of Reconfigurable Logic Technologies
ECE 448 Lecture 6 Finite State Machines State Diagrams vs. Algorithmic State Machine (ASM) Charts.
Computer Operation. Binary Codes CPU operates in binary codes Representation of values in binary codes Instructions to CPU in binary codes Addresses in.
CoDeveloper Overview Updated February 19, Introducing CoDeveloper™  Targeting hardware/software programmable platforms  Target platforms feature.
Dynamic and On-Line Design Space Exploration for Reconfigurable Architecture Fakhreddine Ghaffari, Michael Auguin, Mohamed Abid Nice Sophia Antipolis University.
Chapter 7.2 Computer Architecture
Embedded Systems Design
Architecture & Organization 1
Chapter III Desktop Imaging Systems & Issues
Anne Pratoomtong ECE734, Spring2002
Introduction to cosynthesis Rabi Mahapatra CSCE617
Architecture & Organization 1
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
COMS 361 Computer Organization
ECE 448 Lecture 6 Finite State Machines State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL code ECE 448 – FPGA and ASIC Design.
Topics Bus interfaces. Platform FPGAs..
Presentation transcript:

FPL Sept. 2, 2003 Software Decelerators Eric Keller, Gordon Brebner and Phil James-Roxby Xilinx Research Labs

FPL 2003Software Decelerators 2 Talk Outline Background Software Decelerators Case Study: Finite State Machines Results Conclusions

FPL 2003Software Decelerators 3 Modern Platform FPGA High Performance Sync Dual-Port™ RAM SelectIO™-Ultra Technology Advanced FPGA Logic 18 Bit 36 Bit Embedded DSP Functionality PowerPC™ Processors 400+ MHz clock rate DCM Digital Clock Management Digitally Controlled Impedance High-speed Serial Transceivers 622 Mbps to Gbps

FPL 2003Software Decelerators 4 Hardware Accelerator Processor-Centric Algorithms executed on processor – key functions performed by hardware Goal: Increase overall performance ProcessorMemDWT JPEG2000 Tier 1 Coder RCT

FPL 2003Software Decelerators 5 Motherboard On A Chip Processor running an operating system Common board peripherals on FPGA – Ethernet MAC – SVGA controller

FPL 2003Software Decelerators 6 Logic-centric viewpoint Consistent with an interface-centric view that is appropriate for reactive systems - highly relevant for future ambient intelligence/ubiquitous computing Processors have no special status in systems, and indeed play only a secondary role as ‘function units’ Explicit ‘hardware-software co-design’ becomes lesser issue - certainly no top-level partitioning Hardware accelerators of processor-centric model are inverted and replaced by ‘software decelerators’

FPL 2003Software Decelerators 7 Software Decelerators Algorithms are executed in logic – Processor executes software to perform one or more services for programmable logic & inputs outputs * + + PPC

FPL 2003Software Decelerators 8 Motivation Emergence of platform FPGAs To increase overall system quality – by making use of services provided by processor Ease of designing a complex function Offload non time-critical logic – to achieve a better partition (e.g. saving area) Offload corner cases – e.g. in MIR IPv4 packets handled in logic, IPv6 handled in processor

FPL 2003Software Decelerators 9 Goals Overall area consumed by software decelerator should not be greater than logic counterpart Interfacing logic should consume minimal logic Interface should shield logic from processor – and vice versa Provide timing and resource usage information Implementation neutral method to capture design

FPL 2003Software Decelerators 10 Example: finite state machines Implement a general class of sequential functions that are recognizable in digital designs Processor determines next state and state outputs to meet schedule determined by logic-based system – possibility to support multiple state machines Hardware platform Software Timing report FSM decelerator generator Graphical Representation Textual Representation

FPL 2003Software Decelerators 11 Design Entry Graphical front end – e.g. StateCAD Textual intermediate representation – XML to support many design entry methods Define interface Define state

FPL 2003Software Decelerators 12 Logic-Processor Interface Rest of system doesn’t see processor signals Choice of interface – PowerPC’s native busses: PLB, OCM, DCR With only two nodes, optimizations are possible – interface logic always being addressed – No need for arbiter PowerPC

FPL 2003Software Decelerators 13 Clocking Polling/Interrupt on external clock – processing time for state must be less than clock period – processor uses polling to detect clock edges – clock edge causes an interrupt Software Generated – processor generates clock pulse using a memory mapped circuit – allows different states to take different processing time

FPL 2003Software Decelerators 14 Software Design General case is complex requiring timing analysis Assembly code generation – each state has same structure (clock/reset, equations, transitions) Execute out of cache – predictable memory accesses Accurate timing generation – count the exact number of cycles it will take for each state and transition

FPL 2003Software Decelerators 15 Results: Resource Usage *Ratio is the area of the decelerator as a percentage of area consumed by a logic implementation

FPL 2003Software Decelerators 16 Results: Performance

FPL 2003Software Decelerators 17 Conclusions Software decelerators – through example of FSM based design methodology – extendable to other functions – can provide an increased overall system quality Methodology applicable to subset of designs – achievable speeds vary with characteristics of FSM I/O takes a lot of processing time

FPL 2003Software Decelerators 18 Future Work Further study implications of logic centric model Automatic selection and synthesis of logic- processor interfaces Characteristics of hard/soft processors – e.g. I/O takes large percentage of time FSM based architectural components Domain-specific high-level design entry and tools