© 2007 Altera Corporation FPGA Coprocessing in Multi-Core Architectures for DSP J Ryan Kenny Bryce Mackin Altera Corporation 101 Innovation Drive San Jose,

Slides:

Advertisements

Similar presentations

AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.

Advertisements

Avalon Switch Fabric. 2 Proprietary interconnect specification used with Nios II Principal design goals – Low resource utilization for bus logic – Simplicity.

Nov COMP60621 Concurrent Programming for Numerical Applications Lecture 6 Chronos – a Dell Multicore Computer Len Freeman, Graham Riley Centre for.

OPTERON (Advanced Micro Devices). History of the Opteron AMD's server & workstation processor line 2003: Original Opteron released o 32 & 64 bit processing.

Week 1- Fall 2009 Dr. Kimberly E. Newman University of Colorado.

CS 213 Commercial Multiprocessors. Origin2000 System – Shared Memory Directory state in same or separate DRAMs, accessed in parallel Upto 512 nodes (1024.

Configurable System-on-Chip: Xilinx EDK

Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Project performed by: Naor Huri Idan Shmuel.

Cosc 2150 Current CPUs Intel and AMD processors. Notes The information is current as of Dec 5, 2014, unless otherwise noted. The information for this.

Complete CompTIA A+ Guide to PCs, 6e Chapter 2: On the Motherboard © 2014 Pearson IT Certification

HS06 on the last generation of CPU for HEP server farm Michele Michelotto 1.

Intel® 64-bit Platforms Platform Features. Agenda Introduction and Positioning of Intel® 64-bit Platforms Intel® 64-Bit Xeon™ Platforms Intel® Itanium®

© 2008 Altera Corporation—Public High-Performance Embedded Computing Workshop September 2008 Impact on High-Performance Applications: FPGA Chip Bandwidth.

Leveling the Field for Multicore Open Systems Architectures Markus Levy President, EEMBC President, Multicore Association.

GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.

Presenter MaxAcademy Lecture Series – V1.0, September 2011 Introduction and Motivation.

Processor Technology John Gordon, Peter Oliver e-Science Centre, RAL October 2002 All details correct at time of writing 09/10/02.

Basic Computer Structure and Knowledge Project Work.

1.  Project Goals.  Project System Overview.  System Architecture.  Data Flow.  System Inputs.  System Outputs.  Rates.  Real Time Performance.

Prof. JunDong Cho VADA Lab. Project.

Different CPUs CLICK THE SPINNING COMPUTER TO MOVE ON.

Advanced Processing Group ISO9001 Registered Octopus: A Multi-core implementation Export of this products is subject to U.S. export controls. Licenses.

Current Computer Architecture Trends CE 140 A1/A2 29 August 2003.

Company LOGO High Performance Processors Miguel J. González Blanco Miguel A. Padilla Puig Felix Rivera Rivas.

GBT Interface Card for a Linux Computer Carson Teale 1.

Real-Time HD Harmonic Inc. Real Time, Single Chip High Definition Video Encoder! December 22, 2004.

A+ Guide to Managing and Maintaining your PC, 6e Chapter 5 Processors and Chipsets (v0.9)

© 2005 Altera Corporation SOPC Builder: a Design Tool for Rapid System Prototyping on FPGAs Kerry Veenstra Workshop on Architecture Research using FPGA.

High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.

Essentials components in Mobo The more important part in a mobo is the chipset. It make the interconnexions between all the other parts on the mobo.

Advanced SW/HW Optimization Techniques for Application Specific MCSoC m Yumiko Kimezawa Supervised by Prof. Ben Abderazek Graduate School of Computer.

Annapolis Micro Systems, Inc. 190 Admiral Cochrane Dr., Ste 130, Annapolis, MD Web: HQ Phone: (410) HQ Fax: (410)

Reconfigurable Computing: A First Look at the Cray-XD1 Mitch Sukalski, David Thompson, Rob Armstrong, Curtis Janssen, and Matt Leininger Orgs: 8961 & 8963.

Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Principles of Parallel Programming First Edition by Calvin Lin Lawrence Snyder.

1 Unit 2: Computer Systems Session One Part One. 2 Aims: Discussion into what will be covered in this unit. Assessment Understand the basic principles.

Towards the Design of Heterogeneous Real-Time Multicore System Adaptive Systems Laboratory, Master of Computer Science and Engineering in the Graduate.

Experience Running Embedded EPICS on NI CompactRIO Eric Björklund Dolores Baros Scott Baily.

Virtualisation Front Side Buses SMP systems COMP Jamie Curtis.

Hyper Threading Technology. Introduction Hyper-threading is a technology developed by Intel Corporation for it’s Xeon processors with a 533 MHz system.

Workstations CPU David Josué Morales José Luis Micheri Marie Muñoz The Dinosaurs Electrical and Computer Engineering Department August 25, 2004.

XTCA projects (HW and SW) related to ATLAS LAr xTCA interest group - CERN 07/03/2011 Nicolas Letendre – Laurent Fournier - LAPP.

Design Criteria and Proposal for a CBM Trigger/DAQ Hardware Prototype Joachim Gläß Computer Engineering, University of Mannheim Contents –Requirements.

Network On Chip Platform

The Components of the System Unit

UClinux console (HyperTerminal) Memec V2MB1000 prototyping board running uClinux on embedded Xilinx® MicroBlaze™ processor Development system with Xilinx.

Ethernet Bomber Ethernet Packet Generator for network analysis

Succeeding with Technology Chapter 2 Hardware Designed to Meet the Need The Digital Revolution Integrated Circuits and Processing Storage Input, Output,

Content Project Goals. Workflow Background. System configuration. Working environment. System simulation. System synthesis. Benchmark. Multicore.

Understanding Parallel Computers Parallel Processing EE 613.

GBT-FPGA Interface Carson Teale. GBT New radiation tolerant ASIC for bidirectional 4.8 Gb/s optical links to replace current timing, trigger, and control.

Advanced SW/HW Optimization Techniques for Application Specific MCSoC m Yumiko Kimezawa Supervised by Prof. Ben Abderazek Graduate School of Computer.

WorldScape Defense Company, L.L.C. Company Proprietary Slide 1 An Ultra-High Performance Scalable Processing Architecture for HPC and Embedded Applications.

Computer Parts - Hardware There are many parts that work together to make a computer work.

Inside a PC Exploring the CPU. CPU: Objectives Students will identify and explain the function of the CPU Students will research various CPUs so they.

PROCESSOR Ambika | shravani | namrata | saurabh | soumen.

Central Processing Unit (CPU) The Computer’s Brain.

Hardware Architecture

Multi-Core CPUs Matt Kuehn. Roadmap ► Intel vs AMD ► Early multi-core processors ► Threads vs Physical Cores ► Multithreading and Multi-core processing.

Intel and AMD processors

CPU Central Processing Unit

Lab 0: Familiarization with Equipment and Software

Altera Stratix II FPGA Architecture

A Streaming FFT on 3GSPS ADC Data using Core Libraries and DIME-C

Central Processing Unit- CPU

Appro Xtreme-X Supercomputers

Unit 2 Computer Systems HND in Computing and Systems Development

Unit 2: Computer Systems

CPU Central Processing Unit

NetPerL Seminar Hardware/Software Co-Design

Impact on High-Performance Applications: FPGA Chip Bandwidth at 40 nm

Presentation transcript:

© 2007 Altera Corporation FPGA Coprocessing in Multi-Core Architectures for DSP J Ryan Kenny Bryce Mackin Altera Corporation 101 Innovation Drive San Jose, CA High Performance Embedded Computing Workshop September 2007

Algorithmic Gains with FPGA CoProcessing Many Off-Processor Acceleration Functions have been benchmarked for FPGA CoProcessing Acceleration Depends on Algorithm and Logic Division Process

C source code SOPC Builder Impulse C module HyperTransport interface Single and double-precision IEEE 754 floating-point arithmetic supported in FPGA (XDI Library) and automatically inferred from code All RTL for FPGA complete Altera project files XDI local memory interface Compile C to HDL Make script-driven calls to Altera tools Main loop AMD:16-Lane HT, 400Mhz DDR, 3.2 GB/s Example—Financial Solution (50x +) Main loop ESL: first step: PSP for ImpulseC All components connected via Avalon ™ fabric Intel: FSB at 1066 Mhz, ~ 8.5 GB/s Intel Module AMD Module

Solution Examples (Xtreme Data Dual & Notional Quad Architecture)  FPGA uses all motherboard resources meant for CPU:  HyperTransport Links, Memory interface, power supply, heat-sink  Usable in rack-mount or high- density, “blade” server systems, where PC boards don’t work  Process is Scalable for Quad Processors  Cores Available to interface with AMD HyperTransport  High Bandwidth, Low Latency  Applications and Benchmarks Coming Soon! Intel Xeon AMD Opteron AMD Module via HyperTransport Intel Module connects via Front Side Bus