Reconfigurable Computing MONARCH/MCHIP

Slides:



Advertisements
Similar presentations
Optimization of Parallel Task Execution on the Adaptive Reconfigurable Group Organized Computing System Presenter: Lev Kirischian Department of Electrical.
Advertisements

Vectors, SIMD Extensions and GPUs COMP 4611 Tutorial 11 Nov. 26,
Routers and Routing Basics WANs And Routers. Intro To WANs.
Computer Science and Engineering Laboratory, Transport-triggered processors Jani Boutellier Computer Science and Engineering Laboratory This.
Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Spring 2008, Jan. 14 ELEC / Lecture 2 1 ELEC / Computer Architecture and Design Spring 2007 Introduction Vishwani D. Agrawal.
MPI in uClinux on Microblaze Neelima Balakrishnan Khang Tran 05/01/2006.
Reconfigurable Application Specific Computers RASCs Advanced Architectures with Multiple Processors and Field Programmable Gate Arrays FPGAs Computational.
ECE 232 L2 Basics.1 Adapted from Patterson 97 ©UCBCopyright 1998 Morgan Kaufmann Publishers ECE 232 Hardware Organization and Design Lecture 2 Computer.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
Presenter MaxAcademy Lecture Series – V1.0, September 2011 Introduction and Motivation.
COMPUTER ORGANIZATIONS CSNB123 May 2014Systems and Networking1.
An Energy-Efficient Reconfigurable Multiprocessor IC for DSP Applications Multiple programmable VLIW processors arranged in a ring topology –Balances its.
Computer Architecture ECE 4801 Berk Sunar Erkay Savas.
Fall 2015, Aug 17 ELEC / Lecture 1 1 ELEC / Computer Architecture and Design Fall 2015 Introduction Vishwani D. Agrawal.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
PetaScale Execution Time Analysis Architecture/VLSI Chip Floorplan Monarch Chip Overview Computational Sciences Division Bob Lucas – Director Poster Participants:
The Morphware Stable Interface: A Software Framework for Polymorphous Computing Architectures D. Campbell 1, D. Cottel 2, R. Judd 2, K. MacKenzie 3, M.
2015/10/22\course\cpeg323-08F\Final-Review F.ppt1 Midterm Review Introduction to Computer Systems Engineering (CPEG 323)
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
© 2004 Mercury Computer Systems, Inc. FPGAs & Software Components Graham Bardouleau & Jim Kulp Mercury Computer Systems, Inc. High Performance Embedded.
Computer Organization and Design Computer Abstractions and Technology
80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.
Parallel Programming on the SGI Origin2000 With thanks to Igor Zacharov / Benoit Marchand, SGI Taub Computer Center Technion Moshe Goldberg,
HPEC2002_Session1 1 DRM 11/11/2015 MIT Lincoln Laboratory Session 1: Novel Hardware Architectures David R. Martinez 24 September 2002 This work is sponsored.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
XStream: Rapid Generation of Custom Processors for ASIC Designs Binu Mathew * ASIC: Application Specific Integrated Circuit.
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
COARSE GRAINED RECONFIGURABLE ARCHITECTURES 04/18/2014 Aditi Sharma Dhiraj Chaudhary Pruthvi Gowda Rachana Raj Sunku DAY
Copyright  2005 SRC Computers, Inc. ALL RIGHTS RESERVED Overview.
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
MICROOCESSORS AND MICROCONTROLLER:
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
Spring 2016, Jan 13 ELEC / Lecture 1 1 ELEC / Computer Architecture and Design Spring 2016 Introduction Vishwani D. Agrawal.
(A) Approved for public release; distribution is unlimited. Experience and results porting HPEC Benchmarks to MONARCH Lloyd Lewins & Kenneth Prager Raytheon.
Page 1 Computer Architecture and Organization 55:035 Final Exam Review Spring 2011.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
HPEC-1 SMHS 7/7/2016 MIT Lincoln Laboratory Focus 3: Cell Sharon Sacco / MIT Lincoln Laboratory HPEC Workshop 19 September 2007 This work is sponsored.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
Computer Organization and Architecture Lecture 1 : Introduction
PROGRAMMABLE LOGIC CONTROLLERS SINGLE CHIP COMPUTER
Dynamo: A Runtime Codesign Environment
World’s First Polymorphic Computer – MONARCH
CMSC 611: Advanced Computer Architecture
Parallel computer architecture classification
Rough Schedule 1:30-2:15 IRAM overview 2:15-3:00 ISTORE overview break
Embedded Systems Design
Presented by: Tim Olson, Architect
SmartCell: A Coarse-Grained Reconfigurable Architecture for High Performance and Low Power Embedded Computing Xinming Huang Depart. Of Electrical and Computer.
Session 4: Reconfigurable Computing
A Quantitative Analysis of Stream Algorithms on Raw Fabrics
Graphics Processing Unit
CS703 - Advanced Operating Systems
GSP 215 Competitive Success-- snaptutorial.com
GSP 215 Education for Service-- snaptutorial.com
GSP 215 Teaching Effectively-- snaptutorial.com
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Compiler Back End Panel
Computer Organization
Compiler Back End Panel
Operating Systems Chapter 5: Input/Output Management
Introduction and History of Cray Supercomputers
Course Outline for Computer Architecture
Chip&Core Architecture
Computer Architecture
ADSP 21065L.
What Are Performance Counters?
ELEC / Computer Architecture and Design Fall 2014 Introduction
Cluster Computers.
CIS 6930: Chip Multiprocessor: Parallel Architecture and Programming
Presentation transcript:

Reconfigurable Computing MONARCH/MCHIP High Efficiency Embeddable TeraFlops Polymorphous Computing Architecture Mike Vahey / John Granacki Raytheon / USC-ISI September 29, 2004 MOrphable Networked micro-ARCHitecture Exogi

MONARCH System on a Chip RISC, DRAM, Computing Array, Streaming I/O ED R P RIO Memory Interface CM ROM Port DIFLs XPIRX Polymorphous Architecture Multiple programming modes Reconfigurable, streaming DF RISC scalar RISC SIMD (Altivec like) 6 RISC processors Reconfigurable Computing 96 adders fixed and float 96 multipliers 124 dual port memories 248 address generators 12 MBytes on chip DRAM 14 DMA engines RapidIO interface 20 DIFL ports (1.3 GB/s ea) Power 8-50 W (nominal) Throughput 64 GFLOPS peak DIFL =Differential IFL Alternative to ASICS or custom hardware Demonstrated for RADAR, COM, EO Late algorithm freeze – retains programmability Energy efficiency: 3-6 GFLOPS/W

Application Development Environment & Workstation Multiple Computing Modes adapt to application needs: 1) RISC Scalar, 2) Wide Word, 3) Reconfigurable Data Flow Application SCE Application Development Station Compiler Simulator Component SW Libraries HLC Exogi RISC/WW LLC Application Dataflow LLC Machine Model LLC Metadata Assembler Assembler / Router HLC Metadata Build SCE Components Component Metadata RISC Binary Dataflow Binary Application Repository Component Repository Static Linker/Loader SCE/RTEMS

MONARCH Performance on Lincoln Lab Benchmark Suite Near Peak Performance with balanced Add/Multiply 64 GFLOPS 58 39 33 Throughput GFLOPS Throughput for Kernels Coded 6.30E+9 4.30E+9 3.26E+9 000.00E+0 1.00E+9 2.00E+9 3.00E+9 4.00E+9 5.00E+9 6.00E+9 7.00E+9 8.00E+9 FIR SVD CFAR Kernel Throughput (Flop/s/watts) FIR SVD Kernel CFAR 6.3 4.3 GFLOPS/Watt 3.2 FIR SVD Kernel CFAR