SAR-ATR/FOA Compiler for ACS SLAAC Retreat, March ‘99 Brad L. Hutchings Configurable Computing Lab Brigham Young University.

Slides:



Advertisements
Similar presentations
Implementation Approaches with FPGAs Compile-time reconfiguration (CTR) CTR is a static implementation strategy where each application consists of one.
Advertisements

1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
An Introduction to Reconfigurable Computing Mitch Sukalski and Craig Ulmer Dean R&D Seminar 11 December 2003.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
The Evolution of RISC A Three Party Rivalry By Jenny Mitchell CS147 Fall 2003 Dr. Lee.
Reconfigurable Computing: What, Why, and Implications for Design Automation André DeHon and John Wawrzynek June 23, 1999 BRASS Project University of California.
Introduction to Reconfigurable Computing CS61c sp06 Lecture (5/5/06) Hayden So.
Lecture 2: Field Programmable Gate Arrays I September 5, 2013 ECE 636 Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays I.
Configurable System-on-Chip: Xilinx EDK
Reconfigurable Computing S. Reda, Brown University Reconfigurable Computing (EN2911X, Fall07) Lecture 08: RC Principles: Software (1/4) Prof. Sherief Reda.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
Face-Recognition In Intelligent Sureillence System Feng Zhao 2007 Nov.
General FPGA Architecture Field Programmable Gate Array.
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
SLAAC Hardware Status Brian Schott Provo, UT September 1999.
B212/MAPLD 2005 Craven1 Configurable Soft Processor Arrays Using the OpenFire Processor Stephen Craven Cameron Patterson Peter Athanas Configurable Computing.
1 © FASTER Consortium Catalin Ciobanu Chalmers University of Technology Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration.
Reconfigurable Devices Presentation for Advanced Digital Electronics (ECNG3011) by Calixte George.
Orchestration by Approximation Mapping Stream Programs onto Multicore Architectures S. M. Farhad (University of Sydney) Joint work with Yousun Ko Bernd.
University of Michigan Electrical Engineering and Computer Science 1 Extending Multicore Architectures to Exploit Hybrid Parallelism in Single-Thread Applications.
ACOE255 – Microprocessor Architecture
Softcore Vector Processor Team ASP Brandon Harris Arpith Jacob.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
A flexible FGPA based Data Acquisition Module for a High Resolution PET Camera Abdelkader Bousselham, Attila Hidvégi, Clyde Robson, Peter Ojala and Christian.
FPGA FPGA2  A heterogeneous network of workstations (NOW)  FPGAs are expensive, available on some hosts but not others  NOW provide coarse- grained.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
Implementing Codesign in Xilinx Virtex II Pro Betim Çiço, Hergys Rexha Department of Informatics Engineering Faculty of Information Technologies Polytechnic.
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
HW/SW PARTITIONING OF FLOATING POINT SOFTWARE APPLICATIONS TO FIXED - POINTED COPROCESSOR CIRCUITS - Nalini Kumar Gaurav Chitroda Komal Kasat.
Programming Concepts in GPU Computing Dušan Gajić, University of Niš Programming Concepts in GPU Computing Dušan B. Gajić CIITLab, Dept. of Computer Science.
Hardware Implementation of a Memetic Algorithm for VLSI Circuit Layout Stephen Coe MSc Engineering Candidate Advisors: Dr. Shawki Areibi Dr. Medhat Moussa.
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
VLSI Algorithmic Design Automation Lab. 1 Integration of High-Performance ASICs into Reconfigurable Systems Providing Additional Multimedia Functionality.
“Politehnica” University of Timisoara Course No. 2: Static and Dynamic Configurable Systems (paper by Sanchez, Sipper, Haenni, Beuchat, Stauffer, Uribe)
SAR ATR Challenge Problem Update SLAAC Retreat March 1999 Brian K. Bray Sandia National Laboratories
DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach.
AEEC405 – Microprocessor Architecture. Some Information Instructor Details Main Book.
Task Graph Scheduling for RTR Paper Review By Gregor Scott.
Rinoy Pazhekattu. Introduction  Most IPs today are designed using component-based design  Each component is its own IP that can be switched out for.
FPL Sept. 2, 2003 Software Decelerators Eric Keller, Gordon Brebner and Phil James-Roxby Xilinx Research Labs.
Jason Li Jeremy Fowers 1. Speedups and Energy Reductions From Mapping DSP Applications on an Embedded Reconfigurable System Michalis D. Galanis, Gregory.
Sandia National Labs SAR ATR Hour for the SLAAC Fall ‘99 Retreat Intro/Module Performance Goals: Brian Bray FOA: Scott Hemmert SLD: Steve Crago CDI: Mike.
USC GOVERNMENT ELECTRONIC SYSTEMS L O C K H E E D M A R T I N SLAAC / ECMA Demonstration DARPA Thursday 25 March 1999.
SLAAC Systems Level Applications of Adaptive Computing Delivering ACS Technology to Applications DARPA/ITO Adaptive Computing Systems PI Meeting San Juan,
Cs 147 Spring 2010 Meg Genoar. History Started to emerge in mid-1970s 1988 – RISC took over workstation market.
SLAAC SLD Update Steve Crago USC/ISI September 14, 1999 DARPA.
Lecture 3 : Performance of Parallel Programs Courtesy : MIT Prof. Amarasinghe and Dr. Rabbah’s course note.
Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.
 Programming - the process of creating computer programs.
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
SLAAC / ECMA (Electronic CounterMeasures Analysis) Application of ACS
Backprojection and Synthetic Aperture Radar Processing on a HHPC Albert Conti, Ben Cordes, Prof. Miriam Leeser, Prof. Eric Miller
An Automated Development Framework for a RISC Processor with Reconfigurable Instruction Set Extensions Nikolaos Vassiliadis, George Theodoridis and Spiridon.
3/12/07CS Visit Days1 A Sea Change in Processor Design Uniprocessor SpecInt Performance: From Hennessy and Patterson, Computer Architecture: A Quantitative.
Chia-Ho Pan DSPIC/GIEE NTU
Advanced Rendering Technology The AR250 A New Architecture for Ray Traced Rendering.
Philipp Gysel ECE Department University of California, Davis
CoDeveloper Overview Updated February 19, Introducing CoDeveloper™  Targeting hardware/software programmable platforms  Target platforms feature.
Reconfigurable Computing1 Reconfigurable Computing Part II.
Automated Software Generation and Hardware Coprocessor Synthesis for Data Adaptable Reconfigurable Systems Andrew Milakovich, Vijay Shankar Gopinath, Roman.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Summary Remaining Challenges The Future Messages to Take Home.
Architecture & Organization 1
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Architecture & Organization 1
A High Performance SoC: PkunityTM
Presentation transcript:

SAR-ATR/FOA Compiler for ACS SLAAC Retreat, March ‘99 Brad L. Hutchings Configurable Computing Lab Brigham Young University

Joint STARS ‘97 SAR ATR Pipeline Identification Indexer (SLD) Focus of Attention Detection Annotated SAR Image Belief Management (Fusion Executive) MPM CRMMSELPM.. Joint STARS Advance Workstation (JAWS) ATR Results Display PGA ESAR Image SAR Image T72 *Not Joint STARS imagery

Joint STARS ‘97 SAR ATR Pipeline Identification Indexer (SLD) Focus of Attention Detection Annotated SAR Image Belief Management (Fusion Executive) MPM CRMMSELPM.. Joint STARS Advance Workstation (JAWS) ATR Results Display PGA ESAR Image SAR Image T72 Sandia National Labs *Not Joint STARS imagery Binary Morphology AIQ

What the ATR programmer wants... ATR Guy quant span span cover Go-Fast Box TM No circuit design Automated mapping. Fast compile times. 10x or better performance. Wish List

What the ATR programmer gets… (current state of the art) ATR Guy Box of Gates TM Must be circuit designer. No automated mapping. Slow compile times (days). 10x or better performance. Yikes!

We are getting close to the ideal. (.def files in, bitstreams out). ATR Guy ACS Platform BYU-FOA Compiler FOA.def file (binary morphology) span span cover quant span span cover , XC4062 Xilinx Bitstreams 8 minutes 10X Faster

Baseline -- ‘96 PPC PowerPC SLAAC SLAAC SAR/ATR Roadmap - 500:1 Acceleration vs. ‘96 Baseline 50x 25x 10x 8x 10x Homogeneous ACS FPGAs upgrade to latest technology. Heterogeneous ACS (e.g. memory on chip) Hybrid ACS (e.g. RISC core, RTR) Year Performance Moore’s Law Arch. Advantage Goal Fine-grain parallelism Fine-grain control Dynamic load balance Precision management Skip “0” Early outs Initial Arch. Advantage Fine-grain parallelism Coarse-grain control ACS SLAAC BYU ACS FOA Performance (10x PowerPC) (morphology portion)

Superquant •Adaptive image quantization (AIQ). –Histogram generation and search. –First stage of the FOA algorithm. –Memory-bandwidth limited, not compute-bound. •Implementation –Exploits distributed memories. –Fully parameterizable (uses.def file). • Status: –Histogram processors have been designed and verified. –Projected pixel rates: »estimated, Xilinx 4K: 6-7 Mpixel (30 MHz). June 99. »estimated, Xilinx Virtex: Mpixel (60 MHz). ?

FOA Future Vision (Virtex) Superquant Binary Morphology Binary Morphology Image Data Output +3 Chip Solution +1 board +30 Mpixel

Future Issues •Release source of BYU-FOA compiler to Sandia. •AIQ (adaptive image quantization) aka Superquant –Initial designs are complete. •CDI –Studying specification. •Study advantages of Xilinx Virtex. –Faster clocks, fewer chips, lower cost, yada, yada, yada...