Computer Science, University of Oklahoma Reconfigurable Versus Fixed Versus Hybrid Architectures John K. Antonio Oklahoma Supercomputing Symposium 2008.

Slides:



Advertisements
Similar presentations
Accelerators for HPC: Programming Models Accelerators for HPC: StreamIt on GPU High Performance Applications on Heterogeneous Windows Clusters
Advertisements

Distributed Systems CS
1 Computational models of the physical world Cortical bone Trabecular bone.
Computer Abstractions and Technology
PERFORMANCE ANALYSIS OF MULTIPLE THREADS/CORES USING THE ULTRASPARC T1 (NIAGARA) Unique Chips and Systems (UCAS-4) Dimitris Kaseridis & Lizy K. John The.
System Simulation Of 1000-cores Heterogeneous SoCs Shivani Raghav Embedded System Laboratory (ESL) Ecole Polytechnique Federale de Lausanne (EPFL)
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
Today’s topics Single processors and the Memory Hierarchy
1 BGL Photo (system) BlueGene/L IBM Journal of Research and Development, Vol. 49, No. 2-3.
Chapter Hardwired vs Microprogrammed Control Multithreading
Introduction What is Parallel Algorithms? Why Parallel Algorithms? Evolution and Convergence of Parallel Algorithms Fundamental Design Issues.
CAD and Design Tools for On- Chip Networks Luca Benini, Mark Hummel, Olav Lysne, Li-Shiuan Peh, Li Shang, Mithuna Thottethodi,
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
CPE 731 Advanced Computer Architecture Multiprocessor Introduction
Presenter MaxAcademy Lecture Series – V1.0, September 2011 Introduction and Motivation.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Part 1.  Intel x86/Pentium family  32-bit CISC processor  SUN SPARC and UltraSPARC  32- and 64-bit RISC processors  Java  C  C++  Java  Why Java?
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
RSC Williams MAPLD 2005/BOF-S1 A Linux-based Software Environment for the Reconfigurable Scalable Computing Project John A. Williams 1
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Archs, VHDL 3 Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Paper Review: XiSystem - A Reconfigurable Processor and System
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes Laboratoire de l’Accélérateur Linéaire/IN2P3/CNRS France 2nd.
Automated Design of Custom Architecture Tulika Mitra
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
1 Choices “Our object-oriented system architecture embodies the notion of customizing operating systems to tailor them to support particular hardware configuration.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Reconfigurable Computing Ender YILMAZ, Hasan Tahsin OĞUZ.
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
Data Management for Decision Support Session-4 Prof. Bharat Bhasker.
Backprojection and Synthetic Aperture Radar Processing on a HHPC Albert Conti, Ben Cordes, Prof. Miriam Leeser, Prof. Eric Miller
Somervill RSC 1 125/MAPLD'05 Reconfigurable Processing Module (RPM) Kevin Somervill 1 Dr. Robert Hodson 1
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
Background Computer System Architectures Computer System Software.
BLUE GENE Sunitha M. Jenarius. What is Blue Gene A massively parallel supercomputer using tens of thousands of embedded PowerPC processors supporting.
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
Programmable Logic Devices
Conclusions on CS3014 David Gregg Department of Computer Science
Programmable Hardware: Hardware or Software?
Topics Coarse-grained FPGAs. Reconfigurable systems.
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Multiprocessing.
Constructing a system with multiple computers or processors
Structural Simulation Toolkit / Gem5 Integration
Architecture & Organization 1
Presentation Title September 22, 2018
Anne Pratoomtong ECE734, Spring2002
by Manuel Saldaña, Daniel Nunes, Emanuel Ramalho, and Paul Chow
What is Parallel and Distributed computing?
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Architecture & Organization 1
Dynamically Reconfigurable Architectures: An Overview
Radar Processing with COTS Components
Chapter 17: Database System Architectures
CLUSTER COMPUTING.
Constructing a system with multiple computers or processors
Constructing a system with multiple computers or processors
A High Performance SoC: PkunityTM
Constructing a system with multiple computers or processors
Chapter 1 Introduction.
Chapter 4 Multiprocessors
Types of Parallel Computers
Research: Past, Present and Future
Presentation transcript:

Computer Science, University of Oklahoma Reconfigurable Versus Fixed Versus Hybrid Architectures John K. Antonio Oklahoma Supercomputing Symposium 2008 Norman, Oklahoma October 6, 2008

Computer Science, University of Oklahoma 2 Overview The (past) world of reconfigurable computing The (past) world of multi-core The (emerging) world of reconfigurable multi-core architectures Illustrative analysis Conclusions

Computer Science, University of Oklahoma 3 Drivers for reconfigurable computing Near performance of custom ASIC Near cost of commodity processor More flexible than custom ASIC “Programming” tools improving steadily Often used in embedded applications having high computational throughput requirements and strict SWAP constraints

SAR processing on a UAV “Predator” Jeffrey T. Muehring, “Optimal Configuration of a Parallel Embedded System for Synthetic Aperture Radar Processing,” MS Thesis, Texas Tech University, Dec

5 A prototype hybrid system Data Source PC Annapolis FPGA Subsystem (F) Custom Interface Cables SPARC Mercury DSP/GPP Subsystem Data Sink PC Annapolis FPGA Subsystem (B)

Data Sink PC Data Source PC Custom Interface Cables Mercury DSP/GPP Subsystem Annapolis FPGA Subsystem (F) Annapolis FPGA Subsystem (B) SPARC Data Sink PC Data Source PC Custom Interface Cables Mercury DSP/GPP Subsystem Annapolis FPGA Subsystem (F) Annapolis FPGA Subsystem (B) SPARC A prototype hybrid system

Minimum Power Configurations Resolution Velocity X T X r X a YTYrYaYTYrYa Jeffrey T. Muehring, “Optimal Configuration of a Parallel Embedded System for Synthetic Aperture Radar Processing,” MS Thesis, Texas Tech University, Dec

Minimum Power Jeffrey T. Muehring, “Optimal Configuration of a Parallel Embedded System for Synthetic Aperture Radar Processing,” MS Thesis, Texas Tech University, Dec

Computer Science, University of Oklahoma 9 Overview The (past) world of reconfigurable computing The (past) world of multi-core The (emerging) world of reconfigurable multi-core architectures Illustrative analysis Conclusions

Computer Science, University of Oklahoma 10 Drivers for multi-core technology path Single-core path leading to increased cost, heat, and power consumption Single-core path widens the pocessor/memory speed gap Multi-core path transparent to many application domain developers Multi-core path can improve performance of threaded software

11 Typical multi-core architecture* Dual Core Chip L2 Cache Core Memory Dual Core Chip L2 Cache Core *L. Chai, Q. Gao, D.K. Panda, “Understanding the Impact of Multi-Core Architecture in Cluster Computing: A Case Study with Intel Dual-Core System,” Seventh Int'l Symposium on Cluster Computing and the Grid (CCGrid), Rio de Janeiro - Brazil, May 2007.

Computer Science, University of Oklahoma 12 Overview The (past) world of reconfigurable computing The (past) world of multi-core The (emerging) world of reconfigurable multi-core architectures Illustrative analysis Conclusions

Computer Science, University of Oklahoma 13 Emerging drivers and requirements for multi-core architectures Scale to support massively data parallel (SPMD) applications Match coupling among cores with application granularity Power is a major challenge for large data centers and supercomputing facilities

Computer Science, University of Oklahoma 14 Hybrid architectural framework Multi-core Chip Reconfigurable Logic Core L2 Cache L2 Cache MU Core L2 Cache L2 Cache Core L2 Cache Core L2 Cache MU Reconfigurable logic

Computer Science, University of Oklahoma 15 Shared everything configuration Multi-core Chip Core L2 Cache L2 Cache MU Core L2 Cache L2 Cache Core L2 Cache Core L2 Cache MU Interconnection Network Reconfigurable logic

Computer Science, University of Oklahoma 16 Shared nothing configuration Multi-core Chip Core L2 Cache L2 Cache MU Core L2 Cache L2 Cache Core L2 Cache Core L2 Cache MU Co- Proc Co- Proc Co- Proc Co- Proc Co- Proc Co- Proc Reconfigurable logic

Computer Science, University of Oklahoma 17 Hybrid configuration Multi-core Chip Core L2 Cache L2 Cache MU Core L2 Cache L2 Cache Core L2 Cache Core L2 Cache MU Co-Proc Interconnection Network Reconfigurable logic

Computer Science, University of Oklahoma 18 Features of hybrid architecture Match core coupling and core processing capacity with application granularity –Fixed multiprocessor architecture not well matched with all application granularities –Proposed reconfigurable multi-core architecture can be configured to match core coupling with application granularity

Computer Science, University of Oklahoma 19 Mismatched SPMD execution core 1 core 2 core 3 core 4 core c time Core coupling too loose relative to application granularity Communication time Computation time

Computer Science, University of Oklahoma 20 core 1 core 2 core 3 core 4 core c Matched SPMD execution time Core coupling tightened to match application granularity Communication time Computation time

Computer Science, University of Oklahoma 21 Overview The (past) world of reconfigurable computing The (past) world of multi-core The (emerging) world of reconfigurable multi-core architectures Illustrative analysis Conclusions

Computer Science, University of Oklahoma 22 Illustrative Analysis Notation –Number of cores: c –Problem size: n –Sequential time complexity: –Parallel time complexity: –Computational complexity: –Communication complexity: –Core coupling ratio:

Computer Science, University of Oklahoma 23 Example Sequential Time: Parallel Time: Speedup: The value of K: related to core processing capacity The value of L: related to interconnection among cores

Computer Science, University of Oklahoma 24 K = 1.0, L = 1.0 Number of cores, c

Computer Science, University of Oklahoma 25 K = 1.5, L = 0.5 Number of cores, c

Computer Science, University of Oklahoma 26 K = 0.5, L = 1.5 Number of cores, c

Computer Science, University of Oklahoma 27 n = 1024 Number of cores, c

Computer Science, University of Oklahoma 28 Overview The (past) world of reconfigurable computing The (past) world of multi-core The (emerging) world of reconfigurable multi-core architectures Illustrative analysis Conclusions

Computer Science, University of Oklahoma 29 Conclusions Current multi-core approaches may not scale to support massive parallelism Hybrid reconfigurable multi-core approach enables trades between core coupling and core processing capacity More research needed in reconfigurable micro-architecture to support hybrid architectures