Heterogeneous Computing at USC Dept. of Computer Science and Engineering University of South Carolina Dr. Jason D. Bakos Assistant Professor Heterogeneous.

Slides:



Advertisements
Similar presentations
Spatial Computation Thesis committee: Seth Goldstein Peter Lee Todd Mowry Babak Falsafi Nevin Heintze Ph.D. Thesis defense, December 8, 2003 SCS Mihai.
Advertisements

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Professur für Technische Informatik A Self Distributing Virtual Machine for FPGA Multicores Klaus Waldschmidt J. W. Goethe-University Technische Informatik.
All Programmable FPGAs, SoCs, and 3D ICs
Advanced Piloting Cruise Plot.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
ITRS Roadmap Design + System Drivers Makuhari, December 2007 Worldwide Design ITWG Good morning. Here we present the work that the ITRS Design TWG has.
Mourad Loulou, Tunisia Section Chair 1 Mourad Loulou Tunisia Section Chair IEEE Tunisia Section.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 5 second questions
Andrew Meade School of Biological Sciences.
2010 fotografiert von Jürgen Roßberg © Fr 1 Sa 2 So 3 Mo 4 Di 5 Mi 6 Do 7 Fr 8 Sa 9 So 10 Mo 11 Di 12 Mi 13 Do 14 Fr 15 Sa 16 So 17 Mo 18 Di 19.
Augmenting FPGAs with Embedded Networks-on-Chip
DOROTHY Design Of customeR dRiven shOes and multi-siTe factorY Product and Production Configuration Method (PPCM) ICE 2009 IMS Workshops Dorothy Parallel.
ABC Technology Project
The scale of IC design Small-scale integrated, SSI: gate number usually less than 10 in a IC. Medium-scale integrated, MSI: gate number ~10-100, can operate.
CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 ACM Principles and Practice of Parallel Programming, PPoPP, 2006 Panel Presentations Parallel Processing is.
VOORBLAD.
15. Oktober Oktober Oktober 2012.
Solving Equations How to Solve Them
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Copyright © 2013, 2009, 2006 Pearson Education, Inc.
© 2012 National Heart Foundation of Australia. Slide 2.
Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1 Section 5.4 Polynomials in Several Variables Copyright © 2013, 2009, 2006 Pearson Education, Inc.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
25 seconds left…...
A Hardware Processing Unit For Point Sets S. Heinzle, G. Guennebaud, M. Botsch, M. Gross Graphics Hardware 2008.
Januar MDMDFSSMDMDFSSS
Week 1.
Analyzing Genes and Genomes
We will resume in: 25 Minutes.
Essential Cell Biology
Improved Census Transforms for Resource-Optimized Stereo Vision
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
How Cells Obtain Energy from Food
Energy Generation in Mitochondria and Chlorplasts
An Integrated Reduction Technique for a Double Precision Accumulator Krishna Nagar, Yan Zhang, Jason Bakos Dept. of Computer Science and Engineering University.
Heterogeneous Computing at USC Dept. of Computer Science and Engineering University of South Carolina Dr. Jason D. Bakos Assistant Professor Heterogeneous.
A Sparse Matrix Personality for the Convey HC-1 Dept. of Computer Science and Engineering University of South Carolina Krishna K Nagar, Jason D. Bakos.
Heterogeneous Computing: New Directions for Efficient and Scalable High-Performance Computing Dr. Jason D. Bakos.
Computes the partial dot products for only the diagonal and upper triangle of the input matrix. The vector computed by this architecture is added to the.
Heterogeneous Computing: New Directions for Efficient and Scalable High-Performance Computing CSCE 791 Dr. Jason D. Bakos.
Heterogeneous Computing at USC Dept. of Computer Science and Engineering University of South Carolina Dr. Jason D. Bakos Assistant Professor Heterogeneous.
Seven Minute Madness: Special-Purpose Parallel Architectures Dr. Jason D. Bakos.
FPGA vs. GPU for Sparse Matrix Vector Multiply Yan Zhang, Yasser H. Shalabi, Rishabh Jain, Krishna K. Nagar, Jason D. Bakos Dept. of Computer Science and.
V The DARPA Dynamic Programming Benchmark on a Reconfigurable Computer Justification High performance computing benchmarking Compare and improve the performance.
FPGA Acceleration of Phylogeny Reconstruction for Whole Genome Data Jason D. Bakos Panormitis E. Elenis Jijun Tang Dept. of Computer Science and Engineering.
Seven Minute Madness: Reconfigurable Computing Dr. Jason D. Bakos.
FPGA Acceleration of Gene Rearrangement Analysis Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina Columbia, SC USA.
Seven Minute Madness: Reconfigurable Computing Dr. Jason D. Bakos.
High-Performance Reconfigurable Computing for Genome Analysis Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina Columbia,
Heterogeneous Computing Dr. Jason D. Bakos. Heterogeneous Computing 2 “Traditional” Parallel/Multi-Processing Large-scale parallel platforms: –Individual.
Heterogeneous Computing: New Directions for Efficient and Scalable High-Performance Computing CSCE 791 Dr. Jason D. Bakos.
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
PERFORMANCE ANALYSIS cont. End-to-End Speedup  Execution time includes communication costs between FPGA and host machine  FPGA consistently outperforms.
Seven Minute Madness: Heterogeneous Computing Dr. Jason D. Bakos.
Seven Minute Madness: Heterogeneous Computing Dr. Jason D. Bakos.
Thilina Gunarathne, Bimalee Salpitkorala, Arun Chauhan, Geoffrey Fox
Presentation transcript:

Heterogeneous Computing at USC Dept. of Computer Science and Engineering University of South Carolina Dr. Jason D. Bakos Assistant Professor Heterogeneous and Reconfigurable Computing Lab (HeRC) This material is based upon work supported by the National Science Foundation under Grant Nos. CCF and CCF

Our Group: HeRC Applications work –Computational phylogenetics (FPGA) –High-throughput global sequence alignment for large-scale genomic clustering (GPU) –Sparse linear algebra (FPGA/GPU) –Frequent itemset mining (Multi- core/GPU) –Logic synthesis (GPU) System architecture –Multi-FPGA interconnects Tools –Automatic CPU/coprocessor partitioning (PATHS) –Micro-architectural simulation for code tuning Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 2

FPGA Platforms Annapolis Micro Systems WILDSTAR 2 PRO GiDEL PROCSTAR III Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 3

FPGA Platforms Convey HC-1 Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 4 Jason D. Bakos, “High-Performance Heterogeneous Computing with the Convey HC-1,” IEEE Computing in Science and Engineering, Nov/Dec’10.

GPU Platforms NVIDIA Tesla S1070 Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 5

Programming FPGAs Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 6

Phylogenies genus Drosophila Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 7

Our Projects FPGA-based co-processors for computational biology 1000X speedup! 10X speedup! 1.Tiffany M. Mintz, Jason D. Bakos, "A Cluster-on-a-Chip Architecture for High-Throughput Phylogeny Search," IEEE Trans. on Parallel and Distributed Systems, to appear. 2.Stephanie Zierke, Jason D. Bakos, "FPGA Acceleration of Bayesian Phylogenetic Inference," BMC Bioinformatics, BMC Bioinformatics 2010, 11: Jason D. Bakos, Panormitis E. Elenis, "A Special-Purpose Architecture for Solving the Breakpoint Median Problem," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Vol. 16, No. 12, Dec Jason D. Bakos, Panormitis E. Elenis, Jijun Tang, "FPGA Acceleration of Phylogeny Reconstruction for Whole Genome Data," 7th IEEE International Symposium on Bioinformatics & Bioengineering (BIBE'07), Boston, MA, Oct , Jason D. Bakos, “FPGA Acceleration of Gene Rearrangement Analysis,” 15th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM'07), April 23-25, GRAPPA: MP reconstruction based on gene-rearrangement model MrBayes: MCMCMC reconstruction based on (sequence data) likelihood model Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 8

Our Projects FPGA-based co-processors for sparse linear algebra –Accelerate sparse matrix operations to accelerate sparse numerical linear algebra –Problems: indirect addressing, double precision accumulation, memory bandwidth 1.Krishna.K. Nagar, Jason D. Bakos, "A High-Performance Double Precision Accumulator," IEEE International Conference on Field-Programmable Technology (IC- FPT'09), Dec. 9-11, Yan Zhang, Yasser Shalabi, Rishabh Jain, Krishna K. Nagar, Jason D. Bakos, "FPGA vs. GPU for Sparse Matrix Vector Multiply," IEEE International Conference on Field- Programmable Technology (IC-FPT'09), Dec. 9-11, Krishna K. Nagar, Yan Zhang, Jason D. Bakos, "An Integrated Reduction Technique for a Double Precision Accumulator," Proc. Third International Workshop on High- Performance Reconfigurable Computing Technology and Applications (HPRCTA'09), held in conjunction with Supercomputing 2009 (SC'09), Nov. 15, Jason D. Bakos, Krishna K. Nagar, "Exploiting Matrix Symmetry to Improve FPGA- Accelerated Conjugate Gradient," 17th Annual IEEE International Symposium on Field Programmable Custom Computing Machines (FCCM'09), April 5-8, Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 9

Double Precision Accumulation + + Mem Control Partial sums Basic Accumulator Architecture Adder Pipeline Required Design Reduction Ckt Feedback Loop Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 10

Our Projects: Automated Partitioning Tiff any M. Mintz, “Systematic Code Partitioning for the Disjoint- Memory Co-Processor Accelerated Execution Model” Ph.D. dissertation, University of South Carolina, Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 11

Additional Projects GPU and FPGA Acceleration of Data Mining GPU Acceleration of Logic Synthesis Ibrahim Savran, Jason D. Bakos, "GPU Acceleration of Near- Minimal Logic Minimization," 2010 Symposium on Application Accelerators in High Performance Computing (SAAHPC'10), July 13-15, Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 12

Additional Projects Multi-FPGA System Architectures 1.Jason D. Bakos, Charles L. Cathey, E. Allen Michalski, "Predictive Load Balancing for Interconnected FPGAs," 16th International Conference on Field Programmable Logic and Applications (FPL'06), Madrid, Spain, August 28-30, Charles L. Cathey, Jason D. Bakos, Duncan A. Buell, "A Reconfigurable Distributed Computing Fabric Exploiting Multilevel Parallelism," 14th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM'06), April 24-26, GPU Simulation 1.Patrick A. Moran, Jason D. Bakos, "A PTX Simulator for Performance Tuning CUDA Code," IEEE Trans. on Parallel and Distributed Systems, submitted. Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 13

Contact Information Jason D. Bakos –Office: 3A52 – – Heterogeneous and Reconfigurable Computing (HeRC) Lab: –Lab: 3D15 – Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 14

Our Group Heterogeneous and Reconfigurable Computing Group Zheming Jin Tiffany Mintz Krishna Nagar Jason BakosYan Zhang Heterogeneous Computing at USC | EPSCOR Clemson | 9/21/10 15