© 2010 The MITRE Corporation. All rights reserved. Session 2: Many Core Sharon Sacco / The MITRE Corporation HPEC 2010 Approved for Public Release: 10-3292.

Slides:



Advertisements
Similar presentations
Multi-core Computing Lecture 3 MADALGO Summer School 2012 Algorithms for Modern Parallel and Distributed Models Phillip B. Gibbons Intel Labs Pittsburgh.
Advertisements

System Simulation Of 1000-cores Heterogeneous SoCs Shivani Raghav Embedded System Laboratory (ESL) Ecole Polytechnique Federale de Lausanne (EPFL)
© 2010 IBM Corporation Welcome to Systor 2010 Yaron Wolfsthal, Mgr, Systems Technologies, Haifa Research Lab The 3rd Annual Haifa Experimental Systems.
Using GPUs to Enable Highly Reliable Embedded Storage University of Alabama at Birmingham 115A Campbell Hall 1300 University Blvd. Birmingham, AL
Peter Richtarik School of Mathematics Optimization with Big Data * in a billion dimensional space on a foggy day Extreme* Mountain Climbing =
DUKE CSNUFFT on Multicores HPEC-2008 Sept. 25, 2008NUFFTs on Multicores 1 Parallelization of NUFFT with Radial Data on Multicore Processors Nikos Pitsianis.
K-means clustering –An unsupervised and iterative clustering algorithm –Clusters N observations into K clusters –Observations assigned to cluster with.
IISWC 2007 Panel Benchmarking in the Web 2.0 Era Sudhanva Gurumurthi University of Virginia.
Embedded vs. PC Application Programming. Overview  The software design cycle  Designing differences  Code differences  Test differences.
Parallel Programming Henri Bal Rob van Nieuwpoort Vrije Universiteit Amsterdam Faculty of Sciences.
Carnegie Mellon μ-op Fission: Hyper-threading without the Hyper-headache Anthony Cartolano Robert Koutsoyannis Daniel S. McFarlin Carnegie Mellon University.
Document Number Here © 2006 The MITRE Corporation. All rights reserved. Holds and Diversions June 22, 2004.
Android is a trademark of Google Inc. Use of this trademark is subject to Google Permissions. Linux is the registered trademark of Linus Torvalds in the.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved. Platform Computing Ken Hertzler VP Product Management.
DaMoN 2011 Paper Preview Organized by Stavros Harizopoulos and Qiong Luo Athens, Greece Jun 13, 2011.
© 2011 The MITRE Corporation. All rights reserved. Sharon Sacco / The MITRE Corporation HPEC 2011 Focus 3: GPU.
INTEL CONFIDENTIAL Why Parallel? Why Now? Introduction to Parallel Programming – Part 1.
Dr. Gheith Abandah, Chair Computer Engineering Department The University of Jordan 20/4/20091.
Implementation of Digital Front End Processing Algorithms with Portability Across Multiple Processing Platforms September 20-21, 2011 John Holland, Jeremy.
N Tropy: A Framework for Analyzing Massive Astrophysical Datasets Harnessing the Power of Parallel Grid Resources for Astrophysical Data Analysis Jeffrey.
High-Performance Packet Classification on GPU Author: Shijie Zhou, Shreyas G. Singapura and Viktor K. Prasanna Publisher: HPEC 2014 Presenter: Gang Chi.
Scalable Data Clustering with GPUs Andrew D. Pangborn Thesis Defense Rochester Institute of Technology Computer Engineering Department Friday, May 14 th.
PVTOL: A High-Level Signal Processing Library for Multicore Processors
High-Speed Parallel Processing of Protocol-Aware Signatures Jordi Ros-Giralt, James Ezick, Peter Szilagyi, Richard Lethin Unclassified, DISTRIBUTION STATEMENT.
© 2010 Mercury Computer Systems, Inc. Performance Scalability on Embedded Many-Core Processors Michael Champigny Research Scientist Advanced Computing.
HPEC-1 MIT Lincoln Laboratory Session 3: Cloud Computing Albert Reuther/ MIT Lincoln Laboratory HPEC Conference 16 September 2010.
Multicore In Real-Time Systems – Temporal Isolation Challenges Due To Shared Resources Ondřej Kotaba, Jan Nowotsch, Michael Paulitsch, Stefan.
Advisor: Dr. Aamir Shafi Co-Advisor: Mr. Ali Sajjad Member: Dr. Hafiz Farooq Member: Mr. Tahir Azim Optimizing N-body Simulations for Multi-core Compute.
Energy saving in multicore architectures Assoc. Prof. Adrian FLOREA, PhD Prof. Lucian VINTAN, PhD – Research.
Slide 1 MIT Lincoln Laboratory Toward Mega-Scale Computing with pMatlab Chansup Byun and Jeremy Kepner MIT Lincoln Laboratory Vipin Sachdeva and Kirk E.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
HPEC-1 MIT Lincoln Laboratory Session 1: GPU: Graphics Processing Units Miriam Leeser / Northeastern University HPEC Conference 15 September 2010.
Annapolis Micro Systems, Inc. 190 Admiral Cochrane Dr., Ste 130, Annapolis, MD Web: HQ Phone: (410) HQ Fax: (410)
©Wen-mei W. Hwu and David Kirk/NVIDIA Urbana, Illinois, August 2-5, 2010 VSCSE Summer School Proven Algorithmic Techniques for Many-core Processors Lecture.
Cousins HPEC 2002 Session 4: Emerging High Performance Software David Cousins Division Scientist High Performance Computing Dept. Newport,
System Architecture: Near, Medium, and Long-term Scalable Architectures Panel Discussion Presentation Sandia CSRI Workshop on Next-generation Scalable.
Haney - 1 HPEC 9/28/2004 MIT Lincoln Laboratory pMatlab Takes the HPCchallenge Ryan Haney, Hahn Kim, Andrew Funk, Jeremy Kepner, Charles Rader, Albert.
Summary Background –Why do we need parallel processing? Moore’s law. Applications. Introduction in algorithms and applications –Methodology to develop.
1 Modeling the Shallow Water Equations with C and MPI LA-UR Approved for public release; distribution is unlimited Bob Robey.
Approved for Public Release; Distribution Unlimited © 2006 The MITRE Corporation. All Rights Reserved. The SMS Table Kent V. Hollinger December.
Thermal-Aware Scheduling for Real-time Applications in Embedded Systems Adam Lewis, Soumik Ghosh, and N.-F. Tzeng (A) Approved for public release; distribution.
Dan Reed Scalable and Multicore Computing Strategist Managing Director, Data Center Futures Microsoft.
Slide-1 Multicore Theory MIT Lincoln Laboratory Theory of Multicore Algorithms Jeremy Kepner and Nadya Bliss MIT Lincoln Laboratory HPEC 2008 This work.
QR Decomposition: Demonstration of Distributed Computing on Wireless Sensor Networks By Sherine Abdelhak, Soumik Ghosh, Rabi Chaudhuri, Magdy Bayoumi (A)
Parallelism without Concurrency Charles E. Leiserson MIT.
Carnegie Mellon Program Generation with Spiral: Beyond Transforms This work was supported by DARPA DESA program, NSF-NGS/ITR, NSF-ACR, Mercury Inc., and.
Approved for Public Release; Distribution Unlimited. Case Number: , © 2006 The MITRE Corporation. All rights reserved Characterization Framework.
HPEC HN 9/23/2009 MIT Lincoln Laboratory RAPID: A Rapid Prototyping Methodology for Embedded Systems Huy Nguyen, Thomas Anderson, Stuart Chen, Ford.
PTCN-HPEC-02-1 AIR 25Sept02 MIT Lincoln Laboratory Resource Management for Digital Signal Processing via Distributed Parallel Computing Albert I. Reuther.
CRBLASTER: Benchmarking a Cosmic-Ray Rejection Application on the Tilera 64-core TILE64 Processor Kenneth Mighell National Optical Astronomy Observatory.
MIT Lincoln Laboratory 1 of 4 MAA 3/8/2016 Development of a Real-Time Parallel UHF SAR Image Processor Matthew Alexander, Michael Vai, Thomas Emberley,
1 Leveraging Multi-Core Processors in a CDMA-2000 SDR Base Station Steve Muir, John Chapin, Andrew Chiu, Victor Lum and Jeremy Nimmer Vanu, Inc.
Optimization of Embedded Linux systems without FPU Sergey Panasyuk, SUNYIT Scott Spetka, SUNYIT and ITT Corp Utica, NY High Performance Embedded.
Update on CHEP from the Computing Speaker Committee G. Carlino (INFN Napoli) on behalf of the CSC ICB, October
Electronic visualization laboratory, university of illinois at chicago Sort Last Parallel Rendering for Viewing Extremely Large Data Sets on Tile Displays.
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
HPEC-1 SMHS 7/7/2016 MIT Lincoln Laboratory Focus 3: Cell Sharon Sacco / MIT Lincoln Laboratory HPEC Workshop 19 September 2007 This work is sponsored.
George Viamontes, Mohammed Amduka, Jon Russo, Matthew Craven, Thanhvu Nguyen Lockheed Martin Advanced Technology Laboratories 3 Executive Campus, 6th Floor.
Conclusions on CS3014 David Gregg Department of Computer Science
SmartCell: A Coarse-Grained Reconfigurable Architecture for High Performance and Low Power Embedded Computing Xinming Huang Depart. Of Electrical and Computer.
Many-core Software Development Platforms
Babak Falsafi Computer Architecture Lab (CALCM) Carnegie Mellon
Parallel Computing has been moving into University training for several years… IAC Membership indicates commitment to parallel computing in undergraduate.
Key to Scalable Parallelism - Regularity and Locality
Learn about MATLAB Engineers – not sales!
Restructuring the multi-resolution approximation for spatial data to reduce the memory footprint and to facilitate scalability Vinay Ramakrishnaiah Mentors:
Parallel Computation of 2D Morse-Smale Complexes
Alternative Processor Panel Results 2008
New Scheduling Algorithms: Improving Fairness and Quality of Service
©2018 The MITRE Corporation. ALL RIGHTS RESERVED
Presentation transcript:

© 2010 The MITRE Corporation. All rights reserved. Session 2: Many Core Sharon Sacco / The MITRE Corporation HPEC 2010 Approved for Public Release: Distribution is unlimited

© 2010 The MITRE Corporation. All rights reserved. ■Theme: Multicore processors and their impact on DoD HPEC Systems ■Panel Session: Multicore Meltdown? ■Most discussed processors –FPGA: 14 abstracts –STI Cell BE: 12 abstracts –GPU: 9 abstracts At HPEC 2007:  Multicore processors need sophisticated programming techniques  Keeping the cores busy is challenging  Getting high performance is tricky  Multicore processors need sophisticated programming techniques  Keeping the cores busy is challenging  Getting high performance is tricky

© 2010 The MITRE Corporation. All rights reserved. HPEC 2010 ■Theme: Custom clouds & general purpose GPUs: their impact on DoD applications ■Panel Session: ISR Clouds ■Most discussed processors: –GPU: 16 abstracts –Tilera: 4 abstracts –STI Cell BE: 4 abstracts –FPGA: 3 abstracts  Many Core processors need sophisticated programming techniques  Keeping the cores busy is even more challenging  Getting high performance is really tricky  Many Core processors need sophisticated programming techniques  Keeping the cores busy is even more challenging  Getting high performance is really tricky

© 2010 The MITRE Corporation. All rights reserved. Session 2: Agenda ■Invited Talk ■ Richard Schooler / Tilera ■Micro-op Fission: Hyper-threading Without the Hyper- headache ■Daniel McFarlin / Carnegie Mellon University ■Automatic Parallelization and Locality Optimization of Beamforming Algorithms ■Albert Hartono / Reservoir Labs ■Break (15 minutes)

© 2010 The MITRE Corporation. All rights reserved. Session 2: Agenda (cont.) ■Performance Scalability on Embedded Many-Core Processors ■ Michael Champigny / Mercury Computer Systems ■CRBLASTER: Benchmarking a Cosmic-Ray Rejections Application on the Tilera 64-core TILE64 Processor ■Kenneth Mighell / National Optical Astronomy Observatory ■Towards Mega-Scale Computing with pMatlab ■Chansup Byun / MIT Lincoln Laboratory