Babak Falsafi Computer Architecture Lab (CALCM) Carnegie Mellon

Slides:



Advertisements
Similar presentations
4. Shared Memory Parallel Architectures 4.4. Multicore Architectures
Advertisements

Parallel Programming Motivation and terminology – from ACM/IEEE 2013 curricula.
Speculative Sequential Consistency with Little Custom Storage Impetus Group Computer Architecture Lab (CALCM) Carnegie Mellon University
Latency considerations of depth-first GPU ray tracing
JavaTile: CMP-simulation with a twist Dan Greenfield Computer Architecture Group Internal Presentation, 16 th February 2007.
October 2003 What Does the Future Hold for Parallel Languages A Computer Architect’s Perspective Josep Torrellas University of Illinois
Instruction Level Parallelism (ILP) Colin Stevens.
University of Michigan Electrical Engineering and Computer Science 1 Parallelizing Sequential Applications on Commodity Hardware Using a Low-Cost Software.
Single-Chip Multiprocessors: the Rebirth of Parallel Architecture Guri Sohi University of Wisconsin.
Multithreading and Dataflow Architectures CPSC 321 Andreas Klappenecker.
Chapter Hardwired vs Microprogrammed Control Multithreading
Compiling Application-Specific Hardware Mihai Budiu Seth Copen Goldstein Carnegie Mellon University.
RISC. Rational Behind RISC Few of the complex instructions were used –data movement – 45% –ALU ops – 25% –branching – 30% Cheaper memory VLSI technology.
18-447: Computer Architecture Lecture 30B: Multiprocessors Prof. Onur Mutlu Carnegie Mellon University Spring 2013, 4/22/2013.
Ioana Burcea * Stephen Somogyi §, Andreas Moshovos*, Babak Falsafi § # Predictor Virtualization *University of Toronto Canada § Carnegie Mellon University.
(1) Scheduling for Multithreaded Chip Multiprocessors (Multithreaded CMPs)
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
10/27: Lecture Topics Survey results Current Architectural Trends Operating Systems Intro –What is an OS? –Issues in operating systems.
VTU – IISc Workshop Compiler, Architecture and HPC Research in Heterogeneous Multi-Core Era R. Govindarajan CSA & SERC, IISc
Hyper Threading (HT) and  OPs (Micro-Operations) Department of Computer Science Southern Illinois University Edwardsville Summer, 2015 Dr. Hiroshi Fujinoki.
Compiling Several Classes of Communication Patterns on a Multithreaded Architecture Gagan Agrawal Department of Computer and Information Sciences Ohio.
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Summary Background –Why do we need parallel processing? Moore’s law. Applications. Introduction in algorithms and applications –Methodology to develop.
Classic Model of Parallel Processing
A few issues on the design of future multicores André Seznec IRISA/INRIA.
1 Processor Architecture Jurij Silc, Borut Robic, Theo Ungerer.
SIMULTANEOUS MULTITHREADING Ting Liu Liu Ren Hua Zhong.
Fall-11: Early Adoption of NSF/TCPP PDC Curriculum at Texas Tech University and Beyond Yong Chen Yu Zhuang Noe Lopez-Benitez May 10 th, 2013.
E X C E E D I N G E X P E C T A T I O N S VLIW-RISC CSIS Parallel Architectures and Algorithms Dr. Hoganson Kennesaw State University Instruction.
Processor Level Parallelism. Improving the Pipeline Pipelined processor – Ideal speedup = num stages – Branches / conflicts mean limited returns after.
1 November 11, 2015 A Massively Parallel, Hybrid Dataflow/von Neumann Architecture Yoav Etsion November 11, 2015.
ISA's, Compilers, and Assembly
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Solving the straggler problem with bounded staleness Jim Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth Gibson, Kimberly Keeton*,
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Hardware Trends CSE451 Andrew Whitaker. Motivation Hardware moves quickly OS code tends to stick around for a while “System building” extends way beyond.
Processor Level Parallelism 2. How We Got Here Developments in PC CPUs.
Computer Architecture: Multi-Core Processors: Why? Prof. Onur Mutlu Carnegie Mellon University.
“Processors” issues for LQCD January 2009 André Seznec IRISA/INRIA.
740: Computer Architecture Memory Consistency Prof. Onur Mutlu Carnegie Mellon University.
Elec/Comp 526 Spring 2015 High Performance Computer Architecture Instructor Peter Varman DH 2022 (Duncan Hall) rice.edux3990 Office Hours Tue/Thu.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
18-447: Computer Architecture Lecture 30B: Multiprocessors
Prof. Onur Mutlu Carnegie Mellon University
Computer Architecture: Parallel Processing Basics
ECE/CS 757: Advanced Computer Architecture II Midterm 2 Review
Simultaneous Multithreading
Hot Chips, Slow Wires, Leaky Transistors
Parallel Computing Lecture
Antonia Zhai, Christopher B. Colohan,
Parallel Computing has been moving into University training for several years… IAC Membership indicates commitment to parallel computing in undergraduate.
The University of Texas at Austin
Computer Architecture Lecture 4 17th May, 2006
Lecture 1: Parallel Architecture Intro
ISCA 2005 Panel Guri Sohi.
Computer Architecture: Multithreading (IV)
Coe818 Advanced Computer Architecture
Single-Chip Multiprocessors: the Rebirth of Parallel Architecture
Database Servers on Chip Multiprocessors: Limitations and Opportunities Nikos Hardavellas With Ippokratis Pandis, Ryan Johnson, Naju Mancheril, Anastassia.
Computer Evolution and Performance
Mattan Erez The University of Texas at Austin
Advanced Architecture +
CS 286 Computer Organization and Architecture
EE 193: Parallel Computing
The University of Adelaide, School of Computer Science
EE 155 / Comp 122 Parallel Computing
EN Software Carpentry Python – A Crash Course Esoteric Sections Parallelization
Prof. Onur Mutlu Carnegie Mellon University
Presentation transcript:

Babak Falsafi Computer Architecture Lab (CALCM) Carnegie Mellon ISCA 2005 panel on Chip multiprocessors are here. But, where are the threads? Babak Falsafi Computer Architecture Lab (CALCM) Carnegie Mellon

End of Single Thread Era? Multiple cores Multithreaded cores OoO Superscalar First Cores RISC Good news: 1 TIPS by 2010 Bad news: where is the “I” in TIPS?

1. Hand program them? 40-years, still not mainstream Limited to skilled folks Need to think parallel at all stack levels Algorithms, languages, systems, etc. Not your typical undergrad curriculum Decomposition first, PRAM, or dataflow? Is Transactional Memory the right solution to the wrong problem? Is this an architecture problem?

2. Automatically parallelize? Parallelizing compilers/languages: 30-year effort, modest results Works for data-parallel Thread-Level Speculation: 15-year effort, less than modest results Less than 10% speedup with 4 cores Hand tuning helps but doesn’t solve the problem

3. Use them for helpers/nannies? Performance: Precomputation, slipstream, microthreading,… E.g., better memory, branch performance Similar success/failure as TLS Functionality: Garbage collection, reliability, security How many nannies does a program need? Is this an effective use of real-estate?

Our Distinguished Panel Mr. Multiscalar Mr. Niagara Mr. Supercompiler Michael Wolfe Mr. Transaction Mr. Impact Mr. Stampede