Professor Ioana Banicescu CSE 8843

Slides:



Advertisements
Similar presentations
Prepared 7/28/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
Advertisements

Potential for parallel computers/parallel programming
CIS December '99 Introduction to Parallel Architectures Dr. Laurence Boxer Niagara University.
11Sahalu JunaiduICS 573: High Performance Computing5.1 Analytical Modeling of Parallel Programs Sources of Overhead in Parallel Programs Performance Metrics.
Advanced Topics in Algorithms and Data Structures Classification of the PRAM model In the PRAM model, processors communicate by reading from and writing.
PRAM Models Advanced Algorithms & Data Structures Lecture Theme 13 Prof. Dr. Th. Ottmann Summer Semester 2006.
Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Advanced Topics in Algorithms and Data Structures An overview of the lecture 2 Models of parallel computation Characteristics of SIMD models Design issue.
CS533 Concepts of Operating Systems Class 20 Summary.
1 Lecture 8 Architecture Independent (MPI) Algorithm Design Parallel Computing Fall 2007.
CSE 300: Software Reliability Engineering Topics covered: Software Reliability Models.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
Introduction to Systems Architecture Kieran Mathieson.
CS294-6 Reconfigurable Computing Day 10 September 24, 1998 Interconnect Richness.
Models of Parallel Computation Advanced Algorithms & Data Structures Lecture Theme 12 Prof. Dr. Th. Ottmann Summer Semester 2006.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Chapter 4 Performance. Times User CPU time – Time that the CPU is executing the program System CPU time – time the CPU is executing OS routines for the.
1 Interconnects Shared address space and message passing computers can be constructed by connecting processors and memory unit using a variety of interconnection.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
DLS on Star (Single-level tree) Networks Background: A simple network model for DLS is the star network with a master-worker platform. It consists of a.
Querying Large Databases Rukmini Kaushik. Purpose Research for efficient algorithms and software architectures of query engines.
LogP and BSP models. LogP model Common MPP organization: complete machine connected by a network. LogP attempts to capture the characteristics of such.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 February 3, 2005 Session 7.
SOFTWARE DESIGN. INTRODUCTION There are 3 distinct types of activities in design 1.External design 2.Architectural design 3.Detailed design Architectural.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
By: Nelson Webster. Algorithm Engineers Algorithm engineers study the effectiveness and efficiency of procedures of solving problems on a computer.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
Numerical Algorithms Chapter 11.
EEE 431 Computational Methods in Electrodynamics
PERFORMANCE EVALUATIONS
OPERATING SYSTEMS CS 3502 Fall 2017
Ioannis E. Venetis Department of Computer Engineering and Informatics
COMPUTATIONAL MODELS.
Parallel Programming By J. H. Wang May 2, 2017.
Lecture 2: Parallel computational models
Multiprocessor Cache Coherency
Multi-Processing in High Performance Computer Architecture:
Steven Whitham Jeremy Woods
CS 583 Fall 2006 Analysis of Algorithms
IP – Based Design Methodology
Computer Architecture Introduction to Data-Parallel architectures
A Cloud System for Machine Learning Exploiting a Parallel Array DBMS
Design Space Exploration
Logical architecture refinement
ESE534: Computer Organization
ESE534: Computer Organization
Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.
Performance Evaluation of the Parallel Fast Multipole Algorithm Using the Optimal Effectiveness Metric Ioana Banicescu and Mark Bilderback Department of.
CSE8380 Parallel and Distributed Processing Presentation
Numerical Algorithms Quiz questions
CS 584.
CS184a: Computer Architecture (Structures and Organization)
Professor Ioana Banicescu CSE 8843
PERFORMANCE MEASURES. COMPUTATIONAL MODELS Equal Duration Model:  It is assumed that a given task can be divided into n equal subtasks, each of which.
Copyright © Cengage Learning. All rights reserved.
Automatic optimization of parallel linear algebra software
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
CSE 373 Data Structures and Algorithms
Parallel Speedup.
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
COMPUTER ORGANIZATION AND ARCHITECTURE
What Are Performance Counters?
CS 584 Lecture 5 Assignment. Due NOW!!.
Lecture 20 Parallel Programming CSE /8/2019.
Computer Architecture
Presentation transcript:

Professor Ioana Banicescu CSE 8843 Computational Models Professor Ioana Banicescu CSE 8843

Architectural Models for Parallel Algorithm Design Different algorithms require different architectures to solve the same problem efficiently Need to design a new algorithm for each architecture To avoid the complexity and cost of software development

Universal Abstract Model for Parallel Architecture Parallel programs can be ported to specific computers Parallel programs can be written for abstract model Software development cost is reduced Performance can be predicted since: parameters (characteristics) of the abstract model vs. those of specific architectures are known

Universal Model Characteristics Sufficiently general to capture the most important properties of a class of parallel computers Programs designed for this abstract model must execute efficiently on parallel computers and (2) are conflicting goals. Observation: The abstract model must make assumptions about the connectivity (degree of connectivity) among processors: Linear arrays and meshes: low Hypercubes: high

Universal Model Characteristics (continued) If the abstract model assumes low connectivity, algorithms designed for it will be optimal for parallel architectures with these characteristics and sub- optimal for architectures with high connectivity (i.e. communication resources under-utilized) Similarly in the case, the abstract model assume high connectivity

Universal Model Characteristics (continued) Regardless of the connectivity of the parallel model, it yields sub-optimal performance for some architectures Argument against the existence of a universal abstract model: No such model is currently known

Today’s Methodology in Parallel Algorithm Design Design parallel algorithms in terms of basic data communication operations Only implementation of these operations must be optimized for different parallel computers Communication operations are a small set Concurrency vs. Data locality