COMPUTATIONAL MODELS.

Slides:



Advertisements
Similar presentations
Prepared 7/28/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
Advertisements

Potential for parallel computers/parallel programming
CIS December '99 Introduction to Parallel Architectures Dr. Laurence Boxer Niagara University.
11Sahalu JunaiduICS 573: High Performance Computing5.1 Analytical Modeling of Parallel Programs Sources of Overhead in Parallel Programs Performance Metrics.
Advanced Topics in Algorithms and Data Structures Classification of the PRAM model In the PRAM model, processors communicate by reading from and writing.
PRAM Models Advanced Algorithms & Data Structures Lecture Theme 13 Prof. Dr. Th. Ottmann Summer Semester 2006.
Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Advanced Topics in Algorithms and Data Structures An overview of the lecture 2 Models of parallel computation Characteristics of SIMD models Design issue.
CS533 Concepts of Operating Systems Class 20 Summary.
1 Lecture 8 Architecture Independent (MPI) Algorithm Design Parallel Computing Fall 2007.
Advanced Topics in Algorithms and Data Structures 1 Lecture 4 : Accelerated Cascading and Parallel List Ranking We will first discuss a technique called.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
Chapter 7 Performance Analysis. 2 Additional References Selim Akl, “Parallel Computation: Models and Methods”, Prentice Hall, 1997, Updated online version.
Parallel Routing Bruce, Chiu-Wing Sham. Overview Background Routing in parallel computers Routing in hypercube network –Bit-fixing routing algorithm –Randomized.
Introduction to Systems Architecture Kieran Mathieson.
CS294-6 Reconfigurable Computing Day 10 September 24, 1998 Interconnect Richness.
Models of Parallel Computation Advanced Algorithms & Data Structures Lecture Theme 12 Prof. Dr. Th. Ottmann Summer Semester 2006.
Chapter 4 Performance. Times User CPU time – Time that the CPU is executing the program System CPU time – time the CPU is executing OS routines for the.
1 Interconnects Shared address space and message passing computers can be constructed by connecting processors and memory unit using a variety of interconnection.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
DLS on Star (Single-level tree) Networks Background: A simple network model for DLS is the star network with a master-worker platform. It consists of a.
Querying Large Databases Rukmini Kaushik. Purpose Research for efficient algorithms and software architectures of query engines.
LogP and BSP models. LogP model Common MPP organization: complete machine connected by a network. LogP attempts to capture the characteristics of such.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 February 3, 2005 Session 7.
SOFTWARE DESIGN. INTRODUCTION There are 3 distinct types of activities in design 1.External design 2.Architectural design 3.Detailed design Architectural.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
By: Nelson Webster. Algorithm Engineers Algorithm engineers study the effectiveness and efficiency of procedures of solving problems on a computer.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
Numerical Algorithms Chapter 11.
EEE 431 Computational Methods in Electrodynamics
Auburn University
Potential for parallel computers/parallel programming
PERFORMANCE EVALUATIONS
OPERATING SYSTEMS CS 3502 Fall 2017
Distributed and Parallel Processing
Ioannis E. Venetis Department of Computer Engineering and Informatics
Computational Thinking, Problem-solving and Programming: General Principals IB Computer Science.
Parallel Programming By J. H. Wang May 2, 2017.
Lecture 2: Parallel computational models
Multiprocessor Cache Coherency
Multi-Processing in High Performance Computer Architecture:
Steven Whitham Jeremy Woods
CS 583 Fall 2006 Analysis of Algorithms
IP – Based Design Methodology
Computer Architecture Introduction to Data-Parallel architectures
A Cloud System for Machine Learning Exploiting a Parallel Array DBMS
Design Space Exploration
Logical architecture refinement
ESE534: Computer Organization
ESE534: Computer Organization
Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.
Numerical Algorithms Quiz questions
CS 584.
Professor Ioana Banicescu CSE 8843
CS184a: Computer Architecture (Structures and Organization)
PERFORMANCE MEASURES. COMPUTATIONAL MODELS Equal Duration Model:  It is assumed that a given task can be divided into n equal subtasks, each of which.
Copyright © Cengage Learning. All rights reserved.
Automatic optimization of parallel linear algebra software
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
Parallel Speedup.
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
COMPUTER ORGANIZATION AND ARCHITECTURE
What Are Performance Counters?
CS 584 Lecture 5 Assignment. Due NOW!!.
Computer Architecture
Reinforcement Learning (2)
Presentation transcript:

COMPUTATIONAL MODELS

Architectural Models For Parallel Algorithm Design Different algorithms require different architectures to solve the some problem efficiently Need to design a new algorithm for each architecture? To avoid the complexity and cost of (software development) we need Universal abstract model for parallel architectures -parallel programs can be ported to specific computers -parallel programs can be written for the abstract model -software development cost is reduced -performance can be predicted since: parameters (characteristics) of the abstract model vs. those of specific architectures are known.

Universal model characteristics Sufficiently general to capture the most important properties of a large class of parallel computers Programs designed for this abstract model must execute efficiently on parallel computers. (1) and (2) are conflicting goals: Observation: the abstract model must make assumptions about the connectivity (degree of connectivity) among processors: {-linear arrays and meshes: low {-hypercubes : high (a) If the abstract model assumes low connectivity, Alg. Designed for it will be optimal for parallel architectures with those characteristics, and suboptimal for architectures with high connectivity (i.e. communication resources under-utilized)

(b) * similarly, in the case the abstract model assumes high connectivity… => regardless of the connectivity of the parallel model, it yields sub-optimal performance for some architectures. Argument against the existence of a universal abstract model. “No such model is currently known” => Today’s methodology in parallel algorithm design is: Design parallel algorithms in terms of basic data communication operations. *only implementation of those operations must be optimized for different parallel computers. *communication operations area a small set. Concurrency ----- vs. ------ data locality