DCS/2003/1 CENG 532 - Distributed Computing Systems Measures of Performance.

Slides:



Advertisements
Similar presentations
CSE 160 – Lecture 9 Speed-up, Amdahl’s Law, Gustafson’s Law, efficiency, basic performance metrics.
Advertisements

Prepared 7/28/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
Distributed Systems CS
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
11Sahalu JunaiduICS 573: High Performance Computing5.1 Analytical Modeling of Parallel Programs Sources of Overhead in Parallel Programs Performance Metrics.
Example (1) Two computer systems have been tested using three benchmarks. Using the normalized ratio formula and the following tables below, find which.
CS 584. Logic The art of thinking and reasoning in strict accordance with the limitations and incapacities of the human misunderstanding. The basis of.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Parallel Programming with MPI and OpenMP Michael J. Quinn.
Arquitectura de Sistemas Paralelos e Distribuídos Paulo Marques Dep. Eng. Informática – Universidade de Coimbra Ago/ Quantitative.
Recap.
Lecture 5 Today’s Topics and Learning Objectives Quinn Chapter 7 Predict performance of parallel programs Understand barriers to higher performance.
Steve Lantz Computing and Information Science Parallel Performance Week 7 Lecture Notes.
Parallel Algorithms - Introduction Advanced Algorithms & Data Structures Lecture Theme 11 Prof. Dr. Th. Ottmann Summer Semester 2006.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
Computer Science 320 Measuring Speedup. What Is Running Time? T(N, K) says that the running time T is a function of the problem size N and the number.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 3 – Parallel Performance Theory - 1 Parallel Performance Theory - 1 Parallel Computing CIS 410/510 Department of Computer and Information Science.
Performance Evaluation of Parallel Processing. Why Performance?
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture.
“elbowing out” Processors used Speedup Efficiency timeexecution Parallel Processors timeexecution Sequential Efficiency   
Parallel ICA Algorithm and Modeling Hongtao Du March 25, 2004.
Performance Model & Tools Summary Hung-Hsun Su UPC Group, HCS lab 2/5/2004.
Amdahl's Law Validity of the single processor approach to achieving large scale computing capabilities Presented By: Mohinderpartap Salooja.
2015/10/14Part-I1 Introduction to Parallel Processing.
Performance Measurement n Assignment? n Timing #include double When() { struct timeval tp; gettimeofday(&tp, NULL); return((double)tp.tv_sec + (double)tp.tv_usec.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 February Session 6.
Performance Measurement. A Quantitative Basis for Design n Parallel programming is an optimization problem. n Must take into account several factors:
Compiled by Maria Ramila Jimenez
Parallel Processing Sharing the load. Inside a Processor Chip in Package Circuits Primarily Crystalline Silicon 1 mm – 25 mm on a side 100 million to.
Lecture 9 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.
Scaling Area Under a Curve. Why do parallelism? Speedup – solve a problem faster. Accuracy – solve a problem better. Scaling – solve a bigger problem.
Lecture 9: 9/24/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
Computer Architecture
Lecture 8: 9/19/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
Parallel Programming with MPI and OpenMP
Classic Model of Parallel Processing
Performance Performance
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 May 2, 2006 Session 29.
Scaling Conway’s Game of Life. Why do parallelism? Speedup – solve a problem faster. Accuracy – solve a problem better. Scaling – solve a bigger problem.
INEL6067 Parallel Architectures Why build parallel machines? °To help build even bigger parallel machines °To help solve important problems °Speed – more.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Computer Science 320 Measuring Sizeup. Speedup vs Sizeup If we add more processors, we should be able to solve a problem of a given size faster If we.
CS203 – Advanced Computer Architecture Performance Evaluation.
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 February Session 7.
1 Potential for Parallel Computation Chapter 2 – Part 2 Jordan & Alaghband.
Classification of parallel computers Limitations of parallel processing.
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 February Session 11.
History of Computers and Performance David Monismith Jan. 14, 2015 Based on notes from Dr. Bill Siever and from the Patterson and Hennessy Text.
Distributed and Parallel Processing George Wells.
DCS/1 CENG Distributed Computing Systems Measures of Performance.
CS203 – Advanced Computer Architecture
Introduction to Parallel Processing
4- Performance Analysis of Parallel Programs
September 2 Performance Read 3.1 through 3.4 for Tuesday
Defining Performance Which airplane has the best performance?
Parallel Computers.
Chapter 3: Principles of Scalable Performance
You have a system that contains a special processor for doing floating-point operations. You have determined that 50% of your computations can use the.
Amdahl's law.
CS 584.
PERFORMANCE MEASURES. COMPUTATIONAL MODELS Equal Duration Model:  It is assumed that a given task can be divided into n equal subtasks, each of which.
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
Potential for parallel computers/parallel programming
Presentation transcript:

DCS/2003/1 CENG Distributed Computing Systems Measures of Performance

DCS/2003/2 Grosch’s Law-1960’s “ To sell a computer twıce as much, ıt must be four times as fast” It was Ok at the time, but soon it became meaningless After 1970, it was possible to make faster computers and sell even cheaper…. Ultimately the switching speeds rach a limit, the speed of the light on an integrated circuit…

DCS/2003/3 Von Neumann’s Bottleneck Serial single processor computer architectures followed John Von Neumann’s architecture of One processor, single control unit, single memory This is no more valid: Low cost parallel computers can easily deliver the performance of the fastest single processor computer…

DCS/2003/4 Amdahl’s Law; 1967 Let speedup (S) be ratio of serial time (one processor) to parallel time (N processors) S=T 1 /T N < 1/f Where f is the serial fraction of the problem, 1-f is the parallel fraction of the problem, then Tn= T 1 *f+T 1 (1-f)/N S=1/(f+(1-f)/N), thus s<1/f

DCS/2003/5 Amdahl’s Law; 1967 At f=0.10, Amdahl’ Law predicts, at best a tenfold speedup, which is very pessimistic This was soon broken, encouraged by Gordon Bell Prize!

DCS/2003/6 Gustafson-Barsis Law; 1988 The team of researchers of Sandia Labs (John Gustafson and Ed Barsis), using 1024 processor nCube/10, overthrew Amdahl’s Law, by achieving 1000 fold speedup with f to According to Amdahl’s Law, the speedup would have been from 125 to 250. The key point was that 1-f was not independent of N.

DCS/2003/7 Gustafson-Barsis Law; 1988 They interpreted the speedup formula, by scaling up the problem to fit the parallel machine: T 1 =f+(1-f)N T N =f+(1-f)=1, then the speedup can be computes as S=N-(N-1)f

DCS/2003/8 Extreme case analysis Assuming Amdahl’s Law, an upper and lower bound can be given for the speedup, under unrealistic assumptions: N/log 2 N <= S <= N where N is based on single processor logN is based on divide and conquer

DCS/2003/9 Inclusion of the communication time Some researchers (Gelenbe) suggests speedup to be approxımated by S=1/C ( N) where C(N) is some function of N For example, C(N) can be estimated as C (N) =A+Blog 2 N where A and B are constants determined by the communication mechanisms

DCS/2003/10 Benchmark Performance Benchmark ıs a program whose purpose is to measure a performance characteristic of a computer system, such as floating point speed, I/O speed, or for a restricted class of problems The benchmarks are arranged to be either Kernels of real applications, such as Linpacks, Livermore Loops, or Synthetic, approximating the behavior of the real problem, such as Whetstone and Wichmann…