1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.

Slides:



Advertisements
Similar presentations
Performance What differences do we see in performance? Almost all computers operate correctly (within reason) Most computers implement useful operations.
Advertisements

CS2100 Computer Organisation Performance (AY2014/2015) Semester 2.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Evaluating Performance
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
CS 6290 Evaluation & Metrics. Performance Two common measures –Latency (how long to do X) Also called response time and execution time –Throughput (how.
Computer Organization and Architecture 18 th March, 2008.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon., Sep 5, 2005 Lecture 2.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Performance D. A. Patterson and J. L. Hennessey, Computer Organization & Design: The Hardware Software Interface, Morgan Kauffman, second edition 1998.
1 Lecture 11: Digital Design Today’s topics:  Evaluating a system  Intro to boolean functions.
Chapter 4 Assessing and Understanding Performance
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Sep 3, 2003 Lecture 2.
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1 Measuring Performance Chris Clack B261 Systems Architecture.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2: Computer Performance
Ch4b- 2 EE/CS/CPE Computer Organization  Seattle Pacific University Performance metrics I’m concerned with how long it takes to run my program.
Lecture 2b: Performance Metrics. Performance Metrics Measurable characteristics of a computer system: Count of an event Duration of a time interval Size.
Copyright 1995 by Coherence LTD., all rights reserved (Revised: Oct 97 by Rafi Lohev, Oct 99 by Yair Wiseman, Sep 04 Oren Kapah) IBM י ב מ 7-1 Measuring.
Memory/Storage Architecture Lab Computer Architecture Performance.
June 2005Computer Architecture, Background and MotivationSlide 1 Part I Background and Motivation.
1 CHAPTER 2 THE ROLE OF PERFORMANCE. 2 Performance Measure, Report, and Summarize Make intelligent choices Why is some hardware better than others for.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
CS /02 Semester II Help Session IIA Performance Measures Colin Tan S
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
EGRE 426 Computer Organization and Design Chapter 4.
Computer Engineering Rabie A. Ramadan Lecture 2. Table of Contents 2 Architecture Development and Styles Performance Measures Amdahl’s Law.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
CSE 340 Computer Architecture Summer 2016 Understanding Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
Performance. Moore's Law Moore's Law Related Curves.
Measuring Performance II and Logic Design
Lecture 2: Performance Evaluation
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Defining Performance Which airplane has the best performance?
CSCE 212 Chapter 4: Assessing and Understanding Performance
Performance Cycle time of a computer CPU speed speed = 1 / cycle time
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Parameters that affect it How to improve it and by how much
Performance Measurement and Analysis
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Computer Organization and Design Chapter 4
Presentation transcript:

1

2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft or are averages of cited range of values. 4.1 Performance and Cost/performance

3 Figure 4.1 Performance improvement as a function of cost.

4 4.2 Defining Computer Performance Performance = 1 / Execution time Throughput: amount of work performed per unit time. It can be measured as the number of processes per unit time. Tournaround time: the average time from the moment that a job is submitted until the moment it is completed. It measures how long the average user has to wait for output. Response time: In an interactive systems, the time from when a user press an Enter or clicks a mouse until the system delivers a final response. To filter out variable factor (e.g., scheduling, interrupts, I/O delay) Performance = 1 / CPU Execution time

5 Figure 4.2 Pipeline analogy shows that imbalance between processing power and I/O capabilities leads to a performance bottleneck.

6 CPU execution time = Instructions × (Cycles per instruction ) × (seconds per cycle) = Instructions × CPI / (Clock rate) (CPI: cycles per instruction) Performance comparison (Performance of M1) / (Performance of M2) = (Execution time of M2) / (Execution time of M1)

7 4.3 Performance Enhancement and Amdahl’s Law Amdahl’s law s = 1 / (f+(1-f) / p) ≤ min (p, 1/f) f: time for instructions that cannot be parallelized. p: speed-up (by parallel computer or redesign CPU or algorithm) s: overall speedup Study Example 4.1

8 Figure 4.4 Amdahl’s law: speedup achieved if a fraction f of a task is unaffected and the remaining 1 – f part runs p times as fast.

9 Figure 4.5 Running times of six programs on three machines. 4.4 Performance Measurement vs Modeling

10 Table 4.2 Summary of SPEC CPU2000 benchmark suite characteristics. Benchmarks: real or synthetic programs that are selected for comparative evaluation of machine performance. SPEC: Standard Performance Evaluation Corporation

11 Figure 4.6 Example graphical depiction of SPEC benchmark results. Study Example 4.3

12 Table 4.3 Usage frequency, in percentage, for various instruction classes in four representative applications. Performance Estimation System’s peak performance: expressed in instructions per second. (MIPS, MFLOPS)

13 Example 4.4 CPI and IPS calculation Solution a.For M1, assume all instructions are class I instructions, Peak performance of M1 = 1 / (Avg. CPI × Clock time) = 600 / 2.0 = 300MIPS Notice: Units for Average CPI and clock time are second. For M2, assume all instructions are class N instructions, Peak performance of M2 = 1 / (Avg. CPI × Clock time) = 500 / 2.0 = 250MIPS

14 b. Average CPI for M1=5.0× × ×0.5=2.95 Average CPI for M2=4.0× × ×0.5=2.95 c. 1. Average CPI=2.5× × ×0.5=2.325 MIPS for option 1 = 600/2.325 = Average CPI=5.0× × ×0.5=2.75 MIPS for option 1 = 600/2.75 = MIPS for option 3 = 750/2.95=254. Conclusion: Option 1 has the greatest impact d. With larger cache, cache miss rate is reduced 2% (from 5% to 3%), that is all CPIs are reduced 10×2%=0.2ns (cache miss imposes 10 cycle penalty) Average CPI M1=( )×0.25+( )×0.25+( )×0.5=2.75 This option is comparable to option 2 in c. e. Average CPI for M1= 5.0×x+2.0×y+2.4×(1-x-y)=2.6x-0.4y+2.4 Average CPI for M2= 4.0×x+3.8×y+2.0×(1-x-y)=2x+1.8y+2 We need 600/(2.6x-0.4y+2.4) > 500/(2x+1.8y+2) => 2.56y > 0.2x That is, x/y < 12.8, M1 runs faster than M2 for the given task.

15 Example 4.5 MIPS rating can be misleading a.Runtime for the output of compiler 1= (600M+400M)/10 9 = 1.4s Runtime for the output of compiler 2 = (400M+400M)/10 9 = 1.2s Compiler 2 is faster. b. Code produced by compiler 2 is 1.4/1.2= 1.17 times as faster as that of compiler 1. c. Average CPI for compiler 1 = (600M×1+400M×2)/1000M=1.4 Average CPI for compiler 2 = (400M×1+400M×2)/800M=1.5 MIPS rating of compiler 1=1000/1.4=714 MIPS rating of compiler 2=1000/1.5=667 Compiler 1 is faster

16 Table 4.4 Measured or estimated execution times for three programs. 4.5 Reporting Computer Performance Wrong method (arithmetic mean) Speedup of Y over X=( )/3=6.7(1) Speedup of X over y=( )/3=3.3 (contradictory with (1)) Total time comparison: correct if they are run the same number of times. Geometric mean: Speedup of Y over X=(0.1×10.0×10.0) 1/3 =2.15(2) Speedup of X over y=(10.0×0.1×0.10) 1/3 =0.46 (consistent with (2))

17 Example 4.6 Table 4.3 Usage frequency, in percentage, for various instruction classes in four representative applications.

18 Answer: a. CPI for data compression application on M1=0.25× × ×1.2+0× × ×2.0 =2.31 CPI for data compression application on M2=2.54 CPI for nuclear reactor simulation application on M1=3.94 CPI for nuclear reactor simulation application on M2=2.89 b. Because the programs and clock rates are the same, speedup ratios is given by the ratio of CPIs. Data compression performance speed up (M2/M1)= 2.31/2.54 = 0.91 nuclear reactor simulation performance speed up (M2/M1)= 3.94/2.89 = c. Overall performance advantage of M2 over M1 is (0.91×1.36) 1/2 =1.11

19 Figure 4.7 Exponential growth of supercomputer performance [Bell92]. 4.6 The Quest for High Performance

20 Figure 4.8 Milestones in the Accelerated Strategic Computing Initiative (ASCI) program, sponsored by the U.S. Department of Energy, with extrapolation up to the PFLOPS level.

21 Problem 4.5

22 Problem 4.12