1 Measuring and Discussing Computer System Performance or “My computer is faster than your computer” Reading: 2.4, 3.1-3.5. Peer Instruction Lecture Materials.

Slides:



Advertisements
Similar presentations
Pipeline Summary Try to put everything together for pipelines Before going onto caches. Peer Instruction Lecture Materials for Computer Architecture by.
Advertisements

Performance What differences do we see in performance? Almost all computers operate correctly (within reason) Most computers implement useful operations.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
1  1998 Morgan Kaufmann Publishers Chapter 2 Performance Text in blue is by N. Guydosh Updated 1/25/04*
Evaluating Performance
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
Computer Organization and Architecture 18 th March, 2008.
Kevin Walsh CS 3410, Spring 2010 Computer Science Cornell University Performance See: P&H 1.4.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
Chapter 4 Assessing and Understanding Performance Bo Cheng.
Performance D. A. Patterson and J. L. Hennessey, Computer Organization & Design: The Hardware Software Interface, Morgan Kauffman, second edition 1998.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.5 Comparing and Summarizing Performance.
Computer Architecture Lecture 2 Instruction Set Principles.
Chapter 4 Assessing and Understanding Performance
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
CMSC 611: Advanced Computer Architecture Performance Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2: Computer Performance
Copyright 1995 by Coherence LTD., all rights reserved (Revised: Oct 97 by Rafi Lohev, Oct 99 by Yair Wiseman, Sep 04 Oren Kapah) IBM י ב מ 7-1 Measuring.
Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.
1 CHAPTER 2 THE ROLE OF PERFORMANCE. 2 Performance Measure, Report, and Summarize Make intelligent choices Why is some hardware better than others for.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
Performance.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
EGRE 426 Computer Organization and Design Chapter 4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
Computer Architecture CSE 3322 Web Site crystal.uta.edu/~jpatters/cse3322 Send to Pramod Kumar, with the names and s.
Advanced Pipelining 7.1 – 7.5. Peer Instruction Lecture Materials for Computer Architecture by Dr. Leo Porter is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
Computer Organization CS345 David Monismith Based upon notes by Dr. Bill Siever and from the Patterson and Hennessy Text.
Performance. Moore's Law Moore's Law Related Curves.
Measuring Performance II and Logic Design
Lecture 2: Performance Evaluation
4- Performance Analysis of Parallel Programs
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Defining Performance Which airplane has the best performance?
CSCE 212 Chapter 4: Assessing and Understanding Performance
CMSC 611: Advanced Computer Architecture
Performance Cycle time of a computer CPU speed speed = 1 / cycle time
Performance of computer systems
CMSC 611: Advanced Computer Architecture
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
January 25 Did you get mail from Chun-Fa about assignment grades?
Performance.
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Computer Organization and Design Chapter 4
Presentation transcript:

1 Measuring and Discussing Computer System Performance or “My computer is faster than your computer” Reading: 2.4, Peer Instruction Lecture Materials for Computer Architecture by Dr. Leo Porter is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Dr. Leo Porter Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License

2 Performance Metrics 1. Network Bandwidth (data/sec) 2. Network Latency (ms) 3. Frame Rate (frames/sec) 4. Throughput (ops/sec) Domains SelectionOnline WoW Crysis (FPS) Torrent Download Google Server Farm A4312 B4132 C2134 D2314 ENone of the above Match (Best) Performance Metric to Domain Jack’s car buying analogy, we care about many…. Prep with explanation of metrics and domains.

3 Measures of “Performance” Execution Time Frame Rate Throughput (operations/time) Responsiveness Performance / Cost Performance / Power Performance / Power^2

4 Recall our O(n) discussion Much of computer science focuses on execution time – and much of our class will as well. Ultimately, much of what we do we want fast (response time). So is time a reasonable metric? People sit in front of computers / iphones / etc. User time/ clock time/ CPU time CPU Time for this class – for now.

5 All Together Now CPU Execution Time Instruction Count CPIClock Cycle Time = XX

6 All Together Now CPU Execution Time Instruction Count CPIClock Cycle Time = XX instructions cycles/instruction seconds/cycle seconds

7 IC = 1 billion, 500 MHz processor, execution time of 3 seconds. What is the CPI for this program? CPU Execution Time Instruction Count CPIClock Cycle Time = XX Selectio n CPI A3 B15 C1.5 D15*10^9 ENone of the above 3 sec = 1*10^9 inst*CPI * 1sec/(5*10^8)cycles 1.5*10^9 cycles = 10^9insts*CPI 1.5 = CPI

8 Who Affects Performance? There are a number of people involved in processor / programming design Each of these elements of the performance equation can be impacted by different designer(s) Next slides will be about who can impact what. We’ll do speed voting (1 min ind, 1 min group) then discuss each slide. CPU Execution Time Instruction Count CPIClock Cycle Time = XX Individual only IC CT

9 Who Affects Performance? What can a programmer influence? CPU Execution Time Instruction Count CPIClock Cycle Time = XX 1 min ind / 1 min group SelectionImpacts AIC BIC, CPI CIC, CPI, and CT DIC and CT ENone of the above IC CT

10 Who Affects Performance? What can a compiler influence? CPU Execution Time Instruction Count CPIClock Cycle Time = XX SelectionImpacts AIC BIC, CPI CIC, CPI, and CT DCPI and CT ENone of the above IC CT 1 min ind / 1 min group

11 Who Affects Performance? What can an instruction set architect influence? CPU Execution Time Instruction Count CPIClock Cycle Time = XX SelectionImpacts AIC BIC, CPI CIC, CPI, and CT DCPI and CT ENone of the above IC CT 1 min ind / 1 min group

12 Who Affects Performance? What can an hardware designer influence? CPU Execution Time Instruction Count CPIClock Cycle Time = XX SelectionImpacts AIC BIC, CPI CIC, CPI, and CT DCPI and CT ENone of the above IC CT 1 min ind / 1 min group

13 Performance Variation CPU Execution Time Instruction Count CPIClock Cycle Time = XX Same Diff Same Diff Same Diff Sele ction Row(s) Correct A1 B1 and 3 C2 D2 and 3 ENone of the above ROW DIFF

14 Other Performance Metrics Time is useful – but how might we try to measure the “performance” of a machine -MIPS -MFLOPS

15 MIPS MIPS = Millions of Instructions Per Second = Instruction Count Execution Time * 10 6 = Clock rate CPI * 10 6 program-independent deceptive Just crank up clock rate and have it execute tons of noops. But we need to sell processors, what do we market?

16 Trying to market the “performance” of a processor. “Speed Demons” vs. “Brainiacs” If we can’t use something like raw processor speed (CT), if we want CPI - we need to look at performance on benchmarks Intel vs. Alpha, Intel wins… but ends up remarketing.

17 Benchmarks - Which Programs? peak throughput measures (simple programs)?

18 Benchmarks - Which Programs? peak throughput measures (simple programs)? synthetic benchmarks (whetstone, dhrystone,...)?

19 Benchmarks - Which Programs? peak throughput measures (simple programs)? synthetic benchmarks (whetstone, dhrystone,...)? Real applications

20 Benchmarks - Which Programs? peak throughput measures (simple programs)? synthetic benchmarks (whetstone, dhrystone,...)? Real applications SPEC (best of both worlds, but with problems of their own) -System Performance Evaluation Cooperative -Provides a common set of real applications along with strict guidelines for how to run them. -provides a relatively unbiased means to compare machines.

21 Danger in Benchmark-Specific Performance Measures measures compiler as much as architecture!

22 SPEC Performance on Pentium III and Pentium 4 Focus on clock rate relative to change in INT vs. FP performance. SSE2 on P3 was a FP stack, P4 had independent registers

23 Speedup Often want to compare performance of one machine against another Performance = 1 Execution Time Speedup (A over B) = Performance A Performance B Speedup (A over B) = ET B ET A

24 Amdahl’s Law Execution time after improvement = Execution Time Affected Amount of Improvement Execution Time Unaffected+

25 Amdahl’s Law and Parallelism Our program is 90% parallelizable (segment of code executable in parallel on multiple cores) and runs in 100 seconds with a single core. What is the execution time if you use 4 cores (assume no overhead for parallelization)? Selecti on Execution Time A25 seconds B32.5 seconds C50 seconds D92.5 seconds ENone of the above ISOMORPHIC Execution time after improvement = Execution Time Affected Amount of Improvement Execution Time Unaffected+

26 Amdahl’s Law and Parallelism Our program is 90% parallelizable (segment of code executable in parallel on multiple cores) and runs in 100 seconds with a single core. What is the execution time if you use 2 cores (assume no overhead for parallelization)? Selecti on Execution Time A55 seconds B50 seconds C100 seconds D95 seconds ENone of the above ISOMORPHIC Execution time after improvement = Execution Time Affected Amount of Improvement Execution Time Unaffected+

27 Amdahl’s Law So what does Amdalh’s Law mean at a high level? Selectio n “BEST” message from Amdahl’s Law AParallel programming is critical for improving performance BImproving serial code execution is ultimately the most important goal. CPerformance is strictly tied to the ability to determine which percentage of code is parallelizable. DThe impact of a performance improvement is limited by the percent of execution time affected by the improvement ENone of the above

28 Point out Phenom II x4 and x2 get same performance – but one has 4 cores the other 2. What does this tell us? (Note – with some slower speeds of the Phenom this isn’t the case – 4 cores help.)

29 Speedup vs. Sizeup Speedup runs into problems for parallelization because of diminishing returns and dominance of serial execution. What if time were a constant? Human perception (graphics) Earthquake prediction Weather prediction

30 Key Points Be careful how you specify “performance” Execution time = IC * CPI * CT Use real applications, if possible Use standards, if possible Make the common case fast