CS 6290 Evaluation & Metrics. Performance Two common measures –Latency (how long to do X) Also called response time and execution time –Throughput (how.

Slides:



Advertisements
Similar presentations
Fall 2011 Prof. Hyesoon Kim. Instructor: Hyesoon Kim (KACB 2344) Homepage – –T-square.
Advertisements

Performance Evaluation of Architectures Vittorio Zaccaria.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
1  1998 Morgan Kaufmann Publishers Chapter 2 Performance Text in blue is by N. Guydosh Updated 1/25/04*
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
Computer Performance CS350 Term Project-Spring 2001 Elizabeth Cramer Bryan Driskell Yassaman Shayesteh.
2-1 ECE 361 ECE C61 Computer Architecture Lecture 2 – performance Prof. Alok N. Choudhary
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.5 Comparing and Summarizing Performance.
Computer ArchitectureFall 2007 © September 17, 2007 Karem Sakallah CS-447– Computer Architecture.
Chapter 4 Assessing and Understanding Performance
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Performance & Benchmarking. What Matters? Which airplane has best performance:
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2b: Performance Metrics. Performance Metrics Measurable characteristics of a computer system: Count of an event Duration of a time interval Size.
Memory/Storage Architecture Lab Computer Architecture Performance.
Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.
PerformanceCS510 Computer ArchitecturesLecture Lecture 3 Benchmarks and Performance Metrics Lecture 3 Benchmarks and Performance Metrics.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
EGRE 426 Computer Organization and Design Chapter 4.
Performance 9 ways to fool the public Old Chapter 4 New Chapter 1.4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
CS203 – Advanced Computer Architecture Performance Evaluation.
Lecture 3. Performance Prof. Taeweon Suh Computer Science & Engineering Korea University COSE222, COMP212, CYDF210 Computer Architecture.
Performance 9 ways to fool the public #1 – Reporting Results.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
CpE 442 Introduction to Computer Architecture The Role of Performance
Measuring Performance II and Logic Design
CS203 – Advanced Computer Architecture
CSCI206 - Computer Organization & Programming
Lecture 2: Performance Today’s topics:
Lecture 2: Performance Evaluation
Lecture 3: MIPS Instruction Set
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Defining Performance Which airplane has the best performance?
CSCE 212 Chapter 4: Assessing and Understanding Performance
CS2100 Computer Organisation
CSCI206 - Computer Organization & Programming
Performance of computer systems
Lecture 3: MIPS Instruction Set
Performance of computer systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Parameters that affect it How to improve it and by how much
Computer Organization and Design Chapter 4
Presentation transcript:

CS 6290 Evaluation & Metrics

Performance Two common measures –Latency (how long to do X) Also called response time and execution time –Throughput (how often can it do X) Example of car assembly line –Takes 6 hours to make a car (latency is 6 hours) –A car leaves every 5 minutes (throughput is 12 cars per hour) –Overlap results in Throughput > 1/Latency

Measuring Performance Peak (MIPS, MFLOPS) –Often not useful unachievable in practice, or unsustainable

Measuring Performance Benchmarks –Real applications and application suites E.g., SPEC CPU2000, SPEC2006, TPC-C, TPC-H –Kernels “Representative” parts of real applications Easier and quicker to set up and run Often not really representative of the entire app –Toy programs, synthetic benchmarks, etc. Not very useful for reporting Sometimes used to test/stress specific functions/features

SPEC CPU (integer) “Representative” applications keeps growing with time!

SPEC CPU (floating point)

Price-Performance

TPC Benchmarks Measure transaction-processing throughput Benchmarks for different scenarios –TPC-C: warehouses and sales transactions –TPC-H: ad-hoc decision support –TPC-W: web-based business transactions Difficult to set up and run on a simulator –Requires full OS support, a working DBMS –Long simulations to get stable results

Throughput-Server Perf/Cost High performance Very expensive!

CPU Performance Equation (1) Hardware Technology, Organization Organization, ISA ISA, Compiler Technology A.K.A. The “iron law” of performance

Car Analogy Need to drive from Klaus to CRC –“Clock Speed” = 3500 RPM –“CPI” = 5250 rotations/km or 0.19 m/rot –“Insts” = 800m 800 m 1 rotation 0.19 m 1 minute 3500 rotations = 1.2 minutes

CPU Version Program takes 33 billion instructions to run CPU processes insts at 2 cycles per inst Clock speed of 3GHz = 22 seconds Sometimes clock cycle time given instead (ex. cycle = 333 ps) IPC sometimes used instead of CPI

CPU Performance Equation (2) For each kind of instruction How many instructions of this kind are there in the program How many cycles it takes to execute an instruction of this kind

CPU performance w/ different instructions Instruction Type FrequencyCPI Integer40%1.0 Branch20%4.0 Load20%2.0 Store10%3.0 Total Insts = 50B, Clock speed = 2 GHz

Comparing Performance “X is n times faster than Y” “Throughput of X is n times that of Y”

If Only it Were That Simple “X is n times faster than Y on A” But what about different applications (or even parts of the same application) –X is 10 times faster than Y on A, and 1.5 times on B, but Y is 2 times faster than X on C, and 3 times on D, and… So does X have better performance than Y? Which would you buy?

Summarizing Performance Arithmetic mean –Average execution time –Gives more weight to longer-running programs Weighted arithmetic mean –More important programs can be emphasized –But what do we use as weights? –Different weight will make different machines look better

Speedup Machine AMachine B Program 15 sec4 sec Program 23 sec6 sec What is the speedup of A compared to B on Program 1? What is the speedup of A compared to B on Program 2? What is the average speedup? What is the speedup of A compared to B on Sum(Program1, Program2) ?

Normalizing & the Geometric Mean Speedup of arithmeitc means != arithmetic mean of speedup Use geometric mean: Neat property of the geometric mean: Consistent whatever the reference machine Do not use the arithmetic mean for normalized execution times

CPI/IPC Often when making comparisons in comp- arch studies: –Program (or set of) is the same for two CPUs –The clock speed is the same for two CPUs So we can just directly compare CPI’s and often we use IPC’s

Average CPI vs. “Average” IPC Average CPI =(CPI 1 + CPI 2 + … + CPI n )/n A.M. of IPC =(IPC 1 + IPC 2 + … + IPC n )/n Must use Harmonic Mean to remain  to runtime Not Equal to A.M. of CPI!!!

Harmonic Mean H.M.(x 1,x 2,x 3,…,x n ) = n … + 1 x 1 x 2 x 3 x n What in the world is this? –Average of inverse relationships

A.M.(CPI) vs. H.M.(IPC) “Average” IPC = 1 A.M.(CPI) = 1 CPI 1 + CPI 2 + CPI 3 + … + CPI n n n n n = n CPI 1 + CPI 2 + CPI 3 + … + CPI n = n … + 1 =H.M.(IPC) IPC 1 IPC 2 IPC 3 IPC n

Amdahl’s Law (1) What if enhancement does not enhance everything? Caution: fraction of What?

Amdahl’s Law (2) Make the Common Case Fast VS Important: Principle of locality Approx. 90% of the time spent in 10% of the code

Amdahl’s Law (3) Diminishing Returns Green PhaseBlue Phase Total Execution Time Generation 1 GreenBlue Total Execution Time Generation 2 Generation 3 Blue Total Execution Time over Generation 2 over Generation 1

Yet Another Car Analogy From GT to Mall of Georgia (35mi) –you’ve got a “Turbo” for your car, but can only use on highway Spaghetti Junction to Mall of GA (23mi) –avg. speed of 60mph –avg. speed of 120mph with Turbo GT to Spaghetti junction (12 mi) –stuck in bad rush hour traffic avg. speed of 5 mph Turbo gives 100% speedup across 66% of the distance… … but only results in <10% reduction on total trip time (which is a <11% speedup)

Now Consider Price-Performance Without Turbo –Car costs $8,000 to manufacture –Selling price is $12,000  $4K profit per car –If we sell 10,000 cars, that’s $40M in profit With Turbo –Car costs extra $3,000 –Selling price is $16,000  $5K profit per car –But only a few gear heads buy the car: We only sell 400 cars and make $2M in profit

CPU Design is Similar What does it cost me to add some performance enhancement? How much effective performance do I get out of it? –100% speedup for small fraction of time wasn’t a big win for the car example How much more do I have to charge for it? –Extra development, testing, marketing costs How much more can I charge for it? –Does the market even care? How does the price change affect volume?