1 Recap. 2 Measuring Performance  A computer user: response time (execution time).  A computer center manager - throughput - the total amount of work.

Slides:



Advertisements
Similar presentations
Performance What differences do we see in performance? Almost all computers operate correctly (within reason) Most computers implement useful operations.
Advertisements

Computer Abstractions and Technology
ENEE350 Ankur Srivastava University of Maryland, College Park Based on Slides from Mary Jane Irwin ( )
810:142 Lecture 2: Performance Fall 2006 Chapter 4: Performance Adapted from Mary Jane Irwin at Penn State University for Computer Organization and Design,
ECE-C355 Computer Structures Winter 2008 Chapter 04: Understanding Performance Slides are adapted from Professor Mary Jane Irwin (
Read Section 1.4, Section 1.7 (pp )
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Princess Sumaya Univ. Computer Engineering Dept. Chapter 4:
Lecture 7: 9/17/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
CPU Performance Evaluation: Cycles Per Instruction (CPI)
EECC550 - Shaaban #1 Lec # 3 Spring Computer Performance Evaluation: Cycles Per Instruction (CPI) Most computers run synchronously utilizing.
inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 36 – Performance Every 6 months (Nov/June), the fastest supercomputers.
EECC550 - Shaaban #1 Lec # 3 Winter CPU Performance Evaluation: Cycles Per Instruction (CPI) Most computers run synchronously utilizing.
Computer Performance Evaluation: Cycles Per Instruction (CPI)
CS/ECE 3330 Computer Architecture Chapter 1 Performance / Power.
Assessing and Understanding Performance B. Ramamurthy Chapter 4.
Chapter 4 Assessing and Understanding Performance
Lecture 3: Computer Performance
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Chapter 1 Section 1.4 Dr. Iyad F. Jafar Evaluating Performance.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Where Has This Performance Improvement Come From? Technology –More transistors per chip –Faster logic Machine Organization/Implementation –Deeper pipelines.
CSE 340 Computer Architecture Summer 2014 Understanding Performance
Computer Performance Computer Engineering Department.
inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 38 – Performance Every 6 months (Nov/June), the fastest supercomputers.
BİL 221 Bilgisayar Yapısı Lab. – 1: Benchmarking.
CSCI-365 Computer Organization Lecture Note: Some slides and/or pictures in the following are adapted from: Computer Organization and Design, Patterson.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
Morgan Kaufmann Publishers
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
IT 251 Computer Organization and Architecture Performance Chia-Chi Teng.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
CSE 340 Computer Architecture Summer 2016 Understanding Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
Measuring Performance and Benchmarks Instructor: Dr. Mike Turi Department of Computer Science and Computer Engineering Pacific Lutheran University Lecture.
Computer Architecture & Operations I
Morgan Kaufmann Publishers Technology Trends and Performance
Measuring Performance II and Logic Design
CSCI206 - Computer Organization & Programming
Lecture 2: Performance Evaluation
Computer Architecture & Operations I
CS161 – Design and Architecture of Computer Systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Assessing and Understanding Performance
Defining Performance Which airplane has the best performance?
Morgan Kaufmann Publishers
CSCE 212 Chapter 4: Assessing and Understanding Performance
Chapter 1 Computer Abstractions & Technology Performance Evaluation
CSCI206 - Computer Organization & Programming
CMSC 611: Advanced Computer Architecture
Performance of computer systems
Performance ICS 233 Computer Architecture and Assembly Language
August 30, 2000 Prof. John Kubiatowicz
Performance of computer systems
CMSC 611: Advanced Computer Architecture
COMS 361 Computer Organization
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Benchmarks Programs specifically chosen to measure performance
CS161 – Design and Architecture of Computer Systems
Presentation transcript:

1 Recap

2 Measuring Performance  A computer user: response time (execution time).  A computer center manager - throughput - the total amount of work done in a period of time.  CPU time : a very good and fair measure of performance.  CPU time can also be divided into user CPU time (program) and system CPU time (OS).

3 Aspects of CPU Execution Time CPU Time = Instruction count x CPI x Clock cycle Instruction Count I ClockCycle C CPI Depends on: CPU Organization Technology Depends on: Program Used Compiler ISA CPU Organization Depends on: Program Used Compiler ISA

4 Factors Affecting CPU Performance CPU time= Seconds= Instructions x Cycles x Seconds Program Program Instruction Cycle CPU time= Seconds= Instructions x Cycles x Seconds Program Program Instruction Cycle CPI C Clock Cycle C Instruction I Count I Program Compiler Organization Technology Instruction Set Architecture (ISA) X X X X X X X X X

5 Example: tradeoff between C and CPI Assume stores can execute in 1 cycle by slowing clock 15% Should this be implemented? OpFrequencyCycle Count ALU ops43%1 Loads21%1 Stores12%2 Branches24%2

6 Simple Example Old CPI = 0.43 x x x x 2 = 1.36 New CPI = 0.43 x x x x 2 = 1.24 Speedup = old time/new time –= {I x old CPI x C}/{I x new CPI x 1.15 C} –= 1.36 / (1.24 x 1.15) = 0.95 Answer: Don’t make the change

7 Some Caveats Inter-dependence of I, CPI, and C: Improvement In One May Impact Another –increasing pipeline depth tends to increase clock speed but may increase CPI –Change in ISA to reduce instruction count may require a design with slower clock => May Not Improve Performance –CPI depends on instruction mix => Smaller Instruction Count May Not Improve Performance

8 Code Size & Performance

9 Benchmarks and Benchmarking In lack of a universal task pick some programs that represent common tasks Use representative programs to compare performance of systems: CAUTIONS: –Comparisons are as good as the benchmarks are in representing your real workload. –Many parameters affect measured performance

10 Example: We must use the same compiler Compiler “enhancements” and performance  1998 Morgan Kaufmann Publishers

11 Benchmark Suites A Suite Is a Collection of Representative Benchmarks From Different Application Domains Weakness of Any One Benchmark Likely to Be Compensated By Another Standard Performance Evaluation Corporation (SPEC) –Most Popular Benchmark Suite –Suite Consists of Kernels, Small Fragments, Large Applications –SPEC2006: CINT2006, CFP2006 – Benchmark suites for servers –SPECSFS: measures performance of File servers –SPECWeb: measurers performance of Web servers

12 SPEC CPU2006 Programs Benchmark Language Descriptions 400.PerlbenchC Programming Language 401.bzip2 C Compression 403.GccCC Compiler 429.mcf C Combinatorial Optimization 445.gobmk C Artificial Intelligence: Go 456.HmmerC Search Gene Sequence 458.sjeng C Artificial Intelligence: chess 462.libquantum CPhysics / Quantum Computing 464.h264refCVideo Compression 471.omnetpp C++Discrete Event Simulation 473.astar C++Path-finding Algorithms 483.xalancbmk C++XML Processing CINT2006 (Integer) Source:

13 SPEC CPU2006 Programs Benchmark Language Descriptions 410.BwavesFortran Fluid Dynamics 416.GamessFortran Quantum Chemistry 433.MilcC Physics / Quantum Chromodynamics 434.ZeusmpFortran Physics / CFD 435.GromacsC, FortranBiochemistry / Molecular Dynamics 436.cactusADMC, FortranPhysics / General 437.leslie3dFortranFluid Dynamics 444.NamdC++Biology / Molecular Dynamics 447.dealIIC++ Finite Element Analysis 450.SoplexC++ Linear Programming, Optimization 453.PovrayC++ Image Ray-tracing 454.CalculixC, FortranStructural Mechanics 459.GemsFDTDFortran Computational Electromagnetics 465.TontoFortranQuantum Chemistry 470.LbmCFluid Dynamics 481.WrfC, FortranWeather 482.sphinx3CSpeech CFP2006 (Floating Point) Source:

14 Top 20 SPEC CPU2006 Results (As of August 2007) # MHz Processor int peak int baseMHz Processor fp peak fp base 13000Core 2 Duo E POWER POWER Core 2 Duo E Xeon Dual-Core Itanium Xeon X Dual-Core Itanium Core 2 Duo E Core 2 Duo E Core 2 Duo E Xeon Core 2 Quad Q Opteron Xeon X Core 2 Duo E Xeon Opteron Xeon X Xeon Xeon X Xeon X Core Core 2 Quad Q Core 2 Quad Q Xeon X Core 2 Duo X Core 2 Extreme X Xeon Core 2 Quad Q Core 2 Duo T Dual-Core Itanium Xeon E Xeon Xeon Xeon E Xeon Opteron Xeon X Xeon X Source: Top 20 SPECfp2006 Top 20 SPECint2006

15 Performance Evaluation Using Benchmarks “For better or worse, benchmarks shape a field” Good products created when we have: –Good benchmarks –Good ways to summarize performance Given sales depend in big part on performance relative to competition, there is big investment in improving products as reported by performance summary If benchmarks inadequate, then choose between improving product for real programs vs. improving product to get more sales; Sales almost always wins!

16 How to Summarize Performance