11 Measuring performance Kosarev Nikolay MIPT Feb, 2010.

Slides:



Advertisements
Similar presentations
1 Lecture 2: Metrics to Evaluate Performance Topics: Benchmark suites, Performance equation, Summarizing performance with AM, GM, HM Video 1: Using AM.
Advertisements

CMSC 611: Advanced Computer Architecture Performance Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Evaluating Performance
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
Computer Performance CS350 Term Project-Spring 2001 Elizabeth Cramer Bryan Driskell Yassaman Shayesteh.
Lecture 2c: Benchmarks. Benchmarking Benchmark is a program that is run on a computer to measure its performance and compare it with other machines Best.
Chapter 4 M. Keshtgary Spring 91 Type of Workloads.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
Chapter 4 Assessing and Understanding Performance Bo Cheng.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.5 Comparing and Summarizing Performance.
Assessing and Understanding Performance B. Ramamurthy Chapter 4.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.1 Introduction.
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Sep 3, 2003 Lecture 2.
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
Using Standard Industry Benchmarks Chapter 7 CSE807.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Performance & Benchmarking. What Matters? Which airplane has best performance:
1 Computer Performance: Metrics, Measurement, & Evaluation.
Where Has This Performance Improvement Come From? Technology –More transistors per chip –Faster logic Machine Organization/Implementation –Deeper pipelines.
Lecture 2: Computer Performance
Computer Performance Computer Engineering Department.
Lecture 2b: Performance Metrics. Performance Metrics Measurable characteristics of a computer system: Count of an event Duration of a time interval Size.
Copyright 1995 by Coherence LTD., all rights reserved (Revised: Oct 97 by Rafi Lohev, Oct 99 by Yair Wiseman, Sep 04 Oren Kapah) IBM י ב מ 7-1 Measuring.
BİL 221 Bilgisayar Yapısı Lab. – 1: Benchmarking.
Memory/Storage Architecture Lab Computer Architecture Performance.
Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.
1 CHAPTER 2 THE ROLE OF PERFORMANCE. 2 Performance Measure, Report, and Summarize Make intelligent choices Why is some hardware better than others for.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
PerformanceCS510 Computer ArchitecturesLecture Lecture 3 Benchmarks and Performance Metrics Lecture 3 Benchmarks and Performance Metrics.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
Performance.
Lecture 9: 9/24/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
Cost and Performance.
CSE2021: Computer Organization Instructor: Dr. Amir Asif Department of Computer Science York University Handout # 2: Measuring Performance Topics: 1. Performance:
Performance Performance
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
1 Lecture: Metrics to Evaluate Performance Topics: Benchmark suites, Performance equation, Summarizing performance with AM, GM, HM  Video 1: Using AM.
Lecture 5: 9/10/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
Performance Analysis Topics Measuring performance of systems Reasoning about performance Amdahl’s law Systems I.
EGRE 426 Computer Organization and Design Chapter 4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
CS203 – Advanced Computer Architecture Performance Evaluation.
Lecture 3. Performance Prof. Taeweon Suh Computer Science & Engineering Korea University COSE222, COMP212, CYDF210 Computer Architecture.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
CS203 – Advanced Computer Architecture
Lecture 2: Performance Evaluation
CS161 – Design and Architecture of Computer Systems
4- Performance Analysis of Parallel Programs
Execution time Execution Time (processor-related) = IC x CPI x T
CSCE 212 Chapter 4: Assessing and Understanding Performance
Computer Architecture
CMSC 611: Advanced Computer Architecture
Performance of computer systems
Performance of computer systems
CMSC 611: Advanced Computer Architecture
Performance of computer systems
Execution time Execution Time (processor-related) = IC x CPI x T
Computer Organization and Design Chapter 4
Presentation transcript:

11 Measuring performance Kosarev Nikolay MIPT Feb, 2010

22 Agenda Performance measures Benchmarks Summarizing results

33 Performance measures Time to perform an individual operation The first metric. Used if most instructions take the same execution time. Instruction mix Idea is to categorize all instructions into classes by cycles required to execute an instruction. Average instruction execution time is calculated (IPC if measured in cycles). Gibson instruction mix [1970]. Proposed weights for a set of predefined instruction classes (based on programs running on IBM 704 and 650) Depends on the program executed, instruction set. Could be optimized by compiler. Ignores major performance impacts (memory hierarchy etc.)

44 Performance measures (cont.) MIPS (millions of instructions per second) Depends on instruction set (the heart of the differences between RISC and CISC). Relative MIPS. DEC VAX-11/780 (1 MIPS computer, reference machine). Relative MIPS of machine M for predefined benchmark: MFLOPS (millions of floating-point operations per second) Metric for supercomputers, tries but not corrects the primary MIPS shortcoming

55 Performance measures (cont.) Execution time Ultimate measure of performance for a given application, consistent across systems. Total execution time (elapsed time). Includes system-overhead effects (I/O operation, memory paging, time-sharing load, etc). CPU time. Time spent for execution of application only by microprocessor. Better to report both measures for the end user.

66 Benchmarks Program kernels Small programs extracted from real applications. E.g. Livermore Fortran Kernels (LFK) [1986]. Don’t stress memory hierarchy in a realistic fashion, ignore operating system. Toy programs Real applications but too small to characterize programs that are likely to be executed by the users of a system. E.g. quicksort. Synthetic benchmarks Artificial programs, try to match profile and behavior of real application. E.g. Whetstone [1976], Dhrystone [1984]. Ignore interactions between instructions (due to new ordering) that lead to pipeline stalls, change of memory locality.

77 Benchmarks (cont.) SPEC SPEC (Standard Performance Evaluation Corporation) Benchmark suites consist of real programs modified to be portable and to minimize the effect of I/O activities on performance 5 SPEC generations: SPEC89, SPEC92, SPEC95, SPEC2000 and SPEC2006 (used to measure desktop and server CPU performance) Benchmarks organized in two suites: CINT and CFP 2 derived metrics: SPECratio and SPECrate SPECSFS, SPECWeb (file server and web server benchmarks) measure performance of I/O activities (from disk or network traffic) as well as the CPU

88 Benchmarks (cont.)

99 SPECratio is a speed metric How fast a computer can complete single task Execution time normalized to a reference computer. Formula: It measures how many times faster than a reference machine one system can perform a task Reference machine used for SPEC CPU2000/SPEC CPU2006 is Sun UltraSPARC II system at 296MHz Choice of the reference computer is irrelevant in performance comparisons.

10 Benchmarks (cont.) SPECrate is a throughput metric Measures how many tasks the system completes within an arbitrary time interval Measured elapsed time from when all copies of one benchmark are launched simultaneously until the last copy finishes Each benchmark measured independently User is free to choose # of benchmark copies to run in order to maximize performance Formula Reference factor – normalization factor; benchmark duration is normalized to standard job length (benchmark with the longest SPEC reference time). Unit time – used to convert to unit of time more appropriate for work (e.g. week)