Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.

Slides:



Advertisements
Similar presentations
CS2100 Computer Organisation Performance (AY2014/2015) Semester 2.
Advertisements

Computer Abstractions and Technology
Recap Measuring and reporting performance Quantitative principles Performance vs Cost/Performance.
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.5 Comparing and Summarizing Performance.
CS/ECE 3330 Computer Architecture Chapter 1 Performance / Power.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Chapter 4 Assessing and Understanding Performance
Gordon Moore Gordon Moore, cofounder of Intel 1965: 2 x trans. per chip/year After 1970: 2 x trans. per chip/1.5year 摩爾定律.
Lecture 3: Computer Performance
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
Rung-Bin Lin Chapter 1: Fundamental of Computer Design1-1 Chapter 1. Fundamentals of Computer Design Introduction –Performance Improvement due to (1).
Using Standard Industry Benchmarks Chapter 7 CSE807.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Performance & Benchmarking. What Matters? Which airplane has best performance:
1 Computer Performance: Metrics, Measurement, & Evaluation.
Computer Performance Computer Engineering Department.
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
Chapter 1 - The Computer Revolution Chapter 1 — Computer Abstractions and Technology — 1  Progress in computer technology  Underpinned by Moore’s Law.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
Chapter 1 — Computer Abstractions and Technology — 1 Understanding Performance Algorithm Determines number of operations executed Programming language,
Advanced Computer Architecture Fundamental of Computer Design Instruction Set Principles and Examples Pipelining:Basic and Intermediate Concepts Memory.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Cost and Performance.
CET Gannod1 Chapter 1 Fundamentals of Computer Design.
Morgan Kaufmann Publishers
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
Performance Analysis Topics Measuring performance of systems Reasoning about performance Amdahl’s law Systems I.
EGRE 426 Computer Organization and Design Chapter 4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
CS203 – Advanced Computer Architecture Performance Evaluation.
Chapter 1 Performance & Technology Trends. Outline What is computer architecture? Performance What is performance: latency (response time), throughput.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
Measuring Performance II and Logic Design
CS203 – Advanced Computer Architecture
Lecture 2: Performance Evaluation
CS161 – Design and Architecture of Computer Systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Morgan Kaufmann Publishers Computer Abstractions and Technology
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Defining Performance Which airplane has the best performance?
Morgan Kaufmann Publishers
CSCE 212 Chapter 4: Assessing and Understanding Performance
CS2100 Computer Organisation
Performance of computer systems
Performance of computer systems
Performance of computer systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CS161 – Design and Architecture of Computer Systems
Computer Organization and Design Chapter 4
Presentation transcript:

Recap Technology trends Cost/performance

Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine A executes a program in 10s; Machine B executes the same program in 15s. Which is true: 1)A is 50% faster than B? 2)A is 33% faster than B?

Performance H&P’s definition: “X is n times faster than Y” means Performance is reciprocal of time:

Example Answer: 1) A is 50% faster than B E.g. Machine A executes a program in 10s; Machine B executes the same program in 15s. Which is true: 1)A is 50% faster than B? 2)A is 33% faster than B?

Performance Response time? Throughput?

Measuring Performance Focus on execution time of real programs Measuring execution time?  Wall clock time (elapsed time)  CPU time (excludes I/O and other processes) oUser CPU time oSystem CPU time iota:~$ time gcc -g tmpcnv.s -o tmpcnv real 0m3.352s user 0m0.367s sys 0m0.468s

Choosing Programs to Measure Performance Real Programs –Compilers, text-processing, CAD tools, etc. Modified applications –Scripted or modified for portability Kernels –Attempt to extract key sections from real programs (Livermore loops, Linpack) Toy Benchmarks –Short examples (e.g. Sieve of Eratosthenes) Synthetic Benchmarks –Whetstone, Dhrystone

Benchmarking H&P: car magazines are more scientific about reporting performance than many CS journals!

Benchmark Suites Collections of benchmarks –E.g. SPEC CPU2000 (INT and FP) 25 real FORTRAN/C/C++ programs, modified for portability –Specific graphics benchmarks

Server Benchmarks SPEC also has server benchmarks –File server –Web server TPC: Transaction Processing Council –Various transaction processing benchmarks

Embedded Benchmarks Much less well developed –Tend to use Dhrystone! EEMBC –Recent development –34 benchmarks (mainly kernels) in five application areas

Summarising Performance Measurements Complex area –Weighted arithmetic mean –Geometric mean –Normalised results –…

1.6 Quantitative Principles Make the common case fast! –E.g. addition: focus on “normal” addition, not overflow situations Amdahl’s Law –Quantifies improvements gained by focussing on one aspect of a design

Amdahl’s Law

Example We are considering an enhancement that is 10 times faster than the original, but is only used 40% of the time.

CPU Performance CPU time related to clock speed: –Period (e.g. 1ns) –Rate (e.g. 1GHz) Also interested in Cycles Per Instruction (CPI)

Three Equal Factors Clock rate (technology) CPI (architecture) Instruction count (architecture and compiler)

Measuring IC & CPI Many modern processors include hardware counters for instructions and clock cycles Simulations can give even more detail –Time consuming, but can be very accurate

Another Principle: Locality Locality of Reference –“90/10 Rule” Also applies to data Two aspects: –Temporal locality –Spatial locality

Taking Advantage of Parallelism Key principle for improving performance Examples: –System level: parallel processing, disk arrays, etc. –Processor level: pipelining –Digital design: caches, ALU adders, etc.

1.7 Putting It All Together: Performance & Price/Performance Measure performance and performance/cost for three categories –Desktop (SPEC INT and FP) –TP Servers (TPC-C) –Embedded Processors (EEMBC)

Desktop Integer: –Performance/cost tracks performance FP: –Not as closely related –Pentium 4 much better than Pentium III AMD Athlon very good value for money

Servers Twelve systems –Six top performers –Six best price-performance Multiprocessors –3 P3’s – 280 P3’s Cost: –$131,000 – $15 million

Embedded Processors Difficult to assess –Benchmarks very new –Designs very application-specific –Power a major constraint –Cost difficult to quantify (are support chips required?)

Embedded Processors Range: –500MHz AMD K6 ($78) and IBM PowerPC ($94) used for network switches, etc. –167MHz NEC VR 5432 ($25) popular in colour laser printers –180MHz NEC VR 4122 ($33) popular in PDAs (low power)

1.8 Another View: Power Consumption and Efficiency Embedded processors from previous example: power ranged from 700mW to 9600mW Fig. 1.27: Performance/watt –NEC VR 4122 huge leader

1.9 Fallacies and Pitfalls Fallacy: Relative performance of two similar processors can be judged by clock rate or by a single benchmark –Factors such as pipeline structure and memory system have major impact –E.g. Pentium III vs. Pentium 4 (Fig. 1.28)

1.7GHz P4 –vs– 1.0GHz P3

Fallacies and Pitfalls Fallacy: Benchmarks remain valid indefinitely –Optimisations change –Perhaps deliberately! –Even real programs are affected by changes in technology –E.g. gcc : increasing percentage is “system time” –SPEC has adapted considerably

Fallacies and Pitfalls Pitfall: Comparing hand-coded assembly and compiled high-level language performance –E.g. embedded processor benchmarks –Hand-coded is 5 – 87 times faster!