ENGS 116 Lecture 21 Performance and Quantitative Principles Vincent H. Berk September 26 th, 2008 Reading for today: Chapter 1.1 - 1.4, Amdahl article.

Slides:



Advertisements
Similar presentations
Performance What differences do we see in performance? Almost all computers operate correctly (within reason) Most computers implement useful operations.
Advertisements

Computer Abstractions and Technology
Performance Evaluation of Architectures Vittorio Zaccaria.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Evaluating Performance
ECE 4100/6100 Advanced Computer Architecture Lecture 3 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia Institute.
2-1 ECE 361 ECE C61 Computer Architecture Lecture 2 – performance Prof. Alok N. Choudhary
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon., Sep 5, 2005 Lecture 2.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Computer Performance Evaluation: Cycles Per Instruction (CPI)
CIS429.S00: Lec2- 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important quantitative.
ECE 232 L4 perform.1 Adapted from Patterson 97 ©UCBCopyright 1998 Morgan Kaufmann Publishers ECE 232 Hardware Organization and Design Lecture 4 Performance,
Computer Architecture Lecture 2 Instruction Set Principles.
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Sep 3, 2003 Lecture 2.
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1 Measuring Performance Chris Clack B261 Systems Architecture.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Lecture 1: Course Introduction, Technology Trends, Performance Professor Alvin R. Lebeck Computer Science 220 Fall 2001.
Where Has This Performance Improvement Come From? Technology –More transistors per chip –Faster logic Machine Organization/Implementation –Deeper pipelines.
Lecture 2: Computer Performance
BİL 221 Bilgisayar Yapısı Lab. – 1: Benchmarking.
Memory/Storage Architecture Lab Computer Architecture Performance.
Operation Frequency No. of Clock cycles ALU ops % 1 Loads 25% 2
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
PerformanceCS510 Computer ArchitecturesLecture Lecture 3 Benchmarks and Performance Metrics Lecture 3 Benchmarks and Performance Metrics.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
Digital System Architecture 1 28 ต.ค ต.ค ต.ค ต.ค ต.ค. 58 Lecture 2a Computer Performance and Cost Pradondet Nilagupta.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
Computer Architecture
Lecture 8: 9/19/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
Computer Architecture CPSC 350
CS252/Patterson Lec 1.1 1/17/01 CMPUT429/CMPE382 Winter 2001 Topic2: Technology Trend and Cost/Performance (Adapted from David A. Patterson’s CS252 lecture.
Performance Enhancement. Performance Enhancement Calculations: Amdahl's Law The performance enhancement possible due to a given design improvement is.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Computer Engineering Rabie A. Ramadan Lecture 2. Table of Contents 2 Architecture Development and Styles Performance Measures Amdahl’s Law.
Performance 9 ways to fool the public Old Chapter 4 New Chapter 1.4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Computer Architecture CSE 3322 Web Site crystal.uta.edu/~jpatters/cse3322 Send to Pramod Kumar, with the names and s.
Performance COE 301 / ICS 233 Computer Organization Prof. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum.
CSE 340 Computer Architecture Summer 2016 Understanding Performance.
Performance 9 ways to fool the public #1 – Reporting Results.
CpE 442 Introduction to Computer Architecture The Role of Performance
Measuring Performance II and Logic Design
Lecture 2: Performance Evaluation
CS161 – Design and Architecture of Computer Systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Performance Performance The CPU Performance Equation:
Defining Performance Which airplane has the best performance?
Morgan Kaufmann Publishers
CSCE 212 Chapter 4: Assessing and Understanding Performance
Computer Architecture CSCE 350
Performance COE 301 Computer Organization
Lecture 3: MIPS Instruction Set
August 30, 2000 Prof. John Kubiatowicz
Parameters that affect it How to improve it and by how much
CS2100 Computer Organisation
Presentation transcript:

ENGS 116 Lecture 21 Performance and Quantitative Principles Vincent H. Berk September 26 th, 2008 Reading for today: Chapter , Amdahl article Reading for Monday: Chapter 1.5 – 1.11, Mazor article Homework for Wednesday: 1.1, 1.3, 1.6, 1.7, 1.13

ENGS 116 Lecture 22 Review  Task of Computer Designers –Determine which attributes are important for a new machine –Design a machine to maximize performance without violating cost/power/functionality constraints  3 Components of “Architecture” –Instruction set design –Organization –Hardware

ENGS 116 Lecture 23 Benchmarking Games  Different configurations used to run the same workload on two systems.  Compiler customized to optimize the workload.  Workload arbitrarily picked to skew results.  Test specification written to be biased toward one machine.

ENGS 116 Lecture 24 Design benchmarks for:  Industrial and design  Consumer Electronics  Networking, routers  Office applications  Telecommunications  Weapon systems

ENGS 116 Lecture 25 Execution time  Weighted arithmetic mean: sum over execution time of all programs run, times their relative frequencies  Normalized execution time: take a reference machine, set it to 1, then compute normalized execution times for others based on this machine  Geometric mean of normalized execution time (reference computer becomes irrelevant, ratios can arbitrarily be compared)

ENGS 116 Lecture 26 Amdahl’s Law Execution time after improvement = Make the common case fast

ENGS 116 Lecture 27 Speedup due to enhancement E: Suppose that enhancement E accelerates a fraction F of the task by a factor S, and the remainder of the task is unaffected: ExTime (E) = Speedup (E) = Amdahl’s Law Speedup(E)= ExTime w/o E ExTime w/ E  Performance w/ E Performance w/o E

ENGS 116 Lecture 28 Amdahl’s Law

ENGS 116 Lecture 29 Amdahl’s Law Example:Floating point instructions improved to run 2X, but only 10% of actual instructions are FP

ENGS 116 Lecture 210 Corollary: Make The Common Case Fast  All instructions require an instruction fetch, only a fraction require a data fetch/store. –Optimize instruction access over data access  Programs exhibit locality. Spatial Locality Temporal Locality  Access to small memories is faster. –Provide a storage hierarchy such that the most frequent accesses are to the smallest (closest) memories. Disk/Tape Memory CacheRegisters

ENGS 116 Lecture 211 Metrics of Performance Compiler Programming Language Application Datapath Control TransistorsWiresPins ISA Function Units Millions of instructions per second: MIPS Millions of FP operations per second: MFLOPS Cycles per second (clock rate)‏ Megabytes per second Answers per month Operations per second

ENGS 116 Lecture 212 Marketing Metrics  Machines with different instruction sets?  Programs with different instruction mixes? – Dynamic frequency of instructions  Uncorrelated with performance  Machine dependent  Often not where time is spent

ENGS 116 Lecture 213 Aspects of CPU Performance Instr. Count CPI Clock Rate Program Compiler Instruction Set Organization Technology

ENGS 116 Lecture 214 Aspects of CPU Performance Instr. Count CPI Clock Rate ProgramX CompilerX(X) Instruction SetXX OrganizationXX TechnologyX

ENGS 116 Lecture 215 Average Cycles per Instruction CPI= (CPU Time  Clock Rate) / Instruction Count= Cycles / Instruction Count CPU time = Cycle Time  Instruction Frequency Invest resources where time is spent! Cycles Per Instruction

ENGS 116 Lecture 216 Example: Calculating CPI Base Machine (Reg / Reg)‏ Op FreqCycles CPI (i) (% Time)‏ ALU50% 1.5(33%)‏ Load20% 2.4(27%)‏ Store10% 2.2(13%)‏ Branch 20% 2.4 (27%)‏ 1.5 Typical Mix

ENGS 116 Lecture 217 Example Want to add register / memory operations - One source operand in memory - One source operand in register - Cycle count of 2 Side effect: Branch cycle count will increase to 3. What fraction of the loads must be eliminated for this to pay off? Base Machine (Reg / Reg)‏ OpFreqCycles ALU50% 1 Load20% 2 Store10% 2 Branch20% 2

ENGS 116 Lecture 218 Example Solution Exec Time = Instruction Count  CPI  Clock OpFreqCyclesCPIFreqCyclesCPI ALU Load Store Branch Reg/Mem

ENGS 116 Lecture 219 Example Solution Exec Time = Instruction Count  CPI  Clock OpFreqCyclesCPIFreqCyclesCPI ALU – X 1.5 – X Load – X 2.4 – 2X Store Branch Reg/MemX 22X – X (1.7 – X) /(1 – X)‏ CPI New must be normalized to new instruction frequency

ENGS 116 Lecture 220 Example Solution Exec Time = Instruction Count  CPI  Clock OpFreqCyclesCPIFreqCyclesCPI ALU – X 1.5 – X Load – X 2.4 – 2X Store Branch Reg/MemX 22X – X (1.7 – X) / (1 – X)‏ Instr Cnt Old  CPI Old  Clock Old = Instr Cnt New  CPI New  Clock New

ENGS 116 Lecture 221 Example Solution Exec Time = Instruction Count  CPI  Clock OpFreqCyclesCPIFreqCyclesCPI ALU – X 1.5 – X Load – X 2.4 – 2X Store Branch Reg/MemX 22X – X (1.7 – X) / (1 – X)‏ Instr Cnt Old  CPI Old  Clock Old = Instr Cnt New  CPI New  Clock New 1.00  1.5= (1 – X)  (1.7 – X) / (1 – X)‏ 1.5= 1.7 – X 0.2= X ALL loads must be eliminated for this to be a win!