CPEN 315 - Digital System Design Assessing and Understanding CPU Performance © Logic and Computer Design Fundamentals, 4 rd Ed., Mano Prentice Hall © Computer.

Slides:



Advertisements
Similar presentations
CS1104: Computer Organisation School of Computing National University of Singapore.
Advertisements

CS2100 Computer Organisation Performance (AY2014/2015) Semester 2.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
1  1998 Morgan Kaufmann Publishers Chapter 2 Performance Text in blue is by N. Guydosh Updated 1/25/04*
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
Computer Organization and Architecture (AT70.01) Comp. Sc. and Inf. Mgmt. Asian Institute of Technology Instructor: Dr. Sumanta Guha Slide Sources: Patterson.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
1 Introduction Rapidly changing field: –vacuum tube -> transistor -> IC -> VLSI (see section 1.4) –doubling every 1.5 years: memory capacity processor.
Chapter 4 Assessing and Understanding Performance Bo Cheng.
1 CSE SUNY New Paltz Chapter 2 Performance and Its Measurement.
Chapter 4 Assessing and Understanding Performance
Performance D. A. Patterson and J. L. Hennessey, Computer Organization & Design: The Hardware Software Interface, Morgan Kauffman, second edition 1998.
Computer ArchitectureFall 2007 © September 17, 2007 Karem Sakallah CS-447– Computer Architecture.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1  1998 Morgan Kaufmann Publishers and UCB Performance CEG3420 Computer Design Lecture 3.
Computer ArchitectureFall 2007 © September 19, 2007 Karem Sakallah CS-447– Computer Architecture.
Assessing and Understanding Performance B. Ramamurthy Chapter 4.
Chapter 4 Assessing and Understanding Performance
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
Lecture 3: Computer Performance
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1 ECE3055 Computer Architecture and Operating Systems Lecture 2 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia.
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
1 CHAPTER 2 THE ROLE OF PERFORMANCE. 2 Performance Measure, Report, and Summarize Make intelligent choices Why is some hardware better than others for.
순천향대학교 정보기술공학부 이 상 정 1 4. Accessing and Understanding Performance.
B0111 Performance Anxiety ENGR xD52 Eric VanWyk Fall 2012.
1 CPS4150 Chapter 4 Assessing and Understanding Performance.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
1 Acknowledgements Class notes based upon Patterson & Hennessy: Book & Lecture Notes Patterson’s 1997 course notes (U.C. Berkeley CS 152, 1997) Tom Fountain.
Performance.
1 CS465 Performance Revisited (Chapter 1) Be able to compare performance of simple system configurations and understand the performance implications of.
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
1 COMS 361 Computer Organization Title: Performance Date: 10/02/2004 Lecture Number: 3.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
1  1998 Morgan Kaufmann Publishers Lectures for 2nd Edition Note: these lectures are often supplemented with other materials and also problems from the.
Lecture 5: 9/10/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
Chapter 4. Measure, Report, and Summarize Make intelligent choices See through the marketing hype Understanding underlying organizational aspects Why.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
EGRE 426 Computer Organization and Design Chapter 4.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
COD Ch. 1 Introduction + The Role of Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
Computer Organization
Computer Architecture & Operations I
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
Defining Performance Which airplane has the best performance?
Computer Architecture & Operations I
Prof. Hsien-Hsin Sean Lee
CSCE 212 Chapter 4: Assessing and Understanding Performance
CS2100 Computer Organisation
Computer Performance He said, to speed things up we need to squeeze the clock.
CMSC 611: Advanced Computer Architecture
CMSC 611: Advanced Computer Architecture
Performances of Computer Systems
Computer Performance Read Chapter 4
Performance.
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Computer Organization and Design Chapter 4
CS2100 Computer Organisation
Presentation transcript:

CPEN Digital System Design Assessing and Understanding CPU Performance © Logic and Computer Design Fundamentals, 4 rd Ed., Mano Prentice Hall © Computer Organization & Design, The Hardware/Software Interface - Third Edition

Overview  Defining performance –Response time –Throughput –Relative performance  Measuring performance  CPU performance and its factors –Improving performance –Performance equation

Defining Performance Airplane Example Which of these airplanes has the best performance? Which of these airplanes has the best passenger throughput? AirplanePassengersRange (mi) Speed (mph) Boeing Boeing BAC/Sud Concorde Douglas DC

 Response Time (or execution time) –“the time between the start and the completion of a task” –How long does it take for a job to run? –How long must I wait for the database query?  Throughput –“the total amount of work done in a given time” –How many jobs can the machine run at once? –How much work is getting done?  If we upgrade a machine with a new processor, what do we improve?  If we add an additional processor to a system, what do we improve? Computer Performance

Performance - Execution Time  Elapsed Time (response or execution time) –Total time to complete a task  counts everything (other processes running, disk and memory accesses, I/O, etc.)  CPU time –doesn't count waiting for I/O or time spent running other programs –can be broken up into system time and user time  User CPU time –time spent in a user program  System CPU time –time spent in operating system performing tasks on behalf of the program.

CPU Performance  For some program running on machine X, Performance X = 1 / Execution time X  "X is n times faster than Y" Performance X / Performance Y = n We will use “n times faster”, which means both increased performance and decreased execution time.

CPU Performance - continued Problem 1: machine A runs a program in 20 seconds machine B runs the same program in 25 seconds How much faster is A than B? Problem 2: Computer C’s performance is 4 times better than the performance of computer B, which runs a given application in 28 seconds. How long will computer C take to run that application?

Clock Cycles  Instead of reporting execution time in seconds, we often use cycles  Clock ticks indicate when to start activities: cycle time = seconds per cycle  clock rate (frequency) = cycles per second (1 Hz = 1 cycle/sec) A 200 MHz. clock has a _____ cycle time.

The following equation is commonly used for expressing a computer's performance ability: The CISC approach attempts to minimize the number of instructions per program, sacrificing the number of cycles per instruction. RISC does the opposite, reducing the cycles per instruction at the cost of the number of instructions per program. Performance

How to improve performance  So, to improve performance (everything else being equal) one can either ________ the # of required cycles for a program, or ________ the clock cycle time or, said another way, ________ the clock rate.

Performance - Word problem!  Our favorite program runs in 10 seconds on computer A, which has a 4 GHz clock. We are trying to help a computer designer build a new machine B, that will run this program in 6 seconds. The designer can use new (or perhaps more expensive) technology to substantially increase the clock rate, but has informed us that this increase will affect the rest of the CPU design, causing machine B to require 1.2 times as many clock cycles as machine A for the same program.  What clock rate should we tell the designer to target?

 Could we assume that # of cycles = # of instructions? How many cycles are required for a program? This assumption is incorrect; different instructions take different amounts of time depending on what they do. 1st instruction2nd instruction3rd instruction4th 5th6th...

Different instructions take different number of cycles!  Note that “lw” is longer than “add” and “beq”.  lw~800 ps, R-type~600 ps, beq~500ps [Fig. 6.2]  CPI = clock cycles per instruction

CPI - Cycles per Instruction  Number of clock cycles to execute an instruction  Average CPI for a program though can be computed… CPI ave = CPU clock cycles / instruction count  Average CPI is useful for comparing two implementations of the same ISA

Performance Question…  Performance is determined by execution time  Do any of the other variables equal performance? –# of cycles to execute program? –# of instructions in program? –# of cycles per second? –average # of cycles per instruction? –average # of instructions per second?  Common pitfall: thinking only one of the above variables is indicative of performance.

Performance  In terms of instructions:

Performance CPI example 1  Suppose we have two implementations of the same instruction set architecture (ISA). For some program, Machine A has a clock rate of 4 GHz and a CPI of 2.0. Machine B has a clock rate of 2 GHz and a CPI of 1.2.  What machine is faster for this program and by how much?

Two computers, C1 and C2, have the following metrics when running the same program. Which computer is faster? Performance CPI example 2 Computer C1Computer C2 Instruction Count 10 billion8 billion Clock Rate4 GHz CPI1.01.1

Performance Remember…  Performance is specific to a particular program(s) –Total execution time is a consistent summary of performance  For a given architecture performance increases come from: –increases in clock rate (without adverse CPI effects) –improvements in processor organizations that lower CPI –compiler enhancements that lower CPI and/or instruction count

What is the connection of the general digital system to the content of this chapter?  Each digital system has an architecture  Although is may not have instructions to be executed, it can still be described by using RTL and possibly one or more ASMs.  The system may have instructions but may be different from those for a CPU.  It may have one or more datapaths.  It is likely to have a control unit.  Thus, the total spectrum of digital systems have a wide range of architectural possibilities.