COMS 361 Computer Organization

Slides:



Advertisements
Similar presentations
CS1104: Computer Organisation School of Computing National University of Singapore.
Advertisements

COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
Reducing Leakage Power in Peripheral Circuits of L2 Caches Houman Homayoun and Alex Veidenbaum Dept. of Computer Science, UC Irvine {hhomayou,
Computer Abstractions and Technology
ENEE350 Ankur Srivastava University of Maryland, College Park Based on Slides from Mary Jane Irwin ( )
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Princess Sumaya Univ. Computer Engineering Dept. Chapter 4:
Performance COE 308 Computer Architecture Prof. Muhamed Mudawar Computer Engineering Department King Fahd University of Petroleum and Minerals.
Performance Analysis of Multiprocessor Architectures
Computer Organization and Architecture (AT70.01) Comp. Sc. and Inf. Mgmt. Asian Institute of Technology Instructor: Dr. Sumanta Guha Slide Sources: Patterson.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
CPU Performance Evaluation: Cycles Per Instruction (CPI)
Performance ICS 233 Computer Architecture and Assembly Language Dr. Aiman El-Maleh College of Computer Sciences and Engineering King Fahd University of.
CS61C L3 Performance (1) Staley, Fall 2005 © UCB Titleless TA Aaron Staley inst.eecs./~cs61c-tc inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures.
Chapter 4 Assessing and Understanding Performance Bo Cheng.
EECC550 - Shaaban #1 Lec # 3 Spring Computer Performance Evaluation: Cycles Per Instruction (CPI) Most computers run synchronously utilizing.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
1 CSE SUNY New Paltz Chapter 2 Performance and Its Measurement.
1 Recap. 2 Measuring Performance  A computer user: response time (execution time).  A computer center manager - throughput - the total amount of work.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.5 Comparing and Summarizing Performance.
Computer Performance Evaluation: Cycles Per Instruction (CPI)
Computer ArchitectureFall 2007 © September 17, 2007 Karem Sakallah CS-447– Computer Architecture.
Performance Comparison of Niagara, Xeon, and Itanium2 Daekyeong Moon
Computer Architecture Lecture 2 Instruction Set Principles.
Chapter 4 Assessing and Understanding Performance
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
Benchmarks Programs specifically chosen to measure performance Must reflect typical workload of the user Benchmark types Real applications Small benchmarks.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Where Has This Performance Improvement Come From? Technology –More transistors per chip –Faster logic Machine Organization/Implementation –Deeper pipelines.
Benchmarks Prepared By : Arafat El-madhoun Supervised By:eng. Mohammad temraz.
CDA 3101 Fall 2013 Introduction to Computer Organization Benchmarks 30 August 2013.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
ACMSE’04, ALDepartment of Electrical and Computer Engineering - UAH Execution Characteristics of SPEC CPU2000 Benchmarks: Intel C++ vs. Microsoft VC++
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Lecture2: Performance Metrics Computer Architecture By Dr.Hadi Hassan 1/3/2016Dr. Hadi Hassan Computer Architecture 1.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
Performance COE 301 Computer Organization Dr. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum and Minerals.
Chapter 1 — Computer Abstractions and Technology — 1 Uniprocessor Performance Constrained by power, instruction-level parallelism, memory latency.
ECE/CS 552: Benchmarks, Means and Amdahl’s Law © Prof. Mikko Lipasti Lecture notes based in part on slides created by Mark Hill, David Wood, Guri Sohi,
Computer Architecture CSE 3322 Web Site crystal.uta.edu/~jpatters/cse3322 Send to Pramod Kumar, with the names and s.
Performance COE 301 / ICS 233 Computer Organization Prof. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
Measuring Performance and Benchmarks Instructor: Dr. Mike Turi Department of Computer Science and Computer Engineering Pacific Lutheran University Lecture.
Computer Architecture & Operations I
Measuring Performance II and Logic Design
Lecture 2: Performance Evaluation
Computer Architecture & Operations I
CS161 – Design and Architecture of Computer Systems
Assessing and Understanding Performance
Uniprocessor Performance
Morgan Kaufmann Publishers
CSCE 212 Chapter 4: Assessing and Understanding Performance
Performance COE 301 Computer Organization
Performance of computer systems
COMS 361 Computer Organization
Performance ICS 233 Computer Architecture and Assembly Language
Performance of computer systems
Benchmarks Programs specifically chosen to measure performance
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
CS161 – Design and Architecture of Computer Systems
Computer Organization and Design Chapter 4
Presentation transcript:

COMS 361 Computer Organization Title: Performance Date: 9/09/2004 Lecture Number: 5

Announcements Homework 2 Due 9/14/04

Review Orders of magnitude Performance The big and small of it Relationship between Execution time and clock rate Execution time and the number of instructions Instruction and number of clocks

Outline MIPS, MOPS, FLOPS SPEC Benchmarks

MIPS MEASURE Millions of Instructions Per Second Instruction execution rate Inverse relationship to the execution time The higher the MIPS rating the better

MIPS MEASURE MIPS Problems Contains instruction count, not what the instructions do Small, simple instructions versus large, complex instructions Cannot compare machines with different ISA’s Different programs executed on the same computer can have drastically different MIPS rating

Exercise 2-15 Compare the MIPS rating of two machines, one with a floating point processor (MFP) and the other without (MNFP) Clock rate is 1000MHz for each machine Use a program with this instruction mix floating-point multiply: 10% floating-point add: 15% floating-point divide: 5% integer instructions: 70%

Exercise 2-15 MFP CPI for each instruction class floating-point multiply: 6 floating-point add: 4 floating-point divide: 20 integer instructions: 2 MNFP emulates floating point with integer instructions floating-point multiply: 30 integer instructions floating-point add: 20 integer instructions floating-point divide: 50 integer instructions

Exercise 2-15 Find CPI for MFP

Fallacies and Pitfalls Fallacy Machine performance increases proportionally to the size of the improvement Do NOT expect that doubling the clock rate will double the performance!

Fallacies and Pitfalls Amdahl’s law (law of diminishing returns) Performance enhancements are limited by the amount the enhanced feature is used Execution Time After Improvement = Execution Time Unaffected + (Execution Time Affected / Amount of Improvement ) No sense speeding up instructions that are rarely used Make the common case fast

Benchmarks "A standard of measurement or evaluation" Standard programs to measure performance Which programs? Real applications Difficult to tailor CPU design to score better Better designs will likely be better for most applications Your application! Performance report Performance should be repeatable Include all pertinent information OS, compilers, flags, number of processors

SPEC Benchmarks System Performance Evaluation Corp. (SPEC) Formed in 1988 by some workstation vendors There was a desperate need of realistic, standardized performance tests Fair and useful set of metrics to differentiate candidate systems Balance between requiring strict compliance and allowing vendors to demonstrate their advantages An ounce of honest data was (is) worth more than a pound of marketing hype Ported to a wide variety of platforms of its membership

SPEC Benchmarks Now has over 60 member companies Standardized suite of source code based upon existing applications

SPEC Benchmarks The Open Systems Group (OSG) CPU JAVA SDM SFS WEB CPU benchmarks JAVA Java client and server-side benchmarks SDM Multi-user UN*X commands benchmarks SFS File server benchmarks WEB Web server benchmarks

SPEC Benchmarks The High-Performance Group (HPG) High-performance computing applications Symmetric multiprocessor systems Workstation clusters Distributed memory parallel systems Traditional vector and vector parallel supercomputers

SPEC Benchmarks The Graphics Performance Characterization Group (GPC) SPECapc Benchmarks for graphics-intensive applications SPEC SPECopc OpenGL Performance Characterization determining the performance of the OpenGL application programming interface

SPEC CPU2000 Compute intensive performance Processor Memory Compiler Performance is more than just the processor CINT2000 compute-intensive integer performance CFP2000 compute-intensive floating point performance

CINT2000 Gzip: Data compression utility vpr: FPGA circuit placement and routing gcc: C compiler mcf : Minimum cost network flow solver crafty: Chess program parser Natural language processing eon: Ray tracing perlbmk:Perl gap: Computational group theory vortex: Object Oriented Database bzip2 : Data compression utility twolf: Place and route simulator

CFP2000 wupwise: Quantum chromodynamics swim: Shallow water modeling mgrid: Multi-grid solver in 3D potential field applu: Parabolic/elliptic partial differential equations mesa: 3D Graphics library galgel: Fluid dynamics art: Neural network simulation equake: Finite element simulation facerec Computer vision ammp: Computational chemistry lucas: Number theory

CFP2000 See for yourself fma3d: Finite element crash simulation sixtrack: Particle accelerator model apsi: Temperature, wind, velocity and distribution of pollutants See for yourself push here