UTCS Lecture 1 1 CS 352: Computer Systems Architecture Lecture 1: What is Computer Architecture? January 22, 2007 Doug Burger Computer Architecture and.

Slides:



Advertisements
Similar presentations
Computer Abstractions and Technology
Advertisements

Microprocessors. Von Neumann architecture Data and instructions in single read/write memory Contents of memory addressable by location, independent of.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
EEM 486 EEM 486: Computer Architecture Lecture 1 Course Introduction and the Five Components of a Computer.
Processor Technology and Architecture
COMP3221: Microprocessors and Embedded Systems Lecture 2: Instruction Set Architecture (ISA) Lecturer: Hui Wu Session.
Chapter 4 Processor Technology and Architecture. Chapter goals Describe CPU instruction and execution cycles Explain how primitive CPU instructions are.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
CS / Schlesinger Lec1.1 1/20/99©UCB Spring 1999 Computer Architecture Lecture 1 Introduction and Five Components of a Computer Spring, 1999 Arie Schlesinger.
1 CSE SUNY New Paltz Chapter 1 Introduction CSE-45432Introduction to Computer Architecture Dr. Izadi.
CIS 314 : Computer Organization Lecture 1 – Introduction.
COMP25212 SYSTEM ARCHITECTURE Antoniu Pop Jan/Feb 2015COMP25212 Lecture 1.
COM181 Computer Hardware Ian McCrumRoom 5B18,
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
CENG311 Computer Architecture Kayhan Erciyes. CS231 Assembly language and Digital Circuits Instructor:Kayhan Erciyes Office:
Chapter 2 - Computer Organization CPU organization –Basic Elements and Principles –Parallelism Memory –Storage Hierarchy I/O –Fast survey of devices Character.
1 Layers of Computer Science, ISA and uArch Alexander Titov 20 September 2014.
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
E0001 Computers in Engineering1 The System Unit & Memory.
3 1 3 C H A P T E R Hardware: Input, Processing, and Output Devices.
Physics 413 Chapter 1 Computer Architecture What is a Digital Computer ? A computer is essentially a fast electronic calculating machine. What is a program.
Introduction CSE 410, Spring 2008 Computer Systems
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 Embedded Systems Computer Architecture. Embedded Systems2 Memory Hierarchy Registers Cache RAM Disk L2 Cache Speed (faster) Cost (cheaper per-byte)
1 Recap (from Previous Lecture). 2 Computer Architecture Computer Architecture involves 3 inter- related components – Instruction set architecture (ISA):
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Lecture 1 1 CS 352H: Computer Systems Architecture Lecture 1: What is Computer Architecture and why should I care? Professor Emmett Witchel University.
Lecture 1 1 Computer Systems Architecture Lecture 1: What is Computer Architecture?
COMP25212: System Architecture Lecturers Alasdair Rawsthorne Daniel Goodman
3/15/2002CSE Final Remarks Concluding Remarks SOAP.
Computer Organization and Design Computer Abstractions and Technology
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Computer Organization & Assembly Language © by DR. M. Amer.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Computer Architecture CPSC 350
Ted Pedersen – CS 3011 – Chapter 10 1 A brief history of computer architectures CISC – complex instruction set computing –Intel x86, VAX –Evolved from.
Morgan Kaufmann Publishers
Pipelining and Parallelism Mark Staveley
12/13/ _01 1 Computer Organization EEC-213 Computer Organization Electrical and Computer Engineering.
CS352H: Computer Systems Architecture Lecture 1: What Is Computer Architecture? August 27, 2009 University of Texas Computer Science CS352H Fall 2009 Don.
Computer and Information Sciences College / Computer Science Department CS 206 D Computer Organization and Assembly Language.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
University of Texas at Austin CS310H – Computer Organization and Programming Spring 2008 Don Fussell CS 310H: Computer Organization and Programming (Honors)
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
Chapter 2 Turning Data into Something You Can Use
Performance Analysis Topics Measuring performance of systems Reasoning about performance Amdahl’s law Systems I.
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
Lecture # 10 Processors Microcomputer Processors.
New-School Machine Structures Parallel Requests Assigned to computer e.g., Search “Katz” Parallel Threads Assigned to core e.g., Lookup, Ads Parallel Instructions.
Computer Organization IS F242. Course Objective It aims at understanding and appreciating the computing system’s functional components, their characteristics,
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
Computer Architecture Furkan Rabee
Measuring Performance II and Logic Design
CS 310: Computer Organization and Programming
CSE 410, Spring 2006 Computer Systems
Morgan Kaufmann Publishers
Computer Architecture CSCE 350
CS775: Computer Architecture
BIC 10503: COMPUTER ARCHITECTURE
Morgan Kaufmann Publishers Computer Organization and Assembly Language
Computer Evolution and Performance
Lecture 3: MIPS Instruction Set
COMS 361 Computer Organization
Performance of computer systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Presentation transcript:

UTCS Lecture 1 1 CS 352: Computer Systems Architecture Lecture 1: What is Computer Architecture? January 22, 2007 Doug Burger Computer Architecture and Technology Laboratory University of Texas at Austin

UTCS Lecture 1 2 Goals Understand the “how” and “why” of computer system organziation –Instruction Set Architecture –System Organization (processor, memory, I/O) –Microarchitecture Learn methods of measuring and improving performance –Metrics –Benchmarks –Performance methods Pipelining, ILP, prediction Learn to think and program concurrently

UTCS Lecture 1 3 Logistics LecturesM/W 9:00-10:15 am, GEO InstructorProf. Doug Burger TADong Li GradingFinal Exam125% Midterm Exam215% each Homework~725% Project120% TextsPatterson & Hennessy, Computer Organization and Design (Third Edition) Course Readings (handed out in class)

UTCS Lecture 1 4 CS352 Online URL: Blackboard! Other stuff off of my home page (Course materials, research info) Computer Architecture Seminar Series:

UTCS Lecture 1 5 Specification Program ISA (Instruction Set Architecture) microArchitecture Logic Transistors Physics/Chemistry compute the fibonacci sequence for(i=2; i<100; i++) { a[i] = a[i-1]+a[i-2];} load r1, a[i]; add r2, r2, r1; registers A B S F G D S G S D

UTCS Lecture 1 6 CS352 Topics Technology Trends Instruction set architectures Pipelining Modern pipelined architectures –Dynamic ILP machines –Static ILP machines Cache memory systems Virtual memory Multiprocessors Computer system implementation

UTCS Lecture 1 7 What is Computer Architecture? Technology Applications Computer Architect Interfaces Machine Organization Measurement & Evaluation ISAAPI Link I/O Chan Regs IR

UTCS Lecture 1 8 Technology Constraints Yearly improvement –Semiconductor technology 60% more devices per chip (doubles every 18 months) 15% faster devices (doubles every 5 years) Slower wires –Magnetic Disks 60% increase in density –Circuit boards 5% increase in wire density –Cables no change x more devices since x faster devices 2002

UTCS Lecture 1 9 Changing Technology leads to Changing Architecture 1970s (CISC mainframes) –multi-chip CPUs –semiconductor memory very expensive –microcoded control –complex instruction sets (good code density) 1980s (RISC micros) –single-chip CPUs, on-chip RAM feasible –simple, hard-wired control –simple instruction sets –small on-chip caches 1990s (fast clocks) –lots of transistors –complex control to exploit instruction-level parallelism 2000s (???) –even more transistors –slow wires –BIG SHIFT Here!!! Parallelism is focus Power now critical Open debate

UTCS Lecture 1 10 Changing Technology leads to Changing Architecture 1970s (CISC mainframes) –multi-chip CPUs –semiconductor memory very expensive –microcoded control –complex instruction sets (good code density) 1980s (RISC micros) –single-chip CPUs, on-chip RAM feasible –simple, hard-wired control –simple instruction sets –small on-chip caches 1990s (fast clocks) –lots of transistors –complex control to exploit instruction-level parallelism 2000s (???) –even more transistors –slow wires –BIG SHIFT COMING!!! Parallelism is focus Power now critical Open debate

UTCS Lecture 1 11 Courtesy Intel

UTCS Lecture 1 12 Courtesy Troubador

UTCS Lecture 1 13 Courtesy Troubador

UTCS Lecture 1 14 Intel The first microprocessor 2,300 transistors 108 KHz 10  m process

UTCS Lecture 1 15 Intel Pentium IV “State of the art” 42 million transistors 2GHz 0.13  m process Could fit ~15, s on this chip!

UTCS Lecture 1 16 Application Constraints Applications drive machine ‘balance’ –Numerical simulations floating-point performance main memory bandwidth –Transaction processing I/Os per second integer CPU performance –Decision support I/O bandwidth –Embedded control I/O timing, power –Media processing low-precision ‘pixel’ arithmetic

UTCS Lecture 1 17 Interface Design A good interface –lasts through several generations of implementations IBM 360 and x86 ISAs, DOS APIs –is simple - ‘economy of mechanism’ Interfaces are visible, Implementations generally aren’t 3 Types of Interfaces –Between Layers API, ISA –Between Modules Network protocol (Ethernet), I/O channel or bus (SCSI or PCI) –Standard Representations ASCII, IEEE floating-point

UTCS Lecture 1 18 Instruction-Set Architecture Software impact –support OS functions restartable instructions memory relocation and protection –a good compiler target simple orthogonal –dense Hardware impact –admits efficient implementation across generations –admits parallel implementation no ‘serial’ bottlenecks Abstraction without interpretation OPR1R2R3imm OP R1M1 im2 R2M2R3M3 im2... Hardware/Software Interface

UTCS Lecture 1 19 System-Level Organization Design at the level of processors, memories, and interconnect. More important to application performance than CPU design Feeds and speeds –constrained by IC pin count, module pin count, and signaling rates System balance –for a particular application Driven by –performance/cost goals –available components (cost/perf) –technology constraints P SW 800MHz 4-way Issue 16Bytes x 200MHz I/O MMMM Disk Net Display

UTCS Lecture 1 20 Microarchitecture Register-transfer-level (RTL) design Implement instruction set Exploit capabilities of technology –locality and concurrency Iterative process –generate proposed architecture –estimate cost –measure performance Current emphasis is on overcoming sequential nature of programs –deep pipelining –multiple issue –dynamic scheduling –branch prediction/speculation Regs Instr. Cache IR PC B A C

UTCS Lecture 1 21 Performance Measurement and Evaluation CPU execution time –by instruction or sequence floating point integer branch performance Cache bandwidth Main memory bandwidth I/O performance –bandwidth –seeks –pixels or polygons per second Relative importance depends on applications P $ M Many Dimensions to Performance

UTCS Lecture 1 22 Evaluation Tools Benchmarks, traces, & mixes –macrobenchmarks & suites application execution time –microbenchmarks measure one aspect of performance –traces replay recorded accesses cache, branch, register Simulation at many levels –ISA, cycle accurate, RTL, gate, circuit trade fidelity for simulation rate Area and delay estimation Analysis –e.g., queuing theory MOVE39% BR20% LOAD20% STORE10% ALU11% LD 5EA3 ST 31FF …. LD 1EA2 ….

UTCS Lecture 1 23 Next Time Evaluation of Systems –Performance Amdahl’s Law, CPI –Cost Computer system elements –Transistors and wires Reading assignment –P&H Chapter 1,