Lecture 1 1 CS 352H: Computer Systems Architecture Lecture 1: What is Computer Architecture and why should I care? Professor Emmett Witchel University.

Slides:



Advertisements
Similar presentations
Computer Abstractions and Technology
Advertisements

Chapter1 Fundamental of Computer Design Dr. Bernard Chen Ph.D. University of Central Arkansas.
Chapter 5 Computing Components. The (META) BIG IDEA Cool, idea but maybe too big DATA – Must be stored somewhere in a storage device PROCESSING – Data.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
EEM 486 EEM 486: Computer Architecture Lecture 1 Course Introduction and the Five Components of a Computer.
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
COMP3221: Microprocessors and Embedded Systems Lecture 2: Instruction Set Architecture (ISA) Lecturer: Hui Wu Session.
Chapter 1. Introduction This course is all about how computers work But what do we mean by a computer? –Different types: desktop, servers, embedded devices.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
CIS 314 : Computer Organization Lecture 1 – Introduction.
COMP25212 SYSTEM ARCHITECTURE Antoniu Pop Jan/Feb 2015COMP25212 Lecture 1.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  logistics  why computer organization is important  modern trends.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
Computer Organization and Assembly language
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Computer performance.
+ CS 325: CS Hardware and Software Organization and Architecture Introduction.
Chapter1 Fundamental of Computer Design Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
1 From Processes to Threads. 2 Processes, Threads and Processors Hardware can interpret N instruction streams at once  Uniprocessor, N==1  Dual-core,
The Computer Systems By : Prabir Nandi Computer Instructor KV Lumding.
Computer Science 1000 Terminology. The Language of Computer Science field is notorious for cryptic terms WYSIWYG GPU flops even recognizable terms may.
 Design model for a computer  Named after John von Neuman  Instructions that tell the computer what to do are stored in memory  Stored program Memory.
Introduction CSE 410, Spring 2008 Computer Systems
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  Why computer organization is important  Logistics  Modern trends.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
CS/ECE 3330 Computer Architecture Kim Hazelwood Fall 2009.
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
1 Recap (from Previous Lecture). 2 Computer Architecture Computer Architecture involves 3 inter- related components – Instruction set architecture (ISA):
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 5810/6810: Hennessy.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Lecture 1 1 Computer Systems Architecture Lecture 1: What is Computer Architecture?
UTCS Lecture 1 1 CS 352: Computer Systems Architecture Lecture 1: What is Computer Architecture? January 22, 2007 Doug Burger Computer Architecture and.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
COMP25212: System Architecture Lecturers Alasdair Rawsthorne Daniel Goodman
Computer Organization and Design Computer Abstractions and Technology
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Computer Organization & Assembly Language © by DR. M. Amer.
Introduction.  This course is all about how computers work  But what do we mean by a computer?  Different types: desktop, servers, embedded devices.
Department of Industrial Engineering Sharif University of Technology Session# 6.
CS352H: Computer Systems Architecture Lecture 1: What Is Computer Architecture? August 27, 2009 University of Texas Computer Science CS352H Fall 2009 Don.
University of Texas at Austin CS310H – Computer Organization and Programming Spring 2008 Don Fussell CS 310H: Computer Organization and Programming (Honors)
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
Computer Organization Yasser F. O. Mohammad 1. 2 Lecture 1: Introduction Today’s topics:  Why computer organization is important  Logistics  Modern.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Lecture # 10 Processors Microcomputer Processors.
CS203 – Advanced Computer Architecture Performance Evaluation.
New-School Machine Structures Parallel Requests Assigned to computer e.g., Search “Katz” Parallel Threads Assigned to core e.g., Lookup, Ads Parallel Instructions.
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
Hardware Architecture
William Stallings Computer Organization and Architecture 6th Edition
CS203 – Advanced Computer Architecture
Lecture 1: CS/ECE 3810 Introduction
CS 310: Computer Organization and Programming
Morgan Kaufmann Publishers
Architecture & Organization 1
Unit 2 Computer Systems HND in Computing and Systems Development
Lecture 1: CS/ECE 3810 Introduction
Multicultural Social Community Development Institute ( MSCDI)
Morgan Kaufmann Publishers
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
CS/EE 6810: Computer Architecture
Chapter 1 Introduction.
Computer Evolution and Performance
Overview Prof. Eric Rotenberg
The University of Adelaide, School of Computer Science
Utsunomiya University
Presentation transcript:

Lecture 1 1 CS 352H: Computer Systems Architecture Lecture 1: What is Computer Architecture and why should I care? Professor Emmett Witchel University of Texas at Austin

Lecture 1 2 Goals Understand the “how” and “why” of computer system organization –Instruction Set Architecture –System Organization (processor, memory, I/O) –Microarchitecture –Virtualization Learn methods of evaluating performance –Metrics & benchmarks Learn how to make systems go fast –Pipelining, caching –Parallelism (ILP, DLP, TLP) –Application specific architectures (graphics, signal proc.) Preview of where architecture is heading

Lecture 1 3 Logistics LecturesT/Th 12:30-2:00pm, PAI 3.14 InstructorProf. Emmett Witchel, W 1:15-2:15 TAShalini Sahoo MW 11:30-1:00pm PAI 5.38 Desk1 Gradingsee web page TextsHennessy & Patterson, Computer Organization and Design (Fourth Edition) Including CD Revised Fourth Edition preferred, not required

Lecture 1 4 CS352H Online URL: I will occasionally you via blackboard and by your registered address. I expect this channel to be reliable and timely. discussion group: via blackboard login at courses.utexas.edu General, Homeworks, Project Computer Architecture Seminar Series:

Lecture 1 5 Assignment for Next Tuesday Turn in student survey forms, if you want Read the Moore paper (see webpage) –Write a review of 1/2-1 page (see syllabus) –Review should include Summary of content of paper Your observations on the most interesting/important aspects Your observations on its relevance today –Be prepared to discuss on Tuesday in class

Discussion Are you interested in taking this course? One question about computer science One question about computer architecture CS352H Fall 2007 Lecture 1 6

7 Specification Program ISA (Instruction Set Architecture) microArchitecture Logic Transistors Physics/Chemistry compute the fibonacci sequence for(i=2; i<100; i++) { a[i] = a[i-1]+a[i-2];} load r1, a[i]; add r2, r2, r1; registers A B S F G D S G S D Arch vs. µarch

Lecture 1 8 CS352H Topics Technology Trends Instruction set architectures Pipelining Modern pipelined architectures –Dynamic ILP machines –Static ILP machines Cache memory systems Virtual memory Multiprocessors Computer system implementation

Making This Class Work For You Plus and minus grades Clickers CS352H Fall 2007 Lecture 1 9

10 What is Computer Architecture? Technology Applications Computer Architect Interfaces Machine Organization Measurement & Evaluation ISAAPI Link I/O Chan Regs IR

Lecture 1 11 Technology Constraints Yearly improvement –Semiconductor technology 60% more devices per chip (doubles every 18 months) 15% faster devices (doubles every 5 years) Slower wires –Magnetic Disks 60% increase in density –Circuit boards 5% increase in wire density –Cables no change >100x more devices since x faster devices nm130nm 1000nm 800nm 350nm 250nm

Lecture 1 12 Changing Technology leads to Changing Architecture 1970s –multi-chip CPUs –semiconductor memory very expensive –microcoded control –complex instruction sets (good code density) 1980s –single-chip CPUs, on-chip RAM feasible –simple, hard-wired control –simple instruction sets –small on-chip caches 1990s –lots of transistors –complex control to exploit instruction-level parallelism 2000s –even more transistors –Power wall –Transition to CMPs –Multi-level caches 2010s –Embedded vs. Desktop vs. Data center (cloud) –New storage (PCM, flash) –Simpler cores and lots of them –Optimizing for power

Lecture 1 13 Intel The first microprocessor 2,300 transistors 108 KHz 10  m process

Lecture 1 14 Some Recent Chips! Intel Pentium IV 42 million transistors 4GHz 0.13  m process Could fit ~15, s on this chip! NVidia - GeForce million transistors 400MHz 0.13  m process Intel Itanium II (Montecito) 1.7 billion transistors 1.6 GHz 90nm process IBM Cell 8 vector processors + 1 PPC 4 GHz 90nm process Intel’s net revenue was around $35 billion a year for most of the aughts R&D about $5 billion a year

CS352H Fall 2007 Lecture 1 15 Any Architecture You Want (as long as it is x86)

Lecture 1 16 Application Constraints Applications drive machine ‘balance’ –Numerical simulations floating-point performance main memory bandwidth –Transaction processing I/Os per second integer CPU performance –Decision support I/O bandwidth –Embedded control I/O timing, power –Media processing low-precision ‘pixel’ arithmetic

Lecture 1 17 Application-Driven Architectures General purpose - good performance on “all” programs –x86 family, ARM, powerPC, etc. Application specificity can focus on: –Types of concurrency available –Domain of deployment (server, handheld, desktop) Today - overview of graphics processors –Interface (instruction set architecture - ISA) –Processor organization –Concurrent elements

Apple’s iPad/iPhone4 Powered by A4 Chip A4 is modified ARM Cortex run at 1GHz –Integrated processor, graphics, memory controller Among other claims, ARM says the processors gets a near "25 percent processing power boost, even at same processor speed, from the use of a new instruction pipelining system." –We will cover pipelining in this class. Claim: 10 hours of 1024x768 video at 25W Let’s look at the Freescale i.MX51 CS352H Fall 2007 Lecture 1 18

Performance: Latency and Throughput Latency: time to complete an operation Throughput: work completed per unit time Consider plumbing –Low latency: turn on faucet and water comes out –High bandwidth: lots of water (e.g., to fill a pool) What is “High speed Internet?” –Low latency: needed to interactive gaming –High bandwidth: needed for downloading large files –Marketing departments like to conflate latency and bandwidth…

Relationship between Latency and Throughput Latency and bandwidth only loosely coupled –Henry Ford: assembly lines increase bandwidth without reducing latency My factory takes 1 day to make a Model-T ford. –But I can start building a new car every 10 minutes –At 24 hrs/day, I can make 24 * 6 = 144 cars per day –A special order for 1 green car, still takes 1 day –Throughput is increased, but latency is not. Latency reduction is difficult Often, one can buy bandwidth –E.g., more memory chips, more disks, more computers –Big server farms (e.g., google) are high bandwidth

What is cloud computing? Cloud computing is where dynamically scalable and often virtualized resources are provided as a service over the Internet (thanks, wikipedia!) Infrastructure as a service (IaaS) –Amazon’s EC2 (elastic compute cloud) Platform as a service (PaaS) –Google gears –Microsoft azure Software as a service (SaaS) –gmail –facebook –flickr

Thanks, James Hamilton, amazon

Lecture 1 23 Graphics has dedicated chip in PCs CPU Memory Input/Output Glue Chip (“South Bridge”) Graphics Processor Memory Controller Chip (“North Bridge”) Memory Disk, Keyboard, PCIe, etc. 582 Million transistors 681 Million transistors (GeForce 8800, 90nm) (AGP, PCIe) (Intel “Kentsfield” quad core, QX6700, 65nm, two dies, 8MB L2$)

Lecture 1 24 GPU/CPU Performance comparison GFLOPS G80 = GeForce 8800 GTX G71 = GeForce 7900 GTX G70 = GeForce 7800 GTX NV40 = GeForce 6800 Ultra NV35 = GeForce FX 5950 Ultra NV30 = GeForce FX 5800 Source: NVIDIA (except CELL and Core2 Quad) * IBM Cell ~200 GFlops Core 2 Quad 3GHz, 96 GFLOPS *

CS352H Fall 2007 Lecture 1 25 Why a dedicated processing chip? 1) Specialization – becoming less important with time 2) Parallelism – becoming more important Graphics processors are the only highly-parallel processors in every desktop machine. 128 “processors” * GHz You can program them!

Lecture 1 26 Graphics requires programmability void normalmapped(float2 normalMapTexCoord : TEXCOORD0, … out float4 color : COLOR, uniform float ambient, …) { float3 normalTex, …; normalTex = tex2D(normalMap, normalMapTexCoord).xyz; … diffuse = saturate(dot(normal, normLightDir); … color = Kd * (ambient + diffuse ) + Ks * pow(specular, specularExponent; } Every application does something a bit different. Example Cg “shader” program (invoked like a “callback” function):

Lecture 1 27 GeForce 8800

Lecture 1 28 Next Time Performance evaluation Basic computer organization How chips are made Start in on instruction set review/overview Always check web page for assignments