Lecture 1: CS/ECE 3810 Introduction

Slides:



Advertisements
Similar presentations
Computer Abstractions and Technology
Advertisements

TU/e Processor Design 5Z0321 Processor Design 5Z032 Computer Systems Overview Chapter 1 Henk Corporaal Eindhoven University of Technology 2011.
Lecture 2: Modern Trends 1. 2 Microprocessor Performance Only 7% improvement in memory performance every year! 50% improvement in microprocessor performance.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
1 School of Computing Science Simon Fraser University CMPT 300: Operating Systems I Dr. Mohamed Hefeeda.
1 Lecture 2: System Metrics and Pipelining Today’s topics: (Sections 1.6, 1.7, 1.9, A.1)  Quantitative principles of computer design  Measuring cost.
Computer Organization: Introduction Spring 2006 Jen-Chang Liu ( )
Computer Architecture Instructor: Wen-Hung Liao Office: 大仁樓三樓 Office hours: TBA Course web page:
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  logistics  why computer organization is important  modern trends.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
Computer performance.
Introduction Course Overview and Basic understanding of Computer Architecture.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Introduction CSE 410, Spring 2008 Computer Systems
Last Time Performance Analysis It’s all relative
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  Why computer organization is important  Logistics  Modern trends.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
1 Cpt_S 260: Lecture 1: Introduction This week’s topics:  logistics  information technology trends (new applications)  why computer architecture? 
1 CS/EE 6810: Computer Architecture Class format:  Most lectures on YouTube *BEFORE* class  Use class time for discussions, clarifications, problem-solving,
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
(1) ECE 3056: Architecture, Concurrency and Energy in Computation Lecture Notes by MKP and Sudhakar Yalamanchili Sudhakar Yalamanchili (Some small modifications.
Advanced Computer Architecture Fundamental of Computer Design Instruction Set Principles and Examples Pipelining:Basic and Intermediate Concepts Memory.
Computer System Design Lecture 1 Wannarat Suntiamorntut.
by Computer System Design Lecture 1 Wannarat Suntiamorntut
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
Computer Organization Yasser F. O. Mohammad 1. 2 Lecture 1: Introduction Today’s topics:  Why computer organization is important  Logistics  Modern.
CHAPTER 2 Instruction Set Architecture 3/21/
CC311 Computer Architecture Chapter 1 Computer Abstraction & Technology.
By Wannarat Computer System Design Lecture 1 Wannarat Suntiamorntut.
Introduction CSE 410, Spring 2005 Computer Systems
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Computer Architecture & Operations I
William Stallings Computer Organization and Architecture 6th Edition
Welcome to CSE 502 Introduction.
Lecture 2: Performance Today’s topics:
Chapter 1: Introduction
Lecture 3: MIPS Instruction Set
ECE 3055: Computer Architecture and Operating Systems
Lecture 1: CS/ECE 3810 Introduction
Instruction Set Architecture
CIT 668: System Architecture
Stateless Combinational Logic and State Circuits
CSE 410, Spring 2006 Computer Systems
Morgan Kaufmann Publishers
ECE 154A Introduction to Computer Architecture
COSC 3406: Computer Organization
INTRODUCTION TO MICROPROCESSORS
Architecture & Organization 1
CS203 – Advanced Computer Architecture
INTRODUCTION TO MICROPROCESSORS
INTRODUCTION TO MICROPROCESSORS
Computer Architecture CSCE 350
CS775: Computer Architecture
Morgan Kaufmann Publishers
Lecture 2: Performance Today’s topics: Technology wrap-up
Architecture & Organization 1
Lecture 1: Parallel Architecture Intro
T Computer Architecture, Autumn 2005
CS/EE 6810: Computer Architecture
Chapter 1 Introduction.
Computer Evolution and Performance
Lecture 3: MIPS Instruction Set
COMS 361 Computer Organization
The University of Adelaide, School of Computer Science
Utsunomiya University
Lecture 1 Class Overview
Presentation transcript:

Lecture 1: CS/ECE 3810 Introduction Today’s topics: Why computer organization is important Logistics Modern trends

Why Computer Organization Image credits: uber, extremetech, anandtech

Why Computer Organization Image credits: gizmodo

Why Computer Organization Embarrassing if you are a BS in CS/CE and can’t make sense of the following terms: DRAM, pipelining, cache hierarchies, I/O, virtual memory, … Embarrassing if you are a BS in CS/CE and can’t decide which processor to buy: 3 GHz Xeon or 2.5 GHz Athlon (helps us reason about performance/power), … Obvious first step for chip designers, compiler/OS writers Will knowledge of the hardware help you write better and more secure programs?

Must a Programmer Care About Hardware? Must know how to reason about program performance and energy and security Memory management: if we understand how/where data is placed, we can help ensure that relevant data is nearby Thread management: if we understand how threads interact, we can write smarter multi-threaded programs  Why do we care about multi-threaded programs?

Example 200x speedup for matrix vector multiplication Data level parallelism: 3.8x Loop unrolling and out-of-order execution: 2.3x Cache blocking: 2.5x Thread level parallelism: 14x Further, can use accelerators to get an additional 100x.

Key Topics Moore’s Law, power wall Use of abstractions Assembly language Computer arithmetic Pipelining Using predictions Memory hierarchies Reliability and Security

Logistics See class web-page http://www.cs.utah.edu/~rajeev/cs3810 TAs: Anirban, Surya, Avani, Scott; Office hours: TBA Most communication on Canvas; email me directly to set up office hours, or meet me right after class Textbook: Computer Organization – HW/SW Interface, Patterson and Hennessy, 5th edition

Course Organization 30% midterm, 40% final, 30% assignments ~10 assignments – you may skip one; assignments due at the start of class (upload on Canvas) Co-operation policy: you may discuss – you may not see someone else’s written matter when writing your solution Exams are open-book and open-notes Print slides just before class Screencast YouTube videos

Microprocessor Performance Source: H&P Textbook 50% improvement every year!! What contributes to this improvement?

Power Consumption Trends Dyn power a activity x capacitance x voltage2 x frequency Voltage and frequency are somewhat constant now, while capacitance per transistor is decreasing and number of transistors (activity) is increasing Leakage power is also rising (function of #trans and voltage) Source: H&P Textbook

Important Trends Running out of ideas to improve single thread performance Power wall makes it harder to add complex features Power wall makes it harder to increase frequency

Important Trends Historical contributions to performance: Better processes (faster devices) ~20% Better circuits/pipelines ~15% Better organization/architecture ~15% In the future, bullet-2 will help little and bullet-1 will eventually disappear! Pentium P-Pro P-II P-III P-4 Itanium Montecito Year 1993 95 97 99 2000 2002 2005 Transistors 3.1M 5.5M 7.5M 9.5M 42M 300M 1720M Clock Speed 60M 200M 300M 500M 1500M 800M 1800M Moore’s Law in action At this point, adding transistors to a core yields little benefit

What Does This Mean to a Programmer? Today, one can expect only a 20% annual improvement; the improvement is even lower if the program is not multi-threaded A program needs many threads The threads need efficient synchronization and communication Data placement in the memory hierarchy is important Accelerators should be used when possible

Challenges for Hardware Designers Find efficient ways to improve single-thread performance and energy improve data sharing boost programmer productivity manage the memory system build accelerators for important kernels provide security

The HW/SW Interface a[i] = b[i] + c; Application software Compiler lw $15, 0($2) add $16, $15, $14 add $17, $15, $13 lw $18, 0($12) lw $19, 0($17) add $20, $18, $19 sw $20, 0($16) Systems software (OS, compiler) Assembler Hardware 000000101100000 110100000100010 …

Computer Components Input/output devices Secondary storage: non-volatile, slower, cheaper Primary storage: volatile, faster, costlier CPU/processor (datapath and control)

Wafers and Dies Source: H&P Textbook

Manufacturing Process Silicon wafers undergo many processing steps so that different parts of the wafer behave as insulators, conductors, and transistors (switches) Multiple metal layers on the silicon enable connections between transistors The wafer is chopped into many dies – the size of the die determines yield and cost

Processor Technology Trends Shrinking of transistor sizes: 250nm (1997)  130nm (2002)  70nm (2008)  35nm (2014) Transistor density increases by 35% per year and die size increases by 10-20% per year… functionality improvements! Transistor speed improves linearly with size (complex equation involving voltages, resistances, capacitances) Wire delays do not scale down at the same rate as transistor delays

Memory and I/O Technology Trends DRAM density increases by 40-60% per year, latency has reduced by 33% in 10 years (the memory wall!), bandwidth improves twice as fast as latency decreases Disk density improves by 100% every year, latency improvement similar to DRAM Networks: primary focus on bandwidth; 10Mb  100Mb in 10 years; 100Mb  1Gb in 5 years

Next Class Topics: Performance, MIPS instruction set architecture (Chapter 2) Visit the class web-page http://www.cs.utah.edu/~rajeev/cs3810

Title Bullet