Caltech CS184 Spring2005 -- DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 1: March 28, 2005 Architecture Intro.

Slides:



Advertisements
Similar presentations
Slide 1Michael Flynn EE382 Winter/99 EE382 Processor Design Stanford University Winter Quarter Instructor: Michael Flynn Teaching Assistant:
Advertisements

ENGS 116 Lecture 101 ILP: Software Approaches Vincent H. Berk October 12 th Reading for today: , 4.1 Reading for Friday: 4.2 – 4.6 Homework #2:
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University of.
POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:
Rung-Bin Lin Chapter 4: Exploiting Instruction-Level Parallelism with Software Approaches4-1 Chapter 4 Exploiting Instruction-Level Parallelism with Software.
EDA (CS286.5b) Day 10 Scheduling (Intro Branch-and-Bound)
CS2422 Assembly Language & System Programming September 19, 2006.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Packet-Switched vs. Time-Multiplexed FPGA Overlay Networks Kapre et. al RC Reading Group – 3/29/2006 Presenter: Ilya Tabakh.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon, Dec 5, 2005 Topic: Intro to Multiprocessors and Thread-Level Parallelism.
ECE669 L11: Static Routing Architectures March 4, 2004 ECE 669 Parallel Computer Architecture Lecture 11 Static Routing Architectures.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day1:
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
1 Interfacing Processors and Peripherals I/O Design affected by many factors (expandability, resilience) Performance: — access latency — throughput — connection.
Chapter 13 Embedded Systems
1  1998 Morgan Kaufmann Publishers Lectures for 2nd Edition Note: these lectures are often supplemented with other materials and also problems from the.
Memory Management 1 CS502 Spring 2006 Memory Management CS-502 Spring 2006.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
CS-3013 & CS-502, Summer 2006 Memory Management1 CS-3013 & CS-502 Summer 2006.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 22: April 4, 2007 Interconnect 7: Time Multiplexed Interconnect.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
RISC. Rational Behind RISC Few of the complex instructions were used –data movement – 45% –ALU ops – 25% –branching – 30% Cheaper memory VLSI technology.
CS 300 – Lecture 2 Intro to Computer Architecture / Assembly Language History.
CS294-6 Reconfigurable Computing Day 3 September 1, 1998 Requirements for Computing Devices.
COMP25212 SYSTEM ARCHITECTURE Antoniu Pop Jan/Feb 2015COMP25212 Lecture 1.
Penn ESE Spring DeHon 1 FUTURE Timing seemed good However, only student to give feedback marked confusing (2 of 5 on clarity) and too fast.
Basics of Operating Systems March 4, 2001 Adapted from Operating Systems Lecture Notes, Copyright 1997 Martin C. Rinard.
1 Instant replay  The semester was split into roughly four parts. —The 1st quarter covered instruction set architectures—the connection between software.
Caltech CS184 Spring DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 1: March 31, 2003 Architecture Intro.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
Caltech CS184 Spring DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 2: March 30, 2005 Instruction Set Architecture.
Introduction CSE 410, Spring 2008 Computer Systems
CBSSS 2002: DeHon Architecture as Interface André DeHon Friday, June 21, 2002.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day3:
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  Why computer organization is important  Logistics  Modern trends.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day12:
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
Introduction and Overview Summer 2014 COMP 2130 Introduction to Computer Systems Computing Science Thompson Rivers University.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day15:
COMP25212: System Architecture Lecturers Alasdair Rawsthorne Daniel Goodman
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day14:
Super computers Parallel Processing By Lecturer: Aisha Dawood.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day5:
The Cosmic Cube Charles L. Seitz Presented By: Jason D. Robey 2 APR 03.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day16:
CS 460/660 Compiler Construction. Class 01 2 Why Study Compilers? Compilers are important – –Responsible for many aspects of system performance Compilers.
ECEG-3202 Computer Architecture and Organization Chapter 7 Reduced Instruction Set Computers.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
(Superficial!) Review of Uniprocessor Architecture Parallel Architectures and Related concepts CS 433 Laxmikant Kale University of Illinois at Urbana-Champaign.
CALTECH cs184c Spring DeHon CS184c: Computer Architecture [Parallel and Multithreaded] Day 1: April 3, 2001 Overview and Message Passing.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
CALTECH cs184c Spring DeHon CS184c: Computer Architecture [Parallel and Multithreaded] Day 9: May 3, 2001 Distributed Shared Memory.
Computer Organization Yasser F. O. Mohammad 1. 2 Lecture 1: Introduction Today’s topics:  Why computer organization is important  Logistics  Modern.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day11:
1 Adapted from UC Berkeley CS252 S01 Lecture 17: Reducing Cache Miss Penalty and Reducing Cache Hit Time Hardware prefetching and stream buffer, software.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 15: March 13, 2013 High Level Synthesis II Dataflow Graph Sharing.
CSCI/CMPE 4334 Operating Systems Review: Exam 1 1.
Spring 2003CSE P5481 WaveScalar and the WaveCache Steven Swanson Ken Michelson Mark Oskin Tom Anderson Susan Eggers University of Washington.
Introduction CSE 410, Spring 2005 Computer Systems
CS703 - Advanced Operating Systems By Mr. Farhan Zaidi.
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue
CS184a: Computer Architecture (Structure and Organization)
CS203 – Advanced Computer Architecture
CC423: Advanced Computer Architecture ILP: Part V – Multiple Issue
COMS 361 Computer Organization
Virtual Memory: Working Sets
Dr. Clincy Professor of CS
Presentation transcript:

Caltech CS184 Spring DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 1: March 28, 2005 Architecture Intro

Caltech CS184 Spring DeHon 2 Today This Quarter What is Architecture? –Why? Project Overivew

Caltech CS184 Spring DeHon 3 CS184 Sequence A - structure and organization –raw components, building blocks –design space B – architectural abstractions and optimization –emphasis on abstractions and optimizations including quantification –single and multiple threads

Caltech CS184 Spring DeHon 4 Topics this Quarter (1 of 2) “Architecture” Instruction-Set Architecture (ISA) –including pipeline parallelism Instruction-Level Parallelism (ILP) Memory Architecture and Optimization –Caching and Virtual Memory Binary Translation

Caltech CS184 Spring DeHon 5 Topics (2 of 2) Dataflow Multithreaded Message Passing Shared Memory Vector/SIMD Multiprocessor Interface/Interconnect Defect and Fault Tolerance

Caltech CS184 Spring DeHon 6 Material Lots of material – will go fast …probably going to hit exposure over details

Caltech CS184 Spring DeHon 7 Lectures Same scheme as last term –Schedule MWF –Accommodate holes as necessary –Currently have 25 lectures on queue

Caltech CS184 Spring DeHon 8 Reading Will rely on much more than last term Will use textbook (Hennessy and Patterson) –chapters 1-6 this term Lectures more to complement text than completely overlap –going to cover some pretty rich topics …can’t do it in hours of lecture Classic papers

Caltech CS184 Spring DeHon 9 Assignments Pull from text Help drive working familiarity with conventional architecture techniques and optimizations

Caltech CS184 Spring DeHon 10 Logistics Four assignments on single threaded architectures –Due Monday 9am (out prev. Mon. class) –Still want electronic no handwriting/hand drawing Project –Weekly milestones, starting next week –Sometimes will have both

Caltech CS184 Spring DeHon 11 Themes for Quarter Recurring –“cached” answers and change –merit analysis (cost/performance) –dominant/bottleneck resource requirements –structure/common case

Caltech CS184 Spring DeHon 12 Themes for Quarter New/new focus –measurement –abstractions/semantics –abstractions 0, 1, infinity –dynamic data/event handling (vs. static) –predictability (avg. vs. worst case)

Caltech CS184 Spring DeHon 13 “Architecture” What? Why?

Caltech CS184 Spring DeHon 14 “Architecture” “attributes of a system as seen by the programmer” “conceptual structure and functional behavior” Defines the visible interface between the hardware and software Defines the semantics of the program (machine code)

Caltech CS184 Spring DeHon 15 Architecture distinguished from Implementation IA32 architecture vs. –80486DX2, AMD K5, Pentium-II-700, P6 VAX architectures vs. –11/750, 11/780, uVax-II PowerPC vs. –PPC 601, 604, 630 … Alpha vs. –EV4, 21164, 21264, … Admits to many different implementations of single architecture

Caltech CS184 Spring DeHon 16 Example Distinction: Memory Implementation Abstraction: large-flat memory Implementation: –multiple-levels of caches, varying sizes –virtual memory, with data residing on disk –relocation of physical memory placement One simple abstraction –hides details of implementation/timing Many implementations –varying costs, performance, technology

Caltech CS184 Spring DeHon 17 Why ? What’s the value of this distinction? Why do we have it? What does it cost?

Caltech CS184 Spring DeHon 18 Value? Effort Economics Software Distribution

Caltech CS184 Spring DeHon 19 Software Crisis Mid 1960’s –Could build new machines at reasonable pace –Could not develop software for new machines fast enough

Caltech CS184 Spring DeHon 20 Historical Anecdotes Zuse from The Computer, My Life Brooks from Software Pioneers

Caltech CS184 Spring DeHon 21 Value: Effort Reduce/minimize effort necessary to exploit new/different technology Number of programmers is small Rate of new machine/technology advance is large Key enabler to riding the technology curve

Caltech CS184 Spring DeHon 22 Value: Economics Preserve software investment –both uniquely developed and commercial Lower barrier to acceptance of new machine –all your old code runs…just faster! Offer range of scaling: –need more power  buy different/better/newer machine –have less money  buy the cheaper machine –little/no software effort to support

Caltech CS184 Spring DeHon 23 Architecture Benefits ISA addressed the “software crisis” –Bottleneck to exploiting new machines was the need to write new software suites for them Preserve investment in software –Programmer education Permitted innovation in hardware –Use more/less hardware –Allow customers buy as much machine as they need –New substrates: TTL, ECL, NMOS, CMOS…

Caltech CS184 Spring DeHon 24 Value: Software Distribution Vendor not want to sell source –“give away” their techniques/technology/IP in a way which can be co-opted/reused –[pragmatic argument, not fundamental]

Caltech CS184 Spring DeHon 25 Pragmatic: Binary vs. Source Compatibility For various software engineering reasons (failures?) –source notoriously bad/problematic to port to new machine –entire application not all packaged up in one place must find compatible libraries, compiler, compiler options, header files… different (newer) compilers give different results

Caltech CS184 Spring DeHon 26 Pragmatic: Binary vs. Source Compatibility For various software engineering reasons (failures?) People generally more comfortable with binary compatibility ABI/Binary architectural definition smaller/tighter and more well defined? André: Shouldn’t have to be this way…but that’s where we are today

Caltech CS184 Spring DeHon 27 Fixed Points Architecture requires we “fix” the interface Trick is picking what to expose in the interface and fix, and what to hide What are the “fixed points?” –how you describe the computation –primitive operations the machine understands –primitive data types –interface to memory, I/O –interface to system routines?

Caltech CS184 Spring DeHon 28 Abstract Away? Specific sizes –what fits in on-chip memory –available memory (to some extent) –number of peripherals –where 0, 1, infinity comes in Timing –individual operations –resources (e.g. memory)

Caltech CS184 Spring DeHon 29 Architectural Scalability Depends on robustness of fixed-points –address space –number of registers? –operations available right level of abstraction? –Adequate primitives e.g. atomic ops –sequential assumptions –single memory? –timing assumptions e.g. branch delay, architectural cycles per op?

Caltech CS184 Spring DeHon 30 Change: Future like the past? VM/JIT compilation Binary Translation More advanced compiler technology and algorithms Architectural convergence? –Single Threaded ISA Maturity?

Caltech CS184 Spring DeHon 31 Conventional, Single- Threaded Abstraction Single, large, flat memory sequential, control-flow execution instruction-by-instruction sequential execution atomic instructions single-thread “owns” entire machine –isolation byte addressability unbounded memory, call depth

Caltech CS184 Spring DeHon 32 Embodiment C+OS-API –C+unix-API, C+Windows-API Compile to: –ISA+OS-ABI e.g. x86+linux-ABI Wrap up in standard, executable definition –e.g. a.out

Caltech CS184 Spring DeHon 33 Abstractions Model for fist half of course How support? How optimize? Remarkable –How far implementation can diverge

Caltech CS184 Spring DeHon 34 Project

Caltech CS184 Spring DeHon 35 Project: Graph Machine Network Look at Inter-PCE network For: –ConceptNet –SMVM Answer: –When should it be packet-switched vs. time-multiplexed? –Characterized cost/benefits of each

Caltech CS184 Spring DeHon 36 Project Design Overlay network for FPGA Write VHDL Map to Xilinx Component (Virtex2) Get Area, Timing

Caltech CS184 Spring DeHon 37 Project Steps 1.Get familiar with VHDL, build fast SRL queues 2.Build switching primitives 3.Assemble target switches 4.Assemble network, characterize size/density tradeoffs 5.Route/Simulate Traffic on designs and assess route-time (utilization) 6.Defect and Fault support 7.Custom implementation estimation

Caltech CS184 Spring DeHon 38 Intuitive Tradeoff Benefit of Time-Multiplexing? Cost of Time-Multiplexing? Benefit of Packet Switching? Cost of Packet Switching?

Caltech CS184 Spring DeHon 39 Intuitive Tradeoff Benefit of Time-Multiplexing? –Minimum end-to-end latency –No added decision latency at runtime –Offline route  high quality route  use wires efficiently Cost of Time-Multiplexing? –Route task must be static Cannot exploit low activity –Need memory bit per switch per time step Lots of memory if need large number of time steps…

Caltech CS184 Spring DeHon 40 Intuitive Tradeoff Benefit of Packet Switching? –No area proportional to time steps –Route only active connections –Avoids slow, off-line routing Cost of Packet Switching? –Online decision making Maybe won’t use wires as well Potentially slower routing? –Slower clock, more clocks across net –Data will be blocked in network Adds latency Requires packet queues

Caltech CS184 Spring DeHon 41 Packet Switch Motivations SMVM: –Long offline routing time limits applicability –Route memory exceeds compute memory for large matricies ConceptNet: –Evidence of low activity for keyword retrieval … could be important to exploit

Caltech CS184 Spring DeHon 42 Example ConceptNet retrieval –Visits 84K nodes across all time steps –150K nodes –8 steps  1.2M node visits –Activity less than 7%

Caltech CS184 Spring DeHon 43 Dishoom/ConceptNet Estimates Tstep  2 9 /N z +1500/N z +48+4(N z -1) Pushing all nodes, all edges; Bandwidth (T load ) dominates.

Caltech CS184 Spring DeHon 44 Question For what activity factor does Packet Switching beat Time Multiplexed Routing? –To what extent is this also a function of total time steps? Activity Time Steps TM Packet

Caltech CS184 Spring DeHon 45 Wrapup

Caltech CS184 Spring DeHon 46 Big Ideas Architectural abstraction –define the fixed points –stable abstraction to programmer –admit to variety of implementation –ease adoption/exploitation of new hardware –reduce human effort