Processor Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapter 2.16-20, 4.1-4.

Slides:



Advertisements
Similar presentations
Chapter 2 Instructions: Language of the Computer
Advertisements

Chapter 2.
331 W08.1Spring :332:331 Computer Architecture and Assembly Language Spring 2006 Week 8: Datapath Design [Adapted from Dave Patterson’s UCB CS152.
Kevin Walsh CS 3410, Spring 2010 Computer Science Cornell University A Processor See: P&H Chapter ,
CS 3410, Spring 2014 Computer Science Cornell University See P&H Chapter: , 1.4, Appendix A.
CS61C L25 CPU Design : Designing a Single-Cycle CPU (1) Garcia, Fall 2006 © UCB T-Mobile’s Wi-Fi / Cell phone  T-mobile just announced a new phone that.
The Processor 2 Andreas Klappenecker CPSC321 Computer Architecture.
Lecture 5 Sept 14 Goals: Chapter 2 continued MIPS assembly language instruction formats translating c into MIPS - examples.
Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 25 CPU design (of a single-cycle CPU) Intel is prototyping circuits that.
Prof. Hakim Weatherspoon CS 3410, Spring 2015 Computer Science Cornell University See P&H Chapter: , 1.6, Appendix B.
Processor I CPSC 321 Andreas Klappenecker. Midterm 1 Thursday, October 7, during the regular class time Covers all material up to that point History MIPS.
Prof. Hakim Weatherspoon CS 3410, Spring 2015 Computer Science Cornell University See P&H Chapter: , , Appendix B.
Hakim Weatherspoon CS 3410, Spring 2010 Computer Science Cornell University A Processor See: P&H Chapter ,
Prof. Hakim Weatherspoon CS 3410, Spring 2015 Computer Science Cornell University See P&H Appendix B.8 (register files) and B.9.
CS3350B Computer Architecture Winter 2015 Lecture 5.6: Single-Cycle CPU: Datapath Control (Part 1) Marc Moreno Maza [Adapted.
COSC 3430 L08 Basic MIPS Architecture.1 COSC 3430 Computer Architecture Lecture 08 Processors Single cycle Datapath PH 3: Sections
Prof. Hakim Weatherspoon CS 3410, Spring 2015 Computer Science Cornell University See P&H Chapter: , 1.6, Appendix B.
Memory Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University.
CPU Performance Pipelined CPU Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapters 1.4 and 4.5.
Hakim Weatherspoon CS 3410, Spring 2012 Computer Science Cornell University CPU Performance Pipelined CPU See P&H Chapters 1.4 and 4.5.
Module : Algorithmic state machines. Machine language Machine language is built up from discrete statements or instructions. On the processing architecture,
MIPS Pipeline Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapter 4.6.
Computer Organization CS224 Fall 2012 Lesson 22. The Big Picture  The Five Classic Components of a Computer  Chapter 4 Topic: Processor Design Control.
EEM 486: Computer Architecture Designing a Single Cycle Datapath.
CPS3340 COMPUTER ARCHITECTURE Fall Semester, /08/2013 Lecture 10: MIPS Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE CENTRAL STATE.
CS2100 Computer Organisation The Processor: Datapath (AY2015/6) Semester 1.
Computer Architecture and Design – ECEN 350 Part 6 [Some slides adapted from A. Sprintson, M. Irwin, D. Paterson and others]
CHAPTER 6 Instruction Set Architecture 12/7/
Datapath and Control Unit Design
1 A single-cycle MIPS processor  An instruction set architecture is an interface that defines the hardware operations which are available to software.
Computer Organization CS224 Fall 2012 Lessons 7 and 8.
Hakim Weatherspoon CS 3410, Spring 2012 Computer Science Cornell University MIPS Pipeline See P&H Chapter 4.6.
MIPS Instructions Instructions: Language of the Machine
Hakim Weatherspoon CS 3410, Spring 2012 Computer Science Cornell University MIPS Pipeline See P&H Chapter 4.6.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO Session 7, 8 Instruction Set Architecture.
MIPS Instruction Set Architecture Prof. Sirer CS 316 Cornell University.
Hakim Weatherspoon CS 3410, Spring 2012 Computer Science Cornell University A Processor See: P&H Chapter ,
COM181 Computer Hardware Lecture 6: The MIPs CPU.
Processor Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapter ,
CPU Performance Pipelined CPU Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapters 1.4 and 4.5.
CS 61C: Great Ideas in Computer Architecture MIPS Datapath 1 Instructors: Nicholas Weaver & Vladimir Stojanovic
Computer Architecture Lecture 6.  Our implementation of the MIPS is simplified memory-reference instructions: lw, sw arithmetic-logical instructions:
Electrical and Computer Engineering University of Cyprus
COMPUTER ARCHITECTURE & OPERATIONS I
IT 251 Computer Organization and Architecture
Prof. Sirer CS 316 Cornell University
The University of Adelaide, School of Computer Science
Instructions - Type and Format
Han Wang CS3410, Spring 2012 Computer Science Cornell University
CSCI206 - Computer Organization & Programming
Computer Architecture & Operations I
The University of Adelaide, School of Computer Science
ECE232: Hardware Organization and Design
Hakim Weatherspoon CS 3410 Computer Science Cornell University
Rocky K. C. Chang 6 November 2017
The Processor Lecture 3.1: Introduction & Logic Design Conventions
Guest Lecturer TA: Shreyas Chand
COMS 361 Computer Organization
Computer Architecture
Prof. Sirer CS 316 Cornell University
UCSD ECE 111 Prof. Farinaz Koushanfar Fall 2018
MIPS Instruction Set Architecture
Hakim Weatherspoon CS 3410 Computer Science Cornell University
The RISC-V Processor Hakim Weatherspoon CS 3410 Computer Science
MIPS Pipeline Hakim Weatherspoon CS 3410, Spring 2013 Computer Science
The RISC-V Processor Hakim Weatherspoon CS 3410 Computer Science
Instruction Set Architecture
CS 3410, Spring 2014 Computer Science Cornell University
Processor: Datapath and Control
Presentation transcript:

Processor Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University See P&H Chapter , 4.1-4

Big Picture: Building a Processor PC imm memory target offset cmp control =? new pc memory d in d out addr register file inst extend +4 A Single cycle processor alu

Goal for Today Understanding the basics of a processor We now have enough building blocks to build machines that can perform non-trivial computational tasks Putting it all together: Arithmetic Logic Unit (ALU)—Lab0 & 1, Lecture 2 & 3 Register File—Lecture 4 and 5 Memory—Lecture 5 –SRAM: cache –DRAM: main memory Instruction-types Instruction Datapaths

MIPS Register File PC imm memory target offset cmp control =? new pc memory d in d out addr register file inst extend +4 A Single cycle processor alu

MIPS Register file MIPS register file 32 registers, 32-bits each (with r0 wired to zero) Write port indexed via R W –Writes occur on falling edge but only if WE is high Read ports indexed via R A, R B Dual-Read-Port Single-Write-Port 32 x 32 Register File QAQA QBQB DWDW RWRW RARA RBRB WE

MIPS Register file MIPS register file 32 registers, 32-bits each (with r0 wired to zero) Write port indexed via R W –Writes occur on falling edge but only if WE is high Read ports indexed via R A, R B A B W RWRW RARA RBRB WE r1 r2 … r31

MIPS Register file Registers Numbered from 0 to 31. Each register can be referred by number or name. $0, $1, $2, $3 … $31 Or, by convention, each register has a name. –$16 - $23  $s0 - $s7 –$8 - $15  $t0 - $t7 –$0 is always $zero. –Patterson and Hennessy p121.

MIPS Memory PC imm memory target offset cmp control =? new pc memory d in d out addr register file inst extend +4 A Single cycle processor alu

MIPS Memory 32-bit address 32-bit data (but byte addressed) Enable + 2 bit memory control (mc) 00: read word (4 byte aligned) 01: write byte 10: write halfword (2 byte aligned) 11: write word (4 byte aligned) memory 32 addr 2 mc 32 E D in D out

Putting it all together: Basic Processor PC imm memory target offset cmp control =? new pc memory d in d out addr register file inst extend +4 A Single cycle processor alu

Putting it all together: Basic Processor Let’s build a MIPS CPU …but using (modified) Harvard architecture CPU Registers Data Memory data, address, control ALU Control Program Memory

Takeaway A processor executes instructions Processor has some internal state in storage elements (registers) A memory holds instructions and data (modified) Harvard architecture: separate insts and data von Neumann architecture: combined inst and data A bus connects the two We now have enough building blocks to build machines that can perform non-trivial computational tasks

Next Goal How to program and execute instructions on a MIPS processor?

Levels of Interpretation: Instructions Programs written in a High Level Language C, Java, Python, Ruby, … Loops, control flow, variables for (i = 0; i < 10; i++) printf(“go cucs”); main:addi r2, r0, 10 addi r1, r0, 0 loop:slt r3, r1, r Need translation to a lower- level computer understandable format Assembly is human readable machine language Processors operate on Machine Language ALU, Control, Register File, … Machine Implementation op=addi r0 r2 10 op=reg r1 r2 r3 func=slt

Levels of Interpretation: Instructions High Level Language C, Java, Python, Ruby, … Loops, control flow, variables for (i = 0; i < 10; i++) printf(“go cucs”); main:addi r2, r0, 10 addi r1, r0, 0 loop:slt r3, r1, r Assembly Language No symbols (except labels) One operation per statement Machine Langauge Binary-encoded assembly Labels become addresses ALU, Control, Register File, … Machine Implementation op=addi r0 r2 10 op=reg r1 r2 r3 func=slt

Instruction Usage Instructions are stored in memory, encoded in binary A basic processor fetches decodes executes one instruction at a time pc adder cur inst decode regs execute addrdata op=addi r0 r2 10

Instruction Types Arithmetic add, subtract, shift left, shift right, multiply, divide Memory load value from memory to a register store value to memory from a register Control flow unconditional jumps conditional jumps (branches) jump and link (subroutine call) Many other instructions are possible vector add/sub/mul/div, string operations manipulate coprocessor I/O

Instruction Set Architecture The types of operations permissible in machine language define the ISA MIPS: load/store, arithmetic, control flow, … VAX: load/store, arithmetic, control flow, strings, … Cray: vector operations, … Two classes of ISAs Reduced Instruction Set Computers (RISC) Complex Instruction Set Computers (CISC) We’ll study the MIPS ISA in this course

Instruction Set Architecture Instruction Set Architecture (ISA) Different CPU architecture specifies different set of instructions. Intel x86, IBM PowerPC, Sun Sparc, MIPS, etc. MIPS ≈ 200 instructions, 32 bits each, 3 formats –mostly orthogonal all operands in registers –almost all are 32 bits each, can be used interchangeably ≈ 1 addressing mode: Mem[reg + imm] x86 = Complex Instruction Set Computer (ClSC) > 1000 instructions, 1 to 15 bytes each operands in special registers, general purpose registers, memory, on stack, … –can be 1, 2, 4, 8 bytes, signed or unsigned 10s of addressing modes –e.g. Mem[segment + reg + reg*scale + offset]

Instructions Load/store architecture Data must be in registers to be operated on Keeps hardware simple Emphasis on efficient implementation Integer data types: byte: 8 bits half-words: 16 bits words: 32 bits MIPS supports signed and unsigned data types

MIPS instruction formats All MIPS instructions are 32 bits long, has 3 formats R-type I-type J-type oprsrtrdshamtfunc 6 bits5 bits 6 bits oprsrtimmediate 6 bits5 bits 16 bits opimmediate (target address) 6 bits 26 bits

MIPS Design Principles Simplicity favors regularity 32 bit instructions Smaller is faster Small register file Make the common case fast Include support for constants Good design demands good compromises Support for different type of interpretations/classes

Takeaway A MIPS processor and ISA (instruction set architecture) is an example a Reduced Instruction Set Computers (RISC) where simplicity is key, thus enabling us to build it!!

Next Goal How are instructions executed? What is the general datapath to execute an instruction?

Five Stages of MIPS Datapath 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB A Single cycle processor

Five Stages of MIPS datapath Basic CPU execution loop 1.Instruction Fetch 2.Instruction Decode 3.Execution (ALU) 4.Memory Access 5.Register Writeback Instruction types/format Arithmetic/Register:addu $s0, $s2, $s3 Arithmetic/Immediate:slti $s0, $s2, 4 Memory:lw $s0, 20($s3) Control/Jump:j 0xdeadbeef

Stages of datapath (1/5) Stage 1: Instruction Fetch Fetch 32-bit instruction from memory. (Instruction cache or memory) Increment PC accordingly. –+4, byte addressing –+N PC Prog. Mem +4 inst

Stages of datapath (1/5) 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB A Single cycle processor

Stages of datapath (2/5) Stage 2: Instruction Decode Gather data from the instruction Read opcode to determine instruction type and field length Read in data from register file –E.g. for addu, read two registers. –E.g. for addi, read one registers. –E.g. for jal, read no registers. 555 control Reg. File

Stages of datapath (2/5) All MIPS instructions are 32 bits long, has 3 formats R-type I-type J-type oprsrtrdshamtfunc 6 bits5 bits 6 bits oprsrtimmediate 6 bits5 bits 16 bits opimmediate (target address) 6 bits 26 bits

Stages of datapath (2/5) 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB A Single cycle processor

Stages of datapath (3/5) Stage 3: Execution (ALU) Useful work is done here (+, -, *, /), shift, logic operation, comparison (slt). Load/Store? –lw $t2, 32($t3) –Compute the address of the memory. ALU

Stages of datapath (3/5) 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB A Single cycle processor

Stages of datapath (4/5) Stage 4: Memory access Used by load and store instructions only. Other instructions will skip this stage. Data Mem If sw Data to store from reg to mem R/W If lw Data from memory Target addr from ALU

Stages of datapath (4/5) 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB A Single cycle processor

Stages of datapath (5/5) Stage 5: For instructions that need to write value to register. Examples: arithmetic, logic, shift, etc, load. Store, branches, jump?? Reg. File PC WriteBack from ALU or Memory New instruction address If branch or jump

Stages of datapath (5/5) 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB

Full Datapath 5 ALU 55 control Reg. File PC Prog. Mem inst +4 Data Mem Fetch DecodeExecute MemoryWB

Takeaway The datapath for a MIPS processor has five stages: 1.Instruction Fetch 2.Instruction Decode 3.Execution (ALU) 4.Memory Access 5.Register Writeback This five stage datapath is used to execute all MIPS instructions

Administrivia Make sure to go to your Lab Section this week Find project partners this week (for upcoming project1 next week) Lab2 due in class this week (it is not homework) Design Doc for Lab1 already past due, Monday, Feb 4th Completed Lab1 due next week, Monday, Feb 11th Work alone Homework1 is due Wednesday Due yesterday Work alone Academic Integrity BUT, use your resources Lab Section, Piazza.com, Office Hours, Homework Help Session, Class notes, book, Sections, CSUGLab Second&Third C Primer: Thursday & Monday, B14 Hollister, 6-8pm

Administrivia Check online syllabus/schedule Slides and Reading for lectures Office Hours Homework and Programming Assignments Prelims (in evenings): Tuesday, February 26 th Thursday, March 28 th Thursday, April 25 th If you have a prelim conflict Admin, Molly Trufant, with class number and date of conflict Schedule is subject to change

Collaboration, Late, Re-grading Policies “Black Board” Collaboration Policy Can discuss approach together on a “black board” Leave and write up solution independently Do not copy solutions Late Policy Each person has a total of four “slip days” Max of two slip days for any individual assignment Slip days deducted first for any late assignment, cannot selectively apply slip days For projects, slip days are deducted from all partners 25% deducted per day late after slip days are exhausted Regrade policy Submit written request to lead TA, and lead TA will pick a different grader Submit another written request, lead TA will regrade directly Submit yet another written request for professor to regrade.

Next Goal Specific datapaths MIPS Instructions

MIPS Instruction Types Arithmetic/Logical R-type: result and two source registers, shift amount I-type: 16-bit immediate with sign/zero extension Memory Access load/store between registers and memory word, half-word and byte operations Control flow conditional branches: pc-relative addresses jumps: fixed offsets, register absolute

MIPS instruction formats All MIPS instructions are 32 bits long, has 3 formats R-type I-type J-type oprsrtrdshamtfunc 6 bits5 bits 6 bits oprsrtimmediate 6 bits5 bits 16 bits opimmediate (target address) 6 bits 26 bits

Arithmetic Instructions oprsrtrd-func 6 bits5 bits 6 bits opfuncmnemonicdescription 0x00x21ADDU rd, rs, rtR[rd] = R[rs] + R[rt] 0x00x23SUBU rd, rs, rtR[rd] = R[rs] – R[rt] 0x00x25OR rd, rs, rtR[rd] = R[rs] | R[rt] 0x00x26XOR rd, rs, rt R[rd] = R[rs]  R[rt] 0x00x27NOR rd, rs rtR[rd] = ~ ( R[rs] | R[rt] ) R-Type ex: r4 = r8  r6 # XOR r4, r8, r6

Arithmetic and Logic 5 ALU 55 Reg. File PC Prog. Mem inst +4 control Fetch DecodeExecute MemoryWB skip r8 r6 xor r4 xor

Instruction fetch + decode + ALU = Babbage’s engine+ speed+ reliability– hand crank

Arithmetic Instructions: Shift op-rtrdshamtfunc 6 bits5 bits 6 bits opfuncmnemonicdescription 0x0 SLL rd, rt, shamtR[rd] = R[rt] << shamt 0x00x2SRL rd, rt, shamtR[rd] = R[rt] >>> shamt (zero ext.) 0x00x3SRA rd, rt, shamtR[rd] = R[rt] >> shamt (sign ext.) ex: r8 = r4 * 64 # SLL r8, r4, 6 r8 = r4 << 6 R-Type

Shift 5 ALU 55 Reg. File PC Prog. Mem inst +4 shamt control r4 shamt = 6 r8 sll

opmnemonicdescription 0x9ADDIU rd, rs, immR[rd] = R[rs] + imm 0xcANDI rd, rs, immR[rd] = R[rs] & imm 0xdORI rd, rs, immR[rd] = R[rs] | imm Arithmetic Instructions: Immediates oprsrdimmediate 6 bits5 bits 16 bits I-Type ex: r5 += -1ex: r5 += opmnemonicdescription 0x9ADDIU rd, rs, immR[rd] = R[rs] + sign_extend(imm) 0xcANDI rd, rs, immR[rd] = R[rs] & zero_extend(imm) 0xdORI rd, rs, immR[rd] = R[rs] | zero_extend(imm) ex: r5 = r5 + 5# ADDIU r5, r5, 5 r5 += 5 What if immediate is negative?

Immediates 5 imm 55 extend +4 shamt Reg. File PC Prog. Mem ALU inst control r5 addiu

Immediates 5 imm 55 extend +4 shamt Reg. File PC Prog. Mem ALU inst control r5 addiu

Arithmetic Instructions: Immediates opmnemonicdescription 0xFLUI rd, immR[rd] = imm << 16 op-rdimmediate 6 bits5 bits 16 bits I-Type ex: LUI r5, 0xdead ORI r5, r5 0xbeef What does r5 = ? r5 = 0xdeadbeef ex: r5 = 0x50000# LUI r5, 5

Immediates 5 imm 55 extend +4 shamt Reg. File PC Prog. Mem ALU inst 16 control r5 liu 5

MIPS Instruction Types Arithmetic/Logical R-type: result and two source registers, shift amount I-type: 16-bit immediate with sign/zero extension Memory Access load/store between registers and memory word, half-word and byte operations Control flow conditional branches: pc-relative addresses jumps: fixed offsets, register absolute

Memory Instructions opmnemonicdescription 0x20LB rd, offset(rs)R[rd] = sign_ext(Mem[offset+R[rs]]) 0x24LBU rd, offset(rs)R[rd] = zero_ext(Mem[offset+R[rs]]) 0x21LH rd, offset(rs)R[rd] = sign_ext(Mem[offset+R[rs]]) 0x25LHU rd, offset(rs)R[rd] = zero_ext(Mem[offset+R[rs]]) 0x23LW rd, offset(rs)R[rd] = Mem[offset+R[rs]] 0x28SB rd, offset(rs)Mem[offset+R[rs]] = R[rd] 0x29SH rd, offset(rs)Mem[offset+R[rs]] = R[rd] 0x2bSW rd, offset(rs)Mem[offset+R[rs]] = R[rd] oprsrdoffset 6 bits5 bits 16 bits base + offset addressing I-Type signed offsets ex: = Mem[4+r5] = r1# SW r1, 4(r5)

Memory Operations Data Mem addr ext +4 5 imm 55 control Reg. File PC Prog. Mem ALU inst sw r5+4 4 Write Enable r5 r1

MIPS Instruction Types Arithmetic/Logical R-type: result and two source registers, shift amount I-type: 16-bit immediate with sign/zero extension Memory Access load/store between registers and memory word, half-word and byte operations Control flow conditional branches: pc-relative addresses jumps: fixed offsets, register absolute Next Time

Summary We have all that it takes to build a processor! Arithmetic Logic Unit (ALU)—Lab0 & 1, Lecture 2 & 3 Register File—Lecture 4 and 5 Memory—Lecture 5 –SRAM: cache –DRAM: main memory A MIPS processor and ISA (instruction set architecture) is an example a Reduced Instruction Set Computers (RISC) where simplicity is key, thus enabling us to build it!! We know the data path for the MIPS ISA register and memory instructions.

Next Time Datapath for Branches and Jumps CPU Performance Pipelined CPU