7/14/2000 Page 1 Design of the IRAM FPU Ioannis Mavroidis IRAM retreat July 12-14, 2000.

Slides:



Advertisements
Similar presentations
Asanovic/Devadas Spring VLIW/EPIC: Statically Scheduled ILP Krste Asanovic Laboratory for Computer Science Massachusetts Institute of Technology.
Advertisements

CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture VLIW Steve Ko Computer Sciences and Engineering University at Buffalo.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
Comp Sci Floating Point Arithmetic 1 Ch. 10 Floating Point Unit.
Instruction Set Issues MIPS easy –Instructions are only committed at MEM  WB transition Other architectures are more difficult –Instructions may update.
FPU structure. Assumptions (to shorten execution trace) – 2 instructions dispatched in order per cycle – execution begins in same cycle as dispatch –
Computer Architecture and Design Fall 2009 Indraneil Gokhale.
Processor Overview Features Designed for consumer and wireless products RISC Processor with Harvard Architecture Vector Floating Point coprocessor Branch.
THE MIPS R10000 SUPERSCALAR MICROPROCESSOR Kenneth C. Yeager IEEE Micro in April 1996 Presented by Nitin Gupta.
Superscalar Organization Prof. Mikko H. Lipasti University of Wisconsin-Madison Lecture notes based on notes by John P. Shen Updated by Mikko Lipasti.
Spring 2003CSE P5481 Reorder Buffer Implementation (Pentium Pro) Hardware data structures retirement register file (RRF) (~ IBM 360/91 physical registers)
Register Renaming & Value Prediction. Overview ► Need for Post-RISC ► Register Renaming vs. Allocation Strategies ► How to compile for Post-RISC machines.
Convey Computer Status Steve Wallach swallach”at”conveycomputer.com.
1 CS294 Project VIRAM-1 Verification Retreat – Winter 2001 Sam Williams.
Review CPSC 321 Andreas Klappenecker Announcements Tuesday, November 30, midterm exam.
1  2004 Morgan Kaufmann Publishers Chapter Six. 2  2004 Morgan Kaufmann Publishers Pipelining The laundry analogy.
Joe Gebis Computer Science Division University of California, Berkeley IRAM CAD Status and Plan.
VIRAM-1 Architecture Update and Status Christoforos E. Kozyrakis IRAM Retreat January 2000.
12/13/99 Page 1 IRAM Network Interface Ioannis Mavroidis IRAM retreat January 12-14, 2000.
The PowerPC Architecture  IBM, Motorola, and Apple Alliance  Based on the IBM POWER Architecture ­Facilitate parallel execution ­Scale well with advancing.
Retrospective on the VIRAM-1 Design Decisions Christoforos E. Kozyrakis IRAM Retreat January 9, 2001.
1 IRAM Testing / Verification Sam Williams UC Berkeley
PowerPC 601 Stephen Tam. To be tackled today Architecture Execution Units Fixed-Point (Integer) Unit Floating-Point Unit Branch Processing Unit Cache.
Ryota Shioya, Masahiro Goshimay and Hideki Ando Micro 47 Presented by Kihyuk Sung.
CS402 PPP # 2 MIPS BASIC INFORMATION By George Koutsogiannakis 1.
Computer Architecture - Superscalar Processors Exceptions and Interrupts  When an exception (overflow, page-fault) occurs there are several instructions.
architectural overview
Educational Computer Architecture Experimentation Tool Dr. Abdelhafid Bouhraoua.
8/16/2015\course\cpeg323-08F\Topics1b.ppt1 A Review of Processor Design Flow.
SUPERSCALAR EXECUTION. two-way superscalar The DLW-2 has two ALUs, so it’s able to execute two arithmetic instructions in parallel (hence the term two-way.
Emotion Engine A look at the microprocessor at the center of the PlayStation2 gaming console Charles Aldrich.
POWERPC ARCHITECTURE Term Paper Presentation by by Umut Yazkurt CMPE 511 Fall Fall
SPREE RTL Generator RTL Simulator RTL CAD Flow 3. Area 4. Frequency 5. Power Correctness1. 2. Cycle count SPREE Benchmarks Verilog Results 3. Architecture.
The MIPS R10000 Superscalar Microprocessor Kenneth C. Yeager Nishanth Haranahalli February 11, 2004.
Implementing Click IP Router Kernel on VLIW Architectures Kanyu Mark Cao and Xiaodong Jin Many thanks to Scott Weber and Kees Vissers for the help on this.
|Processors designed for low power |Architectural state is correct at basic block granularity rather than instruction granularity 2.
Dynamic Pipelines. Interstage Buffers Superscalar Pipeline Stages In Program Order In Program Order Out of Order.
Morgan Kaufmann Publishers
Lecture 6. VFP & NEON in ARM
Lecture 2: Computer Architecture: A Science ofTradeoffs.
Transmeta’s New Processor Another way to design CPU By Wu Cheng
Spring 2003CSE P5481 Precise Interrupts Precise interrupts preserve the model that instructions execute in program-generated order, one at a time If an.
1  1998 Morgan Kaufmann Publishers Where we are headed Performance issues (Chapter 2) vocabulary and motivation A specific instruction set architecture.
Reconfigurable architectures ESE 566. Outline Static and Dynamic Configurable Systems –Static SPYDER, RENCO –Dynamic FIREFLY, BIOWATCH PipeRench: Reconfigurable.
EKT303/4 Superscalar vs Super-pipelined.
Modern processor design
Slide 1 Computers for the Post-PC Era David Patterson University of California at Berkeley UC Berkeley IRAM Group UC Berkeley.
Out-of-Order OpenRISC 2 semesters project Semester B: OR1200 ISA Extension Final B Presentation By: Vova Menis-Lurie Sonia Gershkovich Advisor: Mony Orbach.
15-740/ Computer Architecture Lecture 12: Issues in OoO Execution Prof. Onur Mutlu Carnegie Mellon University Fall 2011, 10/7/2011.
SHAKTI PROCESSORS RAHUL BODDUNA RISE LAB, IIT MADRAS
Floating Point Operations
1/21 Cell Processor Systems Seminar Diana Palsetia (11/21/2006)
Exceptions and Interrupts “Unexpected” events requiring change in flow of control – Different ISAs use the terms differently Exception – Arises within.
Dynamic Scheduling Why go out of style?
CSCI206 - Computer Organization & Programming
Rough Schedule 1:30-2:15 IRAM overview 2:15-3:00 ISTORE overview break
PowerPC 604 Superscalar Microprocessor
Single Clock Datapath With Control
Pipelining: Advanced ILP
Array Processor.
Superscalar Processors & VLIW Processors
Superscalar Pipelines Part 2
CSCI206 - Computer Organization & Programming
6.375 Final Project.
Overview Prof. Eric Rotenberg
CS 252 Spring 2000 Jeff Herman John Loo Xiaoyi Tang
CSC3050 – Computer Architecture
Sam Williams IRAM Summer Retreat 2000
Introduction to Computer Systems Engineering
CMSC 611: Advanced Computer Architecture
Presentation transcript:

7/14/2000 Page 1 Design of the IRAM FPU Ioannis Mavroidis IRAM retreat July 12-14, 2000

7/14/2000 Page 2 Features Executes MIPS IV ISA single-precision FP instructions Thirty-two 32-bit Floating Point Registers Two 32-bit Control Registers One 3-cycle (division takes 10 cycles) fully pipelined, nearly full IEEE-754 compliant, execution unit (from Albert Support for partial out-of-order execution (with the use of a reorder buffer) and precise exceptions 6-stage pipeline (R-X-X-X-CDB-WB)

7/14/2000 Page 3 Interface with Scalar Core The FPU is attached to the Scalar Core as a loosely-coupled coprocessor. Scalar core dispatches FP instructions to FPU using an interface that splits instructions into 5 classes.

7/14/2000 Page 4 FPU Architecture

7/14/2000 Page 5 Current Functionality Executes all instructions (except division). A few modifications are needed to adapt to recent interface changes with the Scalar Core. Also some things about exceptions and division are still missing. Functionality verification. –Used random test generator that generates/kills instructions at random and compares results from Verilog simulator against results from an ISA simulator (written in Perl).

7/14/2000 Page 6 Future Work Integrate execution datapath from MIT –Can not compile it without the IBM libraries! Integrate it with MIPS Scalar Core Run FPU testsuite provided by MIPS Integrate with vector unit and run tests that use both coprocessors Synthesize, place and route