The Arrival of the 64bit CPUs - Itanium1 นายชนินท์วงษ์ใหญ่รหัส 43650076 นายสุนัยสุขเอนกรหัส 43650340.

Slides:



Advertisements
Similar presentations
PIPELINE AND VECTOR PROCESSING
Advertisements

Instruction Set Design
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University of.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
ITCS 3181 Logic and Computer Systems 2015 B. Wilkinson slides3.ppt Modification date: March 16, Addressing Modes The methods used in machine instructions.
Fall EE 333 Lillevik 333f06-l20 University of Portland School of Engineering Computer Organization Lecture 20 Pipelining: “bucket brigade” MIPS.
Microprocessors General Features To be Examined For Each Chip Jan 24 th, 2002.
Computer Organization and Architecture
Computer Organization and Architecture
1 Microprocessor-based Systems Course 4 - Microprocessors.
Chapter XI Reduced Instruction Set Computing (RISC) CS 147 Li-Chuan Fang.
Computer Organization and Architecture The CPU Structure.
Chapter 12 Three System Examples The Architecture of Computer Hardware and Systems Software: An Information Technology Approach 3rd Edition, Irv Englander.
Assembly Language for Intel-Based Computers Chapter 2: IA-32 Processor Architecture Kip Irvine.
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
Chapter 12 CPU Structure and Function. Example Register Organizations.
NYU DARPA DIS kick-off September 24, Comparing IA-64 and HPL-PD NYU.
ARM programmer’s model and assembler Embedded Systems Programming.
1 RISC Machines l RISC system »instruction –standard, fixed instruction format –single-cycle execution of most instructions –memory access is available.
Microprocessor Systems Design I Instructor: Dr. Michael Geiger Spring 2012 Lecture 2: 80386DX Internal Architecture & Data Organization.
Chapter 21 IA-64 Architecture (Think Intel Itanium)
Intel IA-64 Architecture Chehun Kim Glenn Ramos. Contents *Pipelining - Stages of pipelining *Microprogramming *Interconnection Structures.
COMP381 by M. Hamdi 1 Commercial Superscalar and VLIW Processors.
Chapter 15 IA-64 Architecture or (EPIC – Extremely Parallel Instruction Computing)
CH12 CPU Structure and Function
IA-64 ISA A Summary JinLin Yang Phil Varner Shuoqi Li.
Lecture#14. Last Lecture Summary Memory Address, size What memory stores OS, Application programs, Data, Instructions Types of Memory Non Volatile and.
Instruction Set Architecture
Homework Problems 1. M1 runs the program P in 1.4 * 9 * ns or ns M2 runs the program P in 1.6*9800*10ns or ns Hence M2 is faster by.
Comparing High-End Computer Architectures for Business Applications Presentation: 493 Track: HP-UX Dr. Frank Baetke HP.
Intel Pentium II Processor Brent Perry Pat Reagan Brian Davis Umesh Vemuri.
Anshul Kumar, CSE IITD CS718 : VLIW - Software Driven ILP Example Architectures 6th Apr, 2006.
RISC By Ryan Aldana. Agenda Brief Overview of RISC and CISC Features of RISC Instruction Pipeline Register Windowing and renaming Data Conflicts Branch.
Spring 2003CSE P5481 VLIW Processors VLIW (“very long instruction word”) processors instructions are scheduled by the compiler a fixed number of operations.
Introducing The IA-64 Architecture - Kalyan Gopavarapu - Kalyan Gopavarapu.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
Computer Architecture EKT 422
Chapter 11 Instruction Sets: Addressing Modes and Formats Gabriel Baron Sydney Chow.
Next Generation ISA Itanium / IA-64. Operating Environments IA-32 Protected Mode/Real Mode/Virtual Mode - if supported by the OS IA-64 Instruction Set.
Processor Structure and Function Chapter8:. CPU Structure  CPU must:  Fetch instructions –Read instruction from memory  Interpret instructions –Instruction.
Embedded Computer Architectures Hennessy & Patterson Chapter 4 Exploiting ILP with Software Approaches Gerard Smit (Zilverling 4102),
What is a program? A sequence of steps
Introduction to Intel IA-32 and IA-64 Instruction Set Architectures.
Unit II Intel IA-64 and Itanium Processor By N.R.Rejin Paul Lecturer/VIT/CSE CS2354 Advanced Computer Architecture.
Computer Architecture
Itanium® 2 Processor Architecture
CS 352H: Computer Systems Architecture
William Stallings Computer Organization and Architecture 6th Edition
Immediate Addressing Mode
A Closer Look at Instruction Set Architectures
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue
Henk Corporaal TUEindhoven 2009
Henk Corporaal TUEindhoven 2011
Sampoorani, Sivakumar and Joshua
CC423: Advanced Computer Architecture ILP: Part V – Multiple Issue
CSC3050 – Computer Architecture
Chapter 11 Processor Structure and function
IA-64 Vincent D. Capaccio.
Computer Organization
Presentation transcript:

The Arrival of the 64bit CPUs - Itanium1 นายชนินท์วงษ์ใหญ่รหัส นายสุนัยสุขเอนกรหัส

The Arrival of the 64bit CPUs - Itanium2 EPIC (Explicitly Parallel Instruction Computing) IA - 64 compatibility with the IA-32 instruction set. CPU speed 800 MHz, 6.4 GFLOPS 3 Level Cache  On-Die Lv1 128 Kb, Lv2 256 Kb  Lv3 4 Mb 2 64 Address space 25.4 Million Transistors What is the Intel® Itanium™ Architecture?

The Arrival of the 64bit CPUs - Itanium3

4

5 Data Types Integer: 1, 2, 4 and 8 byte(s) Floating-point single, double and double-extended formats Pointers: 8 bytes

The Arrival of the 64bit CPUs - Itanium6 Intel® Itanium™ Instruction Format [(qp)] mnemonic[.comp1][.comp2] dests = srcs Simple Instruction add r1 = r2, r3 Predicated instruction (p4)add r1 = r2, r3 Instruction with immediate add r1 = r2, r3, 1 Instruction with completer cmp.eq p3 = r2, r4

The Arrival of the 64bit CPUs - Itanium7 Memory Organization Single space means that both data and instructions share the same memory range. Uniform means that there are no address regions with predefined functionality. Linear means that the address space contains no segments; all 2 64 bytes are consecutive Support 2 byte order: Little-endian and Big-endian defines a single, uniform, linear address space of 2 64 bytes

The Arrival of the 64bit CPUs - Itanium8 Instruction Level Parallelism(ILP) Enabling the compiler/assembly writer to explicitly indicate parallelism. Providing a three-instruction-wide word, called a bundle, that facilitates parallel processing of instructions. Providing a large number of registers, enabling using different registers for different variables and avoiding register contention.

The Arrival of the 64bit CPUs - Itanium9 Instruction Groups An instruction group is a set of instructions which do not have read- after-write or write-after-write dependencies between them and may execute in parallel. An instruction group must contain at least one instruction; the number of instructions in an instruction group is not limited. Instruction groups are indicated in the code by cycle breaks. An instruction group may also end dynamically during run-time by a taken branch.

The Arrival of the 64bit CPUs - Itanium10 Instruction Bundles Instruction groups are composed of instructions contained in bundles. Each bundle contains three instructions, and a template field, which are set during code generation, by a compiler, or the assembler. Template allows the processor to dispatch all three instructions in parallel. Bundles are aligned at 16-byte boundaries.

The Arrival of the 64bit CPUs - Itanium11 Registers 128 General registers 128 Floating-point registers 64 Predicate registers 8 Branch registers 128 Application registers Instruction Pointer (IP) register

The Arrival of the 64bit CPUs - Itanium12 Register Validity enable propagating validity/invalidity of a speculative load result. Each general register has an a corresponding NaT (Not a Thing) Bit. Floating-point registers use a special instance of pseudo- zero, called NaTVal.

The Arrival of the 64bit CPUs - Itanium13 Branching in the Intel® Itanium™ Architecture Relative direct branches, using 21-bit displacement that is appended to the instruction pointer of the bundle containing the branch. Indirect branches, using 64-bit addresses in the branch registers.

The Arrival of the 64bit CPUs - Itanium14 Predication Allowing the processor to execute all possible branch paths in parallel. Instruction 1 Instruction 2 Instruction 3(branch) Instruction 4 (P1)Instruction 7 (P2) Instruction 5 (P1)Instruction 8(P2)Instruction 6 (P1)Instruction 9 (P2)

The Arrival of the 64bit CPUs - Itanium15 Predication The compiler rearrange instruction in this order, paring instruction 4 and 7, 5 and 8 and 6 and 9 for parallel execution. Instruction 1Instruction 2Instruction 3(branch) Instruction 4 (P1)Instruction 7 (P2)Instruction 5 (P1) Instruction 8(P2)Instruction 6 (P1)Instruction 9 (P2) 128 bit long instruction word

The Arrival of the 64bit CPUs - Itanium16 Reduced Memory Access Costs Hiding memory latency. This enables the processor to bring the data in time, and avoid stalling the processor. Memory latency is hidden through the use of:  Data speculation - the execution of an operation before its data dependency is resolved.  Control speculation - the execution of an instruction before its control dependencyis resolved.

The Arrival of the 64bit CPUs - Itanium17 Hiding Memory Latencies speculative loads, error/exception detection is deferred until final result is actually required:  If no error/exception is detected the latency is hidden.  If an error/exception is detected then memory accesses and dependent instructions must be redone by an exception handler.

The Arrival of the 64bit CPUs - Itanium18 Speculative loading Fetch data before the program needs it, even beyond a branch that hasn't executed. Instruction 1 Instruction 2 Instruction 3(branch) Instruction 4Instruction 7 Instruction 5Instruction 8(Load data) Instruction 6 Speculative Check Speculative loading Instruction 9 (Use data)

The Arrival of the 64bit CPUs - Itanium19 Floating Point and Multimedia support for single, double, and double-extended IEEE formats. support for multimedia, or data- parallel applications:  integer data and SIMD computations, similar to the MMX™ technology.  floating-point data and SIMD-FP computations, similar to IA-32 Streaming SIMD Extensions.

The Arrival of the 64bit CPUs - Itanium20 Itanium™ Architecture Floating-point Features 128 floating-point registers A multiply and accumulate instruction (fma), with four different floating-point registers for operands (f=a * b + c). This instruction enables performing a multiply and add in the same number of cycles as one add or multiply instruction. Load and store to and from memory. You can also load from memory into two floating-point registers. Data transfer between floating-point and general registers. Multiple status fields register, enables speculation on floating-point operations. Quick conversion from integer to floating-point and vice-versa. Rotating floating-point registers.

The Arrival of the 64bit CPUs - Itanium21 Multimedia Support Integer multimedia is provided by defining a set of instructions which treat the general registers 8x8, 4x16, or 2x32 bit elements, and by providing specific instructions for operating on these data elements. support is semantically compatible with the MMX™ Technology.

The Arrival of the 64bit CPUs - Itanium22 Chipset AL460GX (support 4 CPU), BS460GX (support 2 CPU) 400 MHz Bus speed DDR-SDRAM, RAMBUS only FSB slot support Itanium CPU Board 10 PCI Slot

The Arrival of the 64bit CPUs - Itanium23

The Arrival of the 64bit CPUs - Itanium24 Reference m/02proc3.htm#Itanium icles/64bit/index.html design/ia-64

The Arrival of the 64bit CPUs - Itanium25 Any Question ?