Microarchitecture.

Slides:

Advertisements

Similar presentations

Advertisements

Computer Organization and Architecture

Chapter 14 Instruction Level Parallelism and Superscalar Processors

Intel Pentium 4 ENCM Jonathan Bienert Tyson Marchuk.

Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.

EECS 470 Superscalar Architectures and the Pentium 4 Lecture 12.

Intel Architecture. Changes in architecture Software architecture: –Front end (Feature changes such as adding more graphics, changing the background colors,

Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.

Multi-core architectures. Single-core computer Single-core CPU chip.

By Michael Butler, Leslie Barnes, Debjit Das Sarma, Bob Gelinas This paper appears in: Micro, IEEE March/April 2011 (vol. 31 no. 2) pp 마이크로 프로세서.

Parallel Processing - introduction  Traditionally, the computer has been viewed as a sequential machine. This view of the computer has never been entirely.

Nicolas Tjioe CSE 520 Wednesday 11/12/2008 Hyper-Threading in NetBurst Microarchitecture David Koufaty Deborah T. Marr Intel Published by the IEEE Computer.

Dynamic Pipelines. Interstage Buffers Superscalar Pipeline Stages In Program Order In Program Order Out of Order.

© Wen-mei Hwu and S. J. Patel, 2005 ECE 511, University of Illinois Lecture 4: Microarchitecture: Overview and General Trends.

Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University

Lecture on Central Process Unit (CPU)

To Compute: To Do Math. Information is collected by tallying data as it travels across circuits. The key part of integrated circuits are transistors.transistors.

MAHARANA PRATAP COLLEGE OF TECHNOLOGY SEMINAR ON- COMPUTER PROCESSOR SUBJECT CODE: CS-307 Branch-CSE Sem- 3 rd SUBMITTED TO SUBMITTED BY.

1 ECE 734 Final Project Presentation Fall 2000 By Manoj Geo Varghese MMX Technology: An Optimization Outlook.

Pentium 4 Deeply pipelined processor supporting multiple issue with speculation and multi-threading 2004 version: 31 clock cycles from fetch to retire,

William Stallings Computer Organization and Architecture 6th Edition

Introduction to Computing Systems

Guide to Operating Systems, 5th Edition

GCSE Computing - The CPU

Computer Organization and Architecture Lecture 1 : Introduction

OPERATING SYSTEM CONCEPT AND PRACTISE

Auburn University COMP8330/7330/7336 Advanced Parallel and Distributed Computing Parallel Hardware Dr. Xiao Qin Auburn.

Protection in Virtual Mode

Chapter 10: Computer systems (1)

ECE354 Embedded Systems Introduction C Andras Moritz.

Distributed Processors

CS6401- OPERATING SYSTEMS L T P C

William Stallings Computer Organization and Architecture 8th Edition

Parallel Processing - introduction

Computer Structure Multi-Threading

nZDC: A compiler technique for near-Zero silent Data Corruption

The University of Adelaide, School of Computer Science

Assembly Language for Intel-Based Computers, 5th Edition

Guide to Operating Systems, 5th Edition

Architecture & Organization 1

Chapter 14 Instruction Level Parallelism and Superscalar Processors

Phnom Penh International University (PPIU)

INTRODUCTION TO MICROPROCESSORS

Flow Path Model of Superscalars

Introduction to Pentium Processor

Instruction Level Parallelism and Superscalar Processors

Number Representations and Basic Processor Architecture

The Microarchitecture of the Pentium 4 processor

Architecture & Organization 1

Levels of Parallelism within a Single Processor

Functional Units.

Chapter 17 Parallel Processing

EE 445S Real-Time Digital Signal Processing Lab Spring 2014

Chapter 1 Introduction.

Introduction to Operating Systems

Introduction to Operating Systems

* From AMD 1996 Publication #18522 Revision E

Computer Evolution and Performance

What is Computer Architecture?

Levels of Parallelism within a Single Processor

What is Computer Architecture?

ARM ORGANISATION.

Course Outline for Computer Architecture

GCSE Computing - The CPU

Chapter 1: Introduction

CSE 502: Computer Architecture

Presentation transcript:

Microarchitecture

Microarchitecture, also called computer organization and sometimes abbreviated as µarch or uarch, is the way a given instruction set architecture (ISA) is implemented in a particular processor. The microarchitecture of a machine is usually represented as (more or less detailed) diagrams that describe the interconnections of the various microarchitectural elements of the machine, which may be everything from single gates and registers, to complete arithmetic logic units (ALUs) and even larger elements

A microarchitecture organized around a single bus

The Microarchitecture of the Pentium 4 Processor(NetBurst) The Intel® NetBurst microarchitecture of Intel’s new flagship Pentium® 4 processor. The Pentium 4 processor provides a substantial performance gain for many key application areas where the end user can truly appreciate the difference. For example, it allows a much better user experience in areas such as Internet audio and streaming video, image processing, video content creation, speech recognition, 3D applications and games, multi-media, and multi-tasking user environments. The Pentium 4 processor enables real time MPEG2 video encoding and near real-time MPEG4 encoding, allowing efficient video editing and video conferencing.

NetBurst Microarchitecture

The in-order front end is the part of the machine that fetches the instructions to be executed next in the program and prepares them to be used later in the machine pipeline. The out-of-order execution logic has several buffers that it uses to smooth and re-order the flow of instructions to optimize performance as they go down the pipeline and get scheduled for execution. Out-of-order execution allows the execution resources such as the ALUs and the cache to be kept as busy as possible executing independent instructions that are ready to execute. The retirement logic is what reorders the instructions, executed in an out-of-order manner, back to the original program order Integer and Floating-Point Execution Units :The execution units are where the instructions are actually executed. This section includes the register files that store the integer and floating-point data operand values that the instructions need to execute. Memory Subsystem :This includes the L2 cache and the system bus. The L2 cache stores both instructions and data that cannot fit in the Execution Trace Cache and the L1 data cache. The external system bus is connected to the backside of the second-level cache and is used to access main memory when the L2 cache has a cache miss, and to access the system I/O resources.

Next generation processors architecture :. Cortex-A15 Processor Next generation processors architecture : *Cortex-A15 Processor *Unified Processing Architecture

( Cortex-A15 Processor ARM Cortex-A15 processor is the proven high-performance engine for highly connected devices, delivering unprecedented flexibility and processing capability. The processor is designed with advanced power reduction techniques as well as reliability features which make the Cortex-A15 ideal for networking infrastructure and other enterprise applications. The Cortex-A15 processor is supported by a suite of optimized IP targeted at high-end solutions bringing premium performance in a mobile power envelope. ARM Mali high-end graphics processors support advanced graphics performance, with the Mali-V500 and Mali-DP500 processors providing display and video capabilities respectively. The ARM interconnect and memory controller IP addresses the critical challenge of efficiently moving and storing data between multiple Cortex-A15 processors, high-performance media processors and dynamic memories to optimize the system performance and power consumption of the SoC.

The Harmony Unified Processing Architecture are described as consisting of “the Multi-Thread Virtual Pipeline parallel computing core (MVP), an independent instruction set architecture, an optimizing compiler, and the Agile Switch dynamic load balancer.” The chip uses both SMP (Symmetric Multi-Processing) and SMT (Simultaneous Multi-Threading).

Thank You