Intel Microprocessor Features Pentium  IIPentium  IIIPentium  4 MPR Issue June 1997April 2000Dec 2001 Clock Speed 266 MHz1GHz2GHz Pipeline Stages 12/14.

Slides:



Advertisements
Similar presentations
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Wiley-Interscience.
Advertisements

Pass Transistor Logic. Agenda  Introduction  VLSI Design methodologies  Review of MOS Transistor Theory  Inverter – Nucleus of Digital Integrated.
1 Chapter Design For Testability The Scan-Path Technique The testing problems with sequential circuit can be overcome by two properties: 1.The.
L06 – Clocks Spring /18/05 Clocking.
Copyright © 2001 Stephen A. Edwards All rights reserved Review of Digital Logic Prof. Stephen A. Edwards.
Introduction to Microprocessors and Microcomputers.
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Sequential Definitions  Use two level sensitive latches of opposite type to build one master-slave flipflop that changes state on a clock edge (when the.
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 17: Dynamic Sequential Circuits And Timing Issues [Adapted from Rabaey’s Digital Integrated Circuits,
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 Complex Digital Circuits Design Lecture 2: Timing Issues; [Adapted from Rabaey’s Digital Integrated.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
A Novel Clock Distribution and Dynamic De-skewing Methodology Arjun Kapoor – University of Colorado at Boulder Nikhil Jayakumar – Texas A&M University,
Lecture 8: Clock Distribution, PLL & DLL
Digital System Clocking:
Programmable logic and FPGA
EE314 Basic EE II Silicon Technology [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
HIGH-SPEED VLSI TESTING WITH SLOW TEST EQUIPMENT Vishwani D. Agrawal Agere Systems Processor Architectures and Compilers Research Murray Hill, NJ
Laboratory of Reliable Computing Department of Electrical Engineering National Tsing Hua University Hsinchu, Taiwan Delay Defect Characteristics and Testing.
Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Automated Design.
Informationsteknologi Friday, October 19, 2007Computer Architecture I - Class 81 Today’s class Digital Logic.
Spring 07, Feb 22 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Power Aware Microprocessors Vishwani D. Agrawal.
Chapter #6: Sequential Logic Design 6.2 Timing Methodologies
Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved The Digital Logic Level.
Computer Organization and Assembly language
Design for Testability
ECE 301 – Digital Electronics Flip-Flops and Registers (Lecture #15)
Transmission gate latches. MSL with unprotected input (Gerosa et al. 1994), Copyright © 1994 IEEE.
CS 61C L01 Introduction (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia CS61C www page www-inst.eecs.berkeley.edu/~cs61c/
Dept. of Communications and Tokyo Institute of Technology
Digital Integrated Circuits© Prentice Hall 1995 Sequential Logic SEQUENTIAL LOGIC.
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Wiley-Interscience.
Evolution in Complexity Evolution in Transistor Count.
Hardware Design of High Speed Switch Fabric IC. Overall Architecture.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 2 CMOS.
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
A 30-GS/sec Track and Hold Amplifier in 0.13-µm CMOS Technology
System Arch 2008 (Fire Tom Wada) /10/9 Field Programmable Gate Array.
1 Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy.
1 Moore’s Law in Microprocessors Pentium® proc P Year Transistors.
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Wiley-Interscience.
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTEMS
EE4800 CMOS Digital IC Design & Analysis
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Architecture of Microprocessor
Clock in Digital Systems. Combinational logic circuit A combinational logic circuit is one whose outputs depend only on its current inputs
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
Interconnect/Via.
A high speed serializer ASIC for ATLAS Liquid Argon calorimeter upgrade Tiankuan Liu On behalf of the ATLAS Liquid Argon Calorimeter Group Department of.
Capability of processor determine the capability of the computer system. Therefore, processor is the key element or heart of a computer system. Other.
11/15/05ELEC / Lecture 191 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 19: Timing Issues; Introduction to Datapath.
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
INTRODUCTION TO MICROPROCESSORS
Architecture & Organization 1
INTRODUCTION TO MICROPROCESSORS
INTRODUCTION TO MICROPROCESSORS
عمارة الحاسب.
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
Digital System Clocking:
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
A High Performance SoC: PkunityTM
Chapter 1 Introduction.
COMS 361 Computer Organization
Presentation transcript:

Intel Microprocessor Features Pentium  IIPentium  IIIPentium  4 MPR Issue June 1997April 2000Dec 2001 Clock Speed 266 MHz1GHz2GHz Pipeline Stages 12/14 22/24 Transistors 7.5M24M42M Cache (I/D/L2) 16k/16K/-16K/16K/256K12K/8K/256K Die Size 203mm 2 106mm 2 217mm 2 IC Process 0.28  m, 4M0.18  m, 6M Max Power 27W23W67W

Clock distribution network with deskewing circuit (Geannopoulos and Dai 1998), Copyright © 1998 IEEE

Delay shift register (Geannopoulos and Dai 1998), Copyright © 1998 IEEE

Phase detector (Geannopoulos and Dai 1998), Copyright © 1998 IEEE

Clock distribution topology (Rusu and Tam 2000), Copyright © 2000 IEEE

Deskew buffer architecture (Rusu and Tam 2000), Copyright © 2000 IEEE

Digitally controlled delay line (Rusu and Tam 2000), Copyright © 2000 IEEE

Simulated regional clock-grid skew (Rusu and Tam 2000), Copyright © 2000 IEEE

Measured regional clock skew (Rusu and Tam 2000), Copyright © 2000 IEEE

Core and I/O clock generation (Kurd et al. 2001), Copyright © 2001 IEEE

Logical diagram of core clock distribution (Kurd et al. 2001), Copyright © 2001 IEEE

Example of local clock buffers generating various frequency, phase and types of clocks (Kurd et al. 2001), Copyright © 2001 IEEE

UltraSPARC Family Characteristics UltraSPARC-IUltraSPARC-IIUltraSPARC-III Year ArchitectureSPARC V9, 4-issue Die size17.7x17.8mm x12.5mm 2 15x15.5mm 2 # of transistors5.2M5.4M23M Clock Frequency167MHz330MHz1GHz Supply voltage3.3V2.5V1.6V Process 0.5  m CMOS0.35  m CMOS0.15  m CMOS Metal layers4 (Al)5 (Al)7 (Al) Power consumption<30W <80W

Clock distribution delay in UltraSPARC-III (Heald et al. 2000), Copyright © 2000 IEEE

Semidynamic flip-flop (Klass 1998), Copyright © 1998 IEEE

(a)Logic embedding in a semidynamic flip-flop; (b)Two-input XOR function. (Klass, 1998), Copyright © 1998 IEEE

Dynamic versions of semidynamic flip-flop: single-ended; Differential. (Klass 1998), Copyright © 1998 IEEE

UltraSPARC-III flip-flop (Heald et al. 2000), Copyright © 2000 IEEE

Alpha Microprocessor Features # transistors Die Size [mm 2 ] 16.8x x x x18.8 Process 0.75  m0.5  m0.35  m0.18  m Supply [V] Power [W] Clk. Freq. [MHz] Gates/Cycle

Alpha microprocessor final clock driver location: (a) 21064, (b) 21164, (c) (a)(b)(c)

21064 clock skew (Gronowski et al. 1998), Copyright © 1998 IEEE

21164 clock skew (Gronowski et al. 1998), Copyright © 1998 IEEE

21264 clock hierarchy (Gronowski et al. 1998), Copyright © 1998 IEEE

21264 clock skew (Gronowski et al. 1998), Copyright © 1998 IEEE

21364 major clock domains (Xanthopoulos et al. 2001), Copyright © 2001

21364, NCLK clock skew (Xanthopoulos et al. 2001), Copyright © 2001 IEEE

21064 modified TSPC latches (Gronowski et al. 1998), Copyright © 1998

21164: (a) phase-A latch, (b) phase-B latch (Gronowski et al. 1998), Copyright © 1998 IEEE (a)(b)

Embedding of logic into a latch: (a) TSPC latch, one level of logic; (b) latch, two levels of logic. (Gronowski et al. 1998), Copyright © 1998 IEEE (a)(b)

21264 flip-flop (Gronowski et al. 1998), Copyright © 1998 IEEE

Critical-path and race analysis for clock buffering and conditioning (Gronowski et al. 1998), Copyright © 1998 IEEE

Hazard-free level-sensitive polarity-hold latch

General LSSD configuration

LSSD shift register latch

LSSD double-latch design

LSSD SRL with multiplexer used in the IBM S/390 G4 processor (Sigal et al. 1997), reproduced by permission

Static multiplexer version of the SRL used in the IBM S/390 G4 (Sigal et al. 1997), reproduced by permission

A clocked storage element is used in the non-timing-critical timing macros of the IBM S/390 G4 processor (Sigal et al. 1997), reproduced by permission

The clock-generation element used to detect problems created with fast paths: IBM S/390 G4 processor (Sigal et al. 1997), reproduced by permission

The experimental IBM PowerPC processor (Silberman et al. 1998), reproduced by permission

The PowerPC 603 MSL (Gerosa et al. 1994), Copyright © 1994 IEEE

The PowerPC 603 local clock regenerator (Gerosa et al. 1994), Copyright © 1994 IEEE

IBM Power4 TM 64-bit processor used in IBM eServer p690. The microprocessor consists of 174 million transistors and runs at 1.3 GHz, contains two microprocessor cores, and an on- chip memory subsystem. It is fabricated in state-of-the-art IBM 0.18  CMOS SOI technology with seven levels of copper wiring (Warnock et al. 2002), reproduced by permission

Standard transmission-gate MSL with LSSD capability (Warnock et al. 2002), reproduced by permission

Scannable split latch with LSSD capability used in the IBM Power4 TM. Designers were allowed to tune transmission gate size and specify input and output gates (Warnock et al. 2002), reproduced by permission

Library MSL with integrated front-end logic gate choices (Warnock et al. 2002), reproduced by permission

Split-latch designs with integrated front- and back-end logic-gate choices (Warnock et al. 2002), reproduced by permission