Computer Architecture Part II-D: Survey of Processor Architecture.

Slides:



Advertisements
Similar presentations
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Advertisements

The Central Processing Unit: What Goes on Inside the Computer.
IT Chapter 2 Part B CPU. The CPU is contained on a single integrated circuit called the microprocessor. Often referred to as the brains of a computer.
The Evolution of RISC A Three Party Rivalry By Jenny Mitchell CS147 Fall 2003 Dr. Lee.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
CHAPTER3: Processor Types and Specifications PART1
1 Chapter 4 The Central Processing Unit and Memory.
7-Aug-15 (1) CSC Computer Organization Lecture 6: A Historical Perspective of Pentium IA-32.
The AMD and Intel Architectures COMP Jamie Curtis.
Week 2 – Lesson 2.  Processor and chipset are located on motherboard  Components determine power and features of system  Major manufacturers: Intel,
PC Maintenance: Preparing for A+ Certification Chapter 5: CPUs.
1 Microprocessor speeds Measure of system clock speed –How many electronic pulses the clock produces per second (clock frequency) –Usually expressed in.
Computer Organization and Assembly language
2-1 Motherboard. 2-2 Section Objectives  Define the purpose of the major components on a motherboard including the CPU, chipset, and expansion slots.
Understanding The CPU Ryan cable Jeremy Wall Jason Basara.
A+ Guide to Managing and Maintaining your PC, 6e Chapter 5 Processors and Chipsets.
How a Computer Processes Data Hardware. Major Components Involved: Central Processing Unit Types of Memory Motherboards Auxiliary Storage Devices.
Computer performance.
Computer Architecture CST 250 INTEL PENTIUM PROCESSOR Prepared by:Omar Hirzallah.
Continuation of Primary System Components Powered by DeSiaMore1.
Information and Communication Technology Fundamentals Credits Hours: 2+1 Instructor: Ayesha Bint Saleem.
Microprocessors Chapter 3.
Microprocessors Chapter 2. External Data Bus Make sure that you read the analogy explaining exactly what the external data bus is binary system - a numbering.
Writer:-Rashedul Hasan Editor:- Jasim Uddin
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Guide to Operating Systems, 4th ed.
 Design model for a computer  Named after John von Neuman  Instructions that tell the computer what to do are stored in memory  Stored program Memory.
Current Computer Architecture Trends CE 140 A1/A2 29 August 2003.
The Microprocessor An integrated circuit with millions of transistors interconnected with very small aluminum wires. Controls and directs activities of.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Company LOGO High Performance Processors Miguel J. González Blanco Miguel A. Padilla Puig Felix Rivera Rivas.
Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 27 – A Brief History of the Microprocessor.
Led the WWII research group that broke the code for the Enigma machine proposed a simple abstract universal machine model for defining computability devised.
Pre-Pentium Intel Processors /
A+ Guide to Managing and Maintaining your PC, 6e Chapter 5 Processors and Chipsets (v0.9)
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
History of Microprocessor MPIntroductionData BusAddress Bus
Computer Maintenance Unit Subtitle: CPU’s UNT in partnership with TEA, Copyright ©. All rights reserved1.
Chapter 5B Modern CPUs.
IBM/Motorola/Apple PowerPC
THE BRIEF HISTORY OF 8085 MICROPROCESSOR & THEIR APPLICATIONS
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
PC Internal Components Lesson 4.  Intel is perhaps the most recognizable microprocessor manufacturer. List some others.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
A Look Inside The Processor
Lecture # 10 Processors Microcomputer Processors.
Lecture 3 Dr. Muhammad Ayaz Computer Organization and Assembly Language. (CSC-210)
Types of RAM (Random Access Memory) Information Technology.
Hardware Architecture
CPU (Central Processing Unit). The CPU is the brain of the computer. Sometimes referred to simply as the processor or central processor, the CPU is where.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
William Stallings Computer Organization and Architecture 6th Edition
Computer Maintenance Unit Subtitle: CPU’s Trade & Industrial Education
CPU Central Processing Unit
Microprocessor Generations
Lesson 8 CPUs Used in Personal Computers.
Visit for more Learning Resources
McGraw-Hill Technology Education
HISTORY OF MICROPROCESSORS
Phnom Penh International University (PPIU)
HISTORY OF MICROPROCESSORS
Microprocessors Chapter 4.
BIC 10503: COMPUTER ARCHITECTURE
Lesson 8 CPUs Used in Personal Computers.
Lecture 3 (Microprocessor)
Intel CPU for Desktop PC: Past, Present, Future
Presentation transcript:

Computer Architecture Part II-D: Survey of Processor Architecture

Microprocessors in the Market What’s the difference?

Areas of Development Below are technologies which can be improved in CPU design:  System bus speed  Internal and external clock frequency  Casing  Cooling system  Instruction set  Material used for the die End result: enhance speed of the CPU and the system in general

The System Bus Conduit for moving data between the processor and other system components CPU Caches System Bus Memory I/O Devices: Controllers Adapters Disks Displays Keyboards Networks Bus

System Bus Speeds Intel Pentium Core 2 Quad/Duo have CPU clocks of 2.66/3 GHz with system bus speeds of 1066/1333 MHz AMD: 2 nd Generation Opteron (dual core) processor has clock speed of 1.8 GHz with a 1000 MHz system bus

Split Clock Frequency Internal clock frequency  Speed of data processing inside the CPU External clock frequency  Speed of data transfer to and from the CPU via the system bus Intel 486DX2 25/50 was first to use clock doubling to implement split clock system

The GHz Race in CPU Frequency June 1999: API (Alpha Processor Inc.) demonstrated a 1 GHz chip March 2000: AMD released Athlon 1 GHz; within days Intel released 1 GHz Pentium III 2002: AMD, Intel uses 0.13 micron technology  Athlon XP (June)  Pentium GHz (May), mobile Pentium 4 2 GHz (June) 2004  Pentium 4: 3.6 GHz, 800 MHz system bus  AMD: 3200+, 2.2 GHz, 400 MHz : Same as bit CPUs  now concentrating on 64-bit 2005  Pentium 4: 3.73 – 3.8 GHz, 800/1066 MHz system bus  AMD: Same as 2004

Is Moore’s Law Dead? Intel’s vision of a 10 GHz CPU cannot be realized due to heat problems Some have pushed speed limits through high-end cooling systems Both Intel and AMD no longer concentrating on speed as performance driver SIA says “Moore’s Law is still going strong after 40 years”

Micron Technology A micron is 1 millionth of a meter  Human hair strand about 100 microns Objective: thinner wires  Allow CPU to operate at lower voltage  Results in CPU generating less heat and operating at higher speeds Currently, processors are in the range of microns (65 nm) Intel’s Roadmap: 45  15 nm

Micron Technology Through the Years

Thinner Wires = Increased Transistors 8086/ , , DX/386SX 250, SX/486DX 486DX2/486DX4 1,200,000 Pentium, Cyrix AMD K5, MMX 3,100,000 AMD K6 8,800,000 Athlon 1.4 GHz 37,000,000 Pertium 4 42,000,000

The Switch to Copper Aluminum limits making chips smaller Copper is a good choice because it  is a better conductor  consumes less energy, and  takes up less space than aluminum Copper allowed processors to boost speeds to the GHz range IBM pioneered the use of copper on September 1, 1998 (IBM Power PC 740/750)

PC on a Chip Integrates a number of key components into one chip Result: The chip replaces dozen or so separate chips (memory, FPU, graphics, video, etc.) Applications: PDAs, cellphones, set-top boxes, embedded processors, etc.

Impact of PC-on-a-Chip Smaller and quieter desktops Battery of devices lasts longer because of the low power drain Proliferation of information appliances

CPU Receptacle ZIF  Zero Insertion Force socket - type of socket designed for easy insertion of chips that have high density of pins  Socket 7 - popular implementation of ZIF

CPU Receptacle Slot 1  Consists of receptacle on the motherboard that holds an Intel Single Edge Contact (SEC) cartridge  Cartridge may contain up to two CPUs and an L2 cache (runs at half the speed of CPU) and plugs into 242-pin receptacle  Started with Pentium II

CPU Receptacle A Pentium II mounted on Slot 1

CPU Receptacle Slot 2  An enhanced Slot 1  Uses 330-pin SEC  Holds up to four CPUs  L2 cache runs at full processor speed  First used in Intel's Pentium II Xeon

CPU Receptacle AMD’s Slot A  Receptacle on motherboard for K7 CPU  Physically similar to Slot 1, but has different electrical requirements

Casing: FC-PGA (Flip-Chip) Traditional Wiring Flip-Chip (IBM)

Advantages of FC-PGA Greater # of I/O pins available Shorter electrical connections Better manufacturing efficiency

Casing: FC-LGA Bottom view of LGA/BGA-based CPULGA Socket 775

Advantages of FC-LGA Lower voltage used (less distance traveled, reduced signal loss) Less heat dissipation

Cache Works as buffer between CPU and memory Two types:  Internal  External

Levels of Cache Level 1 Level 2 Level 3 L1 L2 L3

Cache Placement Intel used to have external L2 cache Pentium Pro  Internal but CPU and L2 cache are separate  Result: larger chip that requires a larger socket

Overclocking Going beyond recommended clock frequency settings 3 method of overclocking  System bus frequency  CPU frequency multiplier  Change both of the above Some CPUs have locked frequencies

Overclocking: How to... Done through BIOS program Older systems require motherboard jumpers Some motherboards (e.g. ASUS TX97) contain jumper codes

Overclocking Issues Heat! Can main memory cope? Will the software still work?

Cooling Systems CPUs get hotter as they get faster Developed to keep the CPU from overheating Sophisticated cooling systems allow more reliable CPU operation

Liquid Nitrogen: Extremely Cool! CPU: Pentium 4 (Northwood) Date: Christmas 2003

The CPU Gets Watered Down

Multimedia Processing Multimedia applications require geometric transformation  Re-computation of location and size of an image to determine new position  Deals with FP FPU handles all real number computations Drawing landscapes (e.g. games) involves lots of computations and CPU may not handle it as fast as the player could react

Ways of Handling Multimedia Speed up the CPU Improve the CPU’s FPU by adding more pipelines Use high-end 3D graphics cards Add new multimedia instructions

Multimedia Innovations in CPUs MMX 3DNow! SSE

MMX Introduced 1995 in the Pentium processor Had 57 new instructions for 3D graphics Introduced SIMD (Single Instruction Multiple Data) instructions: technique that processes more than one integer simultaneously Problems:  Only works with integers  CPU can only work with either MMX or FPU, not both simultaneously because they share registers

3DNow! Introduced summer of 1998 in the AMD K6-2 Characteristics  Supports SIMD instructions  Improved handling of numbers Successful!  Integrated in Windows, games, and drivers  Does not use the same registers

SSE Introduced in Pentium III (Katmai) 500 MHz as Intel’s response to 3DNow! Characteristics  8 new 128-bit registers (can hold four 32-bit #s)  Has Streaming SIMD Extensions  50 new instructions enabling simultaneous advanced calculations of more FP with a single instruction  New Media Instructions designed for coding and decoding MPEGs

Problems with SSE Pipelines can only handle two 32-bit numbers at a time To take advantage of 128-bit registers, FPU pipeline should have been doubled (would have pushed back release date of Katmai) Potentially, it could have enhanced 3D graphics since registers can handle four 32-bit numbers at a time

SSE Enhancements SSE2  Started in Pentium 4  Has 144 new instructions (since SSE)  Data width now 64 bits SSE3  13 additional SIMD instructions (since SSE2)  New instructions primarily designed to improve thread synchronization and specific application areas such as media and gaming Supplemental SSE3 (Core 2)  SSE4

Other CPU Innovations Data width  Internal: How many bits can the CPU process simultaneously?  External: How many bits can the CPU receive simultaneously for processing Superscalar architecture Superpipelined architecture Superscalar processing

Intel Corporation Produced biggest impact on microprocessor technology Main line of business is CPU but also has other hardware products (e.g. motherboards)

Short History of Intel 1968: Birth of Intel  Started in memory business  First product was 64-bit memory 1970s: Increase in market share Early 1980s: Japanese eats up memory market with KB chips 1984: Business slowing down  “Get us out of memory!” 1986: Exited from memory due to success of 80386

Intel Processor Time Line 1971: 4004 Intel’s first microprocessor (108 KHz, 4 bit bus width) 1978: 8086 First 16-bit CPU from Intel 1979: 8088 Reengineered CPU to fit existing 8-bit hardware 1982: bit processor Optimized Instruction handling 1985: 386 First 32-bit CPU (32-bit system bus) 1988: 386SX Cheaper version of the 386DX 1989: 486 Built in math co-processor L1 cache on-chip 2

Intel Processor Time Line 486SX Discount chip No math co-processor 486DX4 Triple the clock speed From 25 MHz to 75 MHz 33 MHz to 100 MHz 1993: Pentium Classic Superscalar (5x 486DX-33 MHz) Width of system bus: 64 bit Speed of system bus: 60 to 66 MHz Initially produced a lot of heat Jan 8, 1997: Pentium MMX New set of instructions for multimedia 32 KB L1 cache Nov 1, 1995: Pentium Pro RISC Processor 32 bit processing L2 cache is built in May 7, 1997: Pentium II (Klamath) 512 KB L2 L1 cache of 32 KB 3

Intel Processor Time Line Jan 26, 1998: Deschutes 333 MHz 0.25 micron technology 1Q 1998: Celeron (Covington) Pentium II without the L2 cache 1998: Celeron (Mendocino) 333 MHz 128 KB L2 internal cache July 26, 1998: Pentium II Xeon 450 MHz Custom SRAM Different L2 caches: 512, 1/2 MB Can have Xeons in one server 1999: Pentim III (Katmai) Enhanced MMX2 graphics instructions 1999: Pentium III Xeon (Tanner) 2001: Itanium (formerly Merced) 64-bit CPU 0.18 micron technology > 25 million transistors 2000: Pentium 4 7 th Generation 0.18 micron technology Core (2005)

Current Intel CPU Innovations Hyperthreading Multi-core  Core  Core 2 (64-bit architecture)

Intel’s First 64-Bit Chip (Server): Itanium Was known as IA-64 (but IA-32 compatible) EPIC (Explicitly Parallel Instruction Computing) processor  Enables up to 20 operations/clock cycle  Employs branch prediction and speculation Three levels of cache: 2 MB / 4 MB L3 cache, 96K L2 cache, and 32K L1 cache 128 integer registers, 128 FP registers

Itanium 2 Available from GHz Internal L3 cache (1.5 MB, 3 MB, 4 MB, 6 MB, or 9 MB) System bus: 400/533/667 MHz, 128-bits wide 0.13 microns, 592 million transistors Next version (“Montecito”) has 1.72 billion transistors, 26 MB on- die cache, 90 nm

Current Intel CPU Lineup Mobile  Centrino (Core and Core 2) Desktop  Core 2 Extreme  Core 2 (now used in Apple Mac Mini) Servers and workstations  Xeon (now used in Apple Mac Pro)  Itanium 2

AMD (Advanced Micro Devices) Incorporated in May 1969 Challenging Intel even before Pentium-class processors Offered their own technology and cannot be considered as producing clones Achieved increased market sales starting with K6 and K6-II

AMD Series (From Pentium Class) K5  Similar to the classic Pentiums  16 KB L1 cache and no MMX  Not very impressive but much cheaper than similar Pentium models K6  Technology brought in from NexGen; put AMD back in business  32 KB L1 cache & MMX  Pentium compatible but performed better than MMX

AMD Series (From Pentium Class) K6-II: Chomper  0.25 micron, system bus speed of 100 MHz  Introduced 3DNow!  Also MMX-compatible; really challenged the Pentium II and led to low-cost Celeron K6-III: Sharptooth  Three levels of cache: L1 and L2 are in CPU; L3 is on motherboard up to 1 MB; 133 system bus  Was not as successful as the K6-II K7: Athlon

Last of AMD’s 32-Bit Processors Athlon XP  Intel played catch-up to the Athlon XP on many occasions, but now stagnant in 32-bit computing  Model has a 2.2 GHz CPU, 3 FP pipelines, 128 KB of L1 cache, 512 KB L2 cache, system bus speed of 400 MHz, and 0.13 micron technology Sempron  Counterpart of Intel Celeron  Model has 2 GHz CPU, 754-pins, 90 nm technology, 128 KB L2 cache

AMD’s 64-Bit Chips Varieties:  Athlon 64 (desktop)  Turion 64 (mobile)  Opteron (servers or workstations) Provides seamless transition to 64-bit System bus runs at processor speed through on-chip memory controller Lead the Itanium 2 on many benchmarks AMD formed a partnership with Sun

Current AMD 64-Bit CPU Innovations HyperTransport Dual core Direct Connect Architecture

Transmeta’s Crusoe Processor Transmeta’s founders include David Ditzel, Linus Torvalds, and Paul Allen released “Crusoe” in January 2000 Architectural achievements  Only 25% the number of transistors compared to current Pentiums  Needs only 1 or 2 watts of power for 400 MHz or 700 MHz chips running at full speed  Much less heat dissipated but can compete with same category Intel and AMD chips

How Crusoe Pulled It Off Efficient instruction set bears no resemblance to x86 Takes advantage of latest and best in hardware design Software layer (code morphing software) in flash ROM translates x86 commands

Current Transmeta Processors Crusoe TM5900  667 MHz – 1 GHz CPU speed  128 KB L1, 512 KB L2  133 MHz system bus  0.13 microns Efficeon TM8800  Up to 1.7 GHz  128 KB L1 instruction cache 64 KB L1 data cache, 1 MB L2  400 MHz system bus

The PowerPC Microprocessor Originally designed by Apple, IBM, and Motorola Based on IBM POWER architecture used in IBM RS/6000 (RISC based) Provides seamless transition to 64-bit The PowerPC G5 is used in Apple iMac G5 2.7 GHz CPU speed, 1.35 GHz system bus, 512 KB on-chip L2 cache

Sun UltraSparc IV+ 2 nd generation dual core processor design (1368 pins FC- LGA) 64-bit CPU, 90 nm, 295 million transistors CPU speeds of 1.95 / 2.1 GHz 2 MB L2 cache, 32 MB off-chip On-chip memory controller CMT (Chip Multi-Threading) with 2 threads per processor 14-stage non-stalling pipeline 4-way superscalar Runs Solaris, Linux, FreeBSD, and other UNIX versions

Sun UltraSparc T1 Available in 4-, 6- or 8-core 64 bits, 90 nm 4-way multithreaded core 14-stage non-stalling pipeline 4 integrated memory controllers 16 KB instruction, 8 KB data L1 cache per core, 3 MB unified L2 cache Available in 1 and 1.2 GHz Low power (72 – 79 watts)

Multiprocessor Systems Combines two or more CPUs of the same brand and model Allows systems to scale up Forms an N-way system

Future Trends In Dec. 1997, the Semiconductor Industry Association (SIA) provided details about future requirements of microprocessors. Attempts to continue the pace predicted by Moore’s Law

1999 SIA Roadmap for Microprocessors

International Technology Roadmap for Semiconductors