An Analysis of SIMD Instructions in the Pentium III Microprocessor By Alexander J. Aved 05 DEC 2000 CS689 Ball State University Muncie, Indiana.

Slides:



Advertisements
Similar presentations
Chapter 3: Software Basics Section I: Preparing to Use Technology.
Advertisements

Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
Streaming SIMD Extension (SSE)
Intel Pentium 4 ENCM Jonathan Bienert Tyson Marchuk.
POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:
Pentium microprocessors CAS 133 – Basic Computer Skills/MS Office CIS 120 – Computer Concepts I Russ Erdman.
1 Comparison of Intel Microprocessor 8086, 386, 486, Pentium II by Hong Li Rivier College, CS699A Professional Seminar Fall 1999.
Pentium 4 and IA-32 ISA ELEC 5200/6200 Computer Architecture and Design, Fall 2006 Lectured by Dr. V. Agrawal Lectured by Dr. V. Agrawal Kyungseok Kim.
1 Microprocessor-based Systems Course 4 - Microprocessors.
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
1 Lecture 6 Performance Measurement and Improvement.
Chapter 12 Three System Examples The Architecture of Computer Hardware and Systems Software: An Information Technology Approach 3rd Edition, Irv Englander.
EECS 470 Superscalar Architectures and the Pentium 4 Lecture 12.
CS 300 – Lecture 23 Intro to Computer Architecture / Assembly Language Virtual Memory Pipelining.
High Performance Computing Introduction to classes of computing SISD MISD SIMD MIMD Conclusion.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.1 Introduction.
CS / Schlesinger Lec1.1 1/20/99©UCB Spring 1999 Computer Architecture Lecture 1 Introduction and Five Components of a Computer Spring, 1999 Arie Schlesinger.
Copyright © 2006, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners Intel® Core™ Duo Processor.
Computer performance.
Feb. 2011Computer Architecture, Advanced ArchitecturesSlide 1 Part VII Advanced Architectures.
NATIONAL POLYTECHNIC INSTITUTE COMPUTING RESEARCH CENTER IPN-CICMICROSE Lab Design and implementation of a Multimedia Extension for a RISC Processor Eduardo.
Assembly Language for Intel-Based Computers, 4 th Edition Chapter 2: IA-32 Processor Architecture (c) Pearson Education, All rights reserved. You.
Semiconductor Memory 1970 Fairchild Size of a single core –i.e. 1 bit of magnetic core storage Holds 256 bits Non-destructive read Much faster than core.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Computer Organization & Assembly Language
XP Practical PC, 3e Chapter 16 1 Looking “Under the Hood”
CS 330 Project Presentation Asli Dolucan Senior, Fall 2001 Wells College.
The Pentium Processor.
The Pentium Processor Chapter 3 S. Dandamudi To be used with S. Dandamudi, “Introduction to Assembly Language Programming,” Second Edition, Springer,
Introduction CSE 410, Spring 2008 Computer Systems
Telecommunications and Signal Processing Seminar Ravi Bhargava * Lizy K. John * Brian L. Evans Ramesh Radhakrishnan * The University of Texas at.
Multi-core architectures. Single-core computer Single-core CPU chip.
Led the WWII research group that broke the code for the Enigma machine proposed a simple abstract universal machine model for defining computability devised.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Intel Pentium II Processor Brent Perry Pat Reagan Brian Davis Umesh Vemuri.
History of Microprocessor MPIntroductionData BusAddress Bus
ACMSE’04, ALDepartment of Electrical and Computer Engineering - UAH Execution Characteristics of SPEC CPU2000 Benchmarks: Intel C++ vs. Microsoft VC++
Computer Organization and Design Computer Abstractions and Technology
Evolution of Microprocessors Microprocessor A microprocessor incorporates most of all the functions of a computer’s central processing unit on a single.
Introduction to MMX, XMM, SSE and SSE2 Technology
Pipelining and Parallelism Mark Staveley
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
Computer performance issues* Pipelines, Parallelism. Process and Threads.
Xinsong1 Multimedia Extension Technology survey Xinsong Yang Electrical and Computer Engineering 734 Final Project 5/10/2002.
The Pentium Series CS 585: Computer Architecture Summer 2002 Tim Barto.
Processor Performance & Parallelism Yashwant Malaiya Colorado State University With some PH stuff.
Computer Architecture Organization and Architecture
Introduction CSE 410, Spring 2005 Computer Systems
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Chapter Overview General Concepts IA-32 Processor Architecture
William Stallings Computer Organization and Architecture 6th Edition
Microarchitecture.
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
INTRODUCTION TO MICROPROCESSORS
Phnom Penh International University (PPIU)
INTRODUCTION TO MICROPROCESSORS
INTRODUCTION TO MICROPROCESSORS
Vector Processing => Multimedia
Multi-Core Computing Osama Awwad Department of Computer Science
CS775: Computer Architecture
MMX Multi Media eXtensions
Special Instructions for Graphics and Multi-Media
BIC 10503: COMPUTER ARCHITECTURE
Some Real Machines Intel® M80C186 CHMOS High Integration 16-bit Microprocessor Intel® Itanium™ 64-bit Microprocessor (IA-64)
STUDY AND IMPLEMENTATION
Coe818 Advanced Computer Architecture
Overview Prof. Eric Rotenberg
Presentation transcript:

An Analysis of SIMD Instructions in the Pentium III Microprocessor By Alexander J. Aved 05 DEC 2000 CS689 Ball State University Muncie, Indiana

TOC Problem Description Problem Statement Research Objectives Literature Review Importance Research Design Data Analysis Schedule Facilities Deliverables Summary

Problem Description The Intel Pentium III microprocessor has 70 new instructions, many of which are SIMD instructions, among other enhancements over the Pentium II. The cost of producing a new microprocessor is very high, and if the benefits of using the latest processor are marginal, Intel will need a way to get people to buy it.

Problem Statement In my research I will determine if the Pentium III’s SIMD instructions offer a performance increase that is noticeable to the typical home computer user, or if the functionality provided by the SIMD instructions are of little benefit to the typical home computer user.

Problem Statement (Cont) Sub-problems What software is included in a home computer? Which software benefits from SIMD instructions?

Problem Statement (Cont) Delimitations Limit software to pre-installed software from computer vender I will choose 15 computers to analyze (10 desktops and 5 laptops), $2,000-$2,500 price range All will be configured with Windows 98 Second Edition

Research Objectives The objective of my research is to determine if the SIMD instructions in the Pentium III microprocessor actually facilitate an increase in performance that is relevant to the average home computer user, or are the performance gains provided by the SIMD instructions insubstantial.

Literature Review Overview of Intel 32-bit ISA Overview of SIMD instructions in Pentium III Some applications of SIMD instructions Some “non” applications of SIMD instructions Current research trends in this area

Literature Review (Cont) 32-bit Dynamic Execution Architecture Out-of-order execution Superscalar Pipelined Branch Prediction

Literature Review (Cont) SIMD Instructions in the Pentium III MMX – Multi-Media eXtensions (57 Instructions, 1997) Mapped 64-bit MMX registers onto 80-bit floating- point registers Streaming SIMD Extensions (SSE) 8 new 128-bit registers added I use the term SIMD to include SSE and MMX instructions

Literature Review (Cont) Applications of SIMD DSP Fast Fourier- and Inverse Discrete Cosine transformations (used in MPEG and JPEG, respectively) Many more (fluid dynamics, weather modeling, VLSI circuit routing, etc.)

Literature Review (Cont) Non-Applications/problems with SIMD Instructions A high degree of data parallelism is needed CPU-intensive applications Integer SIMD/floating point problem OS support needed (context switch)

Literature Review (Cont) Current trends in this area Current trends may be grouped into three major categories 1)Make SIMD machines behave like MIMD machines 2)Scientific applications of SIMD (fluid dynamics, computer vision, etc.) 3)Investigation of SIMD instructions in non-SIMD microprocessors with respect DSP and image compression My research is a totally new approach

Importance The goal of this research is to show that the SIMD instructions in the Pentium III, by themselves, do not significantly enhance the computing experience of home computer users. With the results of this research someone in the market for a home computer can make a more educated decision as to whether or not they need a computer with a microprocessor that includes SSE in its ISA.

Research Design The software will be categorized by type: Word Processing/Page Layout (PageMaker) Data Entry (MS Money) Data Lookup (an encyclopedia) Spreadsheet (Excel) Web Browsers (Netscape) Educational/Tutorial Media Players (Real Audio player) Other/Miscelaneous

Research Design (Cont) The software will be subcategorized based upon inherent data parallelism, dependence on connection speed to other computers, or if the CPU is a bottleneck for the performance of the software. Of the software that has data that exhibits significant parallelism to warrant the use of SIMD instructions, the VTune Performance Analyzer will be used to determine if the product actually uses any SIMD instructions.

TOC Problem Description Problem Statement Research Objectives Literature Review Importance Research Design Data Analysis Schedule Facilities Deliverables Summary

Data Analysis This research is a qualitative analysis of the benefits of the SIMD instructions to home computer users. My data analysis section will consist of a table showing which software products have data that exhibit significant parallelism to benefit from SIMD instructions, the software that currently uses SIMD instructions and the software that would has no need for SIMD instructions due to the asynchronous nature of its data.

Schedule Planned project duration: 11 months Month 1: obtain computer equipment and software Months 2 through 9: perform analysis as previously described Month 10: create final report Month 11: present findings

Facilities An Internet connection (56.6K would be adequate) VTune Version 4.0 (from Intel) Fifteen computers (10 desktop, 5 laptop) (including software)

Deliverables Research Report

Summary I plan to show that the Intel Pentium III microprocessor incorporates SIMD instructions that are very useful for scientific applications, but are of limited use in the home computing environment. References available at: Presentation available at:

Questions? Questions?