Last Time Performance Analysis It’s all relative

Slides:



Advertisements
Similar presentations
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
Advertisements

Computer Structure Power Management Lihu Rappoport and Adi Yoaz Thanks to Efi Rotem for many of the foils.
Computer Abstractions and Technology
Power Reduction Techniques For Microprocessor Systems
1 xkcd/619. Multicore & Parallel Processing Guest Lecture: Kevin Walsh CS 3410, Spring 2011 Computer Science Cornell University.
Chapter1 Fundamental of Computer Design Dr. Bernard Chen Ph.D. University of Central Arkansas.
1 ECE 570– Advanced Computer Architecture Dr. Patrick Chiang Winter 2013 Tues/Thurs 2-4PMPM.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Kevin Walsh CS 3410, Spring 2010 Computer Science Cornell University Multicore & Parallel Processing P&H Chapter ,
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
Lec 2 Aug 31 review of lec 1 continue Ch 1 course overview performance measures Ch 1 exercises quiz 1.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Power-Aware Computing 101 CS 771 – Optimizing Compilers Fall 2005 – Lecture 22.
1 The Problem of Power Consumption in Servers L. Minas and B. Ellison Intel-Lab In Dr. Dobb’s Journal, May 2009 Prepared and presented by Yan Cai Fall.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 Computer Abstractions and Technology Part II.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  Why computer organization is important  Logistics  Modern trends.
Chapter 1 - The Computer Revolution Chapter 1 — Computer Abstractions and Technology — 1  Progress in computer technology  Underpinned by Moore’s Law.
Lecture 1: Performance EEN 312: Processors: Hardware, Software, and Interfacing Department of Electrical and Computer Engineering Spring 2013, Dr. Rozier.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
The University of Adelaide, School of Computer Science
MS108 Computer System I Lecture 2 Metrics Prof. Xiaoyao Liang 2014/2/28 1.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Chapter 1 — Computer Abstractions and Technology — 1 Understanding Performance Algorithm Determines number of operations executed Programming language,
Power-Aware Compilation CS 671 April 22, CS 671 – Spring Why Worry about Power Dissipation? Environment Thermal issues: affect cooling, packaging,
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8.
Chapter 1 Technology Trends and Performance. Chapter 1 — Computer Abstractions and Technology — 2 Technology Trends Electronics technology continues to.
Morgan Kaufmann Publishers
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
CPS3340 COMPUTER ARCHITECTURE Fall Semester, /03/2013 Lecture 3: Computer Performance Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO Session 3 Computer Evolution.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 — Computer Abstractions and Technology — 1 Uniprocessor Performance Constrained by power, instruction-level parallelism, memory latency.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
CS203 – Advanced Computer Architecture
Performance COE 301 / ICS 233 Computer Organization Prof. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum.
Chapter 1 Performance & Technology Trends. Outline What is computer architecture? Performance What is performance: latency (response time), throughput.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface ARM Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Computer Architecture & Operations I
Morgan Kaufmann Publishers Technology Trends and Performance
CS203 – Advanced Computer Architecture
Computer Architecture & Operations I
CS161 – Design and Architecture of Computer Systems
Morgan Kaufmann Publishers Computer Abstractions and Technology
Green cloud computing 2 Cs 595 Lecture 15.
Chapter1 Fundamental of Computer Design
Uniprocessor Performance
Morgan Kaufmann Publishers
COSC 3406: Computer Organization
Architecture & Organization 1
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers
Architecture & Organization 1
Morgan Kaufmann Publishers Computer Abstractions and Technology
Adaptive Single-Chip Multiprocessing
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
The University of Adelaide, School of Computer Science
Morgan Kaufmann Publishers Computer Abstractions and Technology
The University of Adelaide, School of Computer Science
Utsunomiya University
CS161 – Design and Architecture of Computer Systems
Presentation transcript:

CS/ECE 3330 Computer Architecture Chapter 1 Power / Parallelism

Last Time Performance Analysis It’s all relative Make sure the units cancel out! What is a Hz? Amdahl’s Law Benchmarking

Why Worry about Power Dissipation? Thermal issues: affect cooling, packaging, reliability, timing Battery life Environment

Power Trends “The Power Wall”

Power Dissipation Has Peaked Must design with strict power envelopes 130W servers, 65W desktop, 10-30W laptops, 1W mobile

How Hot Does it Get?

Cooling Issues http://www.youtube.com/watch?v=nYhEpHEPqcc

Intel vs. Duracell No Moore’s Law in batteries: 2-3%/year growth 16x 14x Processor (MIPS) 12x Improvement (compared to year 0) Hard Disk (capacity) 10x 8x 6x Memory (capacity) 4x 2x Battery (energy stored) 1x 0 1 2 3 4 5 6 Time (years) No Moore’s Law in batteries: 2-3%/year growth

Environment Environment Protection Agency (EPA): computers consume 10% of commercial electricity consumption Includes peripherals, possibly also manufacturing Data center growth was cited as a contribution to the 2000/2001 California Energy Crisis Equivalent power (with only 30% efficiency) for AC CFCs used for refrigeration Lap burn Fan noise Chlorofluorocarbons (CFCs) cited as the cause of ozone depletion

Power Matters at Scale… [J. Koomey (LBL), 2007] Eric Schmidt, CEO of Google: "What matters most to the computer designers at Google is not speed, but power - low power, because data centers can consume as much electricity as a city."

But Remember Amdahl’s Law

Power vs. Energy

Power vs. Energy Power consumption in watts Determines battery life in hours Sets packaging limits Energy efficiency in joules Rate at which power is consumed over time Energy = power * delay (joules = watts * seconds) Lower energy number means less power to perform a computation at same frequency

Another Fallacy: Low Power at Idle Morgan Kaufmann Publishers April 23, 2017 Another Fallacy: Low Power at Idle X4 power benchmark At 100% load: 295W At 50% load: 246W (83%) At 10% load: 180W (61%) Google data center Mostly operates at 10% – 50% load At 100% load less than 1% of the time Consider designing processors to make power proportional to load Chapter 1 — Computer Abstractions and Technology

Capacitive Power Dissipation Capacitance: Function of wire length, transistor size Supply Voltage: Has been dropping with successive fab generations Power ~ C V2 f Frequency switched: Clock frequency + likelihood of change

Morgan Kaufmann Publishers April 23, 2017 Reducing Power Suppose a new CPU has 75% of capacitive load of old CPU 25% voltage and 25% frequency reduction The power wall We can’t reduce voltage further We can’t remove more heat How else can we improve performance? Chapter 1 — Computer Abstractions and Technology

Uniprocessor Performance Morgan Kaufmann Publishers April 23, 2017 Uniprocessor Performance Constrained by power, instruction-level parallelism, memory latency Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers April 23, 2017 Multiprocessors Multicore microprocessors More than one processor per chip Multiprocessors and clusters – another course Requires explicitly parallel programming Compare with instruction-level parallelism Hardware executes multiple instructions at once Hidden from the programmer Hard to do Programming for performance Load balancing Optimizing communication and synchronization Traditional multiprocessors and clusters are covered in detail in another course Chapter 1 — Computer Abstractions and Technology

Multicore Architecture Examples Morgan Kaufmann Publishers 23 April, 2017 Multicore Architecture Examples 2 × quad-core Intel Xeon e5345 (Clovertown) 2 × quad-core AMD Opteron X4 2356 (Barcelona) Chapter 7 — Multicores, Multiprocessors, and Clusters

Multicore Architecture Examples Morgan Kaufmann Publishers 23 April, 2017 Multicore Architecture Examples 2 × oct-core Sun UltraSPARC T2 5140 (Niagara 2) 2 × oct-core IBM Cell QS20 Chapter 7 — Multicores, Multiprocessors, and Clusters

Key Points Power has become a limiting factor Power vs energy P = C * (V^2) * F One solution: Multicore processors Different scale than “old” parallel processors More detail in Chapter 7