Anshul Gandhi 347, CS building

Slides:



Advertisements
Similar presentations
Computer Structure Power Management Lihu Rappoport and Adi Yoaz Thanks to Efi Rotem for many of the foils.
Advertisements

Computer Abstractions and Technology
FAWN: Fast Array of Wimpy Nodes Developed By D. G. Andersen, J. Franklin, M. Kaminsky, A. Phanishayee, L. Tan, V. Vasudevan Presented by Peter O. Oliha.
FAWN: Fast Array of Wimpy Nodes A technical paper presentation in fulfillment of the requirements of CIS 570 – Advanced Computer Systems – Fall 2013 Scott.
Lecture 2: Modern Trends 1. 2 Microprocessor Performance Only 7% improvement in memory performance every year! 50% improvement in microprocessor performance.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
Energy-efficient Cluster Computing with FAWN: Workloads and Implications Vijay Vasudevan, David Andersen, Michael Kaminsky*, Lawrence Tan, Jason Franklin,
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  logistics  why computer organization is important  modern trends.
1 The Problem of Power Consumption in Servers L. Minas and B. Ellison Intel-Lab In Dr. Dobb’s Journal, May 2009 Prepared and presented by Yan Cai Fall.
Energy Model for Multiprocess Applications Texas Tech University.
CSE 691: Energy-Efficient Computing Lecture 3 SLEEP: full-system Anshul Gandhi 1307, CS building
1 VLSI and Computer Architecture Trends ECE 25 Fall 2012.
Folklore Confirmed: Compiling for Speed = Compiling for Energy Tomofumi Yuki INRIA, Rennes Sanjay Rajopadhye Colorado State University 1.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Last Time Performance Analysis It’s all relative
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
CSE 691: Energy-Efficient Computing Lecture 6 SHARING: distributed vs. local Anshul Gandhi 1307, CS building
Lecture 1: What is a Computer? Lecture for CPSC 2105 Computer Organization by Edward Bosworth, Ph.D.
1 Lecture 1: CS/ECE 3810 Introduction Today’s topics:  Why computer organization is important  Logistics  Modern trends.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
CSE 691: Energy-Efficient Computing Lecture 7 SMARTS: custom-made systems Anshul Gandhi 1307, CS building
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
1 CS/EE 6810: Computer Architecture Class format:  Most lectures on YouTube *BEFORE* class  Use class time for discussions, clarifications, problem-solving,
CSE 691: Energy-Efficient Computing Lecture 1: Intro and Logistics Anshul Gandhi 1307, CS building
Computational Sprinting on a Real System: Preliminary Results Arun Raghavan *, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K.
Impact of the convergence of three worlds: High-Performance Computing, Databases, and Analytics Stefan Manegold
Storage Systems CSE 598d, Spring 2007 Lecture ?: Rules of thumb in data engineering Paper by Jim Gray and Prashant Shenoy Feb 15, 2007.
1 Lecture 2: Metrics to Evaluate Systems Topics: Metrics: power, reliability, cost, benchmark suites, performance equation, summarizing performance with.
Computer Organization Yasser F. O. Mohammad 1. 2 Lecture 1: Introduction Today’s topics:  Why computer organization is important  Logistics  Modern.
E-MOS: Efficient Energy Management Policies in Operating Systems
CSE 591: Energy-Efficient Computing Lecture 3 SPEED: processor Anshul Gandhi 347, CS building
CSE 591: Energy-Efficient Computing Lecture 1: Intro and Logistics Anshul Gandhi 347, New CS building
CSE 591: Energy-Efficient Computing Lecture 4 SLEEP: full-system Anshul Gandhi 347, CS building
CSE 591: Energy-Efficient Computing Lecture 8 SOURCE: renewables Anshul Gandhi 347, CS building
CISC 879 : Advanced Parallel Programming Vaibhav Naidu Dept. of Computer & Information Sciences University of Delaware Dark Silicon and End of Multicore.
Computer Performance. Hard Drive - HDD Stores your files, programs, and information. If it gets full, you can’t save any more. Measured in bytes (KB,
CS203 – Advanced Computer Architecture
History of Computers and Performance David Monismith Jan. 14, 2015 Based on notes from Dr. Bill Siever and from the Patterson and Hennessy Text.
CSE 591: Energy-Efficient Computing Lecture 13 SLEEP: sensors
CS120 Purchasing a Computer
High Performance Computer Architecture:
CS203 – Advanced Computer Architecture
Lecture 2: Performance Today’s topics:
Lynn Choi School of Electrical Engineering
Anshul Gandhi 347, CS building
مقدمة في علوم الحاسب Lecture 5
Lecture 1: CS/ECE 3810 Introduction
CSE 591: Energy-Efficient Computing Lecture 17 SCALING: survey
Resource Aware Scheduler – Initial Results
CSE 591: Energy-Efficient Computing Lecture 20 SPEED: disks
Morgan Kaufmann Publishers
CSE 591: Energy-Efficient Computing Lecture 21 review
CSE 591: Energy-Efficient Computing Lecture 15 SCALING: storage
Lecture 1: CS/ECE 3810 Introduction
CSE 591: Energy-Efficient Computing Lecture 19 SPEED: memory
Hui Chen, Shinan Wang and Weisong Shi Wayne State University
Lecture 2: Performance Today’s topics: Technology wrap-up
CSE 591: Energy-Efficient Computing Lecture 12 SLEEP: memory
CSE 591: Energy-Efficient Computing Lecture 14 SCALING: setup time
CSE 591: Energy-Efficient Computing Lecture 9 SLEEP: processor
CS 140 Lecture Notes: Technology and Operating Systems
Parallel Processing Sharing the load.
Computer Architecture
CS/EE 6810: Computer Architecture
CS 140 Lecture Notes: Technology and Operating Systems
CSE 591: Energy-Efficient Computing Lecture 18 SPEED: power
By Brandon, Ben, and Lee Parallel Computing.
Instructor: Joel Grodstein
The University of Adelaide, School of Computer Science
Presentation transcript:

Anshul Gandhi 347, CS building anshul@cs.stonybrook.edu CSE 591: Energy-Efficient Computing Lecture 7 SMARTS: custom-made systems Anshul Gandhi 347, CS building anshul@cs.stonybrook.edu

Benchmark competitions

Benchmark competitions

fawn paper

Data-intensive workloads Seek-bound small random examples? problems? Scan-bound large sequential examples? problems? Examples: random data accesses as in a user and large file reads. Problems: limited b/w due to disk seeks and low bandwidth due to lots of data and fast CPUs.

FAWN CPUs are much faster than data delivery Makes sense to “balance” CPU and data delivery Use weaker CPUs Use faster data delivery (Flash vs. HDD)

Why FAWN? Memory wall (??) Increased CPU power consumption DVFS is limited Modern CPUs operate close to Vmin Constant leakage current Peak power and data center density

FAWN results

FAWN results #Nodes dictated by required GB space and QPS performance

FAWN results

processor scaling trends

Moore’s Law Moore’s law (observation) # transistors/chip ↑ 2X/2yr (how?) Frequency ↑ as transistor size ↓ (max 9GHz) Leakage current/power ↑ as transistor size ↓ Heat ↑ as frequency ↑

Moore’s Law

Dennard Scaling Dennard scaling (observation) Transistor power (V+I) ↓ as transistor size ↓ + Moore’s law = perf/watt ↑ 2X/2yr (how?) Koomey’s law Dennard scaling broke down recently due to leakage current (though Moore’s continued) Dark Silicon So we did multicore! Moore’s law continued for a while though Dennard scaling broke down. This led to multicore. TurboBoost

heteromates paper

Dark silicon Cannot power on all of the CPU Result of: Success of Moore’s law Failure of Dennard scaling Turbo boost

Main ideas Heterogeneous cores For different performance requirements Energy considerations (battery vs. plugged) Thermal considerations Dark silicon

Main ideas

Main ideas

Results