Computational Sprinting on a Hardware/Software Testbed Arun Raghavan *, Laurel Emurian *, Lei Shao #, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas.

Slides:



Advertisements
Similar presentations
HardBound: Architectural Support for Spatial Safety of the C Programming Language Joe Devietti *, Colin Blundell, Milo Martin, Steve Zdancewic * University.
Advertisements

Win8 on Intel Programming Course Win8 for developers, in detail Cédric Andreolli Intel.
Win8 on Intel Programming Course Desktop : WPF Cédric Andreolli Intel Software
Win8 on Intel Programming Course Desktop : Sensors Cédric Andreolli Intel Software
Computer Structure Power Management Lihu Rappoport and Adi Yoaz Thanks to Efi Rotem for many of the foils.
Win8 on Intel Programming Course Win8 and Intel Paul Guermonprez Intel Software
EngageNY.org ©2012 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Introduction Companion slides for
Internet of Things with Intel Edison Presentation Paul Guermonprez Intel Software
Object-Oriented Reengineering Patterns and Techniques Prof. O. Nierstrasz Prof. S. Ducasse T.
12. Summary, Trends, Research. © O. Nierstrasz PS — Summary, Trends, Research Roadmap  Summary: —Trends in programming paradigms  Research:...
Shared Counters and Parallelism Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit.
ESE Einführung in Software Engineering N. XXX Prof. O. Nierstrasz Fall Semester 2009.
ESE Einführung in Software Engineering X. CHAPTER Prof. O. Nierstrasz Wintersemester 2005 / 2006.
Metamodeling Seminar X. CHAPTER Prof. O. Nierstrasz Spring Semester 2008.
ESE Einführung in Software Engineering X. CHAPTER Prof. O. Nierstrasz Wintersemester 2005 / 2006.
N. XXX Prof. O. Nierstrasz Thanks to Jens Palsberg and Tony Hosking for their kind permission to reuse and adapt the CS132 and CS502 lecture notes.
Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit Concurrent Skip Lists.
Internet of Things with Intel Edison Led sensor lab
© S. Demeyer, S. Ducasse, O. Nierstrasz Chapter.1 MakeMoney Corp. C*O of MakeMoney Corp. Our Vision  We invest in software  We do not know software 
OORPT Object-Oriented Reengineering Patterns and Techniques X. CHAPTER Prof. O. Nierstrasz.
CP — Concurrent Programming X. CHAPTER Prof. O. Nierstrasz Wintersemester 2005 / 2006.
12. eToys. © O. Nierstrasz PS — eToys 12.2 Denotational Semantics Overview:  … References:  …
SWOT Analysis Strengths Weaknesses SWOT Opportunities Threats.
Collecting Images & Clippings Chapter 2.3 in Sketching User Experiences: The Workbook.
Win8 on Intel Programming Course Modern UI : Sensors Cédric Andreolli Intel Software.
Win8 on Intel Programming Course The challenge Paul Guermonprez Intel Software
Multicore Programming
© Copyright Showeet.com I NSERT YOUR TITLE HERE. © Copyright Showeet.com Insert Your Title Here 2 Master text styles –Second level Third level –Fourth.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
The Animated Sequence Chapter 5.1 in Sketching User Experiences: The Workbook.
Numerical Methods Part: Simpson Rule For Integration.
The PHEA Educational Technology Initiative. Project Partners PHEA Foundations – Ford, Carnegie, Kresge, MacArthur South African Institute for Distance.
Numerical Methods Fast Fourier Transform Part: Informal Development of Fast Fourier Transform
Internet of Things with Intel Edison CylonJS Pierre Collet Intel Software
Numerical Methods Multi Dimensional Direct Search Methods - Theory
Numerical Methods Continuous Fourier Series Part: Continuous Fourier Series
Computational Sprinting on a Real System: Preliminary Results Arun Raghavan *, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K.
11/17/ Introduction to Partial Differential Equations Transforming Numerical.
Sketching Vocabulary Chapter 3.4 in Sketching User Experiences: The Workbook Drawing objects, people, and their activities.
©2013 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
The PHEA Educational Technology Initiative. Project Partners PHEA Foundations – Ford, Carnegie, Kresge, MacArthur South African Institute for Distance.
Win8 on Intel Programming Course Paul Guermonprez Intel Software
State of the Ward in 2007 Version 1.0 A Fifth Sunday Lesson Given in the Sterling Park Ward, Ashburn, VA Stake by D. Calvin Andrus, Bishop
Design of Everyday Things Part 2: Useful Designs? Lecture /slide deck produced by Saul Greenberg, University of Calgary, Canada Images from:
SWOT Analysis Matrix S W O T SWOT Analysis.
On Welfare Principles 5th Sunday Lesson Sterling Park Ward Ashburn, Virginia Stake Taught by: D. Calvin Andrus, Bishop and the Welfare Committee 29 March.
Numerical Methods Multidimensional Gradient Methods in Optimization- Theory
Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit Concurrent Skip Lists.
1/30/ Elliptic Partial Differential Equations – Lieberman Method – Part 1 of 2 Elliptic Partial Differential.
© 2014 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
Computational Sprinting Arun Raghavan *, Yixin Luo +, Anuj Chandawalla +, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K. Martin.
2/13/ Elliptic Partial Differential Equations - Introduction Transforming.
Images of pesticides By: Leslie London, University of Cape Town This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 2.5.
The Relative Power of Synchronization Operations Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit.
Numerical Methods Multidimensional Gradient Methods in Optimization- Example
CS203 – Advanced Computer Architecture
EngageNY.org ©2012 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
© 2015 Core Knowledge Foundation. This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 Unported License.
PHPBelgium – Belgian PHP community PHPBelgium event MVC = Make Venerated Code? Patrick Allaert.
1 Licence This presentation document is one used by Ryan Cartwright of Equitas IT Solutions [1] and is Copyright © 2008 Ryan P. Cartwright / Equitas IT.
CS203 – Advanced Computer Architecture
Sketching Vocabulary Chapter 3.4 in Sketching User Experiences: The Workbook Drawing objects, people, and their activities.
Agenda Video pre-presentations Digital sketches & photo traces
Numerical Methods Multi Dimensional Direct Search Methods - Example
Pre and post workshop assessments
FOTW Worksheet Slides Christopher Penn, Financial Aid Podcast Student Loan Network.
Elliptic Partial Differential Equations – Direct Method
Elliptic Partial Differential Equations – Gauss-Seidel Method
Numerical Methods Newton’s Method for One -Dimensional Optimization - Example
Presentation transcript:

Computational Sprinting on a Hardware/Software Testbed Arun Raghavan *, Laurel Emurian *, Lei Shao #, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K. Martin * University of Pennsylvania, Computer and Information Science * University of Michigan, Electrical Eng. and Computer Science + University of Michigan, Mechanical Engineering #

This work licensed under the Creative Commons Attribution-Share Alike 3.0 United States License You are free: to Share — to copy, distribute, display, and perform the work to Remix — to make derivative works Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same, similar or a compatible license. For any reuse or distribution, you must make clear to others the license terms of this work. The best way to do this is with a link to: Any of the above conditions can be waived if you get permission from the copyright holder. Apart from the remix rights granted under this license, nothing in this license impairs or restricts the author's moral rights. 2

Overview 3 Computational sprinting [HPCA’12] Targets responsiveness in thermally constrained environments Far exceed sustainable power for short bursts of computation Simulation based feasibility study This work: what can we learn with today’s hardware? Engineer hardware/software testbed for sprinting Reduce heat venting capacity Sustain only lowest power mode Can sprint on today’s system Longer with phase-change material Sprinting improves energy-efficiency Even for sustained computations

4 T max power temperature Computational Sprinting Using Dark Silicon [HPCA’12]

5 T max power temperature Effect of thermal capacitance Computational Sprinting Using Dark Silicon [HPCA’12]

6 T max power temperature Effect of thermal capacitance Computational Sprinting Using Dark Silicon [HPCA’12]

7 T max power temperature Effect of thermal capacitance Computational Sprinting Using Dark Silicon [HPCA’12]

8 T max power temperature State of the art: Turbo Boost 2.0 exceeds sustainable power with DVFS (~25%) Our goal: 10x Effect of thermal capacitance Computational Sprinting Using Dark Silicon [HPCA’12]

Evaluating Sprinting Simulation-based feasibility study [HPCA’12] Thermal models: buffer heat using thermal capacitance Electrical models: stabilize voltage with gradual core activation Architectural models: Large responsiveness improvements Little dynamic energy overheads Next steps: understanding sprinting on a real system Build a real chip? Sprint on today’s mobile chips? 9 Our approach: study sprinting on hardware available today

This Work: Testbed for Computational Sprinting How long can the testbed sprint? How to select sprint intensity? How can we extend sprint duration? How does sprinting impact energy? 10

Designing a testbed for sprinting 11

sprinting sustainable Quad-core Intel i With heatsink and fan: 95W 12 Remove heatsink, slow fan; 10W thermal design (TDP) CoresFreq.Power Normalized Power Peak Speedup 1 core1.6 GHz10 W1x 4 cores1.6 GHz20 W~2x4x 4 cores3.2 GHz50 W~5x8x 3 operating modes:

Sprinting Performance sobel disparity segment kmeans feature texture Cores + Frequency (3.2GHz): 6.3x speedup Cores only (1.6 GHz): 3.5x speedup Max 4 core, 3.2 GHz Max 4 core, 1.6 GHz Baseline (no sprint) 3.2GHz 1.6GHz 13

How long can the testbed sprint? 14

Testbed Thermal Response Power (W) time (s) Temp (°C) time (s) T max sustained

Testbed Thermal Response Power (W) time (s) Temp (°C) time (s) 16 5x 3s sustained sprint (3.2 GHz) sustained sprint (3.2 GHz) 20g copper  Δ 25 o C, ~188J  50W Heat spreader

Testbed Thermal Response Power (W) time (s) Temp (°C) time (s) 17 5x 3s s sustained sprint (3.2 GHz) sprint (1.6 GHz) sustained sprint (3.2 GHz) sprint (1.6 GHz) 2x 20g copper  Δ 25 o C, ~188J  50W Heat spreader

What if computation doesn’t complete during the sprint? 18

Truncated Sprint Performance 19 computation length normalized speedup Little benefit Near-peak performance for short tasks

Truncated Sprint Performance computation length normalized speedup 20 Little benefit Near-peak performance for short tasks Lower peak performance; benefits longer tasks Best sprint intensity depends on task size How to sprint when task size is unknown? Sprint pacing Max intensity sprint for half thermal capacitance Cores-only sprinting for other half

Truncated Sprint Performance computation length normalized speedup 21 Best sprint intensity depends on task size How to sprint when task size is unknown? Sprint pacing Max intensity sprint for half thermal capacitance Cores-only sprinting for other half

Truncated Sprint Performance 22 Best sprint intensity depends on task size How to sprint when task size is unknown? Sprint pacing Max intensity sprint for half thermal capacitance Cores-only sprinting for other half computation length normalized speedup

Increasing sprint duration 23

Two Ways of Adding Thermal Capacitance Specific heat capacity: introduce thermal mass Latent heat: absorb heat to change phase (e.g. melting) 24 temperature time (s) Phase-change absorbs heat while holding temperature constant Baseline sprinting More specific heat  takes longer to heat 20g copper, Δ 25 o C  ~188J 1g of wax  ~200J

25 Computational Sprinting on a Hardware- Software Testbed Extending Sprints with Phase Change Material 4g of wax, 1g of aluminum foam Copper shim

Impact of Phase Change temperature (°C) 26 air time (s)

temperature (°C) 27 Small extension from heat capacity of encasement… foam air time (s) Impact of Phase Change

temperature (°C) 28 Small extension from heat capacity of encasement… …6x increase in sprint duration with phase change foam wax air time (s) Impact of Phase Change

temperature (°C) 29 Small extension from heat capacity of encasement… …6x increase in sprint duration with phase change due to phase change foam air water time (s) Impact of Phase Change wax

30 Phase Change Material in Action Time lapse 15x

How does sprinting impact energy? 31

Energy Impact of Sprinting sobel disparity segment kmeans feature texture GHz 1.6GHz Race-to-idle: 7% energy savings! Sprint 3.2GHz Sprint 1.6GHz Idle

If sprinting is more energy efficient, why not sprint all the time… 33

Sprint-and-Rest seconds power (W) seconds temperature ( o C) seconds cumulative work sprint-and-rest 34 35% faster 20W sprint, 5W rest < 10W average

Conclusions Testbed confirms sprinting improves responsiveness Sprint pacing can extend benefits of sprinting Exploiting phase change allows longer sprints Sprinting can save energy in thermally limited systems 35

36