Evaluating the Impact of Job Scheduling and Power Management on Processor Lifetime for Chip Multiprocessors (SIGMETRICS 2009) Authors: Ayse K. Coskun,

Slides:



Advertisements
Similar presentations
Real Time Scheduling.
Advertisements

Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures Pree Thiengburanathum Advanced computer architecture Oct 24,
Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors Abhishek Bhattacharjee Margaret Martonosi.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
VARIUS: A Model of Process Variation and Resulting Timing Errors for Microarchitects Sarangi et al Prateeksha Satyamoorthy CS
Scheduling Algorithms for Unpredictably Heterogeneous CMP Architectures J. Winter and D. Albonesi, Cornell University International Conference on Dependable.
Ensuring Robustness via Early- Stage Formal Verification Multicore Power Management: Anita Lungu *, Pradip Bose **, Daniel Sorin *, Steven German **, Geert.
Introduction and Background  Power: A Critical Dimension for Embedded Systems  Dynamic power dominates; static /leakage power increases faster  Common.
International Symposium on Low Power Electronics and Design Dynamic Workload Characterization for Power Efficient Scheduling on CMP Systems 1 Gaurav Dhiman,
Techniques for Multicore Thermal Management Field Cady, Bin Fu and Kai Ren.
A Cyber-Physical Systems Approach to Energy Management in Data Centers Presented by Chen He Adopted form the paper authors.
Neither More Nor Less: Optimizing Thread-level Parallelism for GPGPUs Onur Kayıran, Adwait Jog, Mahmut Kandemir, Chita R. Das.
IT Systems Multiprocessor System EN230-1 Justin Champion C208 –
Institute of Networking and Multimedia, National Taiwan University, Jun-14, 2014.
SBSE Course 3. EA applications to SE Analysis Design Implementation Testing Reference: Evolutionary Computing in Search-Based Software Engineering Leo.
On Modeling the Lifetime Reliability of Homogeneous Manycore Systems Lin Huang and Qiang Xu CUhk REliable computing laboratory (CURE) The Chinese University.
Lifetime Reliability-Aware Task Allocation and Scheduling for MPSoC Platforms Lin Huang, Feng Yuan and Qiang Xu Reliable Computing Laboratory Department.
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
Submission doc.: IEEE /1452r0 November 2014 Leif Wilhelmsson, EricssonSlide 1 Frequency selective scheduling in OFDMA Date: Authors:
ECE 510 Brendan Crowley Paper Review October 31, 2006.
University of Michigan Electrical Engineering and Computer Science 1 Online Timing Analysis for Wearout Detection Jason Blome, Shuguang Feng, Shantanu.
Energy Model for Multiprocess Applications Texas Tech University.
Principles of Time Scales
Elements of the Heuristic Approach
Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group.
University of Karlsruhe, System Architecture Group Balancing Power Consumption in Multiprocessor Systems Andreas Merkel Frank Bellosa System Architecture.
The Impact of Performance Asymmetry in Multicore Architectures Saisanthosh Ravi Michael Konrad Balakrishnan Rajwar Upton Lai UW-Madison and, Intel Corp.
Authors: Tong Li, Dan Baumberger, David A. Koufaty, and Scott Hahn [Systems Technology Lab, Intel Corporation] Source: 2007 ACM/IEEE conference on Supercomputing.
A Bandwidth-aware Memory-subsystem Resource Management using Non-invasive Resource Profilers for Large CMP Systems Dimitris Kaseridis, Jeffery Stuecheli,
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Low-Power Wireless Sensor Networks
Dynamic Slack Reclamation with Procrastination Scheduling in Real- Time Embedded Systems Paper by Ravindra R. Jejurikar and Rajesh Gupta Presentation by.
Temperature Aware Load Balancing For Parallel Applications Osman Sarood Parallel Programming Lab (PPL) University of Illinois Urbana Champaign.
1 CS/EE 6810: Computer Architecture Class format:  Most lectures on YouTube *BEFORE* class  Use class time for discussions, clarifications, problem-solving,
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
1 Reducing Queue Lock Pessimism in Multiprocessor Schedulability Analysis Yang Chang, Robert Davis and Andy Wellings Real-time Systems Research Group University.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
Progress Report 2014/02/12. Previous in IPDPS’14 Energy-efficient task scheduling on per- core DVFS architecture ◦ Batch mode  Tasks with arrival time.
1 Process Scheduling in Multiprocessor and Multithreaded Systems Matt Davis CS5354/7/2003.
Thermal-aware Issues in Computers IMPACT Lab. Part A Overview of Thermal-related Technologies.
Prepare by : Ihab shahtout.  Overview  To give an overview of fixed priority schedule  Scheduling and Fixed Priority Scheduling.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Dynamic Voltage Frequency Scaling for Multi-tasking Systems Using Online Learning Gaurav DhimanTajana Simunic Rosing Department of Computer Science and.
Qiang XU CUhk REliable computing laboratory (CURE)
Green Computing Metrics: Power, Temperature, CO2, … Computing system: Many-cores, Clusters, Grids and Clouds Algorithm and model: task scheduling, CFD.
DTM and Reliability High temperature greatly degrades reliability
Thermal-aware Phase-based Tuning of Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing This work was supported.
Outline Models Design of experiments Current Scheduler Completely Fair Scheduler(CFS) ◦ Since Linux ◦ /kernel/sched.c ◦ Maintain balance (fairness)
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
University of Rostock Institute of Applied Microelectronics and Computer Engineering Monitoring and Control of Temperature in Networks-on- Chip Tim Wegner,
Exploiting Unbalanced Thread Scheduling for Energy and Performance on a CMP of SMT Processors Authors: Matthew DeVuyst, Rakesh Kumar, and Dean M. Tullsen.
Shouqing Hao Institute of Computing Technology, Chinese Academy of Sciences Processes Scheduling on Heterogeneous Multi-core Architecture.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Jamie Unger-Fink John David Eriksen.  Allocation and Scheduling Problem  Better MPSoC optimization tool needed  IP and CP alone not good enough  Communication.
ECE692 Course Project Proposal Cache-aware power management for multi-core real-time systems Xing Fu Khairul Kabir 16 September 2009.
Best detection scheme achieves 100% hit detection with
Taniya Siddiqua, Paul Lee University of Virginia, Charlottesville.
S OFTWARE T HERMAL M ANAGEMENT OF DRAM M EMORY FOR M ULTICORE S YSTEMS (SIGMETRICS 2008) Authors: Jiang Lin, Hongzhong Zheng, Zichun Zhu, Eugene Gorbatov,
-1- UC San Diego / VLSI CAD Laboratory Optimal Reliability-Constrained Overdrive Frequency Selection in Multicore Systems Andrew B. Kahng and Siddhartha.
CS203 – Advanced Computer Architecture
The CRISP Performance Model for Dynamic Voltage and Frequency Scaling in a GPGPU Rajib Nath, Dean Tullsen 1 Micro 2015.
CS203 – Advanced Computer Architecture
Temperature and Power Management
“Temperature-Aware Task Scheduling for Multicore Processors”
Ching-Chi Lin Institute of Information Science, Academia Sinica
Morgan Kaufmann Publishers
Chapter 6: CPU Scheduling
Jacob R. Lorch and Alan Jay Smith University of California, Berkeley
Presentation transcript:

Evaluating the Impact of Job Scheduling and Power Management on Processor Lifetime for Chip Multiprocessors (SIGMETRICS 2009) Authors: Ayse K. Coskun, Richard Strong, Dean M. Tullsen, and Tajana Simunic Rosing Presenter: Daniel Cole

Overview Effect of Thermal Management on a chip multi-processor’s lifetime (MTTF) using simulations Focusing on Thermal Reliability, critical factors: ◦ Asymmetric thermal characteristics of the cores, inner cores have very different properties ◦ Frequency of job migration can inhibit sleep states and cause thermal cycling Provides polices that can decrease the failure rate by a factor of 2 with a performance cost of < 4%

Reliability High temperature does cause failures, but some failures, such as those caused by fatigue, do not occur because of high temperature per se, but rather by thermal cycling (a common materials problem)

Models Power and Thermal essentially use existing applications/models Temperature Induced Reliability: ◦ Electromigration and Time dependent dielectric breakdown (TDDB) are of the form: C_1*e^(-C_2/T) ◦ Thermal Cycling: failure rate ≈ C*(∆T)^(-q)*f  ∆T = temperature cycling range  f = frequency of thermal cycles ◦ Failure rates are combined using an existing sum-of-failure rates model ◦ Average MTTF is computed using the average failure rate throughout the simulation ◦ System dependent constants are estimated by using the fact that the three forms of temperature induced reliability contribute equal weight to the overall failure rate at nominal temperatures

Side Note Thermal Gradients: Temperature differences between adjacent locations on the die Not included because although they can induce hard errors, they primarily cause device latency (increase in timing errors)

Reliability Aware Scheduling Stop_Go: Core Gating (at thermal threshold) Thread migration: ◦ Migration: send jobs on cores exceeding the thermal threshold to cooler cores (swap if cool core has a job) ◦ Balance: Jobs with highest instructions per second assigned to currently coolest core (every scheduling interval) ◦ Balance_Location: Highest instructions per second to outer cores ◦ Heuristics performed poorly (too much movement) DVFS: ◦ Threshold ◦ Location: fixed 85% max on 4 inner cores, 100% on 4 corners, rest 95% ◦ Performance: Scale down memory-bound tasks all the time ◦ Performance + Threshold Turn off idle cores Combination

Floor Plan

Balance Location Job Assignments

Full Utilization

Partial Utilization

Initial Idle Core Locations Paper claims “it is critical to combine a conservative migration technique…with DVFS techniques.” (end of 6.3) However, it only uses dvfs_perf_t as an example of how bad initial job allocation hurts DVFS for MTTF According to previous results, dvfs_perf_t is not best DVFS at handling MTTF, in fact taken alone, location_dvfs always beats it

Turning Idle Cores Off

Points Is location_dvfs enough? Does balance_loc + location_dvfs really offer enough improvement to be more than noise? (overall) Algorithmic model for MTTF using thermal cycling (single core, multicore) Algorithmic model for temperature gradients inducing device latencies in multicore processors (not considered in paper’s model) Hottest cores are determined mostly by location, not jobs