Alex Shye, Berkin Ozisikyilmaz, Arindam Mallik, Gokhan Memik, Peter A. Dinda, Robert P. Dick, and Alok N. Choudhary Northwestern University, EECS International.

Slides:



Advertisements
Similar presentations
Display Power Management Policies in Practice Stephen P. Tarzia Peter A. Dinda Robert P. Dick Gokhan Memik Presented by: Andrew Hahn.
Advertisements

Arindam Mallik Jack Cosgrove Robert P. Dick Gokhan Memik Peter Dinda Northwestern University Department of Electrical Engineering and Computer Science.
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Resource Management of Highly Configurable Tasks April 26, 2004 Jeffery P. HansenSourav Ghosh Raj RajkumarJohn P. Lehoczky Carnegie Mellon University.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
The User In Experimental Computer Systems Research Peter A. Dinda Gokhan Memik, Robert Dick Bin Lin, Arindam Mallik, Ashish Gupta, Sam Rossoff Department.
Performance Counter Based Architecture Level Power Modeling ( ) MethodologyResults Motivation & Goals Processor power is increasing.
Traffic Sensitive Active Queue Management - Mark Claypool, Robert Kinicki, Abhishek Kumar Dept. of Computer Science Worcester Polytechnic Institute Presenter.
User- and Process-Driven Dynamic Voltage and Frequency Scaling Bin Lin Arindam Mallik Peter Dinda Gokhan Memik Robert Dick Empathic Systems Project Department.
The User In Experimental Computer Systems Research Peter A. Dinda Gokhan Memik, Robert Dick Bin Lin, Arindam Mallik, Ashish Gupta, Sam Rossoff Department.
Power Reduction Through Measurement and Modeling of Users and CPUs Process-driven Voltage Scaling (PDVS) User-driven Frequency Scaling (UDFS) Bin Lin,
Bhojan Anand‡, Karthik Thirugnanam†, Jeena Sebastian‡, Pravein G. Kannan‡, Akhihebbal L. Ananda‡, Mun Choon Chan‡ and Rajesh Krishna Balan† ‡ National.
Diversification and Portfolio Analysis Investments and Portfolio Management MB 72.
Adaptive Video Coding to Reduce Energy on General Purpose Processors Daniel Grobe Sachs, Sarita Adve, Douglas L. Jones University of Illinois at Urbana-Champaign.
University of Karlsruhe, System Architecture Group Balancing Power Consumption in Multiprocessor Systems Andreas Merkel Frank Bellosa System Architecture.
Into the Wild: Studying Real User Activity Patterns to Guide Power Optimizations for Mobile Architectures Alex Shye, Benjamin Scholbrock, and Gokhan Memik.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
ASK Labs Company We help machines make decisions.
Neighborhood Operations
Scaling and Packing on a Chip Multiprocessor Vincent W. Freeh Tyler K. Bletsch Freeman L. Rawson, III Austin Research Laboratory.
Crowdsourcing Predictors of Behavioral Outcomes. Abstract Generating models from large data sets—and deter¬mining which subsets of data to mine—is becoming.
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
Portfolio Management-Learning Objective
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
Correlation. In this lesson you will cover: How to measure and interpret correlation About the effects of scaling data on correlation.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
International Symposium on Low Power Electronics and Design NoC Frequency Scaling with Flexible- Pipeline Routers Pingqiang Zhou, Jieming Yin, Antonia.
Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,
Computer Science Department University of Pittsburgh 1 Evaluating a DVS Scheme for Real-Time Embedded Systems Ruibin Xu, Daniel Mossé and Rami Melhem.
1 Overview 1.Motivation (Kevin) 1.5 hrs 2.Thermal issues (Kevin) 3.Power modeling (David) Thermal management (David) hrs 5.Optimal DTM (Lev).5 hrs.
Profile Driven Component Placement for Cluster-based Online Services Christopher Stewart (University of Rochester) Kai Shen (University of Rochester) Sandhya.
EmNet: Satisfying The Individual User Through Empathic Home Networks J. Scott Miller, John R. Lange & Peter A. Dinda Department of Electrical Engineering.
Games are Up for DVFS Yan Gu Samarjit Chakraborty Wei Tsang Ooi Department of Computer Science National University of Singapore.
Towards reducing total energy consumption while constraining core temperatures Osman Sarood and Laxmikant Kale Parallel Programming Lab (PPL) University.
Temperature Aware Load Balancing For Parallel Applications Osman Sarood Parallel Programming Lab (PPL) University of Illinois Urbana Champaign.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Abhilash Thekkilakattil, Radu Dobrin, Sasikumar Punnekkat Mälardalen Real-time Research Center, Mälardalen University Västerås, Sweden Towards Preemption.
Power-Aware Scheduling of Virtual Machines in DVFS-enabled Clusters
RIDA: A Robust Information-Driven Data Compression Architecture for Irregular Wireless Sensor Networks Nirupama Bulusu (joint work with Thanh Dang, Wu-chi.
Computer Science Adaptive, Transparent Frequency and Voltage Scaling of Communication Phases in MPI Programs Min Yeol Lim Computer Science Department Sep.
Power-Aware Parallel Job Scheduling
Active Sampling for Accelerated Learning of Performance Models Piyush Shivam, Shivnath Babu, Jeff Chase Duke University.
Dynamic Voltage Frequency Scaling for Multi-tasking Systems Using Online Learning Gaurav DhimanTajana Simunic Rosing Department of Computer Science and.
Preventing Private Information Inference Attacks on Social Networks.
Lev Finkelstein ISCA/Thermal Workshop 6/ Overview 1.Motivation (Kevin) 2.Thermal issues (Kevin) 3.Power modeling (David) 4.Thermal management (David)
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
Multimedia Computing and Networking Jan Reduced Energy Decoding of MPEG Streams Malena Mesarina, HP Labs/UCLA CS Dept Yoshio Turner, HP Labs.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
An Integrated GPU Power and Performance Model (ISCA’10, June 19–23, 2010, Saint-Malo, France. International Symposium on Computer Architecture)
CHEE825 Fall 2005J. McLellan1 Nonlinear Empirical Models.
July 23, BSA, a Fast and Accurate Spike Train Encoding Scheme Benjamin Schrauwen.
Best detection scheme achieves 100% hit detection with
Accurate WiFi Packet Delivery Rate Estimation and Applications Owais Khan and Lili Qiu. The University of Texas at Austin 1 Infocom 2016, San Francisco.
Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Popularity Prediction of Facebook Videos for Higher Quality Streaming
Overview Motivation (Kevin) Thermal issues (Kevin)
Pavlos Petoumenos, Hugh Leather, Björn Franke
Temperature and Power Management
Jacob R. Lorch Microsoft Research
Empathic Computer Architectures and Systems
Frequency Governors for Cloud Database OLTP Workloads
International Symposium on Microarchitecture. New York, NY.
Alex Shye, Yan Pan, Ben Scholbrock, J. Scott Miller,
Empathic Computer Architectures and Systems
Learning and Leveraging the Relationship between Architecture-Level Measurements and Individual User Satisfaction Alex Shye, Berkin Ozisikyilmaz, Arindam.
Power improvement in the multitasking environment
8-3 RRAM Based Convolutional Neural Networks for High Accuracy Pattern Recognition and Online Learning Tasks Z. Dong, Z. Zhou, Z.F. Li, C. Liu, Y.N. Jiang,
Volume 66, Issue 6, Pages (June 2010)
Presentation transcript:

Alex Shye, Berkin Ozisikyilmaz, Arindam Mallik, Gokhan Memik, Peter A. Dinda, Robert P. Dick, and Alok N. Choudhary Northwestern University, EECS International Symposium on Computer Architecture, June Beijing, China.

Findings/Contributions 1. User satisfaction is correlated to CPU performance 2. User satisfaction is non-linear, application-dependent, and user- dependent 1. We can use hardware performance counters to learn and leverage user satisfaction to optimize power consumption while maintaining satisfaction Claim: Any optimization ultimately exists to satisfy the end user Claim: Current architectures largely ignore the individual user

2 Architectural trade-offs exposed to the user 1 User-centric applications 3 Optimization opportunity User variation = optimization potential

User Satisfaction Your favorite metric (IPS, throughput, etc.) ??

Performance Level ??

Leverage knowledge for optimization Leverage knowledge for optimization Learn relationship between user satisfaction and hardware performance Learn relationship between user satisfaction and hardware performance

 Hardware performance counters are supported on all modern processors  Low overhead  Non-intrusive  WinPAPI interface; 100Hz  For each HPC:  Maximum  Minimum  Standard deviation  Range  Average

 IBM Thinkpad T43p  Pentium M with Intel Speedstep  Supports 6 Frequencies (2.2Ghz Mhz)  Two user studies:  20 users each  First to learn about user satisfaction  Second to show we can leverage user satisfaction  Three multimedia/interactive applications:  Java game: A first-person-shooter tank game  Shockwave: A 3D shockwave animation  Video: DVD-quality MPEG video

 Goal:  Learn relationship between HPCs and user satisfaction  How:  Randomly change performance/frequency  Collect HPCs  Ask the user for their satisfaction rating!

 Compare each set of HPC values with user satisfaction ratings  Collected 360 satisfaction levels (20 users, 6 frequencies, 3 applications)  45 metrics per satisfaction level  Pearson’s Product Moment Correlation Coefficient ( r )  -1: negative linear correlation, 1: positive linear correlation  Strong correlation: 21 of 45 metrics over.7 r value

 Combine all user data  Fit into a neural network  Inputs: HPCs and user ID  Output: User satisfaction  Observe relative importance factor  User more than two times more important than the second- most important factor  User satisfaction is highly user-specific ! HPCs User ID User Satisfaction

 User satisfaction is often non-linear  User satisfaction is application-specific  Most importantly, user satisfaction is user-specific

 Observations:  User satisfaction is non-linear  User satisfaction is application dependent  User satisfaction is user dependent  All three represent optimization potential!  Based on observations, we construct Individualized DVFS ( i DVFS)  Dynamic voltage and frequency scaling (DVFS) effective for improving power consumption  Common DVFS schemes (i.e., Windows XP DVFS, Linux ondemand governor) are based on CPU-utilization

User-aware performance prediction model Predictive user-aware Dynamic Frequency Scaling Building correlation network based on counters stats and user feedback Learning/Modeling Stage Runtime Power Management Hardware counter states User Satisfaction Feedback

 Train per-user and per-application  Small training set!  Two modifications to neural network training ▪ Limit inputs (used two highest correlation HPCs) ▪ BTAC_M-average and TOT_CYC-average ▪ Repeated trainings using most accurate NN HPCsUser Satisfaction

 ρ: user satisfaction tradeoff threshold  α f : per frequency threshold  M: maximum user satisfaction  Greedy approach  Make prediction every 500ms  If within user satisfaction within α f ρ of M twice in a row, decrease frequency  If not, increase frequency and is α f decreased to prevent ping-ponging between frequency

 Goal:  Evaluate i DVFS with real users  How:  Users randomly use application with i DVFS and with Windows XP DVFS  Afterwards, users asked to rate each one  Frequency logs maintained through experiments ▪ Replayed through National Instruments DAQ for system power

 i DVFS can scale frequency effectively based upon user satisfaction  In this case, we slightly decrease power compared to Windows DVFS

 i DVFS significantly improves power consumption  Here, CPU utilization not equal to user satisfaction

 No change in user satisfaction, significant power savings

 Same user satisfaction, same power savings  Red: Users gave high ratings to lower frequencies  Dashed Black: Neural network bad

 Lowered user satisfaction, improved power  Blue: Gave constant ratings during training

 Slight increase in ESP  Benefits in energy reduction outweigh loss in user satisfaction with ESP

 We explore user satisfaction relative to actual hardware performance  Show correlation from HPCs to user satisfaction for interactive applications  Show that user satisfaction is generally non-linear, application-, and user-specific  Demonstrate an example for leveraging user satisfaction to improve power consumption over 25%

 Questions?  For more information, please visit: 