A Prediction-based Approach to Distributed Interactive Applications Peter A. Dinda Department of Computer Science Northwestern University

Slides:

Advertisements

Similar presentations

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.

Advertisements

A Prediction-based Real-time Scheduling Advisor Peter A. Dinda Carnegie Mellon University.

An Evaluation of Linear Models for Host Load Prediction Peter A. Dinda David R. O’Hallaron Carnegie Mellon University.

CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 20 – March 25, 2010.

All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.

The Case For Prediction-based Best-effort Real-time Peter A. Dinda Bruce Lowekamp Loukas F. Kallivokas David R. O’Hallaron Carnegie Mellon University.

1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.

Host Load Trace Replay Peter A. Dinda Thesis Seminar 11/23/98.

Automatic Run-time Adaptation in Virtual Execution Environments Ananth I. Sundararaj Advisor: Peter A. Dinda Prescience Lab Department of Computer Science.

Virtualized Audio as a Distributed Interactive Application Peter A. Dinda Northwestern University Access Grid Retreat, 1/30/01.

Dynamic Mapping of Activation Trees Thesis Proposal January 29, 1998 Peter A. Dinda Committee David O’Hallaron (chair) Thomas Gross Peter Steenkiste Jaspal.

Responsive Interactive Applications by Dynamic Mapping of Activation Trees February 20, 1998 Peter A. Dinda School of Computer.

Understanding and Predicting Host Load Peter A. Dinda Carnegie Mellon University

OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.

DDDDRRaw: A Prototype Toolkit for Distributed Real-Time Rendering on Commodity Clusters Thu D. Nguyen and Christopher Peery Department of Computer Science.

1 Yi Qiao Jason Skicewicz Peter A. Dinda Prescience Laboratory Department of Computer Science Northwestern University Evanston, IL An Empirical Study.

An Introduction to the Prescience Lab Peter A. Dinda Prescience Lab Department of Computer Science Northwestern University

Resource Signal Prediction and Its Application to Real-time Scheduling Advisors or How to Tame Variability in Distributed Systems Peter A. Dinda Carnegie.

Recent Results in Resource Signal Measurement, Dissemination, and Prediction App Transport Network Data Link Physical App Transport Network Data Link Physical.

Performance Evaluation

A Prediction-based Approach to Distributed Interactive Applications Peter A. Dinda Jason Skicewicz Dong Lu Prescience Lab Department of Computer Science.

1 The Mathematics of Signal Processing - an Innovative Approach Peter Driessen Faculty of Engineering University of Victoria.

Online Prediction of the Running Time Of Tasks Peter A. Dinda Department of Computer Science Northwestern University

Exploiting Packet Header Redundancy for Zero Cost Dissemination of Dynamic Resource Information Peter A. Dinda Prescience Lab Department of Computer Science.

An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.

1 Dong Lu, Peter A. Dinda Prescience Laboratory Computer Science Department Northwestern University Virtualized.

Characterizing and Predicting TCP Throughput on the Wide Area Network Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.

Multi-resolution Resource Behavior Queries Using Wavelets Jason Skicewicz Peter A. Dinda Jennifer M. Schopf Northwestern University.

Dv: A toolkit for building remote interactive visualization services David O’Hallaron School of Computer Science Carnegie Mellon University Martin Aeschlimann,

Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.

OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.

A Prediction-based Real-time Scheduling Advisor Peter A. Dinda Prescience Lab Department of Computer Science Northwestern University

Load Analysis and Prediction for Responsive Interactive Applications Peter A. Dinda David R. O’Hallaron Carnegie Mellon University.

Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University.

The Running Time Advisor A Resource Signal-based Approach to Predicting Task Running Time and Its Applications Peter A. Dinda Carnegie Mellon University.

Realistic CPU Workloads Through Host Load Trace Playback Peter A. Dinda David R. O’Hallaron Carnegie Mellon University.

Digital Sound and Video Chapter 10, Exploring the Digital Domain.

STRATEGIES INVOLVED IN REMOTE COMPUTATION

University of Zagreb MMVE 2012 workshop1 Towards Reinterpretation of Interaction Complexity for Load Prediction in Cloud-based MMORPGs Mirko Sužnjević,

Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,

Y. Kotani · F. Ino · K. Hagihara Springer Science + Business Media B.V Reporter: 李長霖.

National Institute of Mental Health

Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.

Quality of Service Karrie Karahalios Spring 2007.

MULTIMEDIA TECHNOLOGY SMM 3001 MEDIA - VIDEO. In this chapter How digital video differs from conventional analog video How digital video differs from.

Suzhen Lin, A. Sai Sudhir, G. Manimaran Real-time Computing & Networking Laboratory Department of Electrical and Computer Engineering Iowa State University,

Copyright © 2011, Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing Truong Vinh Truong Duy; Sato,

*Partially funded by the Austrian Grid Project (BMBWK GZ 4003/2-VI/4c/2004) Making the Best of Your Data - Offloading Visualization Tasks onto the Grid.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Network-Aware Scheduling for Data-Parallel Jobs: Plan When You Can

Egocentric Context-Aware Programming in Ad Hoc Mobile Environments Christine Julien Gruia-Catalin Roman Mobile Computing Laboratory Department of Computer.

1 BBN Technologies Quality Objects (QuO): Adaptive Management and Control Middleware for End-to-End QoS Craig Rodrigues, Joseph P. Loyall, Richard E. Schantz.

CHAPTER 4 THE VISUALIZATION PIPELINE. CONTENTS The focus is on presenting the structure of a complete visualization application, both from a conceptual.

An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.

OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.

Marcelo R.N. Mendes. What is FINCoS? A set of tools for data generation, load submission, and performance measurement of CEP systems; Main Characteristics:

Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.

An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.

By Nitin Bahadur Gokul Nadathur Department of Computer Sciences University of Wisconsin-Madison Spring 2000.

Network Weather Service. Introduction “NWS provides accurate forecasts of dynamically changing performance characteristics from a distributed set of metacomputing.

Modelling & Simulation of Semiconductor Devices Lecture 1 & 2 Introduction to Modelling & Simulation.

OPERATING SYSTEMS CS 3502 Fall 2017

A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.

Partially Predictable

Taxonomy of network applications

Department of Computer Science Northwestern University

Partially Predictable

Smita Vijayakumar Qian Zhu Gagan Agrawal

User-driven Scheduling Of Interactive Virtual Machines

An Optimization Problem in Adaptive Virtual Environments

Presentation transcript:

A Prediction-based Approach to Distributed Interactive Applications Peter A. Dinda Department of Computer Science Northwestern University

2 Context and Question Shared, Unreserved Environments Distributed Interactive Applications Me How an distributed interactive application running on shared, unreserved computing environment provide consistent responsiveness?

3 Why Is This Interesting? Interactive resource demands exploding Tools and toys increasingly are physical simulations New kinds of resource-intensive applications Responsiveness tied to peak demand People provision according to peak demand 90% of the time CPU or network link is unused Opportunity to use the resources smarter Build more powerful, more interesting apps Shared resource pools, resource markets, The Grid… Resource reservations unlikely History argues against it, partial reservation, …

4 Approach Soft real-time model Responsiveness -> deadline Advisory, no guarantees Adaptation mechanisms Exploit DOF available in environment Prediction of resource supply and demand Control the mechanisms to benefit the application Avoid synchronization Rigorous statistical and systems approach to prediction

5 Outline Distributed interactive applications Image editing, scientific visualization, virtualized audio Real-time scheduling advisors Running time advisor Resource signals RPS system Current work

6 Image Editing Photoshop, but for art and commercial photography resolutions 30 frame/sec pixelwise 6x6cm on megapixel display ->4.3 billion ops/sec and 120 MB/s of bandwidth

7 QuakeViz: Distributed Interactive Visualization of Massive Remote Earthquake Datasets Goal Interactive manipulation of massive remote datasets from arbitrary clients Sample 2 host visualization of Northridge Earthquake

8 DV Framework For Distributed Interactive Visualization Large datasets (e.g., earthquake simulations) Distributed VTK visualization pipelines Active frames Encapsulate data, computation, path through pipeline Launched from server by user interaction Annotated with deadline Dynamically chose on which host each pipeline stage will execute and what quality settings to use

9 Example DV Pipeline for QuakeViz Active Frame n+2 ? Active Frame n+1 ? Active Frame n ? Simulation Output interpolation isosurface extraction isosurface extraction scene synthesis scene synthesis interpolation morphology reconstruction morphology reconstruction local display and user rendering reading ROI resolution contours Logical View interpolation isosurface extraction isosurface extraction scene synthesis scene synthesis Physical View deadline Which one?How complex?

10 Virtualized Audio (VA) System

11 The Forward Problem - Auralization Auralization Algorithms sound source positions room geometry/properties sound source signals Listener positions Listener signals Listener wearing Headphones (or HSS scheme) In general, all inputs are a function of time Auralization filtering must proceed in real-time Changes require that the filters be recomputed quickly

12 Exploiting a Remote Supercomputer or the Grid Which one? How complex?

13 A Universal Problem Task ? Which host should the application send the task to so that its running time is appropriate? Known resource requirements What will the running time be if I...

14 Real-time Scheduling Advisor Real-time for distributed interactive applications Assumptions Sequential tasks initiated by user actions Aperiodic arrivals Resilient deadlines (soft real-time) Compute-bound tasks Known computational requirements Best-effort semantics Recommend host where deadline is likely to be met Predict running time on that host No guarantees

15 Running Time Advisor Predicted Running Time Task ? Application notifies advisor of task’s computational requirements (nominal time) Advisor predicts running time on each host Application assigns task to most appropriate host nominal time

16 Real-time Scheduling Advisor Predicted Running Time Task deadline nominal time ? deadline Application notifies advisor of task’s computational requirements (nominal time) and its deadline Advisor acquires predicted task running times for all hosts Advisor recommends one of the hosts where the deadline can be met

17 Variability and Prediction t High Resource Availability Variability Low Prediction Error Variability Characterization of variability Exchange high resource availability variability for low prediction error variability and a characterization of that variability t resource t error ACF t Prediction

18 Task deadline nominal time ? Confidence Intervals to Characterize Variability Predicted Running Time deadline Application specifies confidence level (e.g., 95%) Running time advisor predicts running times as a confidence interval (CI) Real-time scheduling advisor chooses host where CI is less than deadline CI captures variability to the extent the application is interested in it “3 to 5 seconds with 95% confidence” 95% confidence

19 Confidence Intervals And Predictor Quality Bad Predictor No obvious choice Good Predictor Two good choices Predicted Running Time Good predictors provide smaller CIs Smaller CIs simplify scheduling decisions Predicted Running Time deadline

20 Overview of Research Results Predicting CIs is feasible Host load prediction using AR(16) models Running time estimation using host load predictions Predicting CIs is practical RPS Toolkit (inc. in CMU Remos, BBN QuO) Extremely low-overhead online system Predicting CIs is useful Performance of real-time scheduling advisor Statistically rigorous analysis and evaluation Measured performance of real system

21 Experimental Setup Environment –Alphastation 255s, Digital Unix 4.0 –Workload: host load trace playback –Prediction system on each host Tasks –Nominal time ~ U(0.1,10) seconds –Interarrival time ~ U(5,15) seconds Methodology –Predict CIs / Host recommendations –Run task and measure

22 Predicting CIs is Feasible 3000 randomized tasks Near-perfect CIs on typical hosts

23 Predicting CIs is Practical - RPS System 1-2 ms latency from measurement to prediction 2KB/sec transfer rate <2% of CPU At Appropriate Rate

24 Predicting CIs is Useful - Real-time Scheduling Advisor tasks Predicted CI < Deadline Random Host Host With Lowest Load

25 Predicting CIs is Useful - Real-time Scheduling Advisor tasks Predicted CI < Deadline Random Host Host With Lowest Load

26 Resource Signals Characteristics Easily measured, time-varying scalar quantities Strongly correlated with resource availability Periodically sampled (discrete-time signal) Examples Host load (Digital Unix 5 second load average) Network flow bandwidth and latency Leverage existing statistical signal analysis and prediction techniques Currently: Linear Time Series Analysis and Wavelets

27 RPS Toolkit Extensible toolkit for implementing resource signal prediction systems Easy “buy-in” for users C++ and sockets (no threads) Prebuilt prediction components Libraries (sensors, time series, communication) Users have bought in Incorporated in CMU Remos, BBN QuO A number of research users

28 Prototype System RPS components can be composed in other ways

29 Limitations Compute-intensive apps only Host load based Network prediction not solved Not even limits are known Poor scheduler models Poor integration of resource supply predictions Programmer supplies resource demand Application resource demand prediction is nascent and needed

30 The Holy Grail

31 Current work Wavelet-based techniques Scalable information dissemination Signal compression Network prediction Sampling theory and non-periodic sampling Nonlinear predictive models Better scheduler models Relational approach to information Proposal for Grid Forum Information Services Group Application prediction Activation trees

32 Conclusion Prediction-based approach to responsive distributed interactive applications Peter Dinda, Jason Skicewicz, Dong Lu

33 Wave Propagation Approach Captures all properties except absorption absorption adds 1st partial terms LTI simplification  2 p/  2 t =  2 p/  2 x +  2 p/  2 y +  2 p/  2 z

34 LTI Simplification Consider the system as LTI - Linear and Time-Invariant We can characterize an LTI system by its impulse response h(t) In particular, for this system there is an impulse response from each sound source i to each listener j: h(i,j,t) Then for sound sources s i (t), the output m j (t) listener j hears is m j (t) =  i  h(i,j,t) * s i (t), where * is the convolution operator

35 Design Space Can the gap between the resources and the application can be spanned? yes!

36 Linear Time Series Models (2000 sample fits, largest models in study, 30 secs ahead) Pole-zero / state-space models capture autocorrelation parsimoniously

37 Host Load Traces DEC Unix 5 second exponential average Full bandwidth captured (1 Hz sample rate) Long durations

38 Results for Host Load Host load exhibits complex behavior Strong autocorrelation, self-similarity, epochal behavior Host load is predictable 1 to 30 second timeframe Simple linear models are sufficient Recommend AR(16) or better Predictions are useful Can compute effective CIs from them Extensive statistically rigorous randomized study