1 KPC-Toolbox Demonstration Eddy Zheng Zhang, Giuliano Casale, Evgenia Smirni Computer Science Department College of William & Mary.

Slides:



Advertisements
Similar presentations
Autocorrelation Functions and ARIMA Modelling
Advertisements

Bound Analysis of Closed Queueing Networks with Workload Burstiness Giuliano Casale Ningfang Mi Evgenia Smirni College.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Chapter 6 (p153) Predicting Future Performance Criterion-Related Validation – Kind of relation between X and Y (regression) – Degree of relation (validity.
Ch11 Curve Fitting Dr. Deshi Ye
A Mathematica ® based regression analysis program Analisys … A Curve Fitting Application.
Chapter 2: Lasso for linear models
Forecasting JY Le Boudec 1. Contents 1.What is forecasting ? 2.Linear Regression 3.Avoiding Overfitting 4.Differencing 5.ARMA models 6.Sparse ARMA models.
Simulation Where real stuff starts. ToC 1.What, transience, stationarity 2.How, discrete event, recurrence 3.Accuracy of output 4.Monte Carlo 5.Random.
Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources Lerong Cheng 1, Jinjun Xiong 2, and Prof. Lei He 1 1 EE Department, UCLA.
1 Giuliano Casale, Eddy Z. Zhang, Evgenia Smirni {casale, eddy, Speaker: Giuliano Casale Numerical Methods for Structured Markov Chains,
Statistical analysis and modeling of neural data Lecture 6 Bijan Pesaran 24 Sept, 2007.
Introduction to Econometrics The Statistical Analysis of Economic (and related) Data.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Performance Evaluation
Copyright © 2005 Department of Computer Science CPSC 641 Winter LAN Traffic Measurements Some of the first network traffic measurement papers were.
1 MF-852 Financial Econometrics Lecture 6 Linear Regression I Roy J. Epstein Fall 2003.
01/22/09ICDCS20061 Load Unbalancing to Improve Performance under Autocorrelated Traffic Ningfang Mi College of William and Mary Joint work with Qi Zhang.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Business Statistics - QBM117 Least squares regression.
Relationships Among Variables
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
Panel Topic: After Long Range Dependency (LRD) discoveries, what are the lessons learned so far to provide QoS for Internet advanced applications David.
Exploring the direct rebound effect: systematic relationships between model robustness and coefficient estimates Lee Stapleton, Steve Sorrell, Tim Schwanen.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Introduction to regression 3D. Interpretation, interpolation, and extrapolation.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Syntellect Outbound Communicator. Copyright© 2008 Syntellect Inc. All Rights Reserved 2 Agenda Syntellect Outbound Communicator Overview What is Predictive.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
PROCESS MODELLING AND MODEL ANALYSIS © CAPE Centre, The University of Queensland Hungarian Academy of Sciences Statistical Model Calibration and Validation.
Analysis of sapflow measurements of Larch trees within the inner alpine dry Inn- valley PhD student: Marco Leo Advanced Statistics WS 2010/11.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
Chapter 3 System Performance and Models Introduction A system is the part of the real world under study. Composed of a set of entities interacting.
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VI System-level Performance Models for the Web (Book, Chapter 8)
AMH001 (acmse03.ppt - 03/7/03) REMOTE++: A Script for Automatic Remote Distribution of Programs on Windows Computers Ashley Hopkins Department of Computer.
© Geodise Project, University of Southampton, Integrating Data Management into Engineering Applications Zhuoan Jiao, Jasmin.
Dr. Anis Koubâa CS433 Modeling and Simulation
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
CISC Machine Learning for Solving Systems Problems Microarchitecture Design Space Exploration Lecture 4 John Cavazos Dept of Computer & Information.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Unit 3 Section : Regression  Regression – statistical method used to describe the nature of the relationship between variables.  Positive.
Correlogram - ACF. Modeling for Forecast Forecast Data The Base Model Linear Trend Logistic Growth Others Models Look for a best approximation of the.
December 1997 Circuit Analysis Examples 걼 Hairpin Edge Coupled Filter 걼 Bipolar Amplifier 걼 Statistical Analysis and Design Centering 걼 Frequency Doubler.
Analysis of financial data Anders Lundquist Spring 2010.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Economics 173 Business Statistics Lecture 28 © Fall 2001, Professor J. Petry
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
1 Autocorrelation in Time Series data KNN Ch. 12 (pp )
OPERATING SYSTEMS CS 3502 Fall 2017
Correlation and its Applications
Chapter 10 Verification and Validation of Simulation Models
S519: Evaluation of Information Systems
Linear Regression.
Building a Database on S3
Tabulations and Statistics
Prediction of new observations
Introduction to Econometrics
Presented by Chun Zhang 2/14/2003
7.1 Draw Scatter Plots & Best-Fitting Lines
REMOTE++: A Tool for Automatic Remote
CH2 Time series.
Uncertainty Propagation
Presentation transcript:

1 KPC-Toolbox Demonstration Eddy Zheng Zhang, Giuliano Casale, Evgenia Smirni Computer Science Department College of William & Mary

2 What is KPC-Toolbox for?  KPC-Toolbox: MATLAB toolbox Workload Traces  Markovian Arrival Process (MAP)  Why MAP? Very versatile High variabilitytemporal dependence Time Series High variability & temporal dependence in Time Series Easily incorporated into queuing models  Friendly Interface Departure from previous Markovian fitting tools Fit the automatically (no manual tuning)

3 User Interface  Requirement: Matlab installed  Input A trace of inter-event times Or a file that already stores the statistics of the trace  E.g., a file stores the moments, autocorrelations and etc  Help Information Type “ help FunctionName ”,  E.g., “ help map_kpcfit ” Website Keeps Up-To-Date Tool version 

4 A Simple Example of MAP‏  Two state jumps b a c d D1 = D0 = -b-d -a-c Time: a b c d I1 I2 I3 Background Jumps Jumps With Arrivals Arrivals:

5 Challenges  How large is the MAP? MAP(n): determine n?  Which trace descriptors are important? Literature: Moments of interval times, lag-1 autocorrelation long range dependent But, for long range dependent traces? temporal dependence  Need temporal dependence descriptors  MAP Parameterization Construct MAP(n) with matrices D0 and D1 (2n 2 – n entries)

6 Example: Important Trace Statistics 1 2 First, second, third moment and lag-1 autocorrelation accurately fit The queuing prediction ability is not satisfactory! Seagate Web Server Trace Queue Prediction, 80% Utilization Fit With MAP(2)

7 Example: Higher Order Statistics Matter Much Better Results! Queuing Prediction, 80% Utilization ……… Fit with MAP(16) A grid of joint moments and a sequence of autocorrelations fitted, E[X i X i+k X i+k+h ] Seagate Web Server Trace

8  Higher Order Correlations V.S. Moments Correlations capture sequence in the time series Correlations are very important  Summary: first three moments Matching up to the first three moments is sufficient higher order correlations Matching higher order correlations with priority Fitting Guidelines Ref: "KPC-Toolbox: Simple Yet Effective Trace Fitting Using Markovian Arrival Processes", G. Casale, E.Z. Zhang, E. Smirni, to appear in QEST ’ 08

9 Challenge (1): Determine MAP Size  Definition: k ACF coefficient  lag-k ACF coefficient  MAP(n) Property: n Linear Recursive Relationship of n consecutive ACF coeffs  BIC Size Selection: Linear regression model on estimated ACF coeffs BIC value assesses goodness of model size MAP(8) MAP(16) MAP(32)

10 Challenge (2): Trace Descriptor Matching  Kronecker Product Composition (KPC)  KPC Properties: Composition of Statistics Moments are composed from moments of small MAPs  MAP Parameterization by KPC to Match Mean and SCV Exactly Higher order correlations as Close as Possible

11 KPC Tool Overview Trace Extract Statistics Moments ACF Correlations …… Size Selection MAP(2) …… J = log 2 N MAP(2)s MAP(N) Size of MAP N Optimization KPC This work is supported by NSF grants ITR and CNS

12 Thank you!

13  What are higher order correlations? Joint moments of a sequence of inter-arrival times in the time series  Which higher order correlations to fit in KPC? E[X i X i+j X i+j+k ], where i can be arbitrary without loss of generality, and [j,k] chose from a grid of values  E.g., [ ] × [ ] = {[10,10], [10,100], [10,10000], …} Appendix