Simulation Evaluation of Hybrid SRPT Policies

Slides:

Advertisements

Similar presentations

Quantifying the Properties of SRPT Scheduling Mingwei Gong and Carey Williamson Department of Computer Science University of Calgary.

Advertisements

Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.

Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.

Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.

1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer.

Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)

Copyright © 2005 Department of Computer Science 1 Solving the TCP-incast Problem with Application-Level Scheduling Maxim Podlesny, University of Waterloo.

Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.

What’s the Problem Web Server 1 Web Server N Web system played an essential role in Proving and Retrieve information. Cause Overloaded Status and Longer.

PERSISTENT DROPPING: An Efficient Control of Traffic Aggregates Hani JamjoomKang G. Shin Electrical Engineering & Computer Science UNIVERSITY OF MICHIGAN,

Web Server Request Scheduling Mingwei Gong Department of Computer Science University of Calgary November 16, 2004.

July 2003SPECTS Network-Level Impacts on User-Level Web Performance Carey Williamson Nayden Markatchev University of Calgary.

The War Between Mice and Elephants Presented By Eric Wang Liang Guo and Ibrahim Matta Boston University ICNP

Maryam Elahi Fairness in Speed Scaling Design Joint work with: Carey Williamson and Philipp Woelfel.

1 Mor Harchol-Balter Carnegie Mellon University School of Computer Science.

Copyright © 2005 Department of Computer Science CPSC 641 Winter WAN Traffic Measurements There have been several studies of wide area network traffic.

October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.

1 Internet Protocols and Network Performance Issues Carey Williamson iCORE Professor Department of Computer Science University of Calgary.

Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?

OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.

Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.

Network Traffic Measurement and Modeling CSCI 780, Fall 2005.

Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.

1 Queueing Theory H Plan: –Introduce basics of Queueing Theory –Define notation and terminology used –Discuss properties of queuing models –Show examples.

1 Connection Scheduling in Web Servers Mor Harchol-Balter School of Computer Science Carnegie Mellon

Performance Evaluation

1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.

Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.

Computer Networking Lecture 17 – Queue Management As usual: Thanks to Srini Seshan and Dave Anderson.

1 CS 501 Spring 2005 CS 501: Software Engineering Lecture 22 Performance of Computer Systems.

Carnegie Mellon University Computer Science Department 1 CLASSIFYING SCHEDULING POLICIES WITH RESPECT TO HIGHER MOMENTS OF CONDITIONAL RESPONSE TIME Adam.

Wide Web Load Balancing Algorithm Design Yingfang Zhang.

OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.

1 WAN Measurements Carey Williamson Department of Computer Science University of Calgary.

Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.

Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.

Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.

Computer Architecture and Operating Systems CS 3230: Operating System Section Lecture OS-3 CPU Scheduling Department of Computer Science and Software Engineering.

1 Mor Harchol-Balter Carnegie Mellon University Computer Science Heavy Tails: Performance Models & Scheduling Disciplines.

Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.

1 Chapters 8 Overview of Queuing Analysis. Chapter 8 Overview of Queuing Analysis 2 Projected vs. Actual Response Time.

Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.

Carnegie Mellon University Computer Science Department 1 OPEN VERSUS CLOSED: A CAUTIONARY TALE Bianca Schroeder Adam Wierman Mor Harchol-Balter Computer.

Transport Layer3-1 TCP throughput r What’s the average throughout of TCP as a function of window size and RTT? m Ignore slow start r Let W be the window.

Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,

Scheduling. Main Points Scheduling policy: what to do next, when there are multiple threads ready to run – Or multiple packets to send, or web requests.

Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.

Analysis of SRPT Scheduling: Investigating Unfairness Nikhil Bansal (Joint work with Mor Harchol-Balter)

CATNIP – Context Aware Transport/Network Internet Protocol Carey Williamson Qian Wu Department of Computer Science University of Calgary.

MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.

1 CS 501 Spring 2003 CS 501: Software Engineering Lecture 23 Performance of Computer Systems.

1 Queuing Delay and Queuing Analysis. RECALL: Delays in Packet Switched (e.g. IP) Networks End-to-end delay (simplified) = End-to-end delay (simplified)

Development of a QoE Model Himadeepa Karlapudi 03/07/03.

1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.

CPU Scheduling Operating Systems CS 550. Last Time Deadlock Detection and Recovery Methods to handle deadlock – Ignore it! – Detect and Recover – Avoidance.

Looking at the Server-side of P2P Systems

B.Ramamurthy Appendix A

ECF: an MPTCP Scheduler to Manage Heterogeneous Paths

Queueing Theory Carey Williamson Department of Computer Science

Autoscaling Effects in Speed Scaling Systems

CPSC 641: WAN Measurement Carey Williamson

Computer Systems Performance Evaluation

Autoscaling Effects in Speed Scaling Systems

Javad Ghaderi, Tianxiong Ji and R. Srikant

Computer Systems Performance Evaluation

Size-Based Scheduling Policies with Inaccurate Scheduling Information

Carey Williamson Department of Computer Science University of Calgary

Carey Williamson Department of Computer Science University of Calgary

CSE 550 Computer Network Design

FAIRNESS IN QUEUES Adam Wierman Carnegie Mellon University cmu

Presentation transcript:

Simulation Evaluation of Hybrid SRPT Policies 4/16/2017 Simulation Evaluation of Hybrid SRPT Policies Mingwei Gong and Carey Williamson Department of Computer Science University of Calgary April 19, 2004

Introduction Web: large-scale, client-server system 4/16/2017 Introduction Web: large-scale, client-server system WWW: World Wide Wait! User-perceived Web response time involves: Transmission time, propagation delay in network Queueing delays at busy routers in the Internet Delays caused by TCP protocol effects (e.g., handshake, slow start, packet loss, retransmits) Queueing delays at the Web server itself, which may be servicing 100’s or 1000’s of concurrent requests Our focus in this work: Web request scheduling WORMS 2004

Example Scheduling Policies 4/16/2017 Example Scheduling Policies FCFS: First Come First Serve typical policy for single shared resource (“unfair”) e.g., drive-thru restaurant; playoff tickets PS: Processor Sharing time-sharing a resource amongst J jobs each job gets 1/J of the resources (equal, “fair”) e.g., CPU; VM; multi-tasking; Apache Web server SRPT: Shortest Remaining Processing Time pre-emptive version of Shortest Job First (SJF) give full resources to job that will complete quickest e.g., ??? (express lanes in grocery store)(almost) WORMS 2004

Research Methodology Trace-driven simulation Web server simulator 4/16/2017 Research Methodology Trace-driven simulation Input workload is empirical/synthetic trace Web server simulator Empirical trace (1 million requests, World Cup 1998) Synthetic traces (WebTraff) Probe-based sampling methodology Based on PASTA: Poisson Arrivals See Time Averages Any scheduling policy, any arrival process, any service time distribution. The research on SRPT began in the 1960’s. Previous work can be classified into three groups: First, theoretical work proved that SRPT is optimal in terms of mean response time and mean slowdown compared with other scheduling policies. Secondly, in practical work, SRPT was implemented in the Apache Web server The experimental results are consistent with theoretical work. Both works show that SRPT out-performs PS with respect to mean performance However, The SRPT policy is rarely used in practice, the main concern is the fear of penalizing the large jobs. WORMS 2004

Simulation Assumptions 4/16/2017 Simulation Assumptions User requests are for static Web content Server knows response size in advance Network bandwidth is the bottleneck All clients are in the same LAN environment Ignores variations in network bandwidth and propagation delay Fluid flow approximation: service time = response size Ignores packetization issues Ignores TCP protocol effects Ignores network effects (These are consistent with SRPT literature) WORMS 2004

Performance Metrics Slowdown: 4/16/2017 Performance Metrics Slowdown: The slowdown of a job is its observed response time divided by the ideal response time if it were the only job in the system Lower is better We consider mean slowdown as well as the variance of slowdown (complete distribution) WORMS 2004

Empirical Web Server Workload 4/16/2017 Empirical Web Server Workload 1998 WorldCup: Internet Traffic Archive: http://ita.ee.lbl.gov/ Item Value Trace Duration 861 sec Total Requests 1,000,000 Unique Documents 5,549 Total Transferred Bytes 3.3 GB Smallest Transfer Size (bytes) 4 Largest Transfer Size (bytes) 2,891,887 Median Transfer Size (bytes) 889 Mean Transfer Size (bytes) 3,498 Standard Deviation (bytes) 18,815 WORMS 2004

Probe-based Sampling Algorithm 4/16/2017 Probe-based Sampling Algorithm The algorithm is based on PASTA (Poisson Arrivals See Time Average) principle. S Slowdown (1 sample) S Repeat N times S WORMS 2004

Probe-based Sampling Algorithm 4/16/2017 Probe-based Sampling Algorithm For scheduling policy S =(PS, SRPT, FCFS, LRPT, …) do For load level U = (0.50, 0.80, 0.95) do For probe job size J = (1B, 1KB, 10KB, 1MB...) do For trial I = (1,2,3… N) do Insert probe job at randomly chosen point; Simulate Web server scheduling policy; Compute and record slowdown value observed; end of I; Plot marginal distribution of slowdown results; end of J; end of U; end of S; WORMS 2004

Slowdown Profile Plot “asymptotic convergence” “crossover region” (mystery hump) 8 PS x 1 1-p y Slowdown SRPT 1 8 Job Size WORMS 2004

Notation Details Number of jobs in the system: J Number of threads for a single server: K Number of servers in the system: M Probe jobs: 1KB, 10KB, 100KB, 1MB... Number of probes: 3000 All simulation results are for 95% load WORMS 2004

Single Server Scenario (M = 1) PS: Processor Sharing SRPT: Shortest Remaining Processing Time FSP: Fair Sojourn Protocol FSP computes the times at which jobs would complete under PS and then orders the jobs in terms of earliest PS completion times. FSP then devotes full service to the uncompleted job with the earliest PS completion time. “FSP response time dominates PS” (i.e., is never worse) E. Friedman and S. Henderson: “Fairness and Efficiency in Web Server Protocols”, Proc. ACM SIGMETRICS 2003. WORMS 2004

Mean Slowdown (M = 1) WORMS 2004

Variance of Slowdown (M = 1) WORMS 2004

A Hybrid SRPT/PS Policy for a Single Server (M = 1) Threshold-based policy, with threshold T T-SRPT Determining whether the system is "busy" or not depends on number of jobs (J) in the system. If J <= T Then use PS Else use SRPT Special cases: T = 0 is SRPT, T = is PS 8 WORMS 2004

Mean Slowdown for T-SRPT WORMS 2004

Variance of Slowdown for T-SRPT WORMS 2004

A Generalized SRPT Policy for a Multi-threaded Single Server K-SRPT Multi-threaded version of SRPT that allows up to K jobs (the K smallest RPT ones) to be in service concurrently (like PS), though with the same fixed aggregate service rate. Additional jobs (if any) in the system wait in the queue. Also preemptive, like SRPT. Let s = min (J, K) If J <= K Then J jobs each receive 1/s Else K jobs each receive 1/s (while J-K wait) Special cases: K = 1 is SRPT, K = is PS 8 WORMS 2004

Mean Slowdown for K-SRPT WORMS 2004

Variance of Slowdown for K-SRPT WORMS 2004

Multi-Server Scenario M-SRPT: Let s = M If J <= M Then J jobs each receive 1/s (M-J idle servers) Else M jobs each receive 1/s (while J-M wait) M-PS: Let s = max(M, J) Each job receives a service rate of 1/s M-FSP: Let s = M If J <= M Then J jobs (under PS) each receive 1/s Else M jobs (under PS) each receive 1/s WORMS 2004

Mean Slowdown for M-SRPT WORMS 2004

Variance of Slowdown for M-SRPT WORMS 2004

Mean Slowdown for M-PS WORMS 2004

Variance of Slowdown for M-PS WORMS 2004

Mean Slowdown for M-FSP WORMS 2004

Variance of Slowdown for M-FSP WORMS 2004

Summary Slowdown profile plots for several policies For the largest jobs, FSP better than SRPT and PS For small jobs, FSP is sometimes worse than SRPT Multi-threaded server results T-SRPT and K-SRPT provide a smooth transition between SRPT and PS, implying smoother tradeoff in fairness between small jobs and large jobs Multi-server results With more servers, mean slowdown worsens, but variance of slowdown often improves FSP does not response time dominate PS for M > 1 WORMS 2004

Thank You! Questions? For more information 4/16/2017 Thank You! Questions? For more information Email: {gongm,carey}@cpsc.ucalgary.ca WORMS 2004