1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer.

Slides:



Advertisements
Similar presentations
Parallel File System Simulator In order to test the Parallel File System (PFS) scheduling algorithms in a light-weighted approach, we have developed the.
Advertisements

Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.
1 Scoped and Approximate Queries in a Relational Grid Information Service Dong Lu, Peter A. Dinda, Jason A. Skicewicz Prescience Lab, Dept. of Computer.
Winter 2004 UCSC CMPE252B1 CMPE 257: Wireless and Mobile Networking SET 3f: Medium Access Control Protocols.
Nondeterministic Queries in a Relational Grid Information Service Peter A. Dinda Dong Lu Prescience Lab Department of Computer Science Northwestern University.
EPIDEMIC DENSITY ADAPTIVE DATA DISSEMINATION EXPLOITING OPPOSITE LANE IN VANETS Irem Nizamoglu Computer Science & Engineering.
1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Simulation Evaluation of Hybrid SRPT Policies
Web Server Request Scheduling Mingwei Gong Department of Computer Science University of Calgary November 16, 2004.
Efficient and Flexible Parallel Retrieval using Priority Encoded Transmission(2004) CMPT 886 Represented By: Lilong Shi.
Maryam Elahi Fairness in Speed Scaling Design Joint work with: Carey Williamson and Philipp Woelfel.
Automatic Run-time Adaptation in Virtual Execution Environments Ananth I. Sundararaj Advisor: Peter A. Dinda Prescience Lab Department of Computer Science.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
1 Modeling and Taming Parallel TCP on the Wide Area Network Dong Lu,Yi Qiao Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
1 Components of a Scalable Distributed Relational Information Service Dong Lu June 14, 2005.
Efficient Hop ID based Routing for Sparse Ad Hoc Networks Yao Zhao 1, Bo Li 2, Qian Zhang 2, Yan Chen 1, Wenwu Zhu 3 1 Lab for Internet & Security Technology,
Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.
1 Yi Qiao Jason Skicewicz Peter A. Dinda Prescience Laboratory Department of Computer Science Northwestern University Evanston, IL An Empirical Study.
Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.
1 Dong Lu, Peter A. Dinda Prescience Laboratory Department of Computer Science Northwestern University Evanston, IL GridG: Synthesizing Realistic.
1 Connection Scheduling in Web Servers Mor Harchol-Balter School of Computer Science Carnegie Mellon
Recent Results in Resource Signal Measurement, Dissemination, and Prediction App Transport Network Data Link Physical App Transport Network Data Link Physical.
Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.
Hardness of Approximation and Greedy Algorithms for the Adaptation Problem in Virtual Environments Ananth I. Sundararaj, Manan Sanghi, John R. Lange and.
Online Prediction of the Running Time Of Tasks Peter A. Dinda Department of Computer Science Northwestern University
1 Dong Lu, Peter A. Dinda Prescience Laboratory Computer Science Department Northwestern University Virtualized.
Characterizing and Predicting TCP Throughput on the Wide Area Network Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.
A Prediction-based Real-time Scheduling Advisor Peter A. Dinda Prescience Lab Department of Computer Science Northwestern University
Peer-to-peer file-sharing over mobile ad hoc networks Gang Ding and Bharat Bhargava Department of Computer Sciences Purdue University Pervasive Computing.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
Department of Computer Science Southern Illinois University Edwardsville Dr. Hiroshi Fujinoki and Kiran Gollamudi {hfujino,
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
Multi-level Hashing for Peer-to-Peer System in Wireless Ad Hoc Environment Dewan Tanvir Ahmed and Shervin Shirmohammadi Distributed & Collaborative Virtual.
Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web Mohamed A. Sharaf Alexandros Labrinidis Panos K. Chrysanthis Kirk Pruhs Advanced Data.
Fuzzy Trust Recommendation Based on Collaborative Filtering for Mobile Ad-hoc Networks Junhai Luo 1,2, Xue Liu 1, Yi Zhang 3,Danxia Ye 2,Zhong Xu 1 1 McGill.
Computer Science Informed Content Delivery Across Adaptive Overlay Networks Overlay networks have emerged as a powerful and highly flexible method for.
1 BitHoc: BitTorrent for wireless ad hoc networks Jointly with: Chadi Barakat Jayeoung Choi Anwar Al Hamra Thierry Turletti EPI PLANETE 28/02/2008 MAESTRO/PLANETE.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Meta Scheduling Sathish Vadhiyar Sources/Credits/Taken from: Papers listed in “References” slide.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Department of Electrical and Computer Engineering University of Massachusetts, Amherst Xin Huang and Tilman Wolf A Methodology.
1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
A Case for a Mobility Based Admission Control Policy Shahram Ghandeharizadeh 1, Tooraj Helmi 1, Shyam Kapadia 1, Bhaskar Krishnamachari 1,2 1 Computer.
1 The Effect of Heavy-Tailed Job Size Distributions on System Design Mor Harchol-Balter MIT Laboratory for Computer Science.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
Copyright © 2011, Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing Truong Vinh Truong Duy; Sato,
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
What is Web Information retrieval from web Search Engine Web Crawler Web crawler policies Conclusion How does a web crawler work Synchronization Algorithms.
Analysis of SRPT Scheduling: Investigating Unfairness Nikhil Bansal (Joint work with Mor Harchol-Balter)
Saran Jenjaturong, Chalermek Intanagonwiwat Department of Computer Engineering Chulalongkorn University Bangkok, Thailand IEEE CROWNCOM 2008 acceptance.
1 Data Overhead Impact of Multipath Routing for Multicast in Wireless Mesh Networks Yi Zheng, Uyen Trang Nguyen and Hoang Lan Nguyen Department of Computer.
1 Mor Harchol-Balter Carnegie Mellon with Nikhil Bansal with Bianca Schroeder with Mukesh Agrawal.
Courtesy Piggybacking: Supporting Differentiated Services in Multihop Mobile Ad Hoc Networks Wei LiuXiang Chen Yuguang Fang WING Dept. of ECE University.
Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
Scheduling Jobs Across Geo-distributed Datacenters Chien-Chun Hung, Leana Golubchik, Minlan Yu Department of Computer Science University of Southern California.
Accelerating Peer-to-Peer Networks for Video Streaming
Looking at the Server-side of P2P Systems
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
ECF: an MPTCP Scheduler to Manage Heterogeneous Paths
User-driven Scheduling Of Interactive Virtual Machines
Multiple-resource Request Scheduling. for Differentiated QoS
Modeling and Taming Parallel TCP on the Wide Area Network
Uniprocessor scheduling
Size-Based Scheduling Policies with Inaccurate Scheduling Information
Presentation transcript:

1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer Science + Dept. of Industrial Engineering & Management Science Northwestern University Evanston, IL USA

2 Outline Review of size-based scheduling Motivation Simulation Setup Simulation Results New applications Research Summary by subjects

3 Non-size-based scheduling FCFS, PS, etc. FCFS: First Come First Serve –Intuitive –Easiest to implement PS: Processor Sharing –Fair: all jobs accept equal resources –Also easy to implement Problem: Unaware of job size information, which results in big mean response time

4 Review of size-based scheduling SRPT, FSP, etc. Utilize the job size (processing time, service time) information for scheduling –Optimal in mean response time –Fair? –Easy to implement? We use Job Size to refer to the Processing Time (Service Time) of the job

5 Shortest Remaining Processing Time (SRPT) Always serve the job with minimum remaining processing time first, Preemptive scheduling Yields minimum mean response time [Schrage, Operations Research, 1968] Performance gains of SRPT over PS do not usually come at the expense of large jobs, in other words, it is Fair for heavy-tail job size distribution [Bansal and Harchol-Balter, Sigmetrics ‘01] Easy to implement? –With accurate a priori job size information, YES –Otherwise, NO

6 Fair Sojourn Protocol (FSP) Combined SRPT with PS, preemptive scheduling Mean response time is close to that of SRPT; and more fair than PS [Friedman, et al, Sigmetrics ‘03] Easy to implement? –With accurate a priori job size information, YES –Otherwise, NO

7 Motivation Size-based scheduling requires accurate knowledge of job sizes In practice, a priori job size information is not always available All the previous work assumes perfect knowledge of job sizes a priori How does performance depend on quality of job size information?

8 Correlation We study the performance of Size-based schedulers as a function of the correlation coefficient (Pearson’s R) between actual job sizes and estimated job sizes.

9 Outline Review of size-based scheduling Motivation Simulation Setup Simulation Results New applications Research Summary by subjects

10 Simulation Setup: Trace generator Trace Generator Correlation (Pearson’s R) Distribution ADistribution B X Y Correlated random pairs of X and Y X has distribution A Y has distribution B X and Y are correlated to R

11 Simulation Setup: Trace generator Algorithm: “Normal-To-Anything” –First developed by Cario and Nelson, on INFORMS Journal on Computing 10, 1 (1998). –We simplified the algorithm and first introduced it into the simulation studies of computer systems

12 Scatter plot of example traces R=0.13 R=0.78 Y X Y X

13 Simulation Setup: Performance metrics Performance metrics –Mean response time: Sojourn time, Turn-around time –Slowdown: the ratio of response time to its size. Fairness metric

14 Simulation Setup: Simulator Simulator –Written in C++ –Supports M/G/1 and G/G/n/m queuing model Simulator validation –Little’s law –Repeat the simulations in the FSP paper [Friedman, et al, Sigmetrics ‘03] –Compare with available theoretical results [Bansal and Harchol-Balter, Sigmetrics ‘01]

15 Simulation Setup: Scheduling Policies PS: Processor sharing Size-based scheduling policies –SRPT: Ideal SRPT scheduler –SRPT-E: SRPT scheduler using estimated job size –FSP: Ideal Fair Sojourn Protocol –FSP-E: FSP scheduler using estimated job size Each simulation is repeated 20 times and we present the average

16 Outline Review of size-based scheduling Motivation Simulation Setup Simulation Results New applications Research Summary by subjects

17 Simulation Results: Mean response time

18 Simulation Results: Slowdown (R=0.0224)

19 Simulation Results: Slowdown (R=0.239)

20 Simulation Results: Slowdown (R=0.4022)

21 Simulation Results: Slowdown (R=0.5366)

22 Simulation Results: Slowdown (R=0.7322)

23 Simulation Results: Slowdown (R=0.9779)

24 Simulation Results: Conclusions Performance heavily depends on correlation –SRPT-E and FSP-E can outperform PS given an effective job size estimator Crossover point of performance metrics is a function of correlation –Also of job size distributions (See TR NWU-CS-04-33)

25 Outline Review of size-based scheduling Motivation Simulation Setup Simulation Results New applications Research Summary by subjects

26 New Applications: Web server scheduling (TR NWU-CS-04-33) Is file size a good estimator of a job’s service time (processing time)? Not Really (R  0.14) Service time (wall clock time) File Size

27 New Applications: Web server scheduling Domain-based estimator: much more accurate prediction of the service time at low overhead

28 New Applications: P2P server side scheduling (LCR ’04) “Server side” of current file sharing P2P applications superficially similar to web server –Both send back files upon requests. However, P2P application can’t even know the file size accurately a priori –Partial downloads Our ongoing work shows that SRPT-E performs well using our time-series based job size estimators.

29 New Applications: Network backup system scheduling Incremental backup copies only the files that have been created or modified since a previous backup With Incremental backup, the actual job sizes is difficult to know until the backup finishes We believe that SRPT-E or FSP-E can be applied with time series based job size predictors

30 Summary Performance of size-based scheduling policies depends on correlation between size estimates and actual sizes –Fairness, mean response time, etc. Estimator must preserve ordering of job sizes for high performance –Performance degrades as correlation degrades Effective new estimators for Web and P2P

31 For More Information Prescience Laboratory – Home Page of Dong Lu –

32 Outline Review of size-based scheduling Motivation Simulation Setup Simulation Results New applications Research Summary by subjects

33 Research Summary by subjects Grid Computing Internet Measurement and Prediction Queuing and Scheduling Fat-Tree Based End System Multicast Wireless Ad Hoc Networks Incentivized Protocol Design for Peer-to-Peer Systems Parallel Computing

34 Grid Computing Dong Lu, Peter Dinda, “GridG: Generating Realistic Computational Grids”, ACM SIGMETRICS Performance Evaluation Review (Per), Volume 30, Number 4, Dong Lu, Peter Dinda, “Synthesizing Realistic Computational Grids,” Proceedings of the 15th ACM/IEEE Supercomputing (SC 2003), Phoenix, AZ, November Peter Dinda, Dong Lu, “Nondeterministic queries in a relational Grid information service”, Proceedings of the 15th ACM/IEEE Supercomputing (SC 2003), Phoenix, AZ, November Dong Lu, Peter Dinda, Jason Skicewicz ”Scoped and Approximated queries in a relational Grid Information Service”, Proceedings of 4th IEEE/ACM International Workshop on Grid Computing (Grid 2003), November, 2003, Phoenix, AZ. Bin Lin, Peter Dinda, Dong Lu, ”User-driven Scheduling of Interactive Virtual Machines”, Proceedings of Grid 2004, PITTSBURGH, PA, November, 2004.

35 Internet Measurement and Prediction Dong Lu, Y. Qiao, Peter Dinda, and F. Bustamante, ”Characterizing and Predicting TCP Throughput on the Wide Area Network”, Proceedings of the 25th IEEE International Conference on Distributed Computing Systems (ICDCS 2005), June 2005, Columbus, Ohio. To appear. Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante, ”Modeling and Taming Parallel TCP on the Wide Area Network”, Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2005), April 4-8, 2005, Denver, Colorado.

36 Queuing and Scheduling Dong Lu, Huanyuan Sheng, Peter Dinda, “Size-Based Scheduling Policies with Inaccurate Scheduling Information,” Proceedings of MASCOTS 2004, October 2004, Volendam, The Netherlands. Dong Lu, Peter Dinda, Yi Qiao, Huanyuan Sheng, Fabian Bustamante, “Applications of SRPT Scheduling with Inaccurate Scheduling Information”, (short paper) Proceedings of MASCOTS 2004, October 2004, Volendam, The Netherlands. Yi Qiao, Dong Lu, Fabian Bustamante, Peter Dinda, ”Looking at the Server-Side of Peer-to-Peer Systems”, Proceedings of the 7th ACM Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers (LCR 2004), October 2004, Houston, Texas. Dong Lu, Huanyuan Sheng, and Peter Dinda, “Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies”, Technical Report NWU-CS-04-33, Department of Computer Science, Northwestern University, April, In Submission.

37 Fat-Tree Based End System Multicast: FatNemo Stefan Birrer, Dong Lu, Fabian Bustamante, Yi Qiao, Peter Dinda, “FatNemo: Building a Resilient Multi-Source Multicast Fat-Tree”, Proceedings of the Ninth International Workshop on Web Content Caching and Distribution (WCW 2004), October 2004, Beijing, China. Also appeared in LNCS, Vol. 3293/2004, pp –Long version in submission

38 Wireless Ad Hoc Networks Dong Lu, Haitao Wu, Qian Zhang, Wenwu Zhu, “PARS: Stimulating Cooperation for Power-Aware Routing in Ad-Hoc Networks”. Proceedings of the 40th IEEE International Conference on Communications (ICC 2005), May 2005, Seoul, Korea. To appear.

39 Incentivized Protocol Design for Peer- to-Peer Systems Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante, “MultiTorrents: Bandwidth Optimized Hybrid Multicast for Incentivized P2P File Sharing”. In Submission.

40 Parallel Computing Dong Lu, Peter Dinda ”Virtualized Audio: A Highly adaptive Interactive High Performance Computing Application”, Proceedings of the 6th Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers (LCR 2002), Washington, DC, Also to appear in LNCS.