Page 1 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Things I Thought I Knew about Queueing Theory, but was Wrong About (Part 2, Service.

Slides:



Advertisements
Similar presentations
Dispatching to Incentivize Fast Service in Multi-Server Queues Raga Gopalakrishnan and Adam Wierman California Institute of Technology Sherwin Doroudi.
Advertisements

Effective Straggler Mitigation: Attack of the Clones [1]
Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
ISE480 Sequencing and Scheduling Izmir University of Economics ISE Fall Semestre.
The Impact of Server Incentives on Scheduling Raga Gopalakrishnan and Adam Wierman California Institute of Technology Sherwin Doroudi Carnegie Mellon University.
1 Chapter 8 Queueing models. 2 Delay and Queueing Main source of delay Transmission (e.g., n/R) Propagation (e.g., d/c) Retransmission (e.g., in ARQ)
1 Mor Harchol-Balter, CMU, Computer Sci. Alan Scheller-Wolf, CMU, Tepper Business Andrew Young, Morgan Stanley Surprising results on task assignment for.
Page 1 Alan Scheller-Wolf INFORMS Minneapolis October 6, 2013 Discussion of Marty Reiman’s Markov Lecture A Stochastic Programming Based Approach to ATO.
Page 1 Alan Scheller-Wolf Lunteren, The Netherlands January 15, 2013 Things I Thought I Knew About Queueing Theory, but was Wrong About: Part 1, Multiserver.
Load Balancing of Elastic Traffic in Heterogeneous Wireless Networks Abdulfetah Khalid, Samuli Aalto and Pasi Lassila
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
How Useful is Old Information? On the Management and Efficiency of Cloud Based Services seminar MICHAEL MITZENMACHER November 2010 Computer Science faculty.
Queuing Analysis Based on noted from Appendix A of Stallings Operating System text 6/10/20151.
ECS 152A Acknowledgement: slides from S. Kalyanaraman & B.Sikdar
Queues / Lines You manage a call center which can answer an average of 20 calls an hour. Your call center gets 17.5 calls in an average hour. On average.
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Data Communication and Networks Lecture 13 Performance December 9, 2004 Joseph Conron Computer Science Department New York University
Join-the-Shortest-Queue (JSQ) Routing in Web Server Farms
Dynamic routing – QoS routing Load sensitive routing QoS routing.
Fundamental Characteristics of Queues with Fluctuating Load VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ. Alan Scheller-Wolf Carnegie.
Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers Mor Armony Based on joint papers with Avi Mandelbaum and Amy Ward TexPoint.
Khudyakov Polina Designing a Call Center with an IVR MSc. Seminar Advisor: Professor Avishai Mandelbaum.
Queuing Networks: Burke’s Theorem, Kleinrock’s Approximation, and Jackson’s Theorem Wade Trappe.
Analysis of a Yield Management Model for On Demand IT Services Parijat Dube IBM Watson Research Center with Laura Wynter and Yezekael Hayel.
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
Fundamental Characteristics of Queues with Fluctuating Load (appeared in SIGMETRICS 2006) VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ.
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
1 IOE/MFG 543 Chapter 10: Single machine stochastic models Sections 10.1 and 10.4 You may skip Sections
1 Chapter 7 Dynamic Job Shops Advantages/Disadvantages Planning, Control and Scheduling Open Queuing Network Model.
Queuing Networks. Input source Queue Service mechanism arriving customers exiting customers Structure of Single Queuing Systems Note: 1.Customers need.
Asaf Cohen (joint work with Rami Atar) Department of Mathematics University of Michigan Financial Mathematics Seminar University of Michigan March 11,
L13. Shortest path routing D. Moltchanov, TUT, Spring 2008 D. Moltchanov, TUT, Spring 2014.
1 Exponential distribution: main limitation So far, we have considered Birth and death process and its direct application To single server queues With.
Management of Waiting Lines McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
1 Chapter 7 Dynamic Job Shops Advantages/Disadvantages Planning, Control and Scheduling Open Queuing Network Model.
Flows and Networks Plan for today (lecture 6): Last time / Questions? Kelly / Whittle network Optimal design of a Kelly / Whittle network: optimisation.
Control for Stochastic Models via Diffusion Approximations Amy Ward, ANS Lecture Series 2008 TexPoint fonts used in EMF. Read the TexPoint manual before.
Scheduling policies for real- time embedded systems.
Networks of Queues Plan for today (lecture 6): Last time / Questions? Product form preserving blocking Interpretation traffic equations Kelly / Whittle.
Waiting Lines and Queuing Models. Queuing Theory  The study of the behavior of waiting lines Importance to business There is a tradeoff between faster.
Models of multipath resource allocation Damon Wischik, UCL.
1 Scheduling The part of the OS that makes the choice of which process to run next is called the scheduler and the algorithm it uses is called the scheduling.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.
Competitive Queue Policies for Differentiated Services Seminar in Packet Networks1 Competitive Queue Policies for Differentiated Services William.
“Make to order - Make to Stock Production Systems By: ÖNCÜ HAZIR.
1 Managing Flow Variability: Safety Inventory Operations Management Session 23: Newsvendor Model.
1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.
Approximating the Performance of Call Centers with Queues using Loss Models Ph. Chevalier, J-Chr. Van den Schrieck Université catholique de Louvain.
Flows and Networks Plan for today (lecture 6): Last time / Questions? Kelly / Whittle network Optimal design of a Kelly / Whittle network: optimisation.
Introduction Many organizations use decision rules to alleviate incentive problems as opposed to incentive contracts The principal typically retains some.
NETE4631: Network Information System Capacity Planning (2) Suronapee Phoomvuthisarn, Ph.D. /
CSCI1600: Embedded and Real Time Software Lecture 24: Real Time Scheduling II Steven Reiss, Fall 2015.
1 Optimizing Decisions over the Long-term in the Presence of Uncertain Response Edward Kambour.
Non-Preemptive Buffer Management for Latency Sensitive Packets Moran Feldman Technion Seffi Naor Technion.
1 Task Assignment with Unknown Duration Mor Harchol-Balter Carnegie Mellon.
© 2015 McGraw-Hill Education. All rights reserved. Chapter 17 Queueing Theory.
Flows and Networks Plan for today (lecture 6): Last time / Questions? Kelly / Whittle network Optimal design of a Kelly / Whittle network: optimisation.
Management of Waiting Lines Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent.
Simple Queueing Theory: Page 5.1 CPE Systems Modelling & Simulation Techniques Topic 5: Simple Queueing Theory  Queueing Models  Kendall notation.
Managerial Decision Making Chapter 13 Queuing Models.
Energy-Aware Scheduling on Heterogeneous Processors
Load Balancing and Data centers
Serve Assignment Policies
Deadline Scheduling and Heavy tail distributionS
Queuing Theory Jackson Networks
Javad Ghaderi, Tianxiong Ji and R. Srikant
Presentation transcript:

Page 1 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Things I Thought I Knew about Queueing Theory, but was Wrong About (Part 2, Service Queues) Alan Scheller-Wolf Tepper School of Business Carnegie Mellon University Joint work with Ying Xu and Katia Sycara

Page 2 Alan Scheller-Wolf Lunteren, The Netherlands January 15, 2013 A Great Day As an advisor, one of the greatest things that can happen is the following: Student: “I can prove that XXX is true!” Advisor: “That can’t be right; everyone knows YYY is true.” Student: “I know, but I can prove it.” Advisor: “There must be a bug, show me.” Time passes... Advisor: “Huh, you are right, XXX is true!”

Page 3 Alan Scheller-Wolf Lunteren, The Netherlands January 15, 2013 My Goals Take you through “my great day” as an advisor Help you understand why everyone believed YYY Help you understand, mathematically, why XXX is true Help you see why, intuitively, XXX “has to be true” Discuss what general insights this gives

Page 4 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Quality Based Services (I) What are quality-based services? –Services in which longer service times generate greater value –Why greater value? For customer: –Better quality of service, e.g. diagnostic accuracy (e.g. Alizamir, de Vericourt and Sun, 2012; Wang et al. 2007, 2012) For firm: –Revenue generation (e.g. Ren and Zhou, 2008)

Page 5 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Quality Based Services (II) Examples of quality based services: –Health care –Call centers –Personal services –Education / Consulting –Others… OK, so why not just extend service times indefinitely? –Customers Don’t Like to Wait! (Duh!)

Page 6 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 The Central Problem Leisurely Service Short Wait SPEED-QUALITY TRADE-OFF PROVIDE VALUE WITHOUT TOO MUCH WAIT Lovejoy and Sethuraman (2000); task completion with deadlines

Page 7 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Providing Value Without Too Much Wait How? –Simple: Speed up when busy, slow down when not Dynamic (state dependent) control –Crabill (1972,1974) –Stidham and Weber (1989) –George and Harrison (2001) –Ata and Shneorson (2006) –Hopp et al. (2007)

Page 8 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 But What if a Dynamic Policy is Unrealistic? Why? –State information is expensive or difficult to get Heyman (1977), Harchol-Balter et al. (2003) –Service provision agreements may be customer- chosen or pre-sold Then we must use a static (state-independent) policy –This is a lower bound for the optimal dynamic policy performance

Page 9 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Let’s Take a Closer Look at the Static Case Why would you ever want to offer a variable service rate if you cannot use information on the state (congestion)? Maybe we have heterogeneous customers: Some value longer service more, others shorter waits –Mendelson and Whang (1990); M/M/1 pricing –Rao and Petersen (1998); M/M/1 pricing, class choice –Van Mieghem (2000); heavy traffic, Gc  –Gurvich and Whitt (2010); FQR routing for SL’s

Page 10 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 OK, What About Static Service and Homogeneous Customers? I could still offer variable service, but why? How? –I can’t use system information –I can’t take advantage of heterogeneous customer preferences And increasing variability in the system would increase delay –This is basic queueing, right, the PK formula???

Page 11 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 No! I will use a static policy For homogeneous customers I will randomly assign the arriving customers different grades, based on service rate –With probabilities and rates of my choosing And I will perform better than if I provide service of uniform rate How? –Let’s see…

Page 12 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Model Poisson BASE Service  I can assign arriving jobs different service rates      K But I must do so randomly, with probabilities p 1, p 2, …, p K I am in effect scaling the service distribution Each class (or grade) arrives Poisson( p k ) Single Server Queue

Page 13 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Service Discipline Assume non-preemptive policy r –Allowing preemption is, in general, a mess! So I have an M/G/1 queue with service discipline r

Page 14 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Performance Metrics Recall our Speed-Quality trade-off: Service Value increases with mean service time Delay cost increases with mean waiting time Assume both of these are linear for now Leisurely Service Short Wait

Page 15 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Objective Service Value is only a function of  and p Delay Cost is a function of  p, and r V(  p,r) =  u  k - h  k p k w k (  p,r)} pkpk kk

Page 16 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Results We first show that providing mixed service (K>1) is superior to providing pure service (K=1) How? –The key is r –We will take advantage of the variability we inject into the system –We will take advantage of the information we inject into the system with the variability or

Page 17 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 The Optimal Scheduling Rule We have a non-preemptive system Rule must be independent of state Rates are ordered:      …>  K Schedule according to SEPT for ANY  and p

Page 18 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 But Will This Really Work? I can use the heterogeneous rates to schedule short jobs before long ones But it is still hard to believe that it is beneficial to inject variability just so I can use SEPT!

Page 19 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Dominance Result (I) Let’s compare K=1 with K=2 Further, let’s assume What does this mean? p1p1 11 p2p2 22 += 1  Mean service time is the same in both systems

Page 20 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Dominance Result (II) Algebra and basic formulae yield: Can we make this < 1? Yes, if p1p1 11 p2p2 22 += E[w 2 ] E[w 1 ] (  1-  1 ) (1-p 1  ) 11 22 1 (1-  ) < When can we ensure this holds? Always! We pick the  ’s!

Page 21 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Dominance Result (III) But what about the service value? Since Service values are the same… So result holds even if not quality based service! p1p1 11 p2p2 22 += 1 

Page 22 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Dominance Result (IV) What does this condition do? Constrains difference in service rates (or variability to inject) As load increases, I have more freedom… Result extends to comparing k and k+1classes 11 22 1 (1-  ) < It is always beneficial to add another service class

Page 23 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Optimal Policy for a Given K Now assume we are unconstrained Lemma: Under optimal policy, p k w k = p k+1 w k+1 (we balance waiting time costs by class) And So, if we fix K and  all other parameters are determined! p k+1 pkpk  k+1 kk = = () 2  k+1 kk = () 2 (1-  ) 1/K

Page 24 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 How Do We Find Optimal  FOC: What do you notice  does NOT depend on?  (!) u(1-  )+CK{(1-  ) 1/K – (1-  ) -1/K }=0 h  2 E[X 2 ]  C =

Page 25 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 What does this mean? Since the server has the flexibility to set the service rates, it can always recover the optimal  no matter what  it faces. Similarly, if  k p k < 1, system can likewise adjust service rates and recover optimal value

Page 26 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Numerical Results: Benefit of Variability

Page 27 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Numerical Results: Relative Performance of Different Grades Shortest jobs: Most likely; Shortest wait; Smallest utility Longest jobs: Least likely; Longest wait; Largest utility

Page 28 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Extensions (I) We assumed linear service value and delay cost functions Results extend to convex value and linear cost –Convex service value just helps us There exist concave value and linear cost that satisfy –Concave service value works against us Non-linear cost makes waiting cost terms intractable.

Page 29 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Extensions (II) We assumed non-preemptive scheduling –If assume preempt-resume and exponential service, results extend. In general, if non-exponential service, optimal scheduling policy will depend on age of job in service, and could be arbitrarily complex (like TAGS)

Page 30 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Conclusions (I) For quality-based services it is beneficial to introduce variability in service times Even if: –You must make decisions statically –Customers are homogeneous –Customers do not care about quality It is always beneficial to add service grades

Page 31 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Conclusions (II) The optimal policy is insensitive to and whether or not you reject jobs Optimal parameters have a geometric structure –If parameters don’t have a geometric structure, then adjusting them can reduce delay, without adding any more classes (or information). Asymptotic benefit is about 5% at most

Page 32 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Dank U! Any Questions?

Page 33 Alan Scheller-Wolf Lunteren, The Netherlands January 15, 2013 Dank U! Any Questions?