Tactical Planning in Healthcare using Approximate Dynamic Programming (tactisch plannen van toegangstijden in de zorg) Peter J.H. Hulshof, Martijn R.K.

Slides:



Advertisements
Similar presentations
Business Information Systems Decision Support Group Dirk Christian Mattfeld Future of DELS Research and Practice Dirk C. Mattfeld TU Braunschweig.
Advertisements

HEALTH PLANNING AND MANAGEMENT
Hadi Goudarzi and Massoud Pedram
Scheduling.
HAROSA 2013 Barcelona July WORKFORCE PLANNING AND SCHEDULING WITH CROSS-TRAINING AND LEARNING Scott E. Grasman Industrial & Systems Engineering Rochester.
Aggregate Planning.
Ai in game programming it university of copenhagen Reinforcement Learning [Outro] Marco Loog.
Operations Management
Advanced Access What can we accomplish in Leader? November 2004.
National Institute of Science & Technology TECHNICAL SEMINAR-2004 Dipanwita Dash [1] UNIT COMMITMENT Under the guidance of Mr. Debasisha Jena Presented.
Planning under Uncertainty
1 Budgeting Dr. Varadraj Bapat Module 14.. Management Accounting Dr. Varadraj Bapat, IIT Mumbai 2 Index  Introduction  Objectives  Advantages  Components.
Approximate Dynamic Programming for High-Dimensional Asset Allocation Ohio State April 16, 2004 Warren Powell CASTLE Laboratory Princeton University
Reinforcement Learning Mitchell, Ch. 13 (see also Barto & Sutton book on-line)
Strategic Decisions Using Dynamic Programming
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
1212 / t 1 A SHORT TERM CAPACITY ADJUSTMENT POLICY FOR MINIMIZING LATENESS IN JOB SHOP PODUCTION SYSTEMS Henny P.G. van Ooijen J.Will M. Bertrand.
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
1 Lecture 10: Uniprocessor Scheduling. 2 CPU Scheduling n The problem: scheduling the usage of a single processor among all the existing processes in.
Optimal Adaptive Signal Control for Diamond Interchanges Using Dynamic Programming Optimal Adaptive Signal Control for Diamond Interchanges Using Dynamic.
On the convergence of SDDP and related algorithms Speaker: Ziming Guan Supervisor: A. B. Philpott Sponsor: Fonterra New Zealand.
PATIENT SCHEDULING AT COLUMBIA’S RADIATION ONCOLOGY TREATMENT CENTER By David Kuo Chao and Ji Soo Han.
 Group 3 Vahe Karapetyan Dena Rad Eidin Balali Patrick Njathi Zhong Zheng Damilare Adeoye Operations Research & Linear Programming Agenda Introduction.
The MRP II Hierarchy. Long-Range Planning At the top of the hierarchy we have long-range planning. This involves three functions: resource planning,
Optimal Scheduling for ICU Patients SIDDHANT BHATT STEVE BOYLE ERICA CUNNINGHAM.
Operations Research Models
Long term policies for operating room planning
Resource Allocation for E-healthcare Applications
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 16 Scheduling.
1 Providing Timely Access to Healthcare or Must Patients be Patient? Based on presentation by Professor Linda Green Columbia Business School April 20 th,
Column Generation Approach for Operating Rooms Planning Mehdi LAMIRI, Xiaolan XIE and ZHANG Shuguang Industrial Engineering and Computer Sciences Division.
CPU Scheduling Chapter 6 Chapter 6.
Integrated Tactical and Operational Planning of Police Helicopters Martijn Mes Department of Industrial Engineering and Business Information Systems University.
Planning Production of a Set of Semiconductor Components with Uncertain Wafer and Component Yield Frank W. Ciarallo Assistant Professor Biomedical, Industrial.
Modeling & Simulation What can they offer? March 28, 2012 Ottawa, ON Waiting Time Management Strategies for Scheduled Health Care Services: A Workshop.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 13 Aggregate Planning.
Stochastic Models for Operating Rooms Planning Mehdi LAMIRI, Xiaolan XIE, Alexandre DOLGUI and Frédéric GRIMAUD Centre Génie Industriel et Informatique.
Strategic Production Planning Now showing at your local university.
ECES 741: Stochastic Decision & Control Processes – Chapter 1: The DP Algorithm 31 Alternative System Description If all w k are given initially as Then,
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
Tactical Planning in Healthcare with Approximate Dynamic Programming Martijn Mes & Peter Hulshof Department of Industrial Engineering and Business Information.
Leader-Follower Framework For Control of Energy Services Ali Keyhani Professor of Electrical and Computer Engineering The Ohio State University
Appointment Systems - a Stochastic and Fluid Approach Michal Penn The William Davidson Faculty of Industrial Engineering and Management Technion - Israel.
1 S ystems Analysis Laboratory Helsinki University of Technology Flight Time Allocation Using Reinforcement Learning Ville Mattila and Kai Virtanen Systems.
Com & Net Call admission control (JABA-SD = MBA-SD) May 14, 2002 김 영 덕.
A Method for Transparent Admission Control and Request Scheduling in E-Commerce Web Sites S. Elnikety, E. Nahum, J. Tracey and W. Zwaenpoel Presented By.
MAP the Way to Success in Math: A Hybridization of Tutoring and SI Support Evin Deschamps Northern Arizona University Student Learning Centers.
Outline The role of information What is information? Different types of information Controlling information.
Understanding Variation and Setting Capacity. Why do we get backlog and queues? Because demand exceeds activity Because we want to be efficient Because.
Restless Multi-Arm Bandits Problem (RMAB): An Empirical Study Anthony Bonifonte and Qiushi Chen ISYE8813 Stochastic Processes and Algorithms 4/18/2014.
1 Solving Infinite Horizon Stochastic Optimization Problems John R. Birge Northwestern University (joint work with Chris Donohue, Xiaodong Xu, and Gongyun.
Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.
Consolidation and Last-mile Costs Reduction in Intermodal Transport Martijn Mes & Arturo Pérez Rivera Department of Industrial Engineering and Business.
Resource Allocation in Hospital Networks Based on Green Cognitive Radios 王冉茵
Simulation in Healthcare Ozcan: Chapter 15 ISE 491 Fall 2009 Dr. Burtner.
Stochastic Optimization for Markov Modulated Networks with Application to Delay Constrained Wireless Scheduling Michael J. Neely University of Southern.
Managerial Economics Linear Programming Aalto University School of Science Department of Industrial Engineering and Management January 12 – 28, 2016 Dr.
Introduction to Integer Programming Integer programming models Thursday, April 4 Handouts: Lecture Notes.
Copyright © 2014 by McGraw-Hill Education (Asia). All rights reserved. 13 Aggregate Planning.
Chapter 13 Aggregate Planning.
Chapter 10 Sales and Operations Planning (Aggregate Planning)
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 7 Aggregate Planning.
Water resources planning and management by use of generalized Benders decomposition method to solve large-scale MINLP problems By Prof. André A. Keller.
) Recherche & Innovation Multi-Objectives, Multi-Period Optimization of district networks Using Evolutionary Algorithms and MILP: Daily thermal storage.
Managerial Economics Linear Programming
Anticipatory Synchromodal Transportation Planning
13 Aggregate Planning.
Professor Arne Thesen, University of Wisconsin-Madison
Chap 11 Learning Objectives
Presentation transcript:

Tactical Planning in Healthcare using Approximate Dynamic Programming (tactisch plannen van toegangstijden in de zorg) Peter J.H. Hulshof, Martijn R.K. Mes, Richard J. Boucherie, Erwin W. Hans Center for Health care Operations Improvement and Research (CHOIR) University of Twente Friday, January 24, 2014 CHOIR Seminar – University of Twente

CHOIR Seminar OUTLINE  Introduction  Problem formulation  Solution approaches  Integer Linear Programming  Dynamic Programming  Approximate Dynamic Programming  Results  Conclusions 2/26 This research is partly supported by the Dutch Technology Foundation STW, applied science division of NWO and the Technology Program of the Ministry of Economic Affairs.

MOTIVATION & FOCUS  Motivation  Long access times due to lacking match of supply and demand  Varying demand (e.g., seasonality)  Varying resource availability (e.g., holidays, cancellations)  Limited control of serving the strategically agreed number of patients.  Opportunities to improve resource utilization.  Focus  Integrated decision making on the tactical planning level:  Patient care processes connect multiple departments and resources, which require an integrated approach.  Operational decisions often depend on a tactical plan, e.g., tactical allocation of blocks of resource time to specialties and/or patient categories (master schedule / block plan). CHOIR Seminar 3/26

CARE PROCESS & ACCESS TIME CHOIR Seminar 4/26  Care process: a chain of care stages for a patient, e.g., consultation, surgery, or a visit to the outpatient clinic.  Access time is the delay between the request for an appointment or treatment and the actual appointment or treatment.

OBJECTIVES OF TACTICAL PLANNING  To control access times and care pathway durations  To ensure quality of care for the patient and to prevent patients from seeking treatment elsewhere  Decreasing care pathway duration decreases the delay between costs invested and revenues incurred  To serve the strategically agreed number of patients  To achieve high resource utilization and to balance workload  Decreased costs and increased staff satisfaction CHOIR Seminar 5/26 Tactical planning requires coordinated decision making between multiple resources, multiple time periods, and multiple care pathways.

TACTICAL PLANNING IN OUR STUDY  Typical setting: 8 care processes, 8 weeks as a planning horizon, and 4 resource types.  Current way of creating/adjusting tactical plans: biweekly meeting with decision makers using spreadsheet solutions.  Our objective: to provide an optimization step that supports rational decision making in tactical planning.  Allocate resource capacities to care pathways, considering the variations in patient demand and resource availability.  Determine a patient admission plan for each stage in every care pathway. CHOIR Seminar 6/26

PROBLEM FORMULATION [1/2] CHOIR Seminar 7/26

PROBLEM FORMULATION [2/2] CHOIR Seminar 8/26

(2,2) (1,2) (0,2) (2,1) (1,1) (0,1) (2,0) (1,0) (0,0) Time → States → ILLUSTRATION 1 queue (1 care process with 1 stage), 0/1 waiting time

SOLUTION APPROACHES  Mixed Integer Linear Program (MILP):  Deterministic  Dynamic Programming (DP):  Able to incorporate uncertainty, but not scalable to realistic problem sizes  Approximate Dynamic Programming (ADP):  Alternative way to solve DPs, scalable to realistic problem sizes CHOIR Seminar 10/26

Time → States → (2,2) (1,2) (0,2) (2,1) (1,1) (0,1) (2,0) (1,0) (0,0) ILLUSTRATION MIXED INTEGER LINEAR PROGRAM Deterministic problem - expected values: 1 arrival

Time → States → (2,2) (1,2) (0,2) (2,1) (1,1) (0,1) (2,0) (1,0) (0,0) MIXED INTEGER LINEAR PROGRAM (MILP) Evaluate costs for all possible sequences of decisions (transitions only serve as illustration, not always feasible)

MIXED INTEGER LINEAR PROGRAM (MILP) 13/26 Number of patients in queue j at time t with waiting time u Number of patients to treat in queue j at time t with a waiting time u [1] [1] Hulshof PJ, Boucherie RJ, Hans EW, Hurink JL. (2013) Tactical resource allocation and elective patient admission planning in care processes. Health Care Manag Sci. 16(2): Updating waiting list & bound on u Limit on the decision space CHOIR Seminar Assume upper bound U on u

PROS & CONS OF THE MILP  Pros:  Suitable to support integrated decision making for multiple resources, multiple time periods, and multiple patient groups.  Flexible formulation (other objective functions can easily be incorporated).  Cons:  Quite limited in the state space.  Rounding problems with fraction of patients moving from one queue to another after service.  Model does not include any form of randomness. CHOIR Seminar 14/26

Time → States → (2,2) (1,2) (0,2) (2,1) (1,1) (0,1) (2,0) (1,0) (0,0) DYNAMIC PROGRAMMING (DP) Calculate the exact cost-to-go backwards (transitions only serve as illustration, not always feasible) V 4 (0,1) V 4 (1,1) V 4 (2,1) V 4 (0,2) V 4 (1,2) V 4 (2,0) V 4 (1,0) V 4 (0,0) V 4 (2,2) V 3 (0,1) V 3 (1,1) V 3 (2,1) V 3 (0,2) V 3 (1,2) V 3 (2,0) V 3 (1,0) V 3 (0,0) V 3 (2,2) V 2 (0,1) V 2 (1,1) V 2 (2,1) V 2 (0,2) V 2 (1,2) V 2 (2,0) V 2 (1,0) V 2 (0,0) V 2 (2,2) V 1 (0,1) V 1 (1,1) V 1 (2,1) V 1 (0,2) V 1 (1,2) V 1 (2,0) V 1 (1,0) V 1 (0,0) V 1 (2,2) V 0 (0,1) V 0 (1,1) V 0 (2,1) V 0 (0,2) V 0 (1,2) V 0 (2,0) V 0 (1,0) V 0 (0,0) V 0 (2,2)

CHOIR Seminar DYNAMIC PROGRAMMING FORMULATION  Solve:  Where  By backward induction. 16/26

CHOIR Seminar THREE CURSUS OF DIMENSIONALITY 17/26

V 4 (0,1) V 4 (1,1) V 4 (2,1) V 4 (0,2) V 4 (1,2) V 4 (2,0) V 4 (1,0) V 4 (0,0) V 4 (2,2) V 3 (0,1) V 3 (1,1) V 3 (2,1) V 3 (0,2) V 3 (1,2) V 3 (2,0) V 3 (1,0) V 3 (0,0) V 3 (2,2) V 2 (0,1) V 2 (1,1) V 2 (2,1) V 2 (0,2) V 2 (1,2) V 2 (2,0) V 2 (1,0) V 2 (0,0) V 2 (2,2) V 1 (0,1) V 1 (1,1) V 1 (2,1) V 1 (0,2) V 1 (1,2) V 1 (2,0) V 1 (1,0) V 1 (0,0) V 1 (2,2) V 0 (0,1) V 0 (1,1) V 0 (2,1) V 0 (0,2) V 0 (1,2) V 0 (2,0) V 0 (1,0) V 0 (0,0) V 0 (2,2) Time → States → (2,2) (1,2) (0,2) (2,1) (1,1) (0,1) (2,0) (1,0) (0,0) APPROXIMATE DYNAMIC PROGRAMMING (ADP) Learn cost-to-go forwards iteratively V 4 (0,1) V 3 (0,1) V 2 (1,0) V 1 (2,1) V 0 (0,1) (transitions only serve as illustration, not always feasible)

CHOIR Seminar ADP FORMULATION 19/26

CHOIR Seminar VALUE FUNCTION APPROXIMATION [1/2] 20/26

CHOIR Seminar VALUE FUNCTION APPROXIMATION [2/2] 21/26

CHOIR Seminar EXPERIMENTS 22/26

CHOIR Seminar CONVERGENCE RESULTS ON SMALL INSTANCES  Tested on 5000 random initial states.  DP requires 120 hours, ADP seconds for N=500.  ADP overestimates the value functions (+2.5%) caused by the truncated state space. 23/26

CHOIR Seminar PERFORMANCE ON SMALL AND LARGE INSTANCES  Compare with greedy policy: fist serve the queue with the highest costs until another queue has the highest costs, or until resource capacity is insufficient.  We train ADP using 100 replications after which we fix our value functions.  We simulate the performance of using (i) the greedy policy and (ii) the policy determined by the value functions.  We generate 5000 initial states, simulating each policy with 5000 sample paths.  Results:  Small instances: ADP 2% away from optimum and greedy 52% away from optimum.  Large instances: ADP results 29% savings compared to greedy (higher fluctuations in resource availability or patient arrivals results in larger differences between ADP and greedy). 24/26

CHOIR Seminar MANAGERIAL IMPLICATIONS  The ADP approach can be used to establish long-term tactical plans (e.g., three month periods) in two steps:  Run N iterations of the ADP algorithm to find the value functions given by the feature weights for all time periods.  These value functions can be used to determine the tactical planning decision for each state and time period by generating the most likely sample path.  Implementation in a rolling horizon approach:  Finite horizon approach may cause unwanted and short-term focused behavior in the last time periods.  Recalculation of tactical plans ensures that the most recent information is used.  Recalculation can be done using the existing value function approximations and the actual state of the system. 25/26

QUESTIONS? Martijn Mes Assistant professor University of Twente School of Management and Governance Dept. Industrial Engineering and Business Information Systems Contact Phone: Web: