© 2004 Warren B. Powell Slide 1 Outline A car distribution problem.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Chapter 4 Distributed Bellman-Ford Routing
MINIMUM COST FLOWS: NETWORK SIMPLEX ALGORITHM A talk by: Lior Teller 1.
Computational Stochastic Optimization:
1 Reinforcement Learning Introduction & Passive Learning Alan Fern * Based in part on slides by Daniel Weld.
Applications of Stochastic Programming in the Energy Industry Chonawee Supatgiat Research Group Enron Corp. INFORMS Houston Chapter Meeting August 2, 2001.
Network Optimization Models: Maximum Flow Problems
© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren.
A Constant Factor Approximation Algorithm for the Multicommodity Rent-or-Buy Problem Amit Kumar Anupam Gupta Tim Roughgarden Bell Labs CMU Cornell joint.
Lecture 5: Learning models using EM
Approximate Dynamic Programming for High-Dimensional Asset Allocation Ohio State April 16, 2004 Warren Powell CASTLE Laboratory Princeton University
Network Optimization Models: Maximum Flow Problems In this handout: The problem statement Solving by linear programming Augmenting path algorithm.
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
CISS Princeton, March Optimization via Communication Networks Matthew Andrews Alcatel-Lucent Bell Labs.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
Advanced Topics in Optimization
Michel Gendreau CIRRELT and MAGI École Polytechnique de Montréal
Slide 1 © 2008 Warren B. Powell Slide 1 Approximate Dynamic Programming for High-Dimensional Problems in Energy Modeling Ohio St. University October 7,
Yashar Ganjali High Performance Networking Group Stanford University September 17, 2003 Minimum-delay Routing.
On the convergence of SDDP and related algorithms Speaker: Ziming Guan Supervisor: A. B. Philpott Sponsor: Fonterra New Zealand.
15.082J and 6.855J and ESD.78J November 4, 2010 The Network Simplex Algorithm.
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
Chen Cai, Benjamin Heydecker Presentation for the 4th CREST Open Workshop Operation Research for Software Engineering Methods, London, 2010 Approximate.
15.082J and 6.855J and ESD.78J November 30, 2010 The Multicommodity Flow Problem.
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
An Overview of Dynamic Programming Seminar Series Joe Hartman ISE October 14, 2004.
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
1 Unconstrained Optimization Objective: Find minimum of F(X) where X is a vector of design variables We may know lower and upper bounds for optimum No.
Stochastic Multicast with Network Coding Ajay Gopinathan, Zongpeng Li Department of Computer Science University of Calgary ICDCS 2009, June , Montreal.
Optimization Flow Control—I: Basic Algorithm and Convergence Present : Li-der.
Approximate Dynamic Programming Methods for Resource Constrained Sensor Management John W. Fisher III, Jason L. Williams and Alan S. Willsky MIT CSAIL.
Bekkjarvik, A Heuristic Solution Method for a Stochastic Vehicle Routing Problem Lars Magnus Hvattum.
© 2007 Warren B. Powell Slide 1 The Dynamic Energy Resource Model Lawrence Livermore National Laboratories September 24, 2007 Warren Powell Alan Lamont.
CS 4407, Algorithms University College Cork, Gregory M. Provan Network Optimization Models: Maximum Flow Problems In this handout: The problem statement.
Approximate Dynamic Programming and Policy Search: Does anything work? Rutgers Applied Probability Workshop June 6, 2014 Warren B. Powell Daniel R. Jiang.
An Integrated Approach to Load Matching, Routing, and Equipment Balancing Problem Sarah Root June 8, 2005 Joint work with advisor Amy M. Cohn.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
1 Network Models Transportation Problem (TP) Distributing any commodity from any group of supply centers, called sources, to any group of receiving.
Maximum Flow Problem (Thanks to Jim Orlin & MIT OCW)
Outline The role of information What is information? Different types of information Controlling information.
Improving Revenue by System Integration and Cooperative Optimization Reservations & Yield Management Study Group Annual Meeting Berlin April 2002.
© 2009 Ilya O. Ryzhov 1 © 2008 Warren B. Powell 1. Optimal Learning On A Graph INFORMS Annual Meeting October 11, 2009 Ilya O. Ryzhov Warren Powell Princeton.
15.082J and 6.855J March 4, 2003 Introduction to Maximum Flows.
1 Inventory Control with Time-Varying Demand. 2  Week 1Introduction to Production Planning and Inventory Control  Week 2Inventory Control – Deterministic.
Solving Multistage Stochastic Linear Programs on the Computational Grid Jerry Shen June 8, 2004.
Network Simplex Animations Network Simplex Animations.
Eastern Mediterranean University Department Of Industrial Engineering Deterministic Dynamic Programming presented by: Ziad El Masri Supervisor: Prof. Dr.
OSiL: An XML-based schema for stochastic programs H.I. Gassmann, R. Fourer, J. Ma, R.K. Martin SP XI Vienna, August 2007.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
Tactical Planning in Healthcare using Approximate Dynamic Programming (tactisch plannen van toegangstijden in de zorg) Peter J.H. Hulshof, Martijn R.K.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Tuesday, March 19 The Network Simplex Method for Solving the Minimum Cost Flow Problem Handouts: Lecture Notes Warning: there is a lot to the network.
::Network Optimization:: Minimum Spanning Trees and Clustering Taufik Djatna, Dr.Eng. 1.
Lecture 20 Review of ISM 206 Optimization Theory and Applications.
Signal processing and Networking for Big Data Applications: Lecture 9 Mix Integer Programming: Benders decomposition And Branch & Bound NOTE: To change.
The minimum cost flow problem
Network Simplex Animations
Analytics and OR DP- summary.
6.5 Stochastic Prog. and Benders’ decomposition
Motion Detection And Analysis
Resource Allocation in Non-fading and Fading Multiple Access Channel
Clearing the Jungle of Stochastic Optimization
James B. Orlin Presented by Tal Kaminker
Approximate Dynamic Programming for
Slides for Introduction to Stochastic Search and Optimization (ISSO) by J. C. Spall CHAPTER 15 SIMULATION-BASED OPTIMIZATION II: STOCHASTIC GRADIENT AND.
Matteo Fischetti, University of Padova
Markov Decision Problems
The Ford-Fulkerson Algorithm
6.5 Stochastic Prog. and Benders’ decomposition
Presentation transcript:

© 2004 Warren B. Powell Slide 1 Outline A car distribution problem

© 2004 Warren B. Powell Slide 2 Norfolk Southern

When a boxcar becomes empty, we have three options: Customers Regional depots General depots

Option 1: Send directly to customers

Option 2: Send to distribution areas

Option 1: Send directly to customers Option 2: Send to distribution areas Option 3: Send to general depots

© 2004 Warren B. Powell Slide 7 Forecasts of Car Demands ForecastActual

© 2004 Warren B. Powell Slide 8 Outline ADP for resource allocation problems with low dimensional attribute vectors: »Two-stage problems »Multistage problems

© 2004 Warren B. Powell Slide 9 Outline ADP for resource allocation problems with low dimensional attribute vectors: »Two-stage problems »Multistage problems

© 2004 Warren B. Powell Slide 10 Two-stage problems This period Future

© 2004 Warren B. Powell Slide 11 Two-stage problems Our basic strategy: Separable approximation

© 2004 Warren B. Powell Slide 12 Two-stage problems Two-stage resource allocation under uncertainty

© 2004 Warren B. Powell Slide 13 Two-stage problems

© 2004 Warren B. Powell Slide 14 Two-stage problems

© 2004 Warren B. Powell Slide 15 Two-stage problems

© 2004 Warren B. Powell Slide 16 Two-stage problems We estimate the functions by sampling from our distributions. Marginal value:

© 2004 Warren B. Powell Slide 17 Two-stage problems The time t subproblem: t (i-1,t+3) (i,t+1) (i+1,t+5)

© 2004 Warren B. Powell Slide 18 Two-stage problems Left and right gradients are found by solving flow augmenting path problems. t i (i-1,t+3) Gradients: The right derivative (the value of one more unit of that resource) is a flow augmenting path from that node to the supersink.

© 2004 Warren B. Powell Slide 19 Two-stage problems Left and right derivatives are used to build up a nonlinear approximation of the subproblem.

© 2004 Warren B. Powell Slide 20 Two-stage problems Left and right derivatives are used to build up a nonlinear approximation of the subproblem. Right derivativeLeft derivative

© 2004 Warren B. Powell Slide 21 Two-stage problems Each iteration adds new segments, as well as refining old ones.

© 2004 Warren B. Powell Slide 22 Two-stage problems Number of resources Approximate value function

© 2004 Warren B. Powell Slide 23 Two-stage problems It is important to maintain concavity:

© 2004 Warren B. Powell Slide 24 Value function A concave function… Slopes … has monotonically decreasing slopes. But updating the function with a stochastic gradient may violate this property. Two-stage problems

© 2004 Warren B. Powell Slide 25 A leveling algorithm

© 2004 Warren B. Powell Slide 26 A leveling algorithm

© 2004 Warren B. Powell Slide 27 A leveling algorithm Violates concavity (monotonicity of slopes)

© 2004 Warren B. Powell Slide 28 A leveling algorithm

© 2004 Warren B. Powell Slide 29 A projection algorithm (SPAR)

© 2004 Warren B. Powell Slide 30 A projection algorithm (SPAR)

© 2004 Warren B. Powell Slide 31 A projection algorithm (SPAR)

© 2004 Warren B. Powell Slide 32 A projection algorithm (SPAR)

© 2004 Warren B. Powell Slide 33 Two-stage problems A uniform sampling strategy: »Second stage sampled based on exogenous distribution.

© 2004 Warren B. Powell Slide 34 Two-stage problems Theorem (Topaloglu and Powell, 2003): Theorem (Powell, Ruszczynski and Topaloglu, 200?):

© 2004 Warren B. Powell Slide 35 Two-stage problems Projection algorithm Leveling algorithm

© 2004 Warren B. Powell Slide 36 Two-stage problems An optimizing sampling strategy:

© 2004 Warren B. Powell Slide 37 Two-stage problems Theorem (Powell, Ruszczynski and Topaloglu, 200?):

© 2004 Warren B. Powell Slide 38 Two-stage problems Separability implies:

© 2004 Warren B. Powell Slide 39 Two-stage problems We only sample the points in the second stage which were optimal given the second stage approximation. A stochastic allocation problem:

© 2004 Warren B. Powell Slide 40 Two-stage problems

© 2004 Warren B. Powell Slide 41 Two-stage problems Real problems are nonseparable.

© 2004 Warren B. Powell Slide 42 Two-stage problems Notes: »The use of separable approximations for two-stage problems can produce algorithms that produce solutions that converge almost surely to the optimal solution if the second stage problem is continuously differentiable (Cheung and Powell, Operations Research, 2000). »The use of separable approximations for two-stage problems will not generally produce optimal solutions when the second stage is nondifferentiable and nonseparable.

© 2004 Warren B. Powell Slide 43 Two-stage problems We are solving problems with the structure:

© 2004 Warren B. Powell Slide 44 Two-stage problems Duals reflect flow augmenting cycles through the network:

© 2004 Warren B. Powell Slide 45 Two-stage problems Duals reflect flow augmenting cycles through the network: 1 5 Captures impact on second stage

© 2004 Warren B. Powell Slide 46 Two-stage problems Duals reflect flow augmenting cycles through the network: 1 5

© 2004 Warren B. Powell Slide 47 Two-stage problems We solve a sample realization of the second stage: Capacitated arcs Uncapacitated arcs 1 5

© 2004 Warren B. Powell Slide 48 Two-stage problems (Sample) flow augmenting path from 5 to 1: 1 5

© 2004 Warren B. Powell Slide 49 Two-stage problems (Sample) flow augmenting path from 5 to 1: 1 5 Together they cancel, but one at a time may be infeasible.

© 2004 Warren B. Powell Slide 50 Two-stage problems (Sample) flow augmenting path from 5 to 1: 1 5

© 2004 Warren B. Powell Slide 51 Two-stage problems Questions: »How good is our approximation? »What is the speed of convergence? »How do we compare against deterministic approximations?

© 2004 Warren B. Powell Slide 52 Two-stage problems Nested Benders decomposition:

© 2004 Warren B. Powell Slide 53 Two-stage problems Point forecast Profits Iterations

© 2004 Warren B. Powell Slide 54 Two-stage problems Variations on Bender’s decomposition Point forecast Profits Iterations

© 2004 Warren B. Powell Slide 55 Two-stage problems Variations on Bender’s decomposition SPAR algorithm Point forecast Profits Iterations

© 2004 Warren B. Powell Slide 56 Two-stage problems Variations on Bender’s decomposition SPAR algorithm Deterministic approximation Iterations

© 2004 Warren B. Powell Slide 57 Two-stage problems Variations on Bender’s decomposition SPAR algorithm Deterministic approximation Iterations

© 2004 Warren B. Powell Slide 58 Two-stage problems

© 2004 Warren B. Powell Slide 59 Two-stage problems

© 2004 Warren B. Powell Slide 60 Two-stage problems

© 2004 Warren B. Powell Slide 61 Outline ADP for resource allocation problems with: »Low dimensional attribute vectors Two stage problems Multistage problems

© 2004 Warren B. Powell Slide 62 A pure (deterministic) network: Multistage problems

© 2004 Warren B. Powell Slide 63 A dynamic network: Multistage problems t

© 2004 Warren B. Powell Slide 64 Multistage problems Stepping through time:

© 2004 Warren B. Powell Slide 65 Multistage problems Iterative learning:

© 2004 Warren B. Powell Slide 66 Multistage problems Estimating values: »Single-pass algorithm: »Two-pass algorithm: Previous iterations This iteration

© 2004 Warren B. Powell Slide 67 Resource State-Type Time Multistage problems

© 2004 Warren B. Powell Slide 68 Time Resource State-Type Multistage problems

© 2004 Warren B. Powell Slide 69 Time Resource State-Type Multistage problems

© 2004 Warren B. Powell Slide 70 Resource State-Type Time Multistage problems

© 2004 Warren B. Powell Slide 71 Time Resource State-Type Multistage problems

© 2004 Warren B. Powell Slide 72 Time Resource State-Type Multistage problems

© 2004 Warren B. Powell Slide 73 Multistage problems The single-pass algorithm: »Easy to implement, but… »Suffers from slow backward communication of information: Information captured by value functions captures future time periods in previous iterations.

© 2004 Warren B. Powell Slide 74 Multistage problems Basic idea: »Perform forward pass as we did before, but do note update the value function. »Then step backward through the results, updating values given what happened in this iteration in the next time period.

© 2004 Warren B. Powell Slide 75 Backward pass

© 2004 Warren B. Powell Slide 76 Time Resource State-Type Backward pass

© 2004 Warren B. Powell Slide 77 Time Resource State-Type Backward pass

© 2004 Warren B. Powell Slide 78 Time Resource State-Type Backward pass

© 2004 Warren B. Powell Slide 79 Time Resource State-Type Backward pass

© 2004 Warren B. Powell Slide 80 Multistage problems Adaptive dynamic programming applied to a pure network.

© 2004 Warren B. Powell Slide 81 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 tt+1 2 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 2. ATL - CHI D 1 A multicommodity flow problem

© 2004 Warren B. Powell Slide 82 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 tt+1 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 2. ATL - CHI D 1 A multicommodity flow problem

© 2004 Warren B. Powell Slide 83 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 tt+1 2 Chicag o Atlanta V0V0 V1V1 V2V2 V0V0 V1V1 V2V2 2. ATL - CHI D 1 A multicommodity flow problem

© 2004 Warren B. Powell Slide 84 A multicommodity flow problem The mathematical optimum Using nonlinear approximations

© 2004 Warren B. Powell Slide 85 A multicommodity flow problem Planning horizon Percent of posterior bound Posterior bound Deterministic, rolling horizon

© 2004 Warren B. Powell Slide 86 A multicommodity flow problem Planning horizon Percent of posterior bound Posterior bound Results using value functions Deterministic, rolling horizon

© 2004 Warren B. Powell Slide 87 Multistage experiments Nested Benders, with 50, 100 and 250 iterationsA rolling horizon procedureAdaptive DP approximation

© 2004 Warren B. Powell Slide 88 Norfolk Southern

When a boxcar becomes empty, we have three options: Customers Regional depots General depots

Option 1: Send directly to customers

Option 2: Send to distribution areas

Option 1: Send directly to customers Option 2: Send to distribution areas Option 3: Send to general depots

© 2004 Warren B. Powell Slide 93 A car distribution problem For railroads, customers call in orders the week before: Requirement becomes knownRequirement becomes actionable Time

© 2004 Warren B. Powell Slide 94 Lagged information processes Ongoing research: we are currently testing an algorithmic strategy that explicitly captures this information process: Order is madeTransit time becomes knownCustomer accepts/rejects carOrder is released for shipping Time Destination of order becomes known

© 2004 Warren B. Powell Slide 95 A car distribution problem Repositioning movements based on forecasts Assignments to booked orders. Using value function approximations, we may reposition cars before orders become known:

© 2004 Warren B. Powell Slide 96 A car distribution problem Repositioning movements based on forecasts Assignments to booked orders. Using value function approximations, we may reposition cars before orders become known:

© 2004 Warren B. Powell Slide 97 A car distribution problem Profits Iterations

© 2004 Warren B. Powell Slide 98 A car distribution problem Profits Total revenue Empty repositioning costs Late service penalties Iterations

© 2004 Warren B. Powell Slide 99 A car distribution problem Empty miles as a percent of total miles History “Optimized” without adaptive learning

© 2004 Warren B. Powell Slide 100 A car distribution problem Empty miles as a percent of total miles History “Optimized” without adaptive learning “Optimized” with adaptive learning