Empirically Testing Decision Making in TAC SCM Erik P. Zawadzki July 23, 2007 Joint work with Kevin Leyton-Brown.

Slides:



Advertisements
Similar presentations
Value-at-Risk: A Risk Estimating Tool for Management
Advertisements

Strategic Decisions (Part II)
Hadi Goudarzi and Massoud Pedram
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
A.M. Alonso, C. García-Martos, J. Rodríguez, M. J. Sánchez Seasonal dynamic factor model and bootstrap inference: Application to electricity market forecasting.
1 Inventory Control for Systems with Multiple Echelons.
Experimental Design, Response Surface Analysis, and Optimization
Dynamic Bayesian Networks (DBNs)
Progress in Linear Programming Based Branch-and-Bound Algorithms
Scenario Trees and Metaheuristics for Stochastic Inventory Routing Problems DOMinant Workshop, Molde, Norway, September , 2009 Lars Magnus Hvattum.
Multi-scale Planning and Scheduling Under Uncertain and Varying Demand Conditions in the Pharmaceutical Industry Hierarchically Structured Integrated Multi-scale.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Observers and Kalman Filters
Applications of Stochastic Programming in the Energy Industry Chonawee Supatgiat Research Group Enron Corp. INFORMS Houston Chapter Meeting August 2, 2001.
David Pardoe Peter Stone The University of Texas at Austin Department of Computer Sciences TacTex-05: A Champion Supply Chain Management Agent.
1 The Supply Chain Management Game for the Trading Agent Competition 2004 Supervisor: Ishai Menashe Dr. Ilana David final presentation: 10-Oct-04.
Planning under Uncertainty
Visual Recognition Tutorial
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
1 Graphical Models in Data Assimilation Problems Alexander Ihler UC Irvine Collaborators: Sergey Kirshner Andrew Robertson Padhraic Smyth.
A Heuristic Bidding Strategy for Multiple Heterogeneous Auctions Patricia Anthony & Nicholas R. Jennings Dept. of Electronics and Computer Science University.
Probabilistic video stabilization using Kalman filtering and mosaicking.
Discrete-Event Simulation: A First Course Steve Park and Larry Leemis College of William and Mary.
Bootstrap in Finance Esther Ruiz and Maria Rosa Nieto (A. Rodríguez, J. Romo and L. Pascual) Department of Statistics UNIVERSIDAD CARLOS III DE MADRID.
1 A Second Stage Network Recourse Problem in Stochastic Airline Crew Scheduling Joyce W. Yen University of Michigan John R. Birge Northwestern University.
Ant Colonies As Logistic Processes Optimizers
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
Maximum Likelihood (ML), Expectation Maximization (EM)
Predicting Sequential Rating Elicited from Humans Aviv Zohar & Eran Marom.
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
Applied Business Forecasting and Planning
Simulation Output Analysis
Trading Agent Competition (Supply Chain Management) and TacTex-05.
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Ann Melissa Campbell, Jan Fabian Ehmke 2013 Service Management and Science Forum Decision.
Kalman filtering techniques for parameter estimation Jared Barber Department of Mathematics, University of Pittsburgh Work with Ivan Yotov and Mark Tronzo.
David Pardoe Doran Chakraborty Peter Stone The University of Texas at Austin Department of Computer Science TacTex-09: A Champion Bidding Agent for Ad.
Trading Agent Competition Bassam Aoun 08/11/2004.
A Framework for Distributed Model Predictive Control
Generalized and Bounded Policy Iteration for Finitely Nested Interactive POMDPs: Scaling Up Ekhlas Sonu, Prashant Doshi Dept. of Computer Science University.
Stochastic Models for Operating Rooms Planning Mehdi LAMIRI, Xiaolan XIE, Alexandre DOLGUI and Frédéric GRIMAUD Centre Génie Industriel et Informatique.
Recursion and Dynamic Programming. Recursive thinking… Recursion is a method where the solution to a problem depends on solutions to smaller instances.
Common Set of Tools for Assimilation of Data COSTA Data Assimilation Summer School, Sibiu, 6 th August 2009 COSTA An Introduction Nils van Velzen
Supply Contracts with Total Minimum Commitments Multi-Product Case Zeynep YILDIZ.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
Learning Market Prices for a Real-time Supply Chain Management Trading Agent David Burke Joint work with Ken Brown, Armagan Tarim and Brahim Hnich David.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
CS Statistical Machine learning Lecture 24
Outline The role of information What is information? Different types of information Controlling information.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
SUPPLY CHAIN MANAGEMENT SYSTEMS Part I. 7-2 LEARNING OUTCOMES 1.List and describe the components of a typical supply chain 2.Define the relationship between.
Intelligent Supply Chain Management Strategic Supply Chain Management
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
THE INVENTORY ROUTING PROBLEM
Data Mining and Decision Support
Bayesian Speech Synthesis Framework Integrating Training and Synthesis Processes Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda Nagoya Institute.
Machine Learning 5. Parametric Methods.
Tracking with dynamics
Generalized Point Based Value Iteration for Interactive POMDPs Prashant Doshi Dept. of Computer Science and AI Institute University of Georgia
Tommy Messelis * Stefaan Haspeslagh Burak Bilgin Patrick De Causmaecker Greet Vanden Berghe *
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
Multi-Agents System CMSC 691B Gunjan Kalra Peter DSouza.
Homework 1- Gateway.
Bot Contests - Learnings from Trading Agent Contest for SCM
2016 International Conference on Grey Systems and Uncertainty Analysis
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Demand Management and Forecasting
Presentation transcript:

Empirically Testing Decision Making in TAC SCM Erik P. Zawadzki July 23, 2007 Joint work with Kevin Leyton-Brown

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Trading Agent Competition Supply Chain Management (TAC SCM) Supply Chain Management (SCM) is an important industrial issue Static and unresponsive SC policies  Large inventories  Unreliable deliveries  Underperformance TAC SCM  Encourages research into SCM solutions  A simpler setting

Subproblems A TAC SCM PC (personal computer) manufacturing agent must make decisions for the following four subproblems:  Customer Bidding Customer Market Simulation Component Market Agent 6Agent 1 …

Subproblems A TAC SCM PC (personal computer) manufacturing agent must make decisions for the following four subproblems:  Customer Bidding  Component Ordering Customer Market Simulation Component Market Agent 6Agent 1 …

Subproblems A TAC SCM PC (personal computer) manufacturing agent must make decisions for the following four subproblems:  Customer Bidding  Component Ordering  Production Scheduling Customer Market Simulation Component Market Agent 6Agent 1 …

Subproblems A TAC SCM PC (personal computer) manufacturing agent must make decisions for the following four subproblems:  Customer Bidding  Component Ordering  Production Scheduling  Delivery Scheduling Customer Market Simulation Component Market Agent 6Agent 1 …

Subproblems A TAC SCM PC (personal computer) manufacturing agent must make decisions for the following four subproblems:  Customer Bidding  Component Ordering  Production Scheduling  Delivery Scheduling Decomposition  For instance [Collins et al 2007] Customer Market Simulation Component Market Agent 6Agent 1 …

Decision Making is Hard Decision making in TAC SCM is hard  Each subproblem solution influences the other three E.g. Customer Bidding  Depends on Delivery Scheduling  Depends on Production Scheduling  Depends on Component Ordering  There is an uncertain future Customer RFQs Component availability and pricing Late component deliveries  There is a hard time-constraint Most agents simplify or approximate this decision (or both).

Many ways to simplify Subproblem connection  Introduce independencies Action  Only build PCs once an order is certain Information  Do not use all the information that can be collected

How Should Algorithms be Compared? How do we determine which approaches are better than others?  The traditional approach is running an agent against a large set of other agents Easy to compare complete agents  Harder to compare particular approaches to the subproblems Test results are immediately relatable to competition performance  Results may be highly variable  Multiagent  Randomness in the simulation

An Alternate Approach to Evaluation We suggest a testing framework makes it easy to: 1. Hold some subproblem algorithms fixed while varying others 2. Large number of experiments Parallelism 3. Control variance Blocked experimental design 4. Focus on particular game events Resource shortages Steady state End game

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Model Overview Our Model:  Generate RFQs and handle factories like in TAC SCM Simulation Agent

Model Overview Our Model:  Generate RFQs and handle factories like in TAC SCM  Simulate the customer market using a process learned from game data Customer Process Simulation Agent

Model Overview Our Model:  Generate RFQs and handle factories like in TAC SCM  Simulate the customer market using a process learned from game data  Simulate the component market using a process structurally similar to TAC SCM’s market Customer Process Simulation Component Process Agent

Model Overview Processes independent of agent actions  Blocked experimental design Simulation defined random seed Block experiments by simulation seed Customer Process Component Process Agent Simulation Customer Market Simulation Component Market Agent 6Agent 1 …

Model Overview Processes independent of agent actions  Blocked experimental design Simulation defined random seed Block experiments by simulation seed Customer Process Component Process Agent Simulation We will focus on ‘steady-state’ behaviour  Days 40 to 200  Beginning and end game effects We need to validate our model  Want processes to be faithful to game log data Customer Market Simulation Component Market Agent 6Agent 1 …

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Customer Market Process (CMP) Learn the winning price distribution p(B|θ t,S)  θ t is the model parameters for day t  S is the product type r.v. Assume that each day’s winning price distribution is a Gaussian

Customer Market Process (CMP) Model parameters linearly related to previous day’s with unbiased Gaussian noise θ t = Aθ t-1 + N(0,Q) θ0θ0 θ1θ1 … θTθT

Customer Market Process (CMP) Model parameters linearly related to previous day’s with unbiased Gaussian noise θ t = Aθ t-1 + N(0,Q) Observations (empirical distribution) linearly related to model parameters with unbiased Gaussian noise y t = Cθ t + N(0,R) θ0θ0 θ1θ1 … θTθT y1y1 yTyT

Customer Market Process (CMP) Model parameters linearly related to previous day’s with unbiased Gaussian noise θ t = Aθ t-1 + N(0,Q) Observations (empirical distribution) linearly related to model parameters with unbiased Gaussian noise y t = Cθ t + N(0,R) Linear Dynamic System (LDS) θ0θ0 θ1θ1 … θTθT y1y1 yTyT

How to Learn LDS Parameters Learn the LDS dynamic ( ) with EM  Iteratively improves on an initial model ‘Improvement’ is increasing data likelihood  Unstable Inversion Good initial model helps avoid problems  Can calculate data likelihood and predict future states using Kalman Filters (KFs) Recursive filter for estimating LDS states Very fast Simple to implement

Other LDS consideration Other decisions about the model:  Independent vs `Full’ Model Should the behaviour from other PCs be informative? Can an LDS model this relationship?  Overfitting  Different dimensionality of the model parameters

Picking an LDS Model What makes a good model?  Model easily explains historical data Data likelihood  Predictive power Absolute prediction error

Picking an LDS Model What makes a good model?  Model easily explains historical data Data likelihood  Predictive power Absolute prediction error IndependentFull 1/ / / Model Log-Likelihoods

Picking an LDS Model What makes a good model?  Model easily explains historical data Data likelihood  Predictive power Absolute prediction error Independent Model with 32 model variables  Highest likelihood  Low mean winning price prediction error of 57.0 units IndependentFull 1/ / / Model Log-Likelihoods

KF Predictions Based on Learned LDS

Alternate Approaches Our model is generative  But this is similar to the prediction problem Deep Maize Forecasting [Kiekintveld et al, 2007]  K th -nearest neighbour What in the past looks like what is being seen right now? TacTex Offer Acceptance Predictor [Pardoe and Stone, 2006]  Separated Particle Filters

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Component Market Process Not the focus of our work Needed a simple model Made one based on structural similarity to TAC SCM  Daily manufacturing capacity determined by random walk  Each component manufacturer maximized daily outgoing components using an ILP

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Comparing Three Scheduling Techniques We will use our test framework to compare three different scheduling algorithms  We are interested in the interaction between production and delivery scheduling  To maintain consistency, will using the same customer bidding and component ordering algorithms Both done with simple heuristics  Ordering: static daily amount with inventory cap  Bidding: greedily, fixed percentage of production capacity

Myopic  Delivery Scheduling ILP that maximizes current day’s revenue Ignores the future  Production Scheduling Greedy, based on outstanding PC demand Customer Bidding Production Scheduling Delivery Scheduling Component Ordering

Myopic Delivery Program $$$$$ … SKU 1 $$$$$ SKU 16

SILP Stochastic Integer Linear Program (SILP)  SILP from Benisch et al 2004  Delivery and Production Scheduling ILP that maximizes expected profit Fixed n-day horizon Customer Bidding Production Scheduling Delivery Scheduling Component Ordering

SILP Program $ # $ # # $ E[$] # # # # # # … Day 1Day 2Day n

SILP Program $ # $ # # $ E[$] # # # # # # … Day 1Day 2Day n RFQs and Orders

SILP Program $ # $ # # $ E[$] # # # # # # … Day 1Day 2Day n Production

SILP Program $ # $ # # $ E[$] # # # # # # … Day 1Day 2Day n C

SAA Sample Average Approximation (SAA)  Shapiro et al 2001 Benisch et al 2004  Delivery and Production Scheduling ILP that maximizes expected profit n-day horizon k-samples  Drawn from uncertainty distribution Customer Bidding Production Scheduling Delivery Scheduling Component Ordering

SAA Program $ # $ # # $ $ # $ # # $ … Day 1Day 2 $ # $ # # $ $ # $ # # $ $ # $ # # $ $ # $ # # $ $ # $ # # $ Day n

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

Common Test Setup 11-computer cluster ILPs solved with CPLEX 10.1  Told to emphasize feasibility over optimality Used profit as a measure of solution quality  Revenue less late penalties and storage costs

Experiment 1: SAA Question: Given a global time cap, does it make more sense for SAA to quickly consider more samples, or spend more time optimizing fewer samples?  2, 4, 6, or 8 sample  10, 14, or 18 seconds per sample For each combination ran 100 simulations  30-days of simulated steady-state behaviour

Experiment 1: SAA (Results) Flat surface  Neither dimension significant for configurations that could be reasonably solved in TAC

Experiment 2: Algorithm Comparisons Question: Given these three algorithms and a time constraint, which algorithm should one use?  Myopic  2-day SILP with 10s cap  2-day SILP with 50s cap  3-day SAA with 1 sample, 10s cap  3-day SAA with 5 sample, 50s cap For each algorithm ran 100 simulations  30-days of simulated steady-state behaviour CPLEX solved the Myopic ILP in under 10s

Experiment 2: Algorithm Comparisons (Results) SILP and SAA beat Myopic SILP and SAA not significantly different Altering time cap makes no significant difference

Outline Introduction to Problem Model Customer Market Process Component Market Process Application: Scheduling Agents Experiments Conclusions

From experiments  SILP and SAA were not significantly different for examined configurations  Increasing the number of samples in time- constrained SAA optimization did not significantly increase profit  Early approximations were usually quite good

Conclusions From testing approach and framework  Easy to set up and run large experiments 1200 simulation in the first experiment 500 simulations in the second  Simple to parallelize  More control over parameters Time cap and simulation length altered  Accurate model of Customer Market Process Game data likely given model Low prediction error

Future Directions Data generated component market model Improve our model of the customer market  Priors during EM parameter estimation  Larger data set  TAC SCM Prediction Challenge Expand set of metrics Use framework integrate component ordering and customer bidding

Thank You Questions and comments