Scheduling with uncertain resources Elicitation of additional data Ulaş Bardak, Eugene Fink, Chris Martens, and Jaime Carbonell Carnegie Mellon University
Problem Scheduling a conference under uncertainty Uncertain room properties Uncertain equipment needs Uncertain speaker preferences The automated scheduler needs to collaborate with the human user.
Problem The system may not have enough data for producing a good schedule The user may be able to obtain some of the missing data, but not all data The system should identify critical missing data and ask the user only for these data.
Missing info: Invited talk: – Projector need Poster session: – Room size – Projector need Assumptions: Invited talk: – Needs a projector Poster session: – Small room is OK – Needs no projector Initial schedule Available rooms: Room num. Size Proj- ector Yes No Yes Events and constraints: Invited talk, 9–10am: Needs big room Poster session, 9–11am: Needs a room Initial schedule: Talk Posters
Choice of questions Candidate questions: Invited talk: Needs a projector? Poster session: Needs a larger room? Needs a projector? Events and constraints: Invited talk, 9–10am: Needs a large room Poster session, 9–11am: Needs a room Useless info: There are no large rooms w/o a projector × Useless info: There are no unoccupied larger rooms × Potentially useful info √ Initial schedule: Talk Posters
Improved schedule Events and constraints: Invited talk, 9–10am: Needs a large room Poster session, 9–11am: Needs a room System: Does the poster session need a projector? User: A projector may be useful, but not really necessary. Posters 1 New schedule: Talk Initial schedule: Talk Posters Info elicitation:
Architecture Info elicitorParserOptimizer Process new info Update the schedule Choose questions Top-level control and learning Graphical user interface Administrator
Choice of questions For each candidate question, estimate the probabilities of possible answers For each question, compute its expected impact on the schedule quality, and select questions with large expected impacts For each possible answer, compute the respective change of the schedule quality
Experiments Scheduling of a large conference 14 available rooms 84 conference sessions 700 uncertain variables Manual Scheduling 0.61 Auto w/o Elicitation 0.68 Auto with Elicitation 0.72 Schedule Quality
Schedule Quality Number of Questions 0.50 Experiments 0 actual estimated optimal schedule
Extensions Game-tree search for the most important questions Fast heuristics for pruning unimportant questions Learning new strategies for question selection
Conclusions We have developed a system that analyzes the importance of missing data, identifies critical uncertainties, and asks the user to obtain related additional data. It usually finds a near-optimal solution after asking 2% to 6% of all potential questions. The developed technique does not rely on specific properties of scheduling tasks, and it is applicable to a variety of problems that involve optimization under uncertainty.