Download presentation
Presentation is loading. Please wait.
1
Combinatorial Optimization Under Uncertainty
(Probing & Stopping-Time Algorithms) (10th Nov, 2017) Sahil Singla (Carnegie Mellon University) Thesis Committee: Manuel Blum, Anupam Gupta, Robert D. Kleinberg, R. Ravi, and Jan VondrAΜk
2
Modeling Combinatorial Optimization
Given a Finite Universe π={1,2,..,n} Given an Objective Function f : 2 π βR Given Constraints ββ 2 π Select a Solution set Sββ s.t. max/min π(S) Examples: Maximization: k elements, Matching, Max cut, Knapsack, (Bin) Packing, Orienteering, Welfare Max, SAT, Increasing Subseq, Throughput Scheduling, Submod over Matroid Minimization: MST, Vertex Cover, (Set) Covering, Steiner Network, TSP, Facility Location, Makespan Scheduling, Graph Coloring, Multiway/Sparsest/Multi cut, Clustering E.g, sum of weights E.g., matching
3
Modeling Uncertainty Why Consider Uncertainty?
Lack of future knowledge Imperfect prediction: Noise/Errors Reaching certainty difficult/expensive Probing and Stopping-Time Models Uncertainty clears element-by-element Make decisions before all uncertainty clears Motivating examples
4
Example 1: Oil-Drilling
Probing Set up One Oil Drill: Multiple potential locations Have Estimates on their Values: Location, size, old data Conduct Inspections to Find Exact Value Pay price per location Which Locations Should you Inspect? Goal: Maximize value of the oil drill Minimize total inspection price Similar Examples Purchasing a company Purchasing a house Price could be in different units, e.g., time
5
Underlying Combinatorial Problem
X1 X1= 4 X2=π X2 X3 X3=π X4 X4=10 Price of 1 per-box Value = π Price = π Picked Select the Most Valuable Box (Given Value Distributions) Easy in the Full-Info World: Pick the Max-Value Box Probing Algorithms Can control the probing order Donβt probe all boxes: Price incurred Pick at the end More Complicated when Underlying Problem Changed k boxes, matching, forest
6
Example 2: Diamond-Selling
Stopping-Time Sell One Diamond: Multiple potential buyers Buyers Arrive and Make a Take-it-or-Leave-it Bid Decide Immediately and Irrevocably When Should you Accept the Bid? Goal: Maximize value of the accepted bid Similar Examples Selling ad-slots Hiring a secretary Marriage partner Cannot go back to a declined bid
7
Underlying Combinatorial Problem
X1 X1=4 X2=π X2 X3 X3=π X4 X4=10 Value = 6 Picked Select the Most Valuable Box Easy in the Full-Info World: Pick the Max-Value Box Stopping-Time Algorithms Cannot control the box order If picking, do immediately More Complicated when Underlying Problem Changed k boxes, matching, forest
8
Probing & Stopping-Time Models
Similarities Given some estimates Uncertainty clears element-by-element Irrevocable decisions before all uncertainty clears Differences Probing Stopping-Time Can control the order Cannot control the order Pick in the end If picking, do immediately Price to finding value No price to finding value
9
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
10
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
11
Model 1 for Oil-Drilling
Given Indep Bernoulli Distributions π 1 , π 2 , .. , π π To Find Value, Pay a Given Probing Price π 1 , π 2 ,β¦, π π Probing Constraints : Given Probing Budget π΅ πβππππππ π π β€π΅ How to Adaptively Probe a Set ππππππβ{1,2,β¦,π} Objective: Maximize Expected Value max πβππππππ {π π } πΏ π = π π w.p. π π π otherwise Think of time spent How to generalize?
12
Constrained Stochastic Probing
Given Activation Probabilities: π 1 , π 2 , .. , π π Set π΄ contains π w.p. p π Given Value Function π: 2 [π] β π
β₯ E.g., Max, Sum of Top k, Submodular, Subadditive Packing Probing Constraints E.g., Cardinality, Knapsack, Matroid, Orienteering Adaptively Find ππππππβ[π] Satisfying Constraints Objective: Maximize Expected Value π(ππππππβ©π΄) Think of probability that location is feasible [n] A Prob
13
Optimal Adaptive Strategy
No Yes π 1 π 5 π 2 π 3 π 4 π 6 Every Root-Leaf Path Satisfies No variable appears twice Constraints (e.g., budget) Difficult to Find: Can be Exponential Sized! Want a Simple Optimal/Approximate Strategy
14
Optimal Non-Adaptive Strategy
Select a Feasible set ππππππβ π in the Beginning to Maximize E π΄~π [π ππππππβ©π΄ ] Benefits: Easier to Represent: Just output the set Find: g π = E π΄~π [π πβ©π΄ ] is submod in Full-Info world Concern: Large Adaptivity πππ:= π[ππππ] π[ππ] Example: Can probe β€π πΏ 1 & πΏ 2 take π w.p. π.π each πΏ 3 takes ππ w.p. π.ππ π[ππππ] β 0.80 π ππ = 0.75 0.5 NO YES πΏ π πΏ π πΏ π Examples with GAP= π πβπ βπ.ππ
15
Main Result Thm [GNSβ16,β17]: Adaptivity Gap for
Constraints = Downward-Closed Function = Monotone Submodular is at most 3. ADAP NON-ADAP ALGO GAP πΆ πΆ.GAP Downward-Closed: If a set can be probed then also any subset E.g., At most π, Matching, Orienteering Submodular: If SβT then f Sβͺe βf S β₯f Tβͺe βf(T) E.g., Max, Sum of top π Proof uses Two Ideas Random Root-Leaf Path Stem-by-stem Induction
16
Proof Ideas Assume Best ADAP Strategy Known
Thm [GNSβ17]: Adap Gap β€π Only Existence of a βGoodβ NA-Path Assume Best ADAP Strategy Known πΏ π 0.7 0.3 0.5 0.2 0.8 NO YES Random Path with ADAP Probabilities Example: Red path prob = π.πβπ.πβπ.π Here ADAP gets πππ₯{ π£ 2 , π£ 3 } Here NA gets E[πππ₯{ π 1 , π 2 , π 3 }] πΏ π πΏ π πΏ π Stem-by-Stem Induction shows: E[Random Path] β₯ π π ADAP Since NA β₯ E[Random Path] Adaptivity GAP β€3
17
Constrained Stochastic Probing
ADAP NON-ADAP ALGO GAP πΆ πΆ.GAP Question: The Adaptivity Gap for Constraints = Downward-Closed Function = Monotone Submodular is π/(πβπ)? Question: The Adaptivity Gap for Constraints = Downward-Closed Function = Subadditive is π©π¨π₯π²π₯π¨π π§ ? Subadditive: βS,βπ, if f SβͺT β€f S +f(T)
18
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
19
Model 2 for Oil-Drilling
Given Indep Bernoulli Distributions π 1 , π 2 , .. , π π To Find Value, Pay a Given Probing Price π 1 , π 2 ,β¦, π π How to Adaptively Probe a set ππππππβ{1,2,β¦,π} Objective: Maximize Expected Utility max πβππππππ {π π } β πβππππππ π π Same as Weitzmanβs Pandoraβs Box Optimal policy: Index π π based ordering πΏ π = π π w.p. π π π otherwise Price of Information
20
Price of Information Utility Maximization Problem
Given distributions on values/costs: π 1 , π 2 , .. , π π Given probing prices: π 1 , π 2 ,β¦, π π Adaptively find ππππππβ{1,2,β¦,π} Utility Maximization Problem Packing Constraints: β max πβππππππ & π β β { πβπ π π } β πβππππππ π π Disutility Minimization Problem Covering Constraints: ββ² min πβππππππ & π β ββ² { πβπ π π } + πβππππππ π π Price of Information Think of Max Weight Forest Examples Max Matroid Basis Max Wt Matching Knapsack Examples Min Matroid Basis Vertex/Set Cover Feedback Vertex Set Facility Location Prize-Collecting Steiner Tree Think of Min-cost Spanning Tree
21
Main Result ThM [Sβ18]: For any Packing/Covering Problem, an πΆβapprox Frugal alg in the Free-Info World implies an πΆβapprox strategy in the PoI World. Think of Max Wt Matching with πΌ=2 Problem Approx Ratio Max/Min Matroid Basis 1 Max Wt Matching 2 Max π-system π Max Knapsack Min Vertex-/Set-Cover min{f,log n} Min Facility Location 1.861 Min Prize Collecting Steiner Tree 3 Feedback Vertex Set O(log n)
22
Constrained Price of Information
Given distributions on values: π 1 , π 2 , .. , π π Given probing prices: π 1 , π 2 ,β¦, π π Adaptively find ππππππβ{1,2,β¦,π} Constraint: Given Price Budget π΅ πβππππππ π π β€π΅ Maximize Expected Utility max πβππππππ {π π } β πβππππππ π π ThM [Sβ18]: πΆ(π) Approx for Constrained PoI. Use small Adaptivity Gap Use a Frugal Algorithm
23
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
24
Model 1 for Diamond-Selling
Single Item Prophet Inequality Item value distributions known & independent Items arrive in adversarial order Immediately & Irrevocably: compare to π[πππ±] Cannot Beat Β½-Approx Let πΏ π =1 w.p. 1 & let πΏ 2 = 1/π w.p. π 0 otherwise Picked X1=0.3 X1~Unif(0,1) X2=0.6 X2~Exp(2) X3=0.2 X3βΌ0.2 X4~Unif(0,1) X4=0.9
25
Single Item Β½-Prophet Inequality
Thm [KWβ12]: Threshold π=Β½β
E[Max] gives Β½-approx. PROOF: E Alg =πβ
π+E[ Algβπ + ] , where π=Prβ‘[Maxβ₯π] Observe, E[Max]β€ π+E[ Maxβπ + ] β€π+ π E [ π π βπ + ] . And E Algβπ + = Pr Nothing before π β
E π π βπ + β₯ 1βπ β
βE π π βπ + β₯ 1βπ β
π. Hence, E Alg =πβ
π+E[ Algβπ + ]β₯π. Value = Revenue + Utility Q.E.D.
26
Combinatorial Prophet Inequality
Thm [KWβ12]: βΒ½ Prophet Inequalities for Constraints = Matroid Function = Additive Thm [RSβ17]: βπ(π) Prophet Inequalities for Constraints = Matroid Function = Non-Negative Submodular THM [RSβ17]: βπ(π/(π₯π¨π π§β
π₯π¨ π π π«)) Prophet-Inequalities for Constraints = Downward-Closed Function = Non-Negative Monotone Subadditive This result is Information Theoretic.
27
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
28
Model 2 (and 3) for Diamond-Selling
Single Item Secretary Problem Item values unknown Items arrive in a uniformly random order Immediately & Irrevocably: compare to πππ± Can get π π -approx and this is tight Single Item Prophet-Secretary Item value distributions known & independent Immediately & Irrevocably: compare to π[πππ±] Can we beat approx?
29
Single Item (1β π π )-Prophet-Secretary
THM [EHKSβ18]: Threshold T π =(1β e πβ1 )β
E[Max] gives (1β 1 π )-approx. PROOF: βπ, E[Max]β€ π π +E[ Maxβ π π + ] β€ π π +βE [ π π β π π + ]. Now, E Rev =β π=0 1 π π β
ππ π , where π π =Prβ‘[Nothing before π] and E Utility =βE Utility i β₯ π=0 1 π π β
β π π β π π + ππ β₯ π=0 1 π π β
E Max β π π ππ . Hence, E Alg =E Rev +E Util β₯ 1β 1 π β
E[Max] Value = Revenue + Utility Q.E.D.
30
Matroid Prophet-Secretary
THM [EHKSβ18]: β(1β π π )-approx Prophet-Secretary for Constraints = Matroid Function = Additive Moreover, we can find it in Polynomial Time. Techniques Dynamic threshold Value = Revenue + Utility Questions Can we beat (1β π π ) for prophet secretary? Can we get Ξ©(1) for matroid secretary?
31
OUTLINE Introduction Probing Algorithms (Oil-Drilling)
Problems: Oil-Drilling and Diamond-Selling What is Comb Opt under Uncertainty Probing Algorithms (Oil-Drilling) Model 1: Constrained Stochastic Probing Model 2: Price of Information Stopping-Time Algorithms (Diamond-Selling) Model 1: Prophet Inequality Model 2: Secretary and Prophet-Secretary Further Directions Unknown or Correlated Distributions Beyond Probing and Stopping-Time Algorithms
32
Unknown Distrib: Sample Access
Consider Prophet Inequality Underlying Probability Distributions are Unknown Can Obtain Independent Samples from Distributions Goal: Maximize expected value Minimize number of samples Model 1: Two-Phase Learning How many training samples for Β½βπ -approx policy? Model 2: Simultaneous Exploration and Exploitation Repeatedly play the game to maximize average value
33
Correlated Distrib: Markov Models
Consider Prophet Inequality Given π pairs of Indep Distributions π 1 , π 2 ,β¦, π π or π 1 , π 2 ,β¦, π π Donβt Know in Which World Hidden Bernoulli variable π decides, say w.p. Β½ each Cannot observe π but box values give some idea Goal: Maximize Expected Value Questions What is the optimal algorithm? What is the optimal ratio?
34
Beyond Probing & Stopping-Time
Two-Stage Optimization Requests/Elements appear in two-stages Minimization: Costs increase in Stage-2 e.g., Steiner Tree Maximization: Elements disappear after Stage-1 e.g., Matching [LSβ17] Regret Analysis Restrict OPT to only play few strategies Additive guarantees w.r.t. OPT e.g., Scheduling problems to minimize weighted completion-time or makespan
35
Summary Questions? Probing Algorithms Stopping-Time Algorithms
Constrained Stochastic Probing: Adaptivity Gaps Price of Information: Frugal Algorithms Stopping-Time Algorithms Prophet Inequality: Value = Utility + Revenue Secretary Problem: Divide & Conquer: Add Constraints Further Directions Unknown/Correlated Distributions Two-Stage Optimization and Regret Analysis Questions?
36
References S. Ehsani, T. Kesselheim, M.T. Hajiaghayi, and S. Singla. `Prophet Secretary for Matroids and Combinatorial Auctionsβ. SODAβ18. A. Gupta, H. Jiang, Z. Scully, and S. Singla.Β `Markovian Price of Information'Β . Under Preparation. A. Gupta, V. Nagarajan, and S. Singla.Β `Adaptivity Gaps for Stochastic Probing: Submodular and XOS Functions'Β . SODAβ17. A. Gupta, V. Nagarajan, and S. Singla.Β `Algorithms and Adaptivity Gaps for Stochastic Probing'Β . SODAβ16. G. P. Guruganesh and S. Singla.Β `Online Matroid Intersection: Beating Half for Random Arrivalβ. IPCOβ17. E. Lee and S. Singla.Β `Maximum Matching in the Online Batch-Arrival Model'. IPCOβ17. A. Rubinstein and S. Singla.Β `Combinatorial Prophet Inequalities'.Β SODAβ17. S. Singla. `The Price of Information in Combinatorial Optimization'. SODAβ18.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.