Download presentation
Presentation is loading. Please wait.
1
Prophet Inequalities A Crash Course
Brendan Lucier, Microsoft Research EC18: ACM Conference on Economics and Computation Mentoring Workshop, June 18, 2018
3
The Plan Introduction to Prophet Inequalities
Connections to Pricing and Mechanism Design Variations I: Secretaries and Prophet Secretaries Variations II: Multiple Prizes
4
Prophet Inequality The gamblerβs problem: π· 1 π· 2 π· 3 π· 4 π· 5
Describing prophet inequality as a game
5
Prophet Inequality The gamblerβs problem: Keep: win $20, game stops.
π· 1 π· 2 π· 3 π· 4 π· 5 Boxes arrive and are opened one by one. Gambler chooses whether to keep the prize (end game), or move on. The distributions from which prizes are drawn are known in advance. Independent across boxes. Keep: win $20, game stops. Discard: prize is lost, game continues with next box.
6
Letβs Playβ¦ 3.16 2.87 1.14 2.67 π[2,4] π[2,4] π[1,5] π[0,7]
7
Prophet Inequality Theorem: [Krengel, Sucheston, Garling β77]
There exists a strategy for the gambler such that πΈ ππππ§π β₯ 1 2 πΈ max π π£ π and the factor 2 is tight. Prophet Haggai [Samuel-Cahn β84] β¦ a fixed threshold strategy: choose a single threshold π, accept first prize β₯π.
8
Lower Bound: 2 is Tight 1 π w.p. π 0 otherwise 1
9
Application: Posted Pricing
A mechanism design problem: 1 item to sell, n buyers, independent values π£ π ~ π· π . Buyers arrive sequentially, in an arbitrary order. For each buyer: interact according to some protocol, decide whether or not to trade, and at what price. π£ 1 ~ π· 1 π£ 2 ~ π· 2 π£ 3 ~ π· 3 π£ 4 ~ π· 4 An immediate corollary Corollary of Prophet Inequality: Posting an appropriate take-it-or-leave-it price yields at least half of the expected optimal social welfare. [Hajiaghayi Kleinberg Sandholm β07]
10
Applications (Conβt) What about revenue?
[Chawla Hartline Malec Sivan β10]: Can apply prophet inequality to virtual values to achieve half of optimal revenue. Deferred decision-making: A principle wants to choose from among multiple products, each with value π£ π , but canβt observe the products directly. An agent sees the products and makes a recommendation, but wants to optimize a different value function. We wonβt go into detail here about the connections between virtual value and revenue. See Mattβs tutorial for more. [Kleinberg, Kleinberg ECβ18]: Connection to Prophet Inequality. Hint: accept recommended product π iff π£ π exceeds a threshold.
11
(each box: prizes equally likely)
Prophet Inequality Multiple choices of π that achieve the 2-approx. Hereβs one due to [Kleinberg Weinberg 12]: Theorem (prophet inequality): for one item, setting threshold p= 1 2 πΈ max π π£ π yields expected value β₯ 1 2 πΈ max π π£ π . Example: Note: not optimal! (Price 2 is better) Also: comparison with optimal ex-post, not with the optimal price! Also note: threshold p depends on the set of distributions, but not the arrival order! 10 w.p. 1/2 8 w.p. 1/4 6 w.p. 1/8 2 w.p. 1/8 OPT = 1 or 6 0 or 8 2 or 10 E[OPT] = 8 β accept first prize β₯4 (each box: prizes equally likely)
12
Prophet Inequality: Proof
Theorem (prophet inequality): for one item, setting threshold p= 1 2 πΈ max π π£ π yields expected value β₯ 1 2 πΈ max π π£ π . What can go wrong? If threshold is Too low: we might accept a small prize, preventing us from taking a larger prize in a later round. Too high: we donβt accept any prize.
13
A Proof for Full Information
π£ 1 =10 π£ 2 =50 π£ 3 =80 π£ 4 =15 Idea: price max π π£ π is βbalancedβ Let π£ π β = max π π£ π . Case 1: Somebody π< π β buys the item. β revenue β₯ 1 2 π£ π β Case 2: Nobody π< π β buys the item. β utility of π β β₯ π£ π β β 1 2 π£ π β = 1 2 π£ π β In either case: welfare = revenue + buyer utilities β₯ 1 2 π£ π β Again, this price of 40 is not optimal (80 is better). But we present this argument because it extends directly to the probabilistic case on the next slide
14
Extending to Stochastic Setting
Thm: setting price p= 1 2 πΈ max i π£ π yields value β₯ 1 2 πΈ max i π£ π . Proof. Random variable: π£ β = max i π£ π =πππ REVENUE=πβ
Pr item is sold = E[ π£ β ]β
Pr item is sold SURPLUS = π E utility of buyer π β₯ π E π£ π βp + β
π π sees item = π E π£ π βp + β
Pr π sees item β₯ π E π£ π βp + β
Pr item not sold β₯E max i π£ π βp β
Pr item not sold β₯ E[ π£ β ]β
Pr item not sold Total Value=REVENUE+SURPLUSβ₯ 1 2 E[ v β ]. Important: linearity of expectation trick. -> Idea: if prob of not selling is high, then prob. that max-valued player can buy is high. Since expected value of max-valued player is high, and price is low, surplus must be high! [ equality: independence, and doesnβt depend on value. Next line: dominance. Then just take the sum. ] Note: x^+ == max{x, 0}
15
Prophet Inequality: Proof
Thm: for one item, price p= 1 2 πΈ πππ yields value β₯ 1 2 πΈ πππ . Summary: Price is high enough that expected revenue offsets the opportunity cost of selling the item. Price is low enough that expected buyer surplus offsets the value left on the table due to the item going unsold.
16
Secretaries and Prophet Secretaries
17
A Variation Prophet Inequality: A Related Problem:
Prizes drawn from distributions, order is arbitrary A Related Problem: Prizes are arbitrary, order is uniformly random
18
The game of googol [Gardner β60]
Letβs Playβ¦ 5.21 0.003 682,918 1099 ? ? ? ? The game of googol [Gardner β60]
19
A Related Problem The Secretary Problem:
In each round, only the rank of the current prize is revealed, relative to prizes seen already. Goal: maximize prob. of choosing the largest prize. Note: for the game of googol, itβs without loss of generality to only consider rank in your strategy as long as there are at least 3 boxes. But not so for n=2 boxes! There is a better strategy that uses the values. Puzzle: what is it? Answer: choose any distribution over the real numbers that has full support. The strategy is to pick a threshold from that distribution, and accept the first box iff itβs above the threshold. Why does this work? If both values are above the threshold or both are below, probability of winning is Β½. But if the threshold is between the prizes, then we win for sure! Since this happens with positive probability, this strategy always wins with probability strictly greater than Β½. 1 2 1 1 5.21 0.003 682,918 1099
20
Secretary Problem Theorem: [Lindley β61, Dynkin β63, Gilbert and Mosteller β66] There exists a strategy for the secretary problem such that ππ π πππππ‘ ππππππ π‘ β₯ 1 π and the factor π is tight as π grows large. Strategy: observe the first π/π values, then accept the next value that is larger than all previous.
21
Prophets vs Secretaries
Prophet Inequality: Prizes drawn from distributions, order is arbitrary Secretary Problem / Game of Googol: Prizes are arbitrary, order is uniformly random Prophet Secretary: Prizes drawn from distributions, order is uniformly random known and revealed online [Esfandiari, Hajiaghayi, Liaghat, Monemizadeh β15]
22
Recall: π[2,4] π[2,4] π[1,5] π[0,7] Why can random order help? Intuition: some orders are easier than others. E.g., higher-variance boxes first.
23
Recall: π[0,7] π[1,5] π[2,4] π[2,4]
24
Prophet Secretary Theorem: [Esfandiari, Hajiaghayi, Liaghat, Monemizadeh β15] There exists a strategy for the gambler such that πΈ ππππ§π β₯ 1β 1 π πΈ max π π£ π . Note: better than 1/2 [Azar, Chiplunkar, Kaplan ECβ18]: A strategy for the gambler that beats 1β 1 π .
25
Prophet Secretary threshold value prize round 1 2 3 4 5 6 7 8
A way to visualize prize round 1 2 3 4 5 6 7 8
26
Prophet Secretary Higher threshold:
more revenue when we sell the item to this buyer. value Lower threshold: More surplus for this buyer. Can imagine optimizing for one particular buyer. Same forces at play. How is randomization useful? Adversary canβt force buyer 4 to have a very poor draw. So giving buyer-specific bounds makes sense. round 1 2 3 4 5 6 7 8
27
Extension: Multiple Prizes
28
Multiple-Prize Prophet Inequality
Prophet inequality, but gambler can keep up to π prizes π=1: original prophet inequality: 2-approx πβ₯1: [Hajiaghayi, Kleinberg, Sandholm β07] There is a threshold π such that picking the first k values β₯π gives a 1+π( logβ‘π/π ) approximation. Idea: choose π s.t. expected # of prizes taken is πβ 2π log π . Then w.h.p. # prizes taken lies between πβ 4π log π and π. [Alaei β11] [Alaei Hajiaghayi Liaghat β12] Can be improved to 1+π 1 π using a randomized strategy, and this is tight. If k is greater than 1, the problem gets EASIER. Why? Concentration (Hoeffding inequality). Get tight bound up to logs. Can interpret as: come up with right threshold for a fractional relaxation of the problem, and the canonical rounding doesnβt do too badly. Note: revenue is at least p * (k β blah), and buyer surplus is at least (OPT β kp). Idea of Alaei improvement: same approach, but do the randomized rounding more carefully.
29
Aside: Beyond Cardinality
Constraint Upper Bound Lower Bound Single item 2 π items 1+π 1 π 1+Ξ© 1 π Matroid 2 [Kleinberg Weinberg β12] π matroids πβ
(π+1) [Feldman Svensson Zenklusen β15] π +1 [Kleinberg Weinberg β12] Knapsack 5 [Duetting Feldman Kesselheim L. β17] Downward-closed, max set size β€π π( log π log π ) [Rubinstein β16] Ξ© log π log log π [Babaioff Immorlica Kleinberg β07] Lots of literature on thisβ¦ Directly imply posted-price mechanisms for welfare, revenue
30
Multiple-Prize Prophet Inequality
A different variation on cardinality: The gambler can choose up to πβ₯1 prizes Afterward, gambler can keep the largest of the prizes chosen Theorem [Assaf, Samuel-Cahn β00]: There is a strategy for the gambler such that πΈ ππππ§π β₯ 1β 1 π+1 πΈ max π π£ π [Ezra, Feldman, Nehama ECβ18]: An extension to settings where gambler can choose up to π prizes and keep up to β. Includes an improved bound for β=1!
31
Combinatorial Variants
More general valuation functions: Reward for accepting a set of prizes π is a function π(π). Example: arbitrary submodular. [Rubinstein, Singla β17] Multiple prizes per round: Multiple boxes arrive each round. Revealed in round π: valuation function π π (π) for accepting set of prizes π π on round π. (Note: possible correlation!) Application: posted-price mechanisms for selling many goods [Alaei, Hajiaghayi, Liaghat β12], [Feldman Gravin L β13], [Duetting Feldman Kesselheim L β17] Related to mechanisms for combinatorial auctions. Can use posted prices to get a good approximation to the efficient outcome!
32
Summary Thanks! Open Challenge: Best-Order Prophet Inequality
Prophet Inequalities: analyzing the power of sequential decision-making, vs an offline benchmark. Recent connections to pricing and mechanism design MANY variations! A very active area of research Open Challenge: Best-Order Prophet Inequality Suppose the gambler can choose which order to open boxes. What fraction of πΈ max π π£ π can the gambler guarantee? Can the best order be computed efficiently? Thanks!
33
Bonus: Multi-Dimensional Prophets
More information on the βcombinatorial variantsβ slide Bonus: Multi-Dimensional Prophets
34
A General Model Combinatorial allocation Set M of π resources (goods)
Buyers: 1 2 m n A General Model Combinatorial allocation Set M of π resources (goods) π buyers, arrive sequentially online Buyer π has valuation function π£ π : 2 π β π
β₯0 Each π£ π is drawn indep. from a known distribution π· π Allocation: π= π₯ 1 ,β¦, π₯ π There is a downward-closed set πΉ of feasible allocations. Goal: feasible allocation maximizing π π£ π ( π₯ π ) A theoretical abstraction --- the combinatorial auction problem
35
Posted Price Mechanism
For each bidder in some order π: Seller chooses prices π π ( π₯ π ) Bidder πβs valuation is realized: π£ π βΌ πΉ π π chooses some π₯ π β arg max π£ π π₯ π β π π π₯ π Notes: βObviouslyβ strategy proof [Li 2015] Tie-breaking can be arbitrary Prices: static vs dynamic, item vs. bundle Special case: oblivious posted-price mechanism (OPM) prices chosen in advance, arbitrary arrival order Note: a single price vector, fixed for all time, used for all buyers. Supermarket model!
36
Applications Problem Approx. Price Model
Combinatorial auction, XOS valuations 2 Static item prices Bounded complements (MPH-k) [Feige et al. 2014] 4πβ2 Submodular valuations, matroid constraints 2 (existential) 4 (polytime) Dynamic prices Knapsack constraints 5 Static prices d-sparse Packing Integer Programs 8d Can use multi-dimensional prophet inequalities to bound the performance of pricing rules for combinatorial auction variants. Sometimes useful to use dynamic prices: update prices after each buyer, depending on what they purchase. [Feldman Gravin L β13], [Duetting Feldman Kesselheim L β17]
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.