C&O 355 Mathematical Programming Fall 2010 Lecture 15 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.

Slides:



Advertisements
Similar presentations
The Primal-Dual Method: Steiner Forest TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA A A A AA A A.
Advertisements

C&O 355 Lecture 15 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A A A A A A A.
C&O 355 Lecture 6 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
C&O 355 Lecture 8 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
C&O 355 Lecture 9 N. Harvey TexPoint fonts used in EMF.
C&O 355 Mathematical Programming Fall 2010 Lecture 6 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.
C&O 355 Mathematical Programming Fall 2010 Lecture 9
C&O 355 Lecture 5 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
C&O 355 Lecture 23 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
1 LP Duality Lecture 13: Feb Min-Max Theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum.
Geometry and Theory of LP Standard (Inequality) Primal Problem: Dual Problem:
Lecture #3; Based on slides by Yinyu Ye
C&O 355 Mathematical Programming Fall 2010 Lecture 22 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
C&O 355 Lecture 4 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
C&O 355 Mathematical Programming Fall 2010 Lecture 20 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
C&O 355 Mathematical Programming Fall 2010 Lecture 21 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
C&O 355 Lecture 20 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
How should we define corner points? Under any reasonable definition, point x should be considered a corner point x What is a corner point?
L12 LaGrange Multiplier Method Homework Review Summary Test 1.
Linear programming Thomas S. Ferguson University of California at Los Angeles Compressive Sensing Tutorial PART 3 Svetlana Avramov-Zamurovic January 29,
CS38 Introduction to Algorithms Lecture 15 May 20, CS38 Lecture 15.
Solving a system of equations 1 x x x 3 = 3 2 x x x 3 = 2 3 x x x 3 = 3.
Duality Dual problem Duality Theorem Complementary Slackness
Exploiting Duality (Particularly the dual of SVM) M. Pawan Kumar VISUAL GEOMETRY GROUP.
Optimality Conditions for Nonlinear Optimization Ashish Goel Department of Management Science and Engineering Stanford University Stanford, CA 94305, U.S.A.
KKT Practice and Second Order Conditions from Nash and Sofer
C&O 355 Mathematical Programming Fall 2010 Lecture 17 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
C&O 355 Mathematical Programming Fall 2010 Lecture 1 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.
C&O 355 Lecture 2 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
C&O 355 Mathematical Programming Fall 2010 Lecture 2 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.
C&O 355 Mathematical Programming Fall 2010 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
C&O 355 Mathematical Programming Fall 2010 Lecture 4 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
Simplex method (algebraic interpretation)
C&O 355 Mathematical Programming Fall 2010 Lecture 18 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A A A A A A A Image:
Hon Wai Leong, NUS (CS6234, Spring 2009) Page 1 Copyright © 2009 by Leong Hon Wai CS6234: Lecture 4  Linear Programming  LP and Simplex Algorithm [PS82]-Ch2.
Advanced Operations Research Models Instructor: Dr. A. Seifi Teaching Assistant: Golbarg Kazemi 1.
Nonlinear Programming I Li Xiaolei. Introductory concepts A general nonlinear programming problem (NLP) can be expressed as follows: objective function.
Optimization - Lecture 4, Part 1 M. Pawan Kumar Slides available online
X y x-y · 4 -y-2x · 5 -3x+y · 6 x+y · 3 Given x, for what values of y is (x,y) feasible? Need: y · 3x+6, y · -x+3, y ¸ -2x-5, and y ¸ x-4 Consider the.
C&O 355 Mathematical Programming Fall 2010 Lecture 5 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.
Hon Wai Leong, NUS (CS6234, Spring 2009) Page 1 Copyright © 2009 by Leong Hon Wai CS6234: Lecture 4  Linear Programming  LP and Simplex Algorithm [PS82]-Ch2.
OR Simplex method (algebraic interpretation) Add slack variables( 여유변수 ) to each constraint to convert them to equations. (We may refer it as.
C&O 355 Lecture 7 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
CPSC 536N Sparse Approximations Winter 2013 Lecture 1 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAAAA.
OR Chapter 7. The Revised Simplex Method  Recall Theorem 3.1, same basis  same dictionary Entire dictionary can be constructed as long as we.
Inequalities for Stochastic Linear Programming Problems By Albert Madansky Presented by Kevin Byrnes.
Linear Programming Chapter 9. Interior Point Methods  Three major variants  Affine scaling algorithm - easy concept, good performance  Potential.
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Linear Programming Chap 2. The Geometry of LP  In the text, polyhedron is defined as P = { x  R n : Ax  b }. So some of our earlier results should.
OR II GSLM
Submodularity Reading Group Matroid Polytopes, Polymatroid M. Pawan Kumar
Chap 10. Sensitivity Analysis
L11 Optimal Design L.Multipliers
Computational Optimization
Chap 9. General LP problems: Duality and Infeasibility
Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.
Chapter 5. Sensitivity Analysis
Chap 3. The simplex method
Analysis of Algorithms
Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.
Chapter 5. The Duality Theorem
I.4 Polyhedral Theory (NW)
Flow Feasibility Problems
C&O 355 Lecture 3 N. Harvey Review of Lecture 2:
C&O 355 Mathematical Programming Fall 2010 Lecture 7
I.4 Polyhedral Theory.
Chapter 2. Simplex method
BASIC FEASIBLE SOLUTIONS
Chapter 2. Simplex method
Presentation transcript:

C&O 355 Mathematical Programming Fall 2010 Lecture 15 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A A A A A A A

Topics Minimizing over a convex set: Necessary & Sufficient Conditions (Mini)- KKT Theorem Minimizing over a polyhedral set: Necessary & Sufficient Conditions Smallest Enclosing Ball Problem

Proof: ( direction Direct from subgradient inequality. (Theorem 3.5) f(z) ¸ f(x) + r f(x) T (z-x) ¸ f(x) Subgradient inequality Our hypothesis Thm 3.12: Let C µ R n be a convex set. Let f : R n ! R be convex and differentiable. Then x minimizes f over C iff r f(x) T (z-x) ¸ 0 8 z 2 C. Minimizing over a Convex Set: Necessary & Sufficient Conditions

Thm 3.12: Let C µ R n be a convex set. Let f : R n ! R be convex and differentiable. Then x minimizes f over C iff r f(x) T (z-x) ¸ 0 8 z 2 C. Proof: ) direction Let x be a minimizer, let z 2 C and let y = z-x. Recall that r f(x) T y = f’(x;y) = lim t ! 0 f(x+ty)-f(x). If limit is negative then we have f(x+ty)<f(x) for some t 2 [0,1], contradicting that x is a minimizer. So the limit is non-negative, and r f(x) T y ¸ 0. ¥ t

Smallest Ball Problem Let {p 1,…,p n } be points in R d. Find (unique!) ball (not an ellipsoid!) of smallest volume that contains all the p i ’s. In other words, we want to solve: min { r : 9 y 2 R d s.t. p i 2 B(y,r) 8 i } y r

Smallest Ball Problem Let {p 1,…,p n } be points in R d. Find (unique!) ball (not an ellipsoid!) of smallest volume that contains all the p i ’s. In other words, we want to solve: min { r : 9 y 2 R d s.t. p i 2 B(y,r) 8 i } We will formulate this as a convex program. In fact, our convex program will be of the form min { f(x) : Ax=b, x ¸ 0 }, where f is convex. Minimizing a convex function over a polyhedron To solve this, we will need optimality conditions for convex programs.

LP Optimality Conditions Theorem: Let x 2 R n be a feasible solution to the linear program max { c T x : Ax=b, x ¸ 0 } Then x is optimal iff 9 dual solution y 2 R m s.t. 1) A T y ¸ c, 2) For all j, if x j >0 then A j T y = c j. j th column of A Proof: Dual is min { b T y : A T y ¸ c }. x optimal ) dual has optimal solution y. So (1) holds by feasibility of y. By optimality of x & y, c T x=b T y. Weak duality says: Equality holds here ) (2) holds. (This is “complementary slackness”)

LP Optimality Conditions Theorem: Let x 2 R n be a feasible solution to the linear program min { -c T x : Ax=b, x ¸ 0 } Then x is optimal iff 9 dual solution y 2 R m s.t. 1) -c T + y T A ¸ 0, 2) For all j, if x j >0 then -c j + y T A j = 0. j th column of A Proof: Dual is min { b T y : A T y ¸ c }. x optimal ) dual has optimal solution y. So (1) holds by feasibility of y. By optimality of x & y, c T x=b T y. Weak duality says: Equality holds here ) (2) holds. (This is “complementary slackness”)

(Mini)-KKT Theorem j th column of A Theorem: Let f: R n ! R be a convex, C 2 function. Let x 2 R n be a feasible solution to the convex program min { f(x) : Ax=b, x ¸ 0 } Then x is optimal iff 9 y 2 R m s.t. 1) r f(x) T + y T A ¸ 0, 2) For all j, if x j >0 then r f(x) j + y T A j = 0. Natural generalization of LP optimality conditions: approximation at x is Proven by Karush in 1939 (his Master’s thesis!), and by Kuhn and Tucker in 1951.

Full KKT Theorem Even stating it requires a lot of details! See Section 3.7 and 3.8 of the course notes

Theorem: Let f: R n ! R be a convex, C 2 function. Let x 2 R n be a feasible solution to the convex program min { f(x) : Ax=b, x ¸ 0 } Then x is optimal iff 9 y 2 R m s.t. 1) r f(x) T + y T A ¸ 0, 2) For all j, if x j >0 then r f(x) j + y T A j = 0. Proof: ( direction. Suppose such a y exists. Then ( r f(x) T + y T A) x = 0. (Just like complementary slackness) For any feasible z 2 R n, we have ( r f(x) T + y T A) z ¸ 0. Subtracting these, and using Ax=Az=b, we get r f(x) T (z-x) ¸ 0 8 feasible z. So x is optimal. (By Thm 3.12: “Minimizing over a Convex Set”)

Theorem: Let f: R n ! R be a convex, C 2 function. Let x 2 R n be a feasible solution to the convex program min { f(x) : Ax=b, x ¸ 0 } Then x is optimal iff 9 y 2 R m s.t. 1) r f(x) T + y T A ¸ 0, 2) For all j, if x j >0 then r f(x) j + y T A j = 0. Proof: ) direction. Suppose x is optimal. Let c=- r f(x). Then r f(x) T (z-x) ¸ 0 ) c T z · c T x for all feasible points z. By Thm 3.12: “Minimizing over a Convex Set”

Theorem: Let f: R n ! R be a convex, C 2 function. Let x 2 R n be a feasible solution to the convex program min { f(x) : Ax=b, x ¸ 0 } Then x is optimal iff 9 y 2 R m s.t. 1) r f(x) T + y T A ¸ 0, 2) For all j, if x j >0 then r f(x) j + y T A j = 0. Proof: ) direction. Suppose x is optimal. Let c=- r f(x). Then r f(x) T (z-x) ¸ 0 ) c T z · c T x for all feasible points z. So x is optimal for the LP max { c T x : Ax=b, x ¸ 0 }. So there is an optimal solution y to dual LP min { b T y : A T y ¸ c }. So r f(x) T + y T A = -c T + y T A ¸ 0 ) (1) holds. Furthermore, x and y are both optimal so C.S. holds. ) whenever x j >0, the j th dual constraint is tight ) y T A j = c j ) (2) holds. ¥

Smallest Ball Problem Let P={p 1,…,p n } be points in R d. Let Q be d x n matrix s.t. Q i =p i. (Q i = i th column of Q) Let z 2 R n satisfy z i =p i T p i. Define f : R n ! R by f(x) = x T Q T Qx - x T z. Claim 1: f is convex. Consider the convex program min { f(x) :  j x j = 1, x ¸ 0 }. (Hessian is PSD) Interpretation Qx is an “average” (convex combination) of the p i ’s x T Q T Qx is norm 2 of this average point x T z is average norm 2 of the p i ’s If x T Q T Qx ¿ x T z, the p i ’s are “spread out” p1p1 p3p3 p2p2 x = [1/3, 1/3, 1/3, 0, 0, … ] QxQx

Smallest Ball Problem Let P={p 1,…,p n } be points in R d. Let Q be d x n matrix s.t. Q i =p i. (Q i = i th column of Q) Let z 2 R n satisfy z i =p i T p i. Define f : R n ! R by f(x) = x T Q T Qx - x T z. Claim 1: f is convex. Consider the convex program min { f(x) :  j x j = 1, x ¸ 0 }. Claim 2: This program has an optimal solution x. Claim 3: Assume x is optimal. Let p * =Qx and r= √ -f(x). Then P ½ B(p *,r). Claim 4: B(p *,r) is the smallest ball containing P. (Hessian is 2Q T Q, which is PSD) (By Weierstrass’ Theorem: the high-dimensional extreme value theorem)Weierstrass’ Theorem

Claim 3: The ball B(p *,r) contains P. Proof: By KKT, 9 y 2 R s.t. r f(x) + A T y ¸ 0 For us r f(x) = 2Q T Qx - z = 2Q T p * - z and A = [1, …, 1] So 2p j T p * - p j T p j + y ¸ 0 8 j. (Here y 2 R ) KKT also says: equality holds 8 j s.t. x j >0. So y =  j x j y =  j x j p j T p j - 2  j x j p j T p * =  j x j p j T p j - 2p *T p *. So y + p *T p * =  j x j p j T p j - p *T p * = -f(x) ) r = √ y + p *T p * It remains to show that B(p *,r) contains P. This holds iff k p j -p * k · r 8 j. Now k p j -p * k 2 = (p j -p * ) T (p j -p * ) = p *T p * -2p j T p * +p j T p j · p *T p * +y = r 2 8 j. ¤ y ¸ p j T p j - 2p j T p * 8 j.

Claim 4: B(p *,r) is the smallest ball containing P. Proof: See Matousek-Gartner Section 8.7.

Smallest Ball Problem: Summary Consider the convex program min { f(x) :  j x j = 1, x ¸ 0 }. Claim 2: This program has an optimal solution x. Claim 3: Let p * =Qx and r= √ -f(x). Then P ½ B(p *,r). Claim 4: B(p *,r) is the smallest ball containing P. This example is a bit strange: – Not obvious how convex program relates to balls – Claim 3 is only valid when x is the optimal point Still, KKT tells us interesting things: – optimal value of convex program gives radius of smallest ball containing P