Understanding the Power of Convex Relaxation Hierarchies: Effectiveness and Limitations Yuan Zhou Computer Science Department Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
Iterative Rounding and Iterative Relaxation
Advertisements

Hypercontractive inequalities via SOS, and the Frankl-Rödl graph Manuel Kauers (Johannes Kepler Universität) Ryan ODonnell (Carnegie Mellon University)
Linear Round Integrality Gaps for the Lasserre Hierarchy Grant Schoenebeck.
Approximability & Sums of Squares Ryan O’Donnell Carnegie Mellon.
C&O 355 Lecture 23 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Heuristics for the Hidden Clique Problem Robert Krauthgamer (IBM Almaden) Joint work with Uri Feige (Weizmann)
The Unique Games Conjecture with Entangled Provers is False Julia Kempe Tel Aviv University Oded Regev Tel Aviv University Ben Toner CWI, Amsterdam.
Optimization problems, subexponential time, & Lasserre algorithms Featuring work by: Ryan O’DonnellCMU Venkat GuruswamiCMU Ali K. SinopCMU David WitmerCMU.
Games, Proofs, Norms, and Algorithms Boaz Barak – Microsoft Research Based (mostly) on joint works with Jonathan Kelner and David Steurer.
Rounding Sum of Squares Relaxations Boaz Barak – Microsoft Research Joint work with Jonathan Kelner (MIT) and David Steurer (Cornell) workshop on semidefinite.
Semi-Definite Algorithm for Max-CUT Ran Berenfeld May 10,2005.
Boaz Barak – Microsoft Research Partially based on joint work with Jonathan Kelner (MIT) and David Steurer (Cornell) Sum of Squares Proofs and The Quest.
Inapproximability from different hardness assumptions Prahladh Harsha TIFR 2011 School on Approximability.
The number of edge-disjoint transitive triples in a tournament.
Introduction to Approximation Algorithms Lecture 12: Mar 1.
Approximation Algoirthms: Semidefinite Programming Lecture 19: Mar 22.
Computational problems, algorithms, runtime, hardness
Venkatesan Guruswami (CMU) Yuan Zhou (CMU). Satisfiable CSPs Theorem [Schaefer'78] Only three nontrivial Boolean CSPs for which satisfiability is poly-time.
A Linear Round Lower Bound for Lovasz-Schrijver SDP relaxations of Vertex Cover Grant Schoenebeck Luca Trevisan Madhur Tulsiani UC Berkeley.
Semidefinite Programming
1 Introduction to Linear and Integer Programming Lecture 9: Feb 14.
Approximation Algorithm: Iterative Rounding Lecture 15: March 9.
Implicit Hitting Set Problems Richard M. Karp Harvard University August 29, 2011.
Approximation Algorithms
Analysis of Algorithms CS 477/677
Semidefinite Programming Based Approximation Algorithms Uri Zwick Uri Zwick Tel Aviv University UKCRC’02, Warwick University, May 3, 2002.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Chapter 11: Limitations of Algorithmic Power
Linear Programming Relaxations for MaxCut Wenceslas Fernandez de la Vega Claire Kenyon -Mathieu.
NP-complete and NP-hard problems. Decision problems vs. optimization problems The problems we are trying to solve are basically of two kinds. In decision.
(work appeared in SODA 10’) Yuk Hei Chan (Tom)
Tight Integrality Gaps for Lovász-Schrijver LP relaxations of Vertex Cover Grant Schoenebeck Luca Trevisan Madhur Tulsiani UC Berkeley.
Hardness Results for Problems
Finding Almost-Perfect
Approximation Algorithms: Bristol Summer School 2008 Seffi Naor Computer Science Dept. Technion Haifa, Israel TexPoint fonts used in EMF. Read the TexPoint.
Pablo A. Parrilo ETH Zürich Semialgebraic Relaxations and Semidefinite Programs Pablo A. Parrilo ETH Zürich control.ee.ethz.ch/~parrilo.
David Pritchard Princeton Computer Science Department & Béla Bollobás, Thomas Rothvoß, Alex Scott.
Algorithms for Network Optimization Problems This handout: Minimum Spanning Tree Problem Approximation Algorithms Traveling Salesman Problem.
Primal-Dual Meets Local Search: Approximating MST’s with Non-uniform Degree Bounds Author: Jochen Könemann R. Ravi From CMU CS 3150 Presentation by Dan.
Computational Complexity Polynomial time O(n k ) input size n, k constant Tractable problems solvable in polynomial time(Opposite Intractable) Ex: sorting,
Approximation Schemes via Sherali-Adams Hierarchy for Dense Constraint Satisfaction Problems and Assignment Problems Yuichi Yoshida (NII & PFI) Yuan Zhou.
Approximation Algorithms for NP-hard Combinatorial Problems Magnús M. Halldórsson Reykjavik University
Edge Covering problems with budget constrains By R. Gandhi and G. Kortsarz Presented by: Alantha Newman.
Approximating Minimum Bounded Degree Spanning Tree (MBDST) Mohit Singh and Lap Chi Lau “Approximating Minimum Bounded DegreeApproximating Minimum Bounded.
Approximation Algorithms Department of Mathematics and Computer Science Drexel University.
Yuan Zhou Carnegie Mellon University Joint works with Boaz Barak, Fernando G.S.L. Brandão, Aram W. Harrow, Jonathan Kelner, Ryan O'Donnell and David Steurer.
Fast and accurate energy minimization for static or time-varying Markov Random Fields (MRFs) Nikos Komodakis (Ecole Centrale Paris) Nikos Paragios (Ecole.
Semidefinite Programming
C&O 355 Mathematical Programming Fall 2010 Lecture 16 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
1/19 Minimizing weighted completion time with precedence constraints Nikhil Bansal (IBM) Subhash Khot (NYU)
Linear Program Set Cover. Given a universe U of n elements, a collection of subsets of U, S = {S 1,…, S k }, and a cost function c: S → Q +. Find a minimum.
CS 3343: Analysis of Algorithms Lecture 25: P and NP Some slides courtesy of Carola Wenk.
Shorter Long Codes and Applications to Unique Games 1 Boaz Barak (MSR, New England) Parikshit Gopalan (MSR, SVC) Johan Håstad (KTH) Prasad Raghavendra.
C&O 355 Lecture 24 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A A A A A A A.
CPS Computational problems, algorithms, runtime, hardness (a ridiculously brief introduction to theoretical computer science) Vincent Conitzer.
Unique Games Approximation Amit Weinstein Complexity Seminar, Fall 2006 Based on: “Near Optimal Algorithms for Unique Games" by M. Charikar, K. Makarychev,
Yuan Zhou, Ryan O’Donnell Carnegie Mellon University.
Chapter 11 Introduction to Computational Complexity Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1.
Boaz Barak (MSR New England) Fernando G.S.L. Brandão (Universidade Federal de Minas Gerais) Aram W. Harrow (University of Washington) Jonathan Kelner (MIT)
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Linear Programming Piyush Kumar Welcome to CIS5930.
Approximation Algorithms based on linear programming.
Yuan Zhou Carnegie Mellon University Joint works with Boaz Barak, Fernando G.S.L. Brandão, Aram W. Harrow, Jonathan Kelner, Ryan O'Donnell and David Steurer.
Hypercontractivity & Sums of Squares
Finding Almost-Perfect
Polynomial integrality gaps for
Possibilities and Limitations in Computation
Linear Programming Duality, Reductions, and Bipartite Matching
Presentation transcript:

Understanding the Power of Convex Relaxation Hierarchies: Effectiveness and Limitations Yuan Zhou Computer Science Department Carnegie Mellon University 1

Combinatorial Optimization Goal: optimize an objective function of n 0-1 variables Subject to: certain constraints Arises everywhere in Computer Science, Operations Research, Scheduling, etc 2

Example 1: MaxCut Input: graph G = (V, E) Goal: partition V into two parts A & B such that edges(A, B) is maximized Can also be formulated as Maximize objective, where x i ’s are 0-1 variables A fundamental (and very easily stated) combinatorial optimization problem G=(V,E) A B=V-A number of edges between A & B 3

Example 2: SparsestCut Input: graph G = (V, E) Goal: partition V into two parts A & B such that the sparsity is minimized Closely related to the NormalizedCut problem in Image Segmentation G=(V,E) A B=V-A = Pictures from [ShiMalik00] 4

Convex relaxations Most optimization problems are NP-hard to compute the exact optimum Various approaches to approximate the optimal solution: greedy, heuristics, convex relaxations 5

Convex relaxations Linear programming(LP)/semidefinite programming(SDP) relaxations – SDP: “super LP”, computational tractable 6 Integer program of optimization problems (NP-hard) Convex program – LP/SDP (computational tractable) solve Optimal solution to the convex program relax the constraints approximate

Convex relaxations Linear programming(LP)/semidefinite programming(SDP) relaxations Focus of this talk: LP/SDP relaxation hierarchies – A sequence of more and more powerful relaxations – Extremely successful to approximate the optimum – Imply almost all known approximation algorithms 7 Relaxation #1 #2 #3 #4 …

Outline of my research on hierarchies Introduction for convex relaxation hierarchies Use hierarchies to design approximation algorithms – dense MaxCut, dense k - CSP, metric MaxCut, locally-dense k - CSP, dense MaxGraphIsomorphism, (dense & metric) MaxGraphIsomorphism [Yoshida-Zhou’14] What problems are resistant to hierarchies – the limitation of hierarchies? –SparsestCut [Guruswami-Sinop-Zhou’13], Dense k Subgraph [Bhaskara- Charikar-Guruswami-Vijayaraghavan-Zhou’12], GraphIsomorphism [O’Donnell- Wright-Wu-Zhou’14] New perspective for hierarchy – Connection from theory of algebraic proof complexity – New insight to the big open problem in approximation algorithms 8 [Barak-Brandão-Harrow-Kelner- Steurer-Zhou’12, O’Donnell-Zhou’13, …]

Outline of this talk Introduction for convex relaxation hierarchies Use hierarchies to design approximation algorithms – dense MaxCut, dense k - CSP, metric MaxCut, locally-dense k - CSP, dense MaxGraphIsomorphism, (dense & metric) MaxGraphIsomorphism [Yoshida-Zhou’14] What problems are resistant to hierarchies – the limitation of hierarchies? –SparsestCut [Guruswami-Sinop-Zhou’13], Dense k Subgraph [Bhaskara- Charikar-Guruswami-Vijayaraghavan-Zhou’12], GraphIsomorphism [O’Donnell- Wright-Wu-Zhou’14] New perspective for hierarchy – Connection from theory of algebraic proof complexity – New insight to big open problem in approximation algorithms 9

Writing linear programming (LP) relaxations Toy problem #1: Integer Program (0, 1) (1, 1) (1, 0)(0, 0) x+y= 1 True Optimum : 1 10

Writing linear programming (LP) relaxations Toy problem #1: Integer Program LP relaxation (0, 1) (1, 1) (1, 0)(0, 0) x+y= 1 [0,1] True Optimum : 1 Relaxation Optimum : 3/2 (3/4,3/4) = 2/3 Typical way of approximating the true optimum Analysis of approx. ratio needs to understand the extra sol. introduced Integrality gap (IG) = “2/3-approximation” x+y= 3 2 closer to 1, better approx. 11 This example is credited to Madhur Tulsiani.

Writing semidefinite programming (SDP) relaxations Toy problem #2: MaxCut on a triangle SDP relaxation x yz 0 Integers relaxed to vectors True Optimum : 2 12

Writing semidefinite programming (SDP) relaxations Toy problem #2: MaxCut on a triangle SDP relaxation Integrality gap (IG) = ≈.889 Can write similar SDP relaxations for every MaxCut instance – Integrality gap might be worse [Goemans-Williamson’95] IG >.878 for every MaxCut instance x yz O True Optimum : 2 Relaxation Optimum : 9/4 : BasicSDP 13

Tighten the relaxations Toy problem #2: MaxCut on a triangle BasicSDP relaxation Integrality gap (IG) = = 1 x yz O with triangle inequalities True Optimum : 2 Relaxation Optimum : 2 ✗ Do triangle ineq.’s always improve the BasicSDP in the worst cases? [Khot-Vishnoi’05] No. The worst-case integrality gap is still ≈

Tighten the relaxations [Khot-Vishnoi’05] Triangle ineq.’s do not improve the worst- case integrality gap for MaxCut In many occasions, triangle ineq.’s do help Famous example of SparsestCut on an n -vertex graph – IG of BasicSDP: – IG after triangle ineq.’s: [Arora-Rao-Vazirani’04] Can add even more constraints, leading to even better approximation guarantee 15

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations (0, 1) (1, 1) (1, 0)(0, 0) 16

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations (0, 1) (1, 1) (1, 0)(0, 0) 17

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations (0, 1) (1, 1) (1, 0)(0, 0) 18

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations Start from the BasicRelaxation; power of the program increases as the level goes up Hierarchies studied in Operations Research – Lovász-Schrijver LP (LS) – Sherali-Adams (SA LP, SA+ SDP) – Lasserre-Parrilo SDP (Las) (0, 1) (1, 1) (1, 0)(0, 0) BasicRelaxation (Level-1) Level-2 Level-3 19

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations Start from the BasicRelaxation; power of the program increases as the level goes up Hierarchies studied in Operations Research – Lovász-Schrijver LP (LS) – Sherali-Adams (SA LP, SA+ SDP) – Lasserre-Parrilo SDP (Las) 20 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

LP/SDP relaxation hierarchies Automatic ways to generate more and more variables & constraints, leading to tighter and tighter relaxations Start from the BasicRelaxation; power of the program increases as the level goes up Hierarchies studied in Operations Research – Lovász-Schrijver LP (LS) – Sherali-Adams (SA LP, SA+ SDP) – Lasserre-Parrilo SDP (Las) Powerful algorithmic framework capturing most known approximation algorithms within constant levels – E.g. Arora-Rao-Vazirani algorithm At Level- k : n O(k) var.’s, solvable in n O(k) time Level- n tight ( n : input size) 21 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

Outline of this talk Introduction for convex relaxation hierarchies Use hierarchies to design approximation algorithms – dense MaxCut, dense k - CSP, metric MaxCut, locally-dense k - CSP, dense MaxGraphIsomorphism, (dense & metric) MaxGraphIsomorphism [Yoshida-Zhou’14] What problems are resistant to hierarchies – the limitation of hierarchies? –SparsestCut [Guruswami-Sinop-Zhou’13], Dense k Subgraph [Bhaskara- Charikar-Guruswami-Vijayaraghavan-Zhou’12], GraphIsomorphism [O’Donnell- Wright-Wu-Zhou’14] New perspective for hierarchy – Connection from theory of algebraic proof complexity – New insight to big open problem in approximation algorithms 22

Our results: Sherali-Adams LP hierarchy for dense MaxCut Theorem. [Yoshida-Zhou’14] For dense MaxCut, Sherali-Adams LP hierarchy approximates the optimum arbitrarily well in constant level (polynomial-time) – Integrality gap of level- O(1/ε 2 ) Sherali-Adams LP is (1-ε) for dense MaxCut for any constant ε Graph with n vertices has at most n 2 edges Say it’s dense if it has at least.01n 2 edges dense sparse 23 General MaxCut –.878-approximable by SDP [Goemans-Williamson’95] – NP-hard to.941-approximate [Håstad’01, TSSW’00]

[dlV’96] via sampling and exhaustive search [FK’96] via weak Szemerédi’s regularity lemma [dlVK’01] via copying important variables [dlVKKV’05] via a variant of SVD Our results: summary Within a few levels, Sherali- Adams LP hierarchy arbitrarily well approximates – dense MaxCut – dense k -CSP – metric MaxCut – locally-dense k -CSP – dense MaxGraphIsomorphism – (dense & metric) MaxGraphIsomorphism Although many of our algorithmic results were known via other techniques… Our results show that Sherali-Adams LP hierarchy is a unified approach implying all previous techniques! Although [AFK’02] via LP relaxation for “assignment problems with extra constraints” (New, not known before) 24

Outline of this talk Introduction for convex relaxation hierarchies Use hierarchies to design approximation algorithms – dense MaxCut, dense k - CSP, metric MaxCut, locally-dense k - CSP, dense MaxGraphIsomorphism, (dense & metric) MaxGraphIsomorphism [Yoshida-Zhou’14] What problems are resistant to hierarchies – the limitation of hierarchies? –SparsestCut [Guruswami-Sinop-Zhou’13], Dense k Subgraph [Bhaskara- Charikar-Guruswami-Vijayaraghavan-Zhou’12], GraphIsomorphism [O’Donnell- Wright-Wu-Zhou’14] New perspective for hierarchy – Connection from theory of algebraic proof complexity – New insight to big open problem in approximation algorithms 25

Limitations of hierarchies We will prove theorems in the following style Fix a problem (e.g. MaxCut ), even using many levels (e.g. >100, >log n, >.1n ) of the hierarchy, the integrality gap is still bad – Design a ( MaxCut ) instance I – Prove real MaxCut of I small – Prove relaxation thinks MaxCut of I large I.e. the hierarchy does not give good approximation 26 True Optimum : 2 Relaxation Optimum : 9/4 ≈.889 Integrality gap (IG) = want it far from 1

Motivation The big open problem in approximation algorithms research – Is it NP-hard to beat.878-approximation for MaxCut (Goemans- Williamson SDP)? – I.e. is Goemans-Williamson SDP optimal? 27

Motivation Big open problem – NP-hardness of beating.878- approximation for MaxCut (Goemans-Williamson SDP)? Why? – Mysterious true answer – (If no) better algorithm, disprove Unique Games Conjecture – (If yes) optimality of BasicSDP (for many problems), connect geometry and computation How? – Hmm… we are working on it 28

Motivation Big open problem – NP-hardness of beating.878- approximation for MaxCut (Goemans-Williamson SDP)? Why? – Mysterious true answer – (If no) better algorithm, disprove Unique Games Conjecture – (If yes) optimality of BasicSDP (for many problems), connect geometry and computation How? – Hmm… we are working on it What to do instead/as a first step – Whether our most powerful algorithms (hierarchies) fail to beat the Goemans-Williamson SDP? Why? – Predicts the true answer – (If no) better algorithm, disprove Unique Games Conjecture – (If yes) BasicSDP optimal in a huge class of convex relaxations – New ways of reasoning about convex relaxation hierarchies 29

Limitations for hierarchies Recall: Lasserre-Parrilo – strongest hierarchy known Have seen a few levels ( O(1) ) of Sherali-Adams LP hierarchy already powerful Will prove limitations of the Lasserre- Parrilo SDP hierarchy with many levels ( n.01 ) for several problems – Predict the NP-hardness of approximating these problems – At least substantially new algorithmic ideas needed 30 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

Our results: SparsestCut & Dense k Subgraph Theorem. [Guruswami-Sinop-Zhou’13] factor integrality gap of Ω(n) -level Lasserre-Parrilo for SparsestCut Theorem. [Bhaskara-Charikar-Guruswami-Vijayaraghavan-Zhou’12] n 2/53 -factor integrality gap of Ω(n.01 ) -level Lasserre-Parrilo for Dense k Subgraph –Dense k Subgraph : Given graph G=(V, E), find a set A of k vertices such that the number of edges in A is maximized – Frequently arises in community detection (social networks) ProblemBest Approx. AlgBest NP-HardnessOur IG SparsestCut [ARV’04] None known

Our results: SparsestCut & Dense k Subgraph Theorem. [Guruswami-Sinop-Zhou’13] factor integrality gap of Ω(n) -level Lasserre-Parrilo for SparsestCut Theorem. [Bhaskara-Charikar-Guruswami-Vijayaraghavan-Zhou’12] n 2/53 -factor integrality gap of Ω(n.01 ) -level Lasserre-Parrilo for Dense k Subgraph –Dense k Subgraph : Given graph G=(V, E), find a set A of k vertices such that the number of edges in A is maximized – Frequently arises in community detection (social networks) ProblemBest Approx. AlgBest NP-HardnessOur IG SparsestCut [ARV’04] None known Dense k Subgraph [BCCFV’10] None known n 2/53 32

Our results: GraphIsomorphism 33 Isomorphic graphs Non-isomorphic graphs

Our results: GraphIsomorphism Sherali-Adams LP hierarchy for GraphIsomorphism ( GIso ) – A.k.a. high dimensional color refinement/Weisfeiler-Lehman alg. – A widely used heuristic – A subroutine of Babai-Luks - time GIso algorithm Once conjectured: O(1) -level Sherali-Adams LP solves GIso Refuted by [Cai-Fürer-Immerman’92] : Even.1n -level Sherali-Adams LP says isomorphic, the two graphs might be non-isomorphic Theorem. [O’Donnell-Wright-Wu-Zhou’14] Even.1n -level Lasserre-Parrilo SDP says isomorphic, the two graphs might be far from being isomorphic – i.e. one has to modify Ω(1)- fraction edges to align the graphs 34

Outline of this talk Introduction for convex relaxation hierarchies Use hierarchies to design approximation algorithms – dense MaxCut, dense k - CSP, metric MaxCut, locally-dense k - CSP, dense MaxGraphIsomorphism, (dense & metric) MaxGraphIsomorphism [Yoshida-Zhou’14] What problems are resistant to hierarchies – the limitation of hierarchies? –SparsestCut [Guruswami-Sinop-Zhou’13], Dense k Subgraph [Bhaskara- Charikar-Guruswami-Vijayaraghavan-Zhou’12], GraphIsomorphism [O’Donnell- Wright-Wu-Zhou’14] New perspective for hierarchy – Connection from theory of algebraic proof complexity – New insight to big open problem in approximation algorithms 35

Hierarchy integrality gaps for MaxCut Recall – Big open problem Is Goemans-Williamson SDP the best polynomial-time algorithm for MaxCut ? – As the first step Do hierarchies give.879-approximation (Beat Goemans-Williamson)? Known results for Sherali-Adams+ SDP [KV’05, RS’09, BGHMRS’12] – Level- SA+ SDP do not.879-approximate MaxCut – I.e. Exists MaxCut instances hard for SA+ SDP (integrality gap) – Hardest instances known for MaxCut 36 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

Applying Lasserre-Parrilo to hard instances for Sherali-Adams+ SDP Known results. Instances hard for Sherali-Adams+ SDP hierarchy Question. Are these MaxCut instances also.878-integrality gap instances for Lasserre-Parrilo SDP hierarchy? Our answer. No! – Theorem. [Barak-Brandão-Harrow-Kelner- Steurer-Zhou’12, O’Donnell-Zhou’13] O(1) -level Lasserre-Parrilo gives better-than-.878 approximation to these MaxCut instances 37 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

Why is this interesting? Lasserre-Parrilo succeeds on the hardest known MaxCut instances, with the potential to work for all MaxCut instances – Seriously questions possible optimality of GW 38 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ ≥

Why is this interesting? 39 The big open question: Is Goemans-Williamson the best polynomial-time algorithm for MaxCut ? Evidence for Yes [KV’05, RS’09, BGHMRS’12] GW is optimal in Sherali-Adams+ hierarchy Evidence for No (our results) Hard instances from the left are solved by Lasserre-Parrilo

Why is this interesting? Lasserre-Parrilo succeeds on the hardest known MaxCut instances, with the potential to work for all MaxCut instances – Seriously questions possible optimality of GW Separates Lasserre-Parrilo from Sherali-Adams+ Our proof technique – A surprising connection from theory of algebraic proof complexity 40 SA (k) SA+ (k) Las (k) LS (k) ≥ ≥ >≥

The connection from algebraic proof complexity We relate power of Lasserre-Parrilo to power of an algebraic proof system – Sum-of-Squares (SOS) proof system – Proof system where the only way to deduce inequality is by p(x) 2 ≥ 0 – Dates back to Hilbert’s 17 th Problem 41 Given a multivariate polynomial that takes only non-negative values over reals, can it be represented as a sum of squares of rational functions?

Our proof method Recall: how to prove integrality gaps for MaxCut – Design a MaxCut instance I – Prove real MaxCut of I small – Prove relaxation thinks MaxCut of I large Our goal. Prove I is not Lasserre-Parrilo SDP integrality gap instance – Prove Lasserre-Parrilo SDP certifies MaxCut of I small Our method. By the weak duality theorem for SDPs ( primal optimum ≤ any dual solution ), design a dual solution with small objective value True Optimum : 2 Relaxation Optimum : 9/4 ≈.889 Integrality gap (IG) = want it far from 1 42

Algebraic proof systems – a new perspective for Lasserre-Parrilo Our method. Design a dual solution with small objective value What is Lasserre-Parrilo SDP? – Omitted due to time constraints… What is the dual SDP of Lasserre-Parrilo? Our key observation. (new view of the dual) SOS proof  dual solution i.e. SOS proof of MaxCut is small  dual value small Our goal. Translate the proof into SOS proof system Proofs of the known MaxCut IG [KV’05] Design a MaxCut instance I Prove real MaxCut of I small Prove relaxation thinks MaxCut of I large 43

A comparison Construct integrality gaps Can use all mathematical proof techniques Give a deep proof to a deep theorem Our goal Can only use the limited axioms (as given by the SOS proof system) Give a “simple”(restricted) proof to a deep theorem What is the Sum-of-Squares (SOS) proof system? 44 Prove the MaxCut of the instance I is at most β

Example of Sum-of-Squares proof system Goal: assume, prove Step 1: turn to refute Step 2: assume there were a solution Step 3: come up with the following identity Step 4: contradiction A degree-2 SOS proof 45 squared polynomialnon-negative

Another example: MaxCut on triangle graph To prove MaxCut at most 2 Step 1: turn to refute (for any ε > 0 ) Step 2: assume there were a solution Step 3: Step 4: contradiction Degree-4 SOS proof x yz 46 non-negative squared polynomials 0 =

Lasserre-Parrilo and the Sum-of-Squares proof system Degree- d (for constant d ) SOS proof found by an SDP in n O(d) time Key observation. degree- d SOS proof  solution of dual of level- d Lasserre-Parrilo dual of Lasserre-Parrilo 47

Lasserre-Parrilo succeeds on known MaxCut instances: one-slide proof Theorem. MaxCut of this graph is ≤ blah Proof. …Influence Decoding… …Invariance Principle… …Majority-Is-Stablest… …Smallset Expansion… …Hypercontractivity… ✗ Our new proof. “Check out these polynomials.” However, giving elementary proofs to deep theorems is more challenging and needs new mathematical ideas. 38 pages 40 pages 52 pages 48

Other works along this line [De-Mossel-Neeman’13] O(1) -level Lasserre-Parrilo almost exactly computes the optimum of the known MaxCut instances – Improves our work [O’Donnell-Zhou’13] which states that Lasserre-Parrilo gives better-than-.878 approximation [Barak-Brandão-Harrow-Kelner-Steurer-Zhou’12] O(1) -level Lasserre- Parrilo succeeds on all known UniqueGames instances [O’Donnell-Zhou’13] O(1) -level Lasserre-Parrilo succeeds on the known BalancedSeparator instances [Kauers-O’Donnell-Tan-Zhou’14] O(1) -level Lasserre-Parrilo succeeds on the hard instances for 3-Coloring Central problem in approximation algorithms A similar problem to SparsestCut 49

Summary We utilize the connection between convex programming relaxations and theory of algebraic proof complexity – Lasserre-Parrilo solves the hardest known instances for MaxCut, UniqueGames, BalancedSeparator, 3-Coloring, … – Motivates study of SOS proof system to further understand power of Lasserre-Parrilo – Optimality of BasicSDP ( Goemans-Williamson ) seems more mysterious 50

Future directions Maybe No? – Lasserre-Parrilo better approximation for all MaxCut instances? – We made initial step towards this direction Maybe Yes? – We gave insight in designing integrality gap instances: avoid the power of SOS proof system! 51 The big open question: Is Goemans-Williamson the best polynomial-time algorithm for MaxCut ? Our first step: Is Goemans-Williamson the best in Lasserre-Parrilo hierarchy?

Future directions Concrete open problem. Does level-2 Lasserre-Parrilo improve Goemans-Williamson? Other future directions – Improve our integrality gap theorems for SparsestCut and Dense k Subgraph – Beyond worst-case analysis via Lasserre-Parrilo Real-world instances Random instances – Initial results (for 2->4 MatrixNorm problem) in [Barak-Brandão- Harrow-Kelner-Steurer-Zhou’12] 52

The End Thanks! 53

Questions? 54