The Theory of NP-Completeness
Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking stage of a nondeterministic algorithm is of polynomial time-complexity, then this algorithm is called an NP (nondeterministic polynomial) algorithm. NP problems : (must be decision problems) e.g. searching, MST sorting satisfiability problem (SAT) traveling salesperson problem (TSP)
Nondeterministic operations and functions Choice(S) : arbitrarily chooses one of the elements in set S Failure : an unsuccessful completion Success : a successful completion Nonderministic searching algorithm: j ← choice(1 : n) /* guessing */ if A(j) = x then success /* checking */ else failure 3- 3
A nondeterministic algorithm terminates unsuccessfully iff there exist no a set of choices leading to a success signal. The time required for choice(1 : n) is O(1). A deterministic interpretation of a non-deterministic algorithm can be made by allowing unbounded parallelism in computation. 3- 4
Nondeterministic sorting B ← 0 /* guessing */ for i = 1 to n do j ← choice(1 : n) if B[j] ≠ 0 then failure B[j] = A[i] /* checking */ for i = 1 to n-1 do if B[i] > B[i+1] then failure success 3- 5
NP : the class of decision problem which can be solved by a non-deterministic polynomial algorithm. P: the class of problems which can be solved by a deterministic polynomial algorithm. NP-hard: the class of problems to which every NP problem reduces. NP-complete (NPC): the class of problems which are NP-hard and belong to NP.
Some concepts of NPC Definition of reduction: Problem A reduces to problem B (A B) iff A can be solved by a deterministic polynomial time algorithm using a deterministic algorithm that solves B in polynomial time. B is harder. Up to now, none of the NPC problems can be solved by a deterministic polynomial time algorithm in the worst case. It does not seem to have any polynomial time algorithm to solve the NPC problems.
Theory of NP-completeness The theory of NP-completeness always considers the worst case. The lower bound of any NPC problem seems to be in the order of an exponential function. Not all NP problems are difficult. (e.g. the MST problem is an NP problem.) If A, B NPC, then A B and B A. Theory of NP-completeness If any NPC problem can be solved in polynomial time, then all NP problems can be solved in polynomial time. (NP = P)
Decision problems The solution is simply “Yes” or “No”. Optimization problems are more difficult. e.g. the traveling salesperson problem Optimization version: Find the shortest tour Decision version: Is there a tour whose total length is less than or equal to a constant c ?
Solving an optimization problem by a decision algorithm : Solving TSP optimization problem by decision algorithm : Give c1 and test (decision algorithm) Give c2 and test (decision algorithm) Give cn and test (decision algorithm) We can easily find the smallest ci
The satisfiability problem The logical formula : x1 v x2 v x3 & - x1 & - x2 the assignment : x1 ← F , x2 ← F , x3 ← T will make the above formula true . (-x1, -x2 , x3) represents x1 ← F , x2 ← F , x3 ← T
An unsatisfiable formula : If there is at least one assignment which satisfies a formula, then we say that this formula is satisfiable; otherwise, it is unsatisfiable. An unsatisfiable formula : x1 v x2 & x1 v -x2 & -x1 v x2 & -x1 v -x2
Definition of the satisfiability problem: Given a Boolean formula, determine whether this formula is satisfiable or not. A literal : xi or -xi A clause : x1 v x2 v -x3 Ci A formula : conjunctive normal form C1& C2 & … & Cm
Cook’s theorem NP = P iff the satisfiability problem is a P problem. SAT is NP-complete. It is the first NP-complete problem. Every NP problem reduces to SAT.
The vertex cover problem Def: Given a graph G=(V, E), S is the node cover if S V and for every edge (u, v) E, either u S or v S or both. node cover : {1, 3} {5, 2, 4} Decision problem : S S K
Chromatic number decision problem (CN) Def: A coloring of a graph G=(V, E) is a function f : V { 1, 2, 3,…, k } such that if (u, v) E, then f(u)f(v). The CN problem is to determine if G has a coloring for k. E.g. <Theorem> Satisfiability with at most 3 literals per clause (SATY) CN. 3-colorable f(a)=1, f(b)=2, f(c)=1 f(d)=2, f(e)=3
Set cover decision problem Def: F = {Si} = { S1, S2, …, Sk } Si = { u1, u2, …, un } T is a set cover of F if T F and Si = Si The set cover decision problem is to determine if F has a cover T containing no more than c sets. example F = {(a1, a3), (a2, a4), (a2, a3), (a4), (a1, a3 , a4)} s1 s2 s3 s4 s5 T = { s1, s3, s4 } set cover T = { s1, s2 } set cover, exact cover
Sum of subsets problem a constant C Determine if A A ai = C Def: A set of positive numbers A = { a1, a2, …, an } a constant C Determine if A A ai = C e.g. A = { 7, 5, 19, 1, 12, 8, 14 } C = 21, A = { 7, 14 } C = 11, no solution <Theorem> Exact cover sum of subsets.
Partition problem <Theorem> sum of subsets partition Def: Given a set of positive numbers A = { a1,a2,…,an }, determine if a partition P, ai = ai ip ip e. g. A = {3, 6, 1, 9, 4, 11} partition : {3, 1, 9, 4} and {6, 11} <Theorem> sum of subsets partition
Bin packing problem <Theorem> partition bin packing. Def: n items, each of size ci , ci > 0 bin capacity : C Determine if we can assign the items into k bins, ci C , 1jk. ibinj <Theorem> partition bin packing.
VLSI discrete layout problem Given: n rectangles, each with height hi (integer) width wi and an area A Determine if there is a placement of the n rectangles within the area A according to the rules : Boundaries of rectangles parallel to x axis or y axis. Corners of rectangles lie on integer points. No two rectangles overlap. Two rectangles are separated by at least a unit distance.
<Theorem> bin packing VLSI discrete layout. A Successful Placement <Theorem> bin packing VLSI discrete layout.
Max clique problem Def: A maximal complete subgraph of a graph G=(V,E) is a clique. The max (maximum) clique problem is to determine the size of a largest clique in G. e. g. <Theorem> SAT clique decision problem. maximal cliques : {a, b}, {a, c, d} {c, d, e, f} maximum clique : (largest)
Hamiltonian cycle problem Def: A Hamiltonian cycle is a round trip path along n edges of G which visits every vertex once and returns to its starting vertex. e.g. Hamiltonian cycle : 1, 2, 8, 7, 6, 5, 4, 3, 1. <Theorem> SAT directed Hamiltonian cycle ( in a directed graph )
Traveling salesperson problem Def: A tour of a directed graph G=(V, E) is a directed cycle that includes every vertex in V. The problem is to find a tour of minimum cost. <Theorem> Directed Hamiltonian cycle traveling salesperson decision problem.
0/1 knapsack problem Def: n objects, each with a weight wi > 0 a profit pi > 0 capacity of knapsack : M Maximize pixi 1in Subject to wixi M xi = 0 or 1, 1 i n Decision version : Given K, pixi K ? Knapsack problem : 0 xi 1, 1 i n. <Theorem> partition 0/1 knapsack decision problem.
Toward NP-Completeness: Cook’s theorem: The SAT problem is NP-complete. Once we have found an NP-complete problem, proving that other problems are also NP-complete becomes easier. Given a new problem Y, it is sufficient to prove that Cook’s problem, or any other NP-complete problems, is polynomially reducible to Y. Known problem -> unknown problem
NP-Completeness Proof The following problems are NP-complete: vertex cover(VC) and clique. Definition: A vertex cover of G=(V, E) is V’V such that every edge in E is incident to some vV’. Vertex Cover(VC): Given undirected G=(V, E) and integer k, does G have a vertex cover with k vertices? CLIQUE: Does G contain a clique of size k?
NP-Completeness Proof: Reduction To prove NP-completeness of a new problem, we first prove that the problem belongs to NP, which is usually easy, then reduce a known NP-complete problem to our problem in polynomial time.
NP-Completeness Proof: Vertex Cover(VC) Problem: Given undirected G=(V, E) and integer k, does G have a vertex cover with k vertices? Theorem: the VC problem is NP-complete. Proof: (Reduction from CLIQUE, i.e., given CLIQUE is NP-complete) VC is in NP. This is trivial since we can check it easily in poly-time. Goal: Transform arbitrary CLIQUE instance into VC instance such that CLIQUE answer is “yes” iff VC answer is “yes”.
NP-Completeness Proof: Vertex Cover(VC) Claim: CLIQUE(G, k) has same answer as VC ( , n-k), where n = |V|. Observe: There is a clique of size k in G iff there is a VC of size n-k in .
NP-Completeness Proof: Vertex Cover(VC) Observe: If D is a VC in , then has no edge between vertices in V-D. So, we have k-clique in G n-k VC in Can transform in polynomial time.
NP-Completeness Proof: CLIQUE Problem: Does G=(V,E) contain a clique of size k? Theorem: Clique is NP-Complete. (reduction from SAT) Idea: Make “column” for each of k clauses. No edge within a column. All other edges present except between x and x’
NP-Completeness Proof: CLIQUE Example: G = G has m-clique (m is the number of clauses in E), iff E is satisfiable. (Assign value 1 to all variables in clique)