More On Intractability & Beyond CS161: Online Algorithms Monday, August 11 th 1.

Slides:

Advertisements

Similar presentations

NP-Hard Nattee Niparnan.

Advertisements

Greedy Algorithms Greed is good. (Some of the time)

Great Theoretical Ideas in Computer Science for Some.

Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.

1 EE5900 Advanced Embedded System For Smart Infrastructure Static Scheduling.

Princeton University COS 423 Theory of Algorithms Spring 2001 Kevin Wayne Competitive Analysis.

Complexity ©D Moshkovitz 1 Approximation Algorithms Is Close Enough Good Enough?

Approximation Algorithms Chapter 5: k-center. Overview n Main issue: Parametric pruning –Technique for approximation algorithms n 2-approx. algorithm.

Parallel Scheduling of Complex DAGs under Uncertainty Grzegorz Malewicz.

Combinatorial Algorithms

Great Theoretical Ideas in Computer Science.

1 Discrete Structures & Algorithms Graphs and Trees: II EECE 320.

Introduction to Approximation Algorithms Lecture 12: Mar 1.

CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.

Approximation Algorithms Lecture for CS 302. What is a NP problem? Given an instance of the problem, V, and a ‘certificate’, C, we can verify V is in.

CPSC 411, Fall 2008: Set 4 1 CPSC 411 Design and Analysis of Algorithms Set 4: Greedy Algorithms Prof. Jennifer Welch Fall 2008.

A general approximation technique for constrained forest problems Michael X. Goemans & David P. Williamson Presented by: Yonatan Elhanani & Yuval Cohen.

Analysis of Algorithms CS 477/677

CSE 421 Algorithms Richard Anderson Lecture 6 Greedy Algorithms.

More on Intractability Knapsack Problem Wednesday, August 5 th 1.

Karger’s Min-Cut Approximate Max-Cut Monday, July 21st 1.

Minimum Spanning Trees

1 Introduction to Approximation Algorithms Lecture 15: Mar 5.

CPSC 411, Fall 2008: Set 4 1 CPSC 411 Design and Analysis of Algorithms Set 4: Greedy Algorithms Prof. Jennifer Welch Fall 2008.

Caching Parallel Computational Models Other Topics in Algorithms Wednesday, August 13 th 1.

Minimal Spanning Trees What is a minimal spanning tree (MST) and how to find one.

The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.

Introduction to Intractability P, NP, NP-completeness How To Approach Intractable Problems Monday, August 3 rd 1.

Theory of Computing Lecture 10 MAS 714 Hartmut Klauck.

The Theory of NP-Completeness 1. What is NP-completeness? Consider the circuit satisfiability problem Difficult to answer the decision problem in polynomial.

Nattee Niparnan. Easy & Hard Problem What is “difficulty” of problem? Difficult for computer scientist to derive algorithm for the problem? Difficult.

APPROXIMATION ALGORITHMS VERTEX COVER – MAX CUT PROBLEMS

1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.

Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b

9/8/10 A. Smith; based on slides by E. Demaine, C. Leiserson, S. Raskhodnikova, K. Wayne Adam Smith Algorithm Design and Analysis L ECTURE 6 Greedy Algorithms.

UNC Chapel Hill Lin/Foskey/Manocha Minimum Spanning Trees Problem: Connect a set of nodes by a network of minimal total length Some applications: –Communication.

CS 61B Data Structures and Programming Methodology July 28, 2008 David Sun.

Approximation Algorithms

Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.

CSC 413/513: Intro to Algorithms NP Completeness.

Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.

Graph Colouring L09: Oct 10. This Lecture Graph coloring is another important problem in graph theory. It also has many applications, including the famous.

CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.

NP-Complete Problems. Running Time v.s. Input Size Concern with problems whose complexity may be described by exponential functions. Tractable problems.

NP-COMPLETE PROBLEMS. Admin  Two more assignments…  No office hours on tomorrow.

NP-Complete problems.

CS 3343: Analysis of Algorithms Lecture 25: P and NP Some slides courtesy of Carola Wenk.

1 Chapter 5-1 Greedy Algorithms Slides by Kevin Wayne. Copyright © 2005 Pearson-Addison Wesley. All rights reserved.

CS6045: Advanced Algorithms NP Completeness. NP-Completeness Some problems are intractable: as they grow large, we are unable to solve them in reasonable.

1 Approximation algorithms Algorithms and Networks 2015/2016 Hans L. Bodlaender Johan M. M. van Rooij TexPoint fonts used in EMF. Read the TexPoint manual.

NP Completeness Piyush Kumar. Today Reductions Proving Lower Bounds revisited Decision and Optimization Problems SAT and 3-SAT P Vs NP Dealing with NP-Complete.

Lecture. Today Problem set 9 out (due next Thursday) Topics: –Complexity Theory –Optimization versus Decision Problems –P and NP –Efficient Verification.

CSE 421 Algorithms Richard Anderson Lecture 27 NP-Completeness Proofs.

The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.

Approximation Algorithms by bounding the OPT Instructor Neelima Gupta

Instructor: Shengyu Zhang 1. Optimization Very often we need to solve an optimization problem.  Maximize the utility/payoff/gain/…  Minimize the cost/penalty/loss/…

COSC 3101A - Design and Analysis of Algorithms 14 NP-Completeness.

Approximation Algorithms based on linear programming.

The NP class. NP-completeness Lecture2. The NP-class The NP class is a class that contains all the problems that can be decided by a Non-Deterministic.

Lecture 2: More Examples CS 341: Algorithms Thursday, May 5 th

P, NP, NP-completeness 2 Reductions 1 Thu, July 7 th 1.

TU/e Algorithms (2IL15) – Lecture 11 1 Approximation Algorithms.

Greedy Algorithms / Minimum Spanning Tree Yin Tat Lee

Greedy Algorithms / Caching Problem Yin Tat Lee

Objective of This Course

Coverage Approximation Algorithms

Greedy Algorithms / Caching Problem Yin Tat Lee

Chapter 11 Limitations of Algorithm Power

CS154, Lecture 16: More NP-Complete Problems; PCPs

Presentation transcript:

More On Intractability & Beyond CS161: Online Algorithms Monday, August 11 th 1

Announcements 2 1.PS#6 due wednesday at midnight 2.Project evaluation/competition results on Wednesday 3.Final exam information (later this lecture) 4.Evaluations for the class is open on axess

Outline For Today 1.Approximate Set Cover 2.Approximate Vertex Cover 3.Final Exam Information 4.Beyond CS 161: Online Algorithms 3

Recap: Knapsack FPTAS 4 Knapsack Algorithm Knapsack Algorithm I: (w i, v i, W) accuracy ε (say 0.01) ≥ (1-ε)*OPT (1-ε)-approx Key Takeaway: We are approximating an NP- complete problem to arbitrary precision.

Note On Approximating NP-complete Problems 5 Knapsack is NP-complete.  All NP problems (e.g., TSP) reduce to it.  If we solved Knapsack exactly we solve all NP problems exactly. Alg for KNPS Π 1 ∈ TSP Poly-time TSP -> KNPS Converter Poly-time TSP -> KNPS Converter Π 2 ∈ KNPS Solution to KNPS Solution to TSP Poly-time KNPS Sol. -> TSP Solution Converter Poly-time KNPS Sol. -> TSP Solution Converter

Note On Approximating NP-complete Problems 6 But there are NP problems which can’t be approximated to any constant (e.g. TSP)! Poly-time Approx.Alg for KNPS Π 1 ∈ TSP Poly-time TSP -> KNPS Converter Poly-time TSP -> KNPS Converter Π 2 ∈ KNPS Approx. Solution to KNPS Approx Solution to TSP Poly-time KNPS Sol. -> TSP Solution Converter Poly-time KNPS Sol. -> TSP Solution Converter X

Note On Approximating NP-complete Problems 7 Key Takeaway: Although we can maintain the exact solutions through reductions, approximate solutions cannot be maintained in general. In other words: The information about the approximate solutions can be lost across reductions (though exact solutions can be maintained)!

Outline For Today 1.Approximate Set Cover 2.Randomized Approximate Vertex Cover 3.Final Exam Information 4.Beyond CS 161: Online Algorithms 8

Set Cover Problem (Sec 11.3) 9  Input: U ={1, …, n} items, S 1, …, S m sets s.t. S 1 ∪ S 2 ∪ … ∪ S m = U  Output: minimum # sets required to cover U  Fact: Set Cover is NP-complete: one of Karp’s 21 NP- complete algorithms (Vertex Cover ≤ p Set Cover)

Set Cover Example U S1S S2S S3S S4S S5S S6S6

Set Cover Example S1S1 S2S2 S3S3 S4S4 S5S5 S6S6

Set Cover Example S1S1 S2S2 S3S3 C opt : S 1 ∪ S 2 ∪ S 3

Set Cover Example

Greedy Set-Cover Algorithm 14 Idea: Iteratively pick the set that covers the most “uncovered” elements. procedure Greedy-SetCover( U, S 1, …,S n ): C = ∅ while U is not empty pick S i that maximizes |S i ∩ U | C = C + S i U = U \ S i return C min(| U |, n) iterations, each iteration O(n*| U |) Total: O(n*| U |*min(| U |, n)) time.

Greedy Algorithm Simulation S1S1 S2S2 S3S3 S4S4 S5S5 S6S6 C greedy :

Greedy Algorithm Simulation S1S1 S2S2 S3S3 S5S5 S6S6 C greedy : S 4 S4S4

Greedy Algorithm Simulation S1S1 S2S2 S3S3 S5S5 S6S6 C greedy : S 4

Greedy Algorithm Simulation S1S1 S2S2 S3S3 S5S5 S6S6 C greedy : S 4, S 2

Greedy Algorithm Simulation S1S1 S3S3 S5S5 S6S6 C greedy : S 4, S 2

Greedy Algorithm Simulation S1S1 S5S5 S6S6 C greedy : S 4, S 2, S 3

Greedy Algorithm Simulation S1S1 S6S6 C greedy : S 4, S 2, S 3

Greedy Algorithm Simulation C greedy : S 4, S 2, S 3, S 1 Size is 4, Not Optimal

Thought Experiment 23 Cost of each set in the output is 1. Distribute the cost of each set S i over the new elements that S i covers when it’s picked.

Thought Experiment Simulation S1S1 S2S2 S4S4 S5S5 S6S Costs of Elements S3S3

Thought Experiment Simulation S4S4 2 1/6 2 1/ /6 6 1/ /6 9 1/6 10 1/6 10 1/ /6 1 1/6 5 1/6 5 1/6 Costs of Elements

Thought Experiment Simulation S4S4 2 1/6 2 1/ /6 6 1/ /6 9 1/6 10 1/6 10 1/ /6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 3

Thought Experiment Simulation S4S4 2 1/6 2 1/ /6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/ /6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 1/3 3 1/3

Thought Experiment Simulation S4S4 2 1/6 2 1/ /6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/ /6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 1/3 3 1/3 S3S3

Thought Experiment Simulation S4S4 2 1/6 2 1/ /6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 1/3 3 1/3 S3S3

Thought Experiment Simulation S1S1 S4S4 2 1/6 2 1/ /6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 1/3 3 1/3 S3S3

Thought Experiment Simulation S1S1 S4S4 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 Costs of Elements S2S2 3 1/3 3 1/3 S3S3

Q1: “Cost of the Universe” U ? A: |C greedy | b/c each time the greedy algorithm picks a new set we distribute a cost of 1 to newly covered elements. 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3

Q1: “Cost of the Universe” U ? 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3 b/c each time the greedy algorithm picks a new set we distribute a cost of 1 to newly covered elements.

Q2: Sum of the “Costs of the Sets” in C opt ? 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3

Q2: Sum of the “Costs of the Sets” in C opt ? 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3 S1S1 1/6 + 1/6+ 1/3 + 1/1

Q2: Sum of the “Costs of the Sets” in C opt ? 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3 S1S1 1/6 + 1/6+ 1/3 + 1/1 S2S2 1/6 + 1/6+ 1/3 + 1/3

Q2: Sum of the “Costs of the Sets” in C opt ? 2 1/6 2 1/6 4 1/1 4 1/1 6 1/6 6 1/6 7 1/3 7 1/3 8 1/3 8 1/3 9 1/6 9 1/6 10 1/6 10 1/6 11 1/2 11 1/2 12 1/2 12 1/2 1 1/6 1 1/6 5 1/6 5 1/6 3 1/3 3 1/3 S1S1 1/6 + 1/6+ 1/3 + 1/1 S2S2 1/6 + 1/6+ 1/3 + 1/3 S3S3 1/6 + 1/6+ 1/2 + 1/2

Q2: Sum of the “Costs of the Sets” in C opt ? b/c C opt is a set cover.

Goal Bound the cost of each set in C opt then, we can get a bound on |C greedy | in terms of |C opt | To say that C greedy is not much larger than C opt We currently have:

“Cost of Each Set S” is ≤ H(|S|)=O(ln(|S|) Claim: Cost of set S (not just the ones in C opt but in any set S) is ≤ H(|S|)=O(ln(|S|) If the claim is true then:

Proof of Claim: by picture 41 f f d d e e S c c a a b b g g 1/6 1/4 1/5 H(S) 1/ /2 1/7

Proof of Claim: by picture 42 f f d d e e S c c a a b b g g 1/6 1/4 1/5 H(S) 1/ /2 1/7 Q: Suppose the first time S’s elements are covered, 3 are covered: e, f, g. What can you assert about the costs they get?

Proof of Claim: by picture 43 f ≤1/7 f ≤1/7 d d e ≤1/7 e ≤1/7 S c c a a b b g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 A: ≤ 1/7 (b/c S had 7 uncovered elements but S was not picked. So the set that’s picked must have had at least 7 uncovered elements.)

Proof of Claim: by picture 44 f ≤1/7 f ≤1/7 d d e ≤1/7 e ≤1/7 S c c a a b b g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 Q: Suppose the 2 nd time S’s elements are covered, 1 was covered: d What can you assert about the cost of d?

Proof of Claim: by picture 45 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c c a a b b g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 A: ≤ 1/4 (by the same argument)

Proof of Claim: by picture 46 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c c a a b b g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 Q: Suppose the 3 rd time S’s elements are covered, 2 was covered: b and c What can you assert about the costs of b and c?

Proof of Claim: by picture 47 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c ≤1/3 c ≤1/3 a a b ≤1/3 b ≤1/3 g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 A: ≤ 1/3 (by the same argument)

Proof of Claim: by picture 48 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c ≤1/3 c ≤1/3 a a b ≤1/3 b ≤1/3 g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 Q:What can you assert about the cost of a?

Proof of Claim: by picture 49 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c ≤1/3 c ≤1/3 a ≤1 a ≤1 b ≤1/3 b ≤1/3 g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 A: ≤ 1 (by the same argument)

Proof of Claim: by picture 50 f ≤1/7 f ≤1/7 d ≤1/4 d ≤1/4 e ≤1/7 e ≤1/7 S c ≤1/3 c ≤1/3 a ≤1 a ≤1 b ≤1/3 b ≤1/3 g ≤1/7 g ≤1/7 1/6 1/4 1/5 H(S) 1/ /2 1/7 Conclusion: costs of a + b + … + g ≤ 1 + 1/2 + 1/3 + … + 1/7 costs of a + b + … + g ≤ H(7) ≤ ln(7) Q.E.D.

A More Formal Proof By Induction Template 51 List the items of S covered in reverse order: Let k = |S| e 1, e 2, …, e k Proof by (Reverse) Induction Claim: e i ≤ 1/i Base case is k (argue that it holds) Assume holds for k, k-1, …, I Show holds for i-1 by the same argument in the proof by picture.

Summary: Greedy is log(n)-approximation Assigned costs to each element by distributing the cost 1 of each new set S added by greedy equally to each new element covered by S. 2.By construction: 3.B/c C opt covers all elements 4. 5.Put a log(|S|) bound to the “cost of each set S 6.Concluded

Key Takeaway 53 For NP-complete Problems the algorithmic tools in our toolbox can be used as is. But we have to give up something: (1)generality, (2) exactness, or (3) efficiency.

Outline For Today 1.Approximate Set Cover 2.Approximate Vertex Cover 3.Final Exam Information 4.Beyond CS 161: Online Algorithms 54

Recap: Vertex Cover 55  Input: Undirected Graph G(V, E)  Output: Minimum Vertex Cover of G  Vertex Cover: S ⊆ V, s.t. for each (u, v) ∈ E: either u ∈ S, or v ∈ S.  Fact: Vertex Cover is NP-complete:  3-SAT≤ p CLIQUE≤ p VERTEX-COVER

Vertex Cover Example 56 B D C A E F

Vertex Cover Example 57 B D C A E F

Vertex Cover Example 58 B D C A F Min Vertex Cover: {A, B} F

2-Approximation VC 59 procedure 2-Approx-VC(G(V, E)): VC-OUT = ∅ for (u, v) ∈ E: if neither u or v is in VC: VC-OUT = VC-OUT ∪ {u, v} return VC-OUT Run-time: Can be done in O(n + m) time (exercise) Claim 1: 2-Approx-VC returns a Vertex Cover. Proof: By construction each edge (u, v) is either already covered when we loop over it, or we cover it by adding both u and v.

2-Approximation VC Example 60 B D C A E F

2-Approximation VC Example 61 B D C A F F

2-Approximation VC Example 62 B D C A F F Output : {A, C, B, F}, 4 vertices

Output In Terms of Disjoint Edges 63 B D C A E F Output : {(A, C), (B, F)}, 2 disjoint edges Proof idea: For each edge, any VC has to contain one vertex.

Claim 2: 2-Approx-VC is a 2 approximation 64 Proof: We identify a set of “disjoint edges” (u i, v i ), i.e., no pair of edges we pick have a common vertex. Since any VC has to have either u i or v i in it:  any VC must have at least |VC-OUT|/2 vertices!  the optimal VC must have size ≥|VC-OUT|/2. Fact: Best Approximation known. It’s open whether a better one can exist or not.

Randomized 2-Approx Vertex Cover 65 procedure Rand-2-Approx-VC(G(V, E)): VC-OUT = ∅ for (u, v) ∈ E: if neither u or v is in VC: put either u or v into VC-OUT randomly return VC-OUT Again this algorithm outputs a VC by construction. Exercise: Show that E[|VC-OUT|] ≤ 2|VC opt.|

Outline For Today 1.Approximate Set Cover 2.Randomized Approximate Vertex Cover 3.Final Exam Information 4.Beyond CS 161: Online Algorithms 66

Final Exam Information 67  This Saturday at 3:30pm. At Gates B01  Closed book/notes, etc. One double-sided A4 cheat-sheet is allowed.  140 points.  1 problem consisting of 10 T/F questions (no proofs required). +2 points for correct -2 for incorrect answers  1 problem testing mathematical tools we’ve used  4 or 5 questions on designing and analyzing algorithms.  You can use any algorithm we have covered as a subroutine without re-proving any run-time and correctness claims. But you have to know the run-times of the algorithms we covered.

Topics Covered 68  Cumulative until the first half of today’s lecture.  8 Category of Topics/Algorithms 1.Mathematical Tools: Big-oh Notation, Master Theorem, Substitution Method, Linearity of Expectation, Independence 2.Data Structures: Heaps, Union-Find, Hash Tables, Bloom Filters 3.Fund. Graph Primitives: BFS/DFS, Topological Sort of DAGs, Undirected Conn. Comp., Directed (Strongly) Connected Components

Topics Covered DC & Algs: MergeSort, Strassen 5. Greedy & Algs: Dijkstra, Prim, Kruskal (for MST), Cut Property and Lemmas for MST, Huffman, Scheduling Problems (and others in PSs), Greedy proof techniques: Greedy Stays Ahead, Exchange Arguments 6. Randomized Algs: QuickSort/QuickSelect, Karger, Approximate Max- Cut/Vertex Cover 7. DP & Algs: DP Recipe, Linear Ind. Set, Sequence Alignment, Bellman-Ford, Floyd-Warshall, Pseudo-polynomial Knapsack Algorithms 8. Intractability: P, NP, NP-complete, reductions, Options for Confronting NP-complete Problems, Knapsack Greedy Approx, Knapsack FPTAS, Set Cover, TSP with Triangle Inequality

A Final Note About The Final 70 For most problems, we will give you a computational problem and ask you to solve it, just as in PSs.

Outline For Today 1.Approximate Set Cover 2.Randomized Approximate Vertex Cover 3.Final Exam Information 4.Beyond CS 161: Online Algorithms 71

CS 161’s Computational Model Assumptions 72 1.Inputs to computational problems are fixed size n. 2.Input is correct/error-free. 3.Computation performed on a serial machine (single processor) 4.Computation is performed on a classic machine, i. e., each bit stores a 0 or 1 vs Quantum Machines with qbits. And others, such as Random Access Memory model. Different Computational Models Drop One or More of These Assumptions

Streaming Applications 73  Input is a possibly infinite stream.  At each point in time, the application needs to make an algorithmic decision.  E.g. Caching in OSs, infinite disk lookup requests. OS Cache OS Cache Algorithmic Decision: If there is a miss, what to evict? Question: How optimal is the alg’s eviction strategy?

Streaming Applications 74  News Feeds, FB, Twitter receiving continuous tweets/ user updates.  At each point, these apps need to decide which news/update should appear in whose news feeds.  CS161 Tools Cannot Analyze the algorithmic decision these apps make. FB/Twitt er/Googl e

Online Algorithms 75  Takes as input a possibly infinite stream.  At each point in time t make a decision based on  what has been seen so far  but without knowing the rest of the input  Type of Optimality Analysis: Competitive Ratio  “Worst” (Cost of online algorithm)/(Cost of OPT) ratios against any input stream  Where OPT is the best solution possible if we knew the entire input in advance

Example 1: Skiing in Tahoe 76  Buying equipment costs $500  Renting costs $50  Q: Should we buy or rent?  A: If we will go 9 times or fewer then rent, o.w. buy.  An online algorithm for this problem makes a decision of whether to buy or not each time we go to Tahoe.  Once the algorithm buys, there’s no other decision to make, everything is free.

Example 1: Skiing in Tahoe 77 An Online Skiing Algorithm t=1 RENT An Online Skiing Algorithm t=2 RENT An Online Skiing Algorithm t=k BUY … Observation: Any online algorithm is completely described by the time k it buys the equipment. Q: What’s the optimal choice of k?

Competitive Ratio If We Pick k = 1 78 An Online Skiing Algorithm t=1 BUY Q1: What’s the cost of this algorithm? A1: $500 Q2: What’s the competitive ratio of the algorithm that pick k=1? i.e., what’s the worst case input for this algorithm? A: Going only once. Then the optimal solution would be just rent for $50. => CR = $500/50 = 10

Competitive Ratio If We Pick k = 2 79 An Online Skiing Algorithm t=1 RENT Q1: What’s the cost of this algorithm? A1: $550 Q2: What’s the CR? An Online Skiing Algorithm t=2 BUY

Competitive Ratio If We Pick k = 2 80 Case 1: If we go once: We pay $50, opt is $50, ratio = 1. Case 2: If we go twice: We pay $550, opt is 100, ratio: 5.5 Case 3: If we go three times: We pay $550, opt is 150, ratio: 3.6 Case 4: If we go four times: We pay $550, opt is 200, ratio: 2.75 … A: CR is 5.5 (much better than k=1 algorithm)

Competitive Ratio If We Pick k < An Online Skiing Algorithm t≤k-1 RENT Q1: What’s the cost of this algorithm? A1: (k-1) Q2: What’s the CR? A: If we go ≤ k-1 times, we’re optimal. If we go k times then the ratio: An Online Skiing Algorithm t=k BUY

Competitive Ratio If We Pick k > Q1: What’s the cost of this algorithm? A1: (k-1) Q2: What’s the CR? A: If we go < 10 times, we’re optimal. What if we go ≥ 10 times? Then OPT is $500. If we go t 10 < t < k times, ratio: t50/500 (so increasing by 0.1) If we go exactly k times

Competitive Ratio If We Pick k > Q1: What’s the cost of this algorithm? A1: (k-1) Q2: What’s the CR? A: If we go < 10 times, we’re optimal. What if we go ≥ 10 times? Then OPT is $500. If we go t 10 < t < k times, ratio: t50/500 (so increasing by 0.1) If we go exactly k times

Optimal k 84 Case 1: k < 10, CR: Case 2: k > 10, CR: **Optimal k = 10 => CR: 1.9** Best online strategy is to wait until we go 10 times and then buy the equipment. Case 2: k = 10, CR:

Caching 85 Slow Disk O.w (miss), send request to disk, put the page into cache. Q: Which page to evict? If page is in cache (hit) reply directly from cache

Caching 86 Input: N pages in disk, and stream of infinite page requests. Online Algorithm: Decide which page to evict from cache when it’s full and there’s a miss. Goal: minimize the number of misses. Idea: LRU: Remove the Least Recently Used page

LRU with k = miss

LRU with k =

LRU with k = miss

LRU with k =

LRU with k = miss

LRU with k =

LRU with k = hit

LRU with k = miss

LRU with k =

LRU with k = miss

LRU with k =

LRU with k = miss

LRU with k =

LRU with k = hit

LRU with k = miss

LRU with k = so and so forth…

Competitive Ratio Claim 103 Claim: If the optimal sequence of choices for a size-h cache causes m misses. Then, for the same sequence of requests, LRU for a size-k cache causes misses Interpretation: If LRU had twice as much cache size as an algorithm OPT that knew the future, it would have at most twice the misses of OPT. Note will prove the claim for

Proof of Competitive Ratio 104 Recursively break the sequence of inputs into phases. Let t be the time when we see the (k+1)st different request. Phase 1: a 1 … a t-1 Let t` be the time we see the (k+1)st different element starting from a t Phase 2: a t … a t’

Proof of Competitive Ratio k=3 Phase 1 Phase 2 Phase 3 Phase 4 By construction, each phase has k distinct requests. Q: At most how many misses does LRU have in each phase? A: k b/c even if it evicted everything in the k+1 st item, it would have at most k misses.

Proof of Competitive Ratio Phase 1 Phase 2 Phase 3 Phase 4 Q: What’s the minimum misses that any size-h cache must have in any phase? A: k-h b/c k distinct items will be in the cache at different points during the phase, so at least k-h of them must trigger misses. Therefore the CR: k/k-h Q.E.D.

107 Wednesday More on Beyond CS 161 Parallel Algorithms