CPSC 411, Fall 2008: Set 10 1 CPSC 411 Design and Analysis of Algorithms Set 10: Randomized Algorithms Prof. Jennifer Welch Fall 2008.

Slides:



Advertisements
Similar presentations
Analysis of Algorithms CS 477/677 Instructor: Monica Nicolescu Lecture 6.
Advertisements

Theory of Computing Lecture 3 MAS 714 Hartmut Klauck.
Spring 2015 Lecture 5: QuickSort & Selection
Section 7.4 (partially). Section Summary Expected Value Linearity of Expectations Independent Random Variables.
April 2, 2015Applied Discrete Mathematics Week 8: Advanced Counting 1 Random Variables In some experiments, we would like to assign a numerical value to.
Probabilistic Analysis and Randomized Algorithm. Worst case analysis Probabilistic analysis  Need the knowledge of the distribution of the inputs Indicator.
Chapter 5. Probabilistic Analysis and Randomized Algorithms
UMass Lowell Computer Science Analysis of Algorithms Spring, 2002 Chapter 5 Lecture Randomized Algorithms Sections 5.1 – 5.3 source: textbook.
Probabilistic (Average-Case) Analysis and Randomized Algorithms Two different approaches –Probabilistic analysis of a deterministic algorithm –Randomized.
Analysis of Algorithms CS 477/677 Randomizing Quicksort Instructor: George Bebis (Appendix C.2, Appendix C.3) (Chapter 5, Chapter 7)
1 Discrete Structures & Algorithms Discrete Probability.
1 Hiring Problem and Generating Random Permutations Andreas Klappenecker Partially based on slides by Prof. Welch.
Probabilistic (Average-Case) Analysis and Randomized Algorithms Two different but similar analyses –Probabilistic analysis of a deterministic algorithm.
Analysis of Algorithms CS 477/677 Instructor: Monica Nicolescu.
Probability Distributions Finite Random Variables.
Great Theoretical Ideas in Computer Science.
Qsort - 1 Lin / Devi Comp 122 d(n)d(n) Today’s puzzle  How far can you reach with a stack of n blocks, each 2 units long?
Quicksort!. A practical sorting algorithm Quicksort  Very efficient Tight code Good cache performance Sort in place  Easy to implement  Used in older.
1 Randomized Algorithms Andreas Klappenecker [using some slides by Prof. Welch]
1 Discrete Probability Hsin-Lung Wu Assistant Professor Advanced Algorithms 2008 Fall.
Design & Analysis of Algorithms COMP 482 / ELEC 420 John Greiner.
Sets, Combinatorics, Probability, and Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesProbability.
1 9/8/2015 MATH 224 – Discrete Mathematics Basic finite probability is given by the formula, where |E| is the number of events and |S| is the total number.
1 9/23/2015 MATH 224 – Discrete Mathematics Basic finite probability is given by the formula, where |E| is the number of events and |S| is the total number.
Great Theoretical Ideas In Computer Science Steven Rudich, Anupam GuptaCS Spring 2004 Lecture 22April 1, 2004Carnegie Mellon University
Order Statistics The ith order statistic in a set of n elements is the ith smallest element The minimum is thus the 1st order statistic The maximum is.
Analysis of Algorithms
Great Theoretical Ideas in Computer Science.
CSCI 115 Chapter 3 Counting. CSCI 115 §3.1 Permutations.
3. Counting Permutations Combinations Pigeonhole principle Elements of Probability Recurrence Relations.
BY Lecturer: Aisha Dawood. The hiring problem:  You are using an employment agency to hire a new office assistant.  The agency sends you one candidate.
CSCE 411 Design and Analysis of Algorithms Set 9: Randomized Algorithms Prof. Jennifer Welch Fall 2014 CSCE 411, Fall 2014: Set 9 1.
Introduction to Algorithms Jiafen Liu Sept
Chapter 5: Probability Analysis of Randomized Algorithms Size is rarely the only property of input that affects run time Worst-case analysis most common.
CSC401 – Analysis of Algorithms Lecture Notes 2 Asymptotic Analysis Objectives: Mathematics foundation for algorithm analysis Amortization analysis techniques.
Randomized Algorithms CSc 4520/6520 Design & Analysis of Algorithms Fall 2013 Slides adopted from Dmitri Kaznachey, George Mason University and Maciej.
COMPSCI 102 Introduction to Discrete Mathematics.
QuickSort (Ch. 7) Like Merge-Sort, based on the three-step process of divide- and-conquer. Input: An array A[1…n] of comparable elements, the starting.
Analysis of Algorithms CS 477/677 Instructor: Monica Nicolescu Lecture 7.
Sixth lecture Concepts of Probabilities. Random Experiment Can be repeated (theoretically) an infinite number of times Has a well-defined set of possible.
Introduction to Algorithms Randomized Algorithms – Ch5 Lecture 5 CIS 670.
COSC 3101A - Design and Analysis of Algorithms 4 Quicksort Medians and Order Statistics Many of these slides are taken from Monica Nicolescu, Univ. of.
Great Theoretical Ideas in Computer Science for Some.
Master Method (4. 3) Recurrent formula T(n) = a  T(n/b) + f(n) 1) if for some  > 0 then 2) if then 3) if for some  > 0 and a  f(n/b)  c  f(n) for.
Tutorial 5: Discrete Probability I Reference: cps102/lecture11.ppt Steve Gu Feb 15, 2008.
CSC317 1 Randomized algorithms Hiring problem We always want the best hire for a job! Using employment agency to send one candidate at a time Each day,
Great Theoretical Ideas In Computer Science John LaffertyCS Fall 2006 Lecture 10 Sept. 28, 2006Carnegie Mellon University Classics.
Introduction to Algorithms Probability Review – Appendix C CIS 670.
CSC317 1 Hiring problem-review Cost to interview (low C i ) Cost to fire/hire … (expensive C h ) n number of candidates m hired O (c i n + c h m) Independent.
Theory of Computational Complexity Probability and Computing Ryosuke Sasanuma Iwama and Ito lab M1.
1 Chapter 7 Quicksort. 2 About this lecture Introduce Quicksort Running time of Quicksort – Worst-Case – Average-Case.
Great Theoretical Ideas in Computer Science.
PROBABILITY AND COMPUTING RANDOMIZED ALGORITHMS AND PROBABILISTIC ANALYSIS CHAPTER 1 IWAMA and ITO Lab. M1 Sakaidani Hikaru 1.
Introduction to Discrete Mathematics
Introduction to Discrete Mathematics
Sequences, Series, and Probability
5.2 Probability
Econ 113 Lecture Module 2.
Ch 7: Quicksort Ming-Te Chi
CSCE 411 Design and Analysis of Algorithms
Great Theoretical Ideas In Computer Science
CS200: Algorithm Analysis
CS 583 Analysis of Algorithms
Great Theoretical Ideas In Computer Science
Tomado del libro Cormen
Applied Discrete Mathematics Week 12: Discrete Probability
Chapter 7: Quick Sort Recursive sorting using Partition
Quicksort and Randomized Algs
Independence and Counting
Presentation transcript:

CPSC 411, Fall 2008: Set 10 1 CPSC 411 Design and Analysis of Algorithms Set 10: Randomized Algorithms Prof. Jennifer Welch Fall 2008

CPSC 411, Fall 2008: Set 102 The Hiring Problem You need to hire a new employee. The headhunter sends you a different applicant every day for n days. If the applicant is better than the current employee then fire the current employee and hire the applicant. Firing and hiring is expensive. How expensive is the whole process?

CPSC 411, Fall 2008: Set 103 Hiring Problem: Worst Case Worst case is when the headhunter sends you the n applicants in increasing order of goodness. Then you hire (and fire) each one in turn: n hires.

CPSC 411, Fall 2008: Set 104 Hiring Problem: Best Case Best case is when the headhunter sends you the best applicant on the first day. Total cost is just 1 (fire and hire once).

CPSC 411, Fall 2008: Set 105 Hiring Problem: Average Cost What about the "average" cost? First, we have to decide what is meant by average. An input to the hiring problem is an ordering of the n applicants. There are n! different inputs. Assume there is some distribution on the inputs for instance, each ordering is equally likely but other distributions are also possible Average cost is expected value…

CPSC 411, Fall 2008: Set 106 Probability Every probabilistic claim ultimately refers to some sample space, which is a set of elementary events Think of each elementary event as the outcome of some experiment Ex: flipping two coins gives sample space {HH, HT, TH, TT} An event is a subset of the sample space Ex: event "both coins flipped the same" is {HH, TT}

CPSC 411, Fall 2008: Set 107 Sample Spaces and Events S A HH TH TT HT

CPSC 411, Fall 2008: Set 108 Probability Distribution A probability distribution Pr on a sample space S is a function from events of S to real numbers s.t. Pr[A] ≥ 0 for every event A Pr[S] = 1 Pr[A U B] = Pr[A] + Pr[B] for every two non- intersecting ("mutually exclusive") events A and B Pr[A] is the probability of event A

CPSC 411, Fall 2008: Set 109 Probability Distribution Useful facts: Pr[Ø] = 0 If A  B, then Pr[A] ≤ Pr[B] Pr[S — A] = 1 — Pr[A] // complement Pr[A U B] = Pr[A] + Pr[B] – Pr[A  B] ≤ Pr[A] + Pr[B]

CPSC 411, Fall 2008: Set 1010 Probability Distribution A B Pr[A U B] = Pr[A] + Pr[B] – Pr[A  B]

CPSC 411, Fall 2008: Set 1011 Example Suppose Pr[{HH}] = Pr[{HT}] = Pr[{TH}] = Pr[{TT}] = 1/4. Pr["at least one head"] = Pr[{HH U HT U TH}] = Pr[{HH}] + Pr[{HT}] + Pr[{TH}] = 3/4. Pr["less than one head"] = 1 — Pr["at least one head"] = 1 — 3/4 = 1/4 HH TH TT HT 1/4

CPSC 411, Fall 2008: Set 1012 Specific Probability Distribution discrete probability distribution: sample space is finite or countably infinite Ex: flipping two coins once; flipping one coin infinitely often uniform probability distribution: sample space S is finite and every elementary event has the same probability, 1/|S| Ex: flipping two fair coins once

CPSC 411, Fall 2008: Set 1013 Flipping a Fair Coin Suppose we flip a fair coin n times Each elementary event in the sample space is one sequence of n heads and tails, describing the outcome of one "experiment" The size of the sample space is 2 n. Let A be the event "k heads and n  k tails occur". Pr[A] = C(n,k)/2 n. There are C(n,k) sequences of length n in which k heads and n–k tails occur, and each has probability 1/2 n.

CPSC 411, Fall 2008: Set 1014 Example n = 5, k = 3 HHHTT HHTTH HTTHH TTHHH HHTHT HTHTH THTHH HTHHT THHTH THHHT Pr(3 heads and 2 tails) = C(5,3)/2 5 = 10/32

CPSC 411, Fall 2008: Set 1015 Flipping Unfair Coins Suppose we flip two coins, each of which gives heads two-thirds of the time What is the probability distribution on the sample space? HH TH TT HT 4/9 2/9 1/9 Pr[at least one head] = 8/9

CPSC 411, Fall 2008: Set 1016 In-Class Problem #1 What is the sample space associated with rolling two 6-sided dice? Assume the dice are fair. What are the probabilities associated with each elementary event in the sample space?

CPSC 411, Fall 2008: Set 1017 Independent Events Two events A and B are independent if Pr[A  B] = Pr[A]·Pr[B] I.e., probability that both A and B occur is the product of the separate probabilities that A occurs and that B occurs.

CPSC 411, Fall 2008: Set 1018 Independent Events Example In two-coin-flip example with fair coins: A = "first coin is heads" B = "coins are different" HH TH TT HT 1/4 A B Pr[A] = 1/2 Pr[B] = 1/2 Pr[A  B] = 1/4 = (1/2)(1/2) so A and B are independent

CPSC 411, Fall 2008: Set 1019 In-Class Problem #2 In the 2-dice example, consider these two events: A = "first die rolls 6" B = "first die is smaller than second die" Are A and B independent? Explain.

CPSC 411, Fall 2008: Set 1020 Discrete Random Variables A discrete random variable X is a function from a finite or countably infinite sample space to the real numbers. Associates a real number with each possible outcome of an experiment Define the event "X = v" to be the set of all the elementary events s in the sample space with X(s) = v. So Pr["X = v"] is the sum of Pr[{s}] over all s with X(s) = v.

CPSC 411, Fall 2008: Set 1021 Discrete Random Variable X=v Add up the probabilities of all the elementary events in the orange event to get the probability that X = v

CPSC 411, Fall 2008: Set 1022 Random Variable Example Roll two fair 6-sided dice. Sample space contains 36 elementary events (1:1, 1:2, 1:3, 1:4, 1:5, 1:6, 2:1,…) Probability of each elementary event is 1/36 Define random variable X to be the maximum of the two values rolled What is Pr["X = 3"]? It is 5/36, since there are 5 elementary events with max value 3 (1:3, 2:3, 3:3, 3:2, and 3:1)

CPSC 411, Fall 2008: Set 1023 Independent Random Variables It is common for more than one random variable to be defined on the same sample space. E.g.: X is maximum value rolled Y is sum of the two values rolled Two random variables X and Y are independent if for all v and w, the events "X = v" and "Y = w" are independent.

CPSC 411, Fall 2008: Set 1024 Expected Value of a Random Variable Most common summary of a random variable is its "average", weighted by the probabilities called expected value, or expectation, or mean Definition: E[X] = ∑ v Pr[X = v] v

CPSC 411, Fall 2008: Set 1025 Expected Value Example Consider a game in which you flip two fair coins. You get $3 for each head but lose $2 for each tail. What are your expected earnings? I.e., what is the expected value of the random variable X, where X(HH) = 6, X(HT) = X(TH) = 1, and X(TT) = —4? Note that no value other than 6, 1, and —4 can be taken on by X (e.g., Pr[X = 5] = 0). E[X] = 6(1/4) + 1(1/4) + 1(1/4) + (—4)(1/4) = 1

CPSC 411, Fall 2008: Set 1026 Properties of Expected Values E[X+Y] = E[X] + E[Y], for any two random variables X and Y, even if they are not independent! E[a·X] = a·E[X], for any random variable X and any constant a. E[X·Y] = E[X]·E[Y], for any two independent random variables X and Y

CPSC 411, Fall 2008: Set 1027 In-Class Problem #3 Suppose you roll one fair 6-sided die. What is the expected value of the result? Be sure to write down the formula for expected value.

CPSC 411, Fall 2008: Set 1028 Back to the Hiring Problem We want to know the expected cost of our hiring algorithm, in terms of how many times we hire an applicant Elementary event s is a sequence of the n applicants Sample space is all n! sequences of applicants Assume uniform distribution, so each sequence is equally likely, i.e., has probability 1/n! Random variable X(s) is the number of applicants that are hired, given the input sequence s What is E[X]?

CPSC 411, Fall 2008: Set 1029 Solving the Hiring Problem Break the problem down using indicator random variables and properties of expectation Change viewpoint: instead of one random variable that counts how many applicants are hired, consider n random variables, each one keeping track of whether or not a particular applicant is hired. indicator random variable X i for applicant i: 1 if applicant i is hired, 0 otherwise

CPSC 411, Fall 2008: Set 1030 Indicator Random Variables Important fact: X = X 1 + X 2 + … + X n number hired is sum of all the indicator r.v.'s Important fact: E[X i ] = Pr["applicant i is hired"] Why? Plug in definition of expected value. Probability of hiring i is probability that i is better than the previous i-1 applicants…

CPSC 411, Fall 2008: Set 1031 Probability of Hiring i-th Applicant Suppose n = 4 and i = 3. In what fraction of all the inputs is the 3rd applicant better than the 2 previous ones? /24 = 1/3

CPSC 411, Fall 2008: Set 1032 Probability of Hiring i-th Applicant In general, since all permutations are equally likely, if we only consider the first i applicants, the largest of them is equally likely to occur in each of the i positions. Thus Pr[X i = 1] = 1/i.

CPSC 411, Fall 2008: Set 1033 Expected Number of Hires Recall that X is random variable equal to the number of hires Recall that X = the sum of the X i 's (each X i is the random variable that tells whether or not the i-th applicant is hired) E[X] = E[ ∑ X i ] = ∑ E[X i ], by property of E = ∑ Pr[X i = 1], by property of X i = ∑ 1/i, by argument on previous slide ≤ ln n + 1, by formula for harmonic number

CPSC 411, Fall 2008: Set 1034 In-Class Problem #4 Use indicator random variables to calculate the expected value of the sum of rolling n dice.

CPSC 411, Fall 2008: Set 1035 Discussion of Hiring Problem So average number of hires is ln n, which is much better than worst case number (n). But this relies on the headhunter sending you the applicants in random order. What if you cannot rely on that? maybe headhunter always likes to impress you, by sending you better and better applicants If you can get access to the list of applicants in advance, you can create your own randomization, by randomly permuting the list and then interviewing the applicants. Move from (passive) probabilistic analysis to (active) randomized algorithm by putting the randomization under your control!

CPSC 411, Fall 2008: Set 1036 Randomized Algorithms Instead of relying on a (perhaps incorrect) assumption that inputs exhibit some distribution, make your own input distribution by, say, permuting the input randomly or taking some other random action On the same input, a randomized algorithm has multiple possible executions No one input elicits worst-case behavior Typically we analyze the average case behavior for the worst possible input

CPSC 411, Fall 2008: Set 1037 Randomized Hiring Algorithm Suppose we have access to the entire list of candidates in advance Randomly permute the candidate list Then interview the candidates in this random sequence Expected number of hirings/firings is O(log n) no matter what the original input is

CPSC 411, Fall 2008: Set 1038 Probabilistic Analysis vs. Randomized Algorithm Probabilistic analysis of a deterministic algorithm: assume some probability distribution on the inputs Randomized algorithm: use random choices in the algorithm

CPSC 411, Fall 2008: Set 1039 How to Randomly Permute an Array input: array A[1..n] for i := 1 to n do j := value between i and n chosen with uniform probability (each value equally likely) swap A[i] with A[j]

CPSC 411, Fall 2008: Set 1040 Why Does It Work? Show that after i-th iteration of the for loop: A[1..i] equals each permutation of i elements from {1,…,n} with probability (n–i)!/n! Basis: After first iteration, A[1] contains each permutation of 1 element from {1,…,n} with probability (n–1)!/n! = 1/n true since A[1] is swapped with an element drawn from the entire array uniformly at random

CPSC 411, Fall 2008: Set 1041 Why Does It Work? Induction: Assume that after (i–1)-st iteration of the for loop A[1..i–1] equals each permutation of i–1 elements from {1,…,n} with probability (n–(i–1))!/n! The probability that A[1..i] contains permutation x 1, x 2, …, x i is the probability that A[1..i–1] contains x 1, x 2, …, x i–1 after the (i–1)-st iteration AND that the i-th iteration puts x i in A[i].

CPSC 411, Fall 2008: Set 1042 Why Does It Work? Let e 1 be the event that A[1..i–1] contains x 1, x 2, …, x i–1 after the (i–1)-st iteration. Let e 2 be the event that the i-th iteration puts x i in A[i]. We need to show that Pr[e 1  e 2 ] = (n–i)!/n!. Unfortunately, e 1 and e 2 are not independent: if some element appears in A[1..i –1], then it is not available to appear in A[i]. We need some more probability…

CPSC 411, Fall 2008: Set 1043 Conditional Probability Formalizes having partial knowledge about the outcome of an experiment Example: flip two fair coins. Probability of two heads is 1/4 Probability of two heads when you already know that the first coin is a head is 1/2 Conditional probability of A given that B occurs is Pr[A|B] is defined to be Pr[A  B]/Pr[B]

CPSC 411, Fall 2008: Set 1044 Conditional Probability A B Pr[A] = 5/12 Pr[B] = 7/12 Pr[A  B] = 2/12 Pr[A|B] = (2/12)/(7/12) = 2/7

CPSC 411, Fall 2008: Set 1045 Conditional Probability Definition is Pr[A|B] = Pr[A  B]/Pr[B] Equivalently, Pr[A  B] = Pr[A|B]·Pr[B] Back to analysis of random array permutation…

CPSC 411, Fall 2008: Set 1046 Why Does It Work? Recall: e 1 is event that A[1..i–1] = x 1,…,x i–1 Recall: e 2 is event that A[i] = x i Pr[e 1  e 2 ] = Pr[e 2 |e 1 ]·Pr[e 1 ] Pr[e 2 |e 1 ] = 1/(n–i+1) because x i is available in A[i..n] to be chosen since e 1 already occurred and did not include x i every element in A[i..n] is equally likely to be chosen Pr[e 1 ] = (n–(i–1))!/n! by inductive hypothesis So Pr[e 1  e 2 ] = [1/(n–i+1)]·[(n–(i–1))!/n!] = (n–i)!/n!

CPSC 411, Fall 2008: Set 1047 Why Does It Work? After the last iteration (the n-th), the inductive hypothesis tells us that A[1..n] equals each permutation of n elements from {1,…,n} with probability (n– n)!/n! = 1/n! Thus the algorithm gives us a uniform random permutation.

CPSC 411, Fall 2008: Set 1048 Quicksort Deterministic quicksort:  (n 2 ) worst-case running time  (n log n) average case running time, assuming every input permutation is equally likely Randomized quicksort: don't rely on possibly faulty assumption about input distribution instead, randomize!

CPSC 411, Fall 2008: Set 1049 Randomized Quicksort Two approaches One is to randomly permute the input array and then do deterministic quicksort The other is to randomly choose the pivot element at each recursive call called "random sampling" easier to analyze still gives  (n log n) expected running time

CPSC 411, Fall 2008: Set 1050 Randomized Quicksort Given array A[1..n], call recursive algorithm RandQuickSort(A,1,n). Definition of RandQuickSort(A,p,r): if p < r then q := RandPartition(A,p,r) RandQuickSort(A,p,q–1) RandQuickSort(A,q+1,r)

CPSC 411, Fall 2008: Set 1051 Randomized Partition RandPartition(A,p,r): i := randomly chosen index between p and r swap A[r] and A[i] return Partition(A,p,r)

CPSC 411, Fall 2008: Set 1052 Partition Partition(A,p,r): x := A[r] // the pivot i := p–1 for j := p to r–1 do if A[j] ≤ x then i := i+1 swap A[i] and A[j] swap A[i+1] and A[r] return i+1 A[r]: holds pivot A[p,i]: holds elts ≤ pivot A[i+1,j]: holds elts > pivot A[j+1,r-1]: holds elts not yet processed

CPSC 411, Fall 2008: Set 1053 Partition ip,jr p,i j r r r j j jpi pij r r rpij pi ip r r

CPSC 411, Fall 2008: Set 1054 Expected Running Time of Randomized QuickSort Proportional to number of comparisons done in Partition (comparing current array element against the pivot). Compute the expected total number of comparisons done, over all executions of Partition.