Algorithm Analysis. Analysis of Algorithms / Slide 2 Introduction * Data structures n Methods of organizing data * What is Algorithm? n a clearly specified.

Slides:



Advertisements
Similar presentations
MATH 224 – Discrete Mathematics
Advertisements

CSE 373: Data Structures and Algorithms Lecture 5: Math Review/Asymptotic Analysis III 1.
Razdan with contribution from others 1 Algorithm Analysis What is the Big ‘O Bout? Anshuman Razdan Div of Computing.
Chapter 1 – Basic Concepts
Fall 2006CENG 7071 Algorithm Analysis. Fall 2006CENG 7072 Algorithmic Performance There are two aspects of algorithmic performance: Time Instructions.
Chapter 2: Algorithm Analysis
Introduction to Analysis of Algorithms
Analysis of Algorithms Review COMP171 Fall 2005 Adapted from Notes of S. Sarkar of UPenn, Skiena of Stony Brook, etc.
Algorithm Analysis. Math Review – 1.2 Exponents –X A X B = X A+B –X A /X B =X A-B –(X A ) B = X AB –X N +X N = 2X N ≠ X 2N –2 N+ 2 N = 2 N+1 Logarithms.
Algorithm Analysis CS 201 Fundamental Structures of Computer Science.
CSC 2300 Data Structures & Algorithms January 26, 2007 Chapter 2. Algorithm Analysis.
 Last lesson  Arrays for implementing collection classes  Performance analysis (review)  Today  Performance analysis  Logarithm.
Data Structure Algorithm Analysis TA: Abbas Sarraf
Analysis of Algorithms 7/2/2015CS202 - Fundamentals of Computer Science II1.
Cpt S 223 – Advanced Data Structures
Summary of Algo Analysis / Slide 1 Algorithm complexity * Bounds are for the algorithms, rather than programs n programs are just implementations of an.
Analysis of Algorithms COMP171 Fall Analysis of Algorithms / Slide 2 Introduction * What is Algorithm? n a clearly specified set of simple instructions.
Analysis of Algorithms Spring 2015CS202 - Fundamentals of Computer Science II1.
Insertion Sort CSE 331 Section 2 James Daly. Insertion Sort Basic idea: keep a sublist sorted, then add new items into the correct place to keep it sorted.
CSE373: Data Structures and Algorithms Lecture 4: Asymptotic Analysis Aaron Bauer Winter 2014.
Chapter 6 Algorithm Analysis Bernard Chen Spring 2006.
Asymptotic Notations Iterative Algorithms and their analysis
CS 201 Data Structures and Algorithms Chapter 2: Algorithm Analysis - II Text: Read Weiss, §2.4.3 – Izmir University of Economics.
Program Performance & Asymptotic Notations CSE, POSTECH.
Week 2 CS 361: Advanced Data Structures and Algorithms
1 Chapter 24 Developing Efficient Algorithms. 2 Executing Time Suppose two algorithms perform the same task such as search (linear search vs. binary search)
Introduction to complexity. 2 Analysis of Algorithms Why do we need to analyze algorithms? –To measure the performance –Comparison of different algorithms.
Vishnu Kotrajaras, PhD.1 Data Structures. Vishnu Kotrajaras, PhD.2 Introduction Why study data structure?  Can understand more code.  Can choose a correct.
Algorithm Analysis. Algorithm An algorithm is a clearly specified set of instructions which, when followed, solves a problem. recipes directions for putting.
Algorithm Efficiency CS 110: Data Structures and Algorithms First Semester,
Iterative Algorithm Analysis & Asymptotic Notations
Algorithm Analysis An algorithm is a clearly specified set of simple instructions to be followed to solve a problem. Three questions for algorithm analysis.
Analysis of Algorithms
CS 221 Analysis of Algorithms Instructor: Don McLaughlin.
1 COMP3040 Tutorial 1 Analysis of algorithms. 2 Outline Motivation Analysis of algorithms Examples Practice questions.
CSS342: Algorithm Analysis1 Professor: Munehiro Fukuda.
Introduction to Analysis of Algorithms COMP171 Fall 2005.
Program Efficiency & Complexity Analysis. Algorithm Review An algorithm is a definite procedure for solving a problem in finite number of steps Algorithm.
Chapter 10 Algorithm Analysis.  Introduction  Generalizing Running Time  Doing a Timing Analysis  Big-Oh Notation  Analyzing Some Simple Programs.
Algorithm Efficiency There are often many approaches (algorithms) to solve a problem. How do we choose between them? At the heart of a computer program.
Algorithm Analysis Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2008.
Recap Introduction to Algorithm Analysis Different Functions Function’s Growth Rate Three Problems Related to Algorithm Running Time Find Minimum in an.
1 Asymptotic Notations Iterative Algorithms and their analysis Asymptotic Notations –Big O,  Notations Review of Discrete Math –Summations –Logarithms.
CE 221 Data Structures and Algorithms Chapter 2: Algorithm Analysis - I Text: Read Weiss, §2.1 – Izmir University of Economics.
DATA STRUCTURES AND ALGORITHMS Lecture Notes 2 Prepared by İnanç TAHRALI.
Algorithm Analysis Part of slides are borrowed from UST.
CSC Devon M. Simmonds University of North Carolina, Wilmington CSC231 Data Structures.
1/6/20161 CS 3343: Analysis of Algorithms Lecture 2: Asymptotic Notations.
Algorithm Analysis (Big O)
CS 150: Analysis of Algorithms. Goals for this Unit Begin a focus on data structures and algorithms Understand the nature of the performance of algorithms.
Algorithm Analysis. What is an algorithm ? A clearly specifiable set of instructions –to solve a problem Given a problem –decide that the algorithm is.
E.G.M. PetrakisAlgorithm Analysis1  Algorithms that are equally correct can vary in their utilization of computational resources  time and memory  a.
Big O David Kauchak cs302 Spring Administrative Assignment 1: how’d it go? Assignment 2: out soon… Lab code.
Vishnu Kotrajaras, PhD.1 Data Structures
1 Chapter 2 Algorithm Analysis All sections. 2 Complexity Analysis Measures efficiency (time and memory) of algorithms and programs –Can be used for the.
1 Chapter 2 Algorithm Analysis Reading: Chapter 2.
Ch03-Algorithms 1. Algorithms What is an algorithm? An algorithm is a finite set of precise instructions for performing a computation or for solving a.
Analysis of Algorithms Spring 2016CS202 - Fundamentals of Computer Science II1.
GC 211:Data Structures Week 2: Algorithm Analysis Tools Slides are borrowed from Mr. Mohammad Alqahtani.
Chapter 15 Running Time Analysis. Topics Orders of Magnitude and Big-Oh Notation Running Time Analysis of Algorithms –Counting Statements –Evaluating.
Data Structures I (CPCS-204) Week # 2: Algorithm Analysis tools Dr. Omar Batarfi Dr. Yahya Dahab Dr. Imtiaz Khan.
Chapter 2 Algorithm Analysis
Algorithm Analysis CSE 2011 Winter September 2018.
Algorithm Analysis (not included in any exams!)
CS 201 Fundamental Structures of Computer Science
Programming and Data Structure
Programming and Data Structure
CE 221 Data Structures and Algorithms
Estimating Algorithm Performance
Presentation transcript:

Algorithm Analysis

Analysis of Algorithms / Slide 2 Introduction * Data structures n Methods of organizing data * What is Algorithm? n a clearly specified set of simple instructions on the data to be followed to solve a problem  Takes a set of values, as input and  produces a value, or set of values, as output n May be specified  In English  As a computer program  As a pseudo-code * Program = data structures + algorithms

Analysis of Algorithms / Slide 3 Introduction * Why need algorithm analysis ? n writing a working program is not good enough n The program may be inefficient! n If the program is run on a large data set, then the running time becomes an issue

Analysis of Algorithms / Slide 4 Example: Selection Problem * Given a list of N numbers, determine the kth largest, where k  N. * Algorithm 1: (1) Read N numbers into an array (2) Sort the array in decreasing order by some simple algorithm (3) Return the element in position k

Analysis of Algorithms / Slide 5 * Algorithm 2: (1) Read the first k elements into an array and sort them in decreasing order (2) Each remaining element is read one by one  If smaller than the kth element, then it is ignored  Otherwise, it is placed in its correct spot in the array, bumping one element out of the array. (3) The element in the kth position is returned as the answer.

Analysis of Algorithms / Slide 6 * Which algorithm is better when n N =100 and k = 100? n N =100 and k = 1? * What happens when n N = 1,000,000 and k = 500,000? * We come back after sorting analysis, and there exist better algorithms

Analysis of Algorithms / Slide 7 Algorithm Analysis * We only analyze correct algorithms * An algorithm is correct n If, for every input instance, it halts with the correct output * Incorrect algorithms n Might not halt at all on some input instances n Might halt with other than the desired answer * Analyzing an algorithm n Predicting the resources that the algorithm requires n Resources include  Memory  Communication bandwidth  Computational time (usually most important)

Analysis of Algorithms / Slide 8 * Factors affecting the running time n computer n compiler n algorithm used n input to the algorithm  The content of the input affects the running time  typically, the input size (number of items in the input) is the main consideration l E.g. sorting problem  the number of items to be sorted l E.g. multiply two matrices together  the total number of elements in the two matrices * Machine model assumed n Instructions are executed one after another, with no concurrent operations  Not parallel computers

Analysis of Algorithms / Slide 9 Different approaches * Empirical: run an implemented system on real-world data. Notion of benchmarks. * Simulational: run an implemented system on simulated data. * Analytical: use theoretic-model data with a theoretical model system. We do this in 171!

Analysis of Algorithms / Slide 10 Example * Calculate * Lines 1 and 4 count for one unit each * Line 3: executed N times, each time four units * Line 2: (1 for initialization, N+1 for all the tests, N for all the increments) total 2N + 2 * total cost: 6N + 4  O(N) N+2 4N 1

Analysis of Algorithms / Slide 11 Worst- / average- / best-case * Worst-case running time of an algorithm n The longest running time for any input of size n n An upper bound on the running time for any input  guarantee that the algorithm will never take longer n Example: Sort a set of numbers in increasing order; and the data is in decreasing order n The worst case can occur fairly often  E.g. in searching a database for a particular piece of information * Best-case running time n sort a set of numbers in increasing order; and the data is already in increasing order * Average-case running time n May be difficult to define what “average” means

Analysis of Algorithms / Slide 12 Running-time of algorithms * Bounds are for the algorithms, rather than programs n programs are just implementations of an algorithm, and almost always the details of the program do not affect the bounds * Algorithms are often written in pseudo-codes n We use ‘almost’ something like C++. * Bounds are for algorithms, rather than problems n A problem can be solved with several algorithms, some are more efficient than others

Analysis of Algorithms / Slide 13 Growth Rate * The idea is to establish a relative order among functions for large n *  c, n 0 > 0 such that f(N)  c g(N) when N  n 0 * f(N) grows no faster than g(N) for “large” N

Analysis of Algorithms / Slide 14 Typical Growth Rates

Analysis of Algorithms / Slide 15 Growth rates … * Doubling the input size n f(N) = c  f(2N) = f(N) = c n f(N) = log N  f(2N) = f(N) + log 2 n f(N) = N  f(2N) = 2 f(N) n f(N) = N 2  f(2N) = 4 f(N) n f(N) = N 3  f(2N) = 8 f(N) n f(N) = 2 N  f(2N) = f 2 (N) * Advantages of algorithm analysis n To eliminate bad algorithms early n pinpoints the bottlenecks, which are worth coding carefully

Analysis of Algorithms / Slide 16 Asymptotic notations * Upper bound O(g(N) * Lower bound  (g(N)) * Tight bound  (g(N))

Analysis of Algorithms / Slide 17 Asymptotic upper bound: Big-Oh * f(N) = O(g(N)) * There are positive constants c and n 0 such that f(N)  c g(N) when N  n 0 * The growth rate of f(N) is less than or equal to the growth rate of g(N) * g(N) is an upper bound on f(N)

Analysis of Algorithms / Slide 18 * In calculus: the errors are of order Delta x, we write E = O(Delta x). This means that E <= C Delta x. * O(*) is a set, f is an element, so f=O(*) is f in O(*) * 2N^2+O(N) is equivelent to 2N^2+f(N) and f(N) in O(N).

Analysis of Algorithms / Slide 19 Big-Oh: example * Let f(N) = 2N 2. Then n f(N) = O(N 4 ) n f(N) = O(N 3 ) n f(N) = O(N 2 ) (best answer, asymptotically tight) * O(N 2 ): reads “order N-squared” or “Big-Oh N-squared”

Analysis of Algorithms / Slide 20 Some rules for big-oh * Ignore the lower order terms * Ignore the coefficients of the highest-order term * No need to specify the base of logarithm n Changing the base from one constant to another changes the value of the logarithm by only a constant factor * T 1 (N) + T 2 (N) = max( O(f(N)), O(g(N)) ), * T 1 (N) * T 2 (N) = O( f(N) * g(N) ) If T1(N) = O(f(N) and T2(N) = O(g(N)),

Analysis of Algorithms / Slide 21 Big Oh: more examples * N 2 / 2 – 3N = O(N 2 ) * 1 + 4N = O(N) * 7N N + 3 = O(N 2 ) = O(N 3 ) * log 10 N = log 2 N / log 2 10 = O(log 2 N) = O(log N) * sin N = O(1); 10 = O(1), = O(1) * * log N + N = O(N) * log k N = O(N) for any constant k * N = O(2 N ), but 2 N is not O(N) * 2 10N is not O(2 N )

Analysis of Algorithms / Slide 22 Math Review

Analysis of Algorithms / Slide 23 lower bound *  c, n 0 > 0 such that f(N)  c g(N) when N  n 0 * f(N) grows no slower than g(N) for “large” N

Analysis of Algorithms / Slide 24 Asymptotic lower bound: Big- Omega * f(N) =  (g(N)) * There are positive constants c and n 0 such that f(N)  c g(N) when N  n 0 * The growth rate of f(N) is greater than or equal to the growth rate of g(N). * g(N) is a lower bound on f(N).

Analysis of Algorithms / Slide 25 Big-Omega: examples * Let f(N) = 2N 2. Then n f(N) =  (N) n f(N) =  (N 2 ) (best answer)

Analysis of Algorithms / Slide 26 tight bound * the growth rate of f(N) is the same as the growth rate of g(N)

Analysis of Algorithms / Slide 27 Asymptotically tight bound: Big-Theta * f(N) =  (g(N)) iff f(N) = O(g(N)) and f(N) =  (g(N)) * The growth rate of f(N) equals the growth rate of g(N) * Big-Theta means the bound is the tightest possible. * Example: Let f(N)=N 2, g(N)=2N 2 n Since f(N) = O(g(N)) and f(N) =  (g(N)), thus f(N) =  (g(N)).

Analysis of Algorithms / Slide 28 Some rules * If T(N) is a polynomial of degree k, then T(N) =  (N k ). * For logarithmic functions, T(log m N) =  (log N).

Analysis of Algorithms / Slide 29 General Rules * Loops n at most the running time of the statements inside the for-loop (including tests) times the number of iterations. n O(N) * Nested loops n the running time of the statement multiplied by the product of the sizes of all the for-loops. n O(N 2 )

Analysis of Algorithms / Slide 30 * Consecutive statements n These just add n O(N) + O(N 2 ) = O(N 2 )  Conditional: If S1 else S2 n never more than the running time of the test plus the larger of the running times of S1 and S2. n O(1)

Analysis of Algorithms / Slide 31 Using L' Hopital's rule * ‘rate’ is the first derivative * L' Hopital's rule n If and then = * Determine the relative growth rates (using L' Hopital's rule if necessary) n compute n if 0: f(N) = o(g(N)) and f(N) is not  (g(N)) n if constant  0: f(N) =  (g(N)) n if  : f(N) =  (f(N)) and f(N) is not  (g(N)) n limit oscillates: no relation This is rarely used in 171, as we know the relative growth rates of most of functions used in 171!

Analysis of Algorithms / Slide 32 Our first example: search of an ordered array * Linear search and binary search * Upper bound, lower bound and tight bound

Analysis of Algorithms / Slide 33 O(N) // Given an array of ‘size’ in increasing order, find ‘x’ int linearsearch(int* a[], int size,int x) { int low=0, high=size-1; for (int i=0; i<size;i++) if (a[i]==x) return i; return -1; } Linear search:

Analysis of Algorithms / Slide 34 int bsearch(int* a[],int size,int x) { int low=0, high=size-1; while (low<=higt) { int mid=(low+high)/2; if (a[mid]<x) low=mid+1; else if (x<a[mid]) high=mid-1; else return mid; } return -1 } Iterative binary search:

Analysis of Algorithms / Slide 35 int bsearch(int* a[],int size,int x) { int low=0, high=size-1; while (low<=higt) { int mid=(low+high)/2; if (a[mid]<x) low=mid+1; else if (x<a[mid]) high=mid-1; else return mid; } return -1 } Iterative binary search: l n=high-low l n_i+1 <= n_i / 2 l i.e. n_i <= (N-1)/2^{i-1} l N stops at 1 or below l there are at most 1+k iterations, where k is the smallest such that (N-1)/2^{k-1} <= 1 l so k is at most 2+log(N-1) l O(log N)

Analysis of Algorithms / Slide 36 O(1) T(N/2) int bsearch(int* a[],int low, int high, int x) { if (low>high) return -1; else int mid=(low+high)/2; if (x=a[mid]) return mid; else if(a[mid]<x) bsearch(a,mid+1,high,x); else bsearch(a,low,mid-1); } O(1) Recursive binary search:

Analysis of Algorithms / Slide 37 * With 2 k = N (or asymptotically), k=log N, we have * Thus, the running time is O(log N) Solving the recurrence:

Analysis of Algorithms / Slide 38 * Consider a sequence of 0, 1, 2, …, N-1, and search for 0 * At least log N steps if N = 2^k * An input of size n must take at least log N steps * So the lower bound is Omega(log N) * So the bound is tight, Theta(log N) Lower bound, usually harder than upper bound to prove, informally, l find one input example, l that input has to do ‘at least’ an amount of work l that amount is a lower bound

Analysis of Algorithms / Slide 39 Another Example * Maximum Subsequence Sum Problem * Given (possibly negative) integers A 1, A 2,...., A n, find the maximum value of n For convenience, the maximum subsequence sum is 0 if all the integers are negative * E.g. for input –2, 11, -4, 13, -5, -2 n Answer: 20 (A 2 through A 4 )

Analysis of Algorithms / Slide 40 Algorithm 1: Simple * Exhaustively tries all possibilities (brute force) * O(N 3 ) N N-i, at most N j-i+1, at most N

Analysis of Algorithms / Slide 41 Algorithm 2: improved N N-i, at most N O(N^2) // Given an array from left to right int maxSubSum(const int* a[], const int size) { int maxSum = 0; for (int i=0; i< size; i++) { int thisSum =0; for (int j = i; j < size; j++) { thisSum += a[j]; if(thisSum > maxSum) maxSum = thisSum; } return maxSum; }

Analysis of Algorithms / Slide 42 Algorithm 3: Divide-and-conquer * Divide-and-conquer n split the problem into two roughly equal subproblems, which are then solved recursively n patch together the two solutions of the subproblems to arrive at a solution for the whole problem  The maximum subsequence sum can be  Entirely in the left half of the input  Entirely in the right half of the input  It crosses the middle and is in both halves

Analysis of Algorithms / Slide 43 * The first two cases can be solved recursively * For the last case: n find the largest sum in the first half that includes the last element in the first half n the largest sum in the second half that includes the first element in the second half n add these two sums together

Analysis of Algorithms / Slide 44 O(1) T(N/2) O(N) O(1) T(N/2) // Given an array from left to right int maxSubSum(a,left,right) { if (left==right) return a[left]; else mid=(left+right)/2 maxLeft=maxSubSum(a,left,mid); maxRight=maxSubSum(a,mid+1,right); maxLeftBorder=0; leftBorder=0; for(i = mid; i>= left, i--) { leftBorder += a[i]; if (leftBorder>maxLeftBorder) maxLeftBorder=leftBorder; } // same for the right maxRightBorder=0; rightBorder=0; for … { } return max3(maxLeft,maxRight, maxLeftBorder+maxRightBorder); } O(N)

Analysis of Algorithms / Slide 45 * Recurrence equation n 2 T(N/2): two subproblems, each of size N/2 n N: for “patching” two solutions to find solution to whole problem

Analysis of Algorithms / Slide 46 * With 2 k = N (or asymptotically), k=log N, we have * Thus, the running time is O(N log N)  faster than Algorithm 1 for large data sets

Analysis of Algorithms / Slide 47 * It is also easy to see that lower bounds of algorithm 1, 2, and 3 are Omega(N^3), Omega(N^2), and Omega(N log N). * So these bounds are tight.