Searching and Sorting Hint at Asymptotic Complexity

Slides:



Advertisements
Similar presentations
Introduction to Algorithms Quicksort
Advertisements

SORTING Lecture 12B CS2110 – Spring InsertionSort 2 pre: b 0 b.length ? post: b 0 b.length sorted inv: or: b[0..i-1] is sorted b 0 i b.length sorted.
MATH 224 – Discrete Mathematics
SEARCHING AND SORTING HINT AT ASYMPTOTIC COMPLEXITY Lecture 9 CS2110 – Spring 2015 We may not cover all this material.
Recitation on analysis of algorithms. runtimeof MergeSort /** Sort b[h..k]. */ public static void mS(Comparable[] b, int h, int k) { if (h >= k) return;
CS 1110 Prelim III: Review Session 1. Info My name: Bruno Abrahao – We have two other TA’s in the room to help you individually Beibei Zhu Suyong Zhao.
1 Algorithmic analysis Introduction. This handout tells you what you are responsible for concerning the analysis of algorithms You are responsible for:
Sorting and Asymptotic Complexity
(c) , University of Washington
Recitation 11 Analysis of Algorithms and inductive proofs 1.
C++ Programming: From Problem Analysis to Program Design, Second Edition Chapter 19: Searching and Sorting.
CS 61B Data Structures and Programming Methodology July 28, 2008 David Sun.
CS 100Lecture 131 Announcements Exam stats P3 due on Thursday.
CS November 2010 Developing array algorithms. Reading: Haikus (5-7-5) seen on Japanese computer monitors Yesterday it worked. Today it is.
Recitation on analysis of algorithms. Formal definition of O(n) We give a formal definition and show how it is used: f(n) is O(g(n)) iff There is a positive.
1 CS November 2010 insertion sort, selection sort, quick sort Do exercises on pp to get familiar with concepts and develop skill. Practice.
1 CS April 2010 binary search, linear search insertion sort, selection sort, quick sort Do exercises on pp to get familiar with concepts.
1 CS April 2009 Sorting: insertion sort, selection sort, quick sort Do exercises on pp to get familiar with concepts and develop skill.
0 Introduction to asymptotic complexity Search algorithms You are responsible for: Weiss, chapter 5, as follows: 5.1 What is algorithmic analysis? 5.2.
Quicksort This is probably the most popular sorting algorithm. It was invented by the English Scientist C.A.R. Hoare It is popular because it works well.
1 CS November 2008 Sorting: insertion sort, selection sort, quick sort Do exercises on pp to get familiar with concepts and develop skill.
CORRECTNESS ISSUES AND LOOP INVARIANTS Lecture 8 CS2110 – Fall 2014.
1 CS April 2010 insertion sort, selection sort, quick sort Do exercises on pp to get familiar with concepts and develop skill. Practice.
CORRECTNESS ISSUES AND LOOP INVARIANTS Lecture 8 CS2110 – Fall 2015.
16 Searching and Sorting.
Correctness issues and Loop invariants
COP 3503 FALL 2012 Shayan Javed Lecture 15
Recitation 13 Searching and Sorting.
Simple Sorting Algorithms
Quicksort
Quicksort "There's nothing in your head the sorting hat can't see. So try me on and I will tell you where you ought to be." -The Sorting Hat, Harry Potter.
Teach A level Computing: Algorithms and Data Structures
Quicksort 1.
Big-Oh and Execution Time: A Review
Building Java Programs
Analysis of Algorithms
Algorithm design and Analysis
Announcements P2 is due tomorrow Prelim on Monday
Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.
Data Structures Review Session
8/04/2009 Many thanks to David Sun for some of the included slides!
Searching, Sorting, and Asymptotic Complexity
"Organizing is what you do before you do something, so that when you do it, it is not all mixed up." ~ A. A. Milne Sorting Lecture 11 CS2110 – Fall 2018.
CS100A Lecture 15, 17 22, 29 October 1998 Sorting
25 Searching and Sorting Many slides modified by Prof. L. Lilien (even many without an explicit message indicating an update). Slides added or modified.
Searching CLRS, Sections 9.1 – 9.3.
24 Searching and Sorting.
Sub-Quadratic Sorting Algorithms
Searching, Sorting, and Asymptotic Complexity
Searching, Sorting, and Asymptotic Complexity
Asymptotic complexity Searching/sorting
Quicksort.
"Organizing is what you do before you do something, so that when you do it, it is not all mixed up." ~ A. A. Milne Sorting Lecture 11 CS2110 – Spring 2019.
Binary Search and Loop invariants
Asymptotic complexity
CSC 143 Java Sorting.
CS November 2010 Developing array algorithms. Reading:
Sorting and Asymptotic Complexity
Quicksort.
Searching and Sorting Hint at Asymptotic Complexity
Sum this up for me Let’s write a method to calculate the sum from 1 to some n public static int sum1(int n) { int sum = 0; for (int i = 1; i
CS100A Lecture 15, 17 22, 29 October 1998 Sorting
Searching and Sorting Hint at Asymptotic Complexity
Developing loops from invariants
Linear and Binary Search
Searching and Sorting Hint at Asymptotic Complexity
Quicksort.
Sorting and selection Prof. Noah Snavely CS1114
Algorithm Analysis How can we demonstrate that one algorithm is superior to another without being misled by any of the following problems: Special cases.
CMPT 225 Lecture 16 – Heap Sort.
Presentation transcript:

Searching and Sorting Hint at Asymptotic Complexity We may not cover all this material Searching and Sorting Hint at Asymptotic Complexity Lecture 9 CS2110 – Fall 2015

Miscellaneous Prelim a week from now. Thursday night. By tonight, all people with conflicts should either have emailed Megan or completed assignment P1Conflict. (36 did so, till now.) Review session Sunday 1-3PM, Kimball B11. Next week’s recitation also a review. A3 due Monday night. Group early! Only 328 views of the piazza A3 FAQ. Piazza Supplemental study material. We will be putting something on it soon about loop invariants –up to last lecture. Sorry for the mistakes in uploading todays’ lecture to the CMS. My mistake. Usually I check when I upload something. This time, in a hurry, I didn’t.

Last lecture: binary search pre: b 0 b.length post: b 0 h b.length <= v > v ? inv: b 0 h t b.length <= v ? > v Methodology: Draw the invariant as a combination of pre and post Develop loop using 4 loopy questions. Practice doing this! h= –1; t= b.length; while (h != t–1) { int e= (h+t)/2; if (b[e] <= v) h= e; else t= e; }

Binary search: find position h of v = 5 pre: array is sorted h = -1 t = 11 1 4 5 6 8 10 11 12 1 4 5 6 8 10 11 12 Loop invariant: entries h and below are <= v entries t and above are > v entries between h and t are sorted h = -1 t = 5 1 4 5 6 8 10 11 12 h = 2 t = 5 1 4 5 6 8 10 11 12 h = 3 t = 5 1 4 5 6 8 10 11 12 h = 3 t = 4 5 1 4 5 6 8 10 11 12 post: <= v h > v

Binary search: an O(log n) algorithm inv: b 0 h t b.length = n <= v ? > v h= –1; t= b.length; while (h != t–1) { int e= (h+t)/2; if (b[e] <= v) h= e; else t= e; } Suppose initially: b.length = 2k – 1 Initially, h = -1, t = 2k -1, t - h = 2k Can show that one iteration sets h or t so that t - h = 2k-1 e.g. Set e to (h+t)/2 = (2k – 2)/2 = 2k-1 – 1 Set t to e, i.e. to 2k-1 – 1 Then t - h = 2k-1 – 1 + 1 = 2k-1 Careful calculation shows that: each iteration halves t – h !! Initially t - h = 2k Loop iterates exactly k times

Binary search: an O(log n) algorithm Search array with 32767 elements, only 15 iterations! Bsearch: h= –1; t= b.length; while (h != t–1) { int e= (h+t)/2; if (b[e] <= v) h= e; else t= e; } If n = 2k, k is called log(n) That’s the base 2 logarithm n log(n) 1 = 20 0 2 = 21 1 4 = 22 2 8 = 23 3 31768 = 215 15 Each iteration takes constant time (a few assignments and an if). Bsearch executes ~log n iterations for an array of size n. So the number of assignments and if-tests made is proportional to log n. Therefore, Bsearch is called an order log n algorithm, written O(log n). (We’ll formalize this notation later.)

Linear search: Find first position of v in b (if in) pre: b 0 b.length ? Store in h to truthify: post: b 0 h b.length v not here ? and h = b.length or b[h] = v Stop when this is true inv: b 0 h b.length v not here ? loopy question 1? h= 0; loopy question 2? h= 0; while (h != b.length && b[h] != v) h= h+1; loopy question 3? h= h+1; loopy question 4? OK! B: h != b.length && b[h] != v

Linear search: Find first position of v in b (if in) pre: b 0 b.length ? Store in h to truthify: post: b 0 h b.length v not here ? and h = b.length or b[h] = v Worst case: for array of size n, requires n iterations, each taking constant time. Worst-case time: O(n). Expected or average time? n/2 iterations. O(n/2) —is also O(n) inv: b 0 h b.length v not here ? h= 0; while (h != b.length && b[h] != v) h= h+1;

Looking at execution speed n*n ops Looking at execution speed Process an array of size n Number of operations executed 2n+2, n+2, n are all “order n” O(n) Called linear in n, proportional to n 2n + 2 ops n + 2 ops n ops Constant time 0 1 2 3 … size n

InsertionSort pre: b 0 b.length ? post: b 0 b.length sorted inv: or: b[0..i-1] is sorted b 0 i b.length sorted ? A loop that processes elements of an array in increasing order has this invariant inv: b 0 i b.length processed ?

Each iteration, i= i+1; How to keep inv true? b 0 i b.length sorted ? b 0 i b.length 2 5 5 5 7 3 ? e.g. b 0 i b.length 2 3 5 5 5 7 ? Push b[i] down to its shortest position in b[0..i], then increase i Will take time proportional to the number of swaps needed

What to do in each iteration? 12 inv: b 0 i b.length sorted ? b 0 i b.length 2 5 5 5 7 3 ? e.g. 2 5 5 5 3 7 ? Push b[i] to its sorted position in b[0..i], then increase i Loop body (inv true before and after) 2 5 5 3 5 7 ? 2 5 3 5 5 7 ? 2 3 5 5 5 7 ? b 0 i b.length 2 3 5 5 5 7 ?

InsertionSort Note English statement in body. // sort b[], an array of int // inv: b[0..i-1] is sorted for (int i= 1; i < b.length; i= i+1) { Push b[i] down to its sorted position in b[0..i] } Note English statement in body. Abstraction. Says what to do, not how. This is the best way to present it. Later, show how to implement that with a loop Many people sort cards this way Works well when input is nearly sorted

InsertionSort // sort b[], an array of int // inv: b[0..i-1] is sorted for (int i= 1; i < b.length; i= i+1) { Push b[i] down to its sorted position in b[0..i] } Worst-case: O(n2) (reverse-sorted input) Best-case: O(n) (sorted input) Expected case: O(n2) Pushing b[i] down can take i swaps. Worst case takes 1 + 2 + 3 + … n-1 = (n-1)*n/2 Swaps. Let n = b.length

SelectionSort pre: b 0 b.length ? post: b 0 b.length sorted inv: b 0 i b.length sorted , <= b[i..] >= b[0..i-1] Additional term in invariant Keep invariant true while making progress? e.g.: b 0 i b.length 1 2 3 4 5 6 9 9 9 7 8 6 9 Increasing i by 1 keeps inv true only if b[i] is min of b[i..]

SelectionSort Another common way for people to sort cards Runtime //sort b[], an array of int // inv: b[0..i-1] sorted // b[0..i-1] <= b[i..] for (int i= 1; i < b.length; i= i+1) { int m= index of minimum of b[i..]; Swap b[i] and b[m]; } Another common way for people to sort cards Runtime Worst-case O(n2) Best-case O(n2) Expected-case O(n2) sorted, smaller values larger values b 0 i length Each iteration, swap min value of this section into b[i]

Swapping b[i] and b[m] // Swap b[i] and b[m] int t= b[i]; b[i]= b[m]; b[m]= t;

Partition algorithm of quicksort Swap array values around until b[h..k] looks like this: x ? h h+1 k x is called the pivot pre: <= x x >= x h j k post:

20 31 24 19 45 56 4 20 5 72 14 99 pivot partition j 19 4 5 14 20 31 24 45 56 20 72 99 Not yet sorted The 20 could be in the other partition these can be in any order these can be in any order

Partition algorithm x ? h h+1 k pre: b <= x x >= x h j k post: b invariant needs at least 4 sections Combine pre and post to get an invariant <= x x ? >= x h j t k b

Partition algorithm h j t k <= x x ? >= x h j t k b Initially, with j = h and t = k, this diagram looks like the start diagram j= h; t= k; while (j < t) { if (b[j+1] <= b[j]) { Swap b[j+1] and b[j]; j= j+1; } else { Swap b[j+1] and b[t]; t= t-1; } Terminate when j = t, so the “?” segment is empty, so diagram looks like result diagram Takes linear time: O(k+1-h)

QuickSort procedure /** Sort b[h..k]. */ public static void QS(int[] b, int h, int k) { if (b[h..k] has < 2 elements) return; Base case Function does the partition algorithm and returns position j of pivot int j= partition(b, h, k); // We know b[h..j–1] <= b[j] <= b[j+1..k] } //Sort b[h..j-1] and b[j+1..k] QS(b, h, j-1); QS(b, j+1, k);

QuickSort Quicksort developed by Sir Tony Hoare (he was knighted by the Queen of England for his contributions to education and CS). 81 years old. Developed Quicksort in 1958. But he could not explain it to his colleague, so he gave up on it. Later, he saw a draft of the new language Algol 58 (which became Algol 60). It had recursive procedures. First time in a procedural programming language. “Ah!,” he said. “I know how to write it better now.” 15 minutes later, his colleague also understood it.

Worst case quicksort: pivot always smallest value x0 >= x0 j partioning at depth 0 x0 x1 >= x1 j partioning at depth 1 x0 x1 x2 >= x2 j partioning at depth 2 /** Sort b[h..k]. */ public static void QS(int[] b, int h, int k) { if (b[h..k] has < 2 elements) return; int j= partition(b, h, k); QS(b, h, j-1); QS(b, j+1, k);

Best case quicksort: pivot always middle value <= x0 x0 >= x0 0 j n depth 0. 1 segment of size ~n to partition. <=x1 x1 >= x1 x0 <=x2 x2 >=x2 Depth 2. 2 segments of size ~n/2 to partition. Depth 3. 4 segments of size ~n/4 to partition. Max depth: about log n. Time to partition on each level: ~n Total time: O(n log n). Average time for Quicksort: n log n. Difficult calculation

QuickSort procedure /** Sort b[h..k]. */ public static void QS(int[] b, int h, int k) { if (b[h..k] has < 2 elements) return; int j= partition(b, h, k); // We know b[h..j–1] <= b[j] <= b[j+1..k] // Sort b[h..j-1] and b[j+1..k] QS(b, h, j-1); QS(b, j+1, k); } Worst-case: quadratic Average-case: O(n log n) Worst-case space: O(n*n)! --depth of recursion can be n Can rewrite it to have space O(log n) Average-case: O(n * log n)

Partition algorithm Key issue: Choosing pivot How to choose a pivot? Choosing pivot Ideal pivot: the median, since it splits array in half But computing median of unsorted array is O(n), quite complicated Popular heuristics: Use first array value (not good) middle array value median of first, middle, last, values GOOD! Choose a random element

Quicksort with logarithmic space Problem is that if the pivot value is always the smallest (or always the largest), the depth of recursion is the size of the array to sort. Eliminate this problem by doing some of it iteratively and some recursively

Quicksort with logarithmic space Problem is that if the pivot value is always the smallest (or always the largest), the depth of recursion is the size of the array to sort. Eliminate this problem by doing some of it iteratively and some recursively

QuickSort with logarithmic space /** Sort b[h..k]. */ public static void QS(int[] b, int h, int k) { int h1= h; int k1= k; // invariant b[h..k] is sorted if b[h1..k1] is sorted while (b[h1..k1] has more than 1 element) { Reduce the size of b[h1..k1], keeping inv true }

QuickSort with logarithmic space /** Sort b[h..k]. */ public static void QS(int[] b, int h, int k) { int h1= h; int k1= k; // invariant b[h..k] is sorted if b[h1..k1] is sorted while (b[h1..k1] has more than 1 element) { int j= partition(b, h1, k1); // b[h1..j-1] <= b[j] <= b[j+1..k1] if (b[h1..j-1] smaller than b[j+1..k1]) { QS(b, h, j-1); h1= j+1; } else {QS(b, j+1, k1); k1= j-1; } } Only the smaller segment is sorted recursively. If b[h1..k1] has size n, the smaller segment has size < n/2. Therefore, depth of recursion is at most log n