Lecture 10 CS203 1. Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting.

Lecture 10 CS203 1

Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting algorithms illustrate many creative approaches to problem solving that can be applied to other problems. Second, sorting algorithms are good for practicing fundamental programming techniques using selection statements, loops, methods, and arrays. Third, sorting algorithms are excellent examples to demonstrate algorithm performance. 2

Sorting These sorting algorithms apply to sorting any type of object, as long as we can find a way to order them. For simplicity, though, we will first sort numeric values, then more complex objects. When we sort more complex numbers, we will sort them by some key. For example, if class Student has instance variables representing CIN, GPA and name, we will sort according to one of these or some combination of them. We have already done this with the priority queue and other examples. 3

Sorting Arrays and Lists are reference types; that is, the variable we pass between methods does not contain the array or list, but a reference to it. Therefore, you can sort an array or list in Java with a void method that takes the reference variable, and sorts the elements without returning anything. All other references to the data structure can still be used to access the sorted structure. On the other hand, you can also construct a new sorted list and return it. This does not necessarily require copying the objects themselves. This practice may become more common in the near future for reasons you will learn about in a few weeks. 4

Bubble Sort Recall that bubble sort repeatedly iterates through the list to be sorted, comparing each pair of adjacent items and swapping them if they are in the wrong order. This iteration is repeated until no swaps are needed, which indicates that the list is sorted. The algorithm gets its name from the way smaller elements "bubble" to the top of the list. Text adapted from Wikipedia 5

Bubble Sort 6 The number of comparisons is always at least as large as the number of swaps. Therefore, in studying the time complexity, we count the comparisons. The largest key always floats to the right position in the first pass, the next largest rises to the next position in the next pass, etc.

Bubble Sort We are estimating the effect of growth in n, not the exact number of CPU cycles we need. We do not care about the division by 2 since this does not affect the rate of growth. As n increases, the lower-order term n/2 is dominated by the n 2 term. Therefore, we also disregard the lower order term. Bubble sort time: O(n 2 ) 7 In the best case, the data is already sorted, so that we only need one pass, making bubble sort O(n) in this case. More importantly, though, for the worst case, the number of comparisons is : Recall that 0+1+…n = n(n+1)/2. Thus, 0+1+…(n-1) = (n-1)(n)/2

Selection Sort and Insertion Sort These two sorts grow sorted sublists one element at a time. Selection sort finds the lowest value and swaps it with the value at the bottom, or finds the largest element and swaps it with the values at the top. It then repeats the process for the rest of the values, ignoring the already-sorted portion, repeatedly until the list is sorted. We will assume we are moving elements to the beginning of a list. Insertion sort takes one value at a time and places it in the correct spot in the sorted sublist, in the same way most people would sort a hand of cards. 8

Analyzing Selection Sort The number of comparisons in selection sort is n-1 for the first iteration, n-2 for the second iteration, and so on. Let T(n) denote the complexity for selection sort and c denote the total number of other operations in each iteration. So, 9 Ignoring constants and smaller terms, the complexity of selection sort is O(n 2 ).

Analyzing Insertion Sort With An Array Where selection sort always inserts an element at the end of the sorted sublist, insertion sort requires inserting some elements in arbitrary places. At the kth iteration to insert an element to a sorted array of size k, it may take k comparisons to find the insertion position. If the structure we are sorting is an array, it may also require k moves to insert the element, as subequent elements are moved back in the array until we reach the spot freed by the value being inserted. Let T(n) denote the complexity for insertion sort and c denote the total number of other operations such as assignments in each iteration. So, 10 Ignoring constants and smaller terms, the complexity of the insertion sort algorithm is O(n 2 ). These two terms are just twice the values for selection sort

Quadratic Time Note again that the common O(n 2 ) complexity means that bubble sort, selection sort, and insertion sort have comparable growth in expense as n increases. It does not mean that their performance will be the same in any particular case. 11

Quadratic Time An algorithm with O(n 2 ) time complexity is called a quadratic algorithm. Algorithms with nested loops are often quadratic. A quadratic algorithm's expense grows quickly as the problem size increases. If you double the input size, the time for the algorithm is quadrupled. 12

Sorting We teach bubble sort, selection sort, and insertion sort first because they are easy to understand. Other sort methods are more efficient in average and worst cases. In particular, there are sort algorithms with O(n log n) complexity. In other words, the consumption of CPU cycles grows proportionally to n times log of n. These algorithms usually involve performing an operation that is O(log n) n times. Since log n grows much more slowly than n, this is a dramatic improvement over O(n 2) : If n = 2, n 2 = 4 and n log n = 2 If n = 100, n 2 = 10000 and n log n = 664 If n = 10000, n 2 = 100,000,000 and n log n = 132,877 13

Comparing Common Growth Functions Constant time 14 Logarithmic time Linear time Log-linear time Quadratic time Cubic time Exponential time

Comparing Common Growth Functions 15

Merge Sort Merge Sort is a recursive divide and conquer algorithm, like the Towers Of Hanoi algorithm Recursive method splits the list repeatedly until it is made up of single-element sublists, then merges them as the recursion unwinds: mergeSort(list): firstHalf = mergeSort(firstHalf); secondHalf = mergeSort(secondHalf); list = merge(firstHalf, secondHalf); merge: add the lesser of firstHalf [0] and secondHalf [0] to the new, larger list repeat until one of the (already sorted) sublists is exhausted add the rest of the remaining sublist to the larger list. 16

Merge Sort 17

Merge Sort Time Assume n is a power of 2. This assumption makes the math simpler. If n is not a power of 2, the difference is irrelevant to the order of complexity. Merge sort splits the list into two sublists, sorts the sublists using the same algorithm recursively, and then merges the sublists. Each recursive call merge sorts half the list, so the depth of the recursion is the number of times you need to split n to get lists of size 1, ie log n. The single-item lists are, obviously, sorted. Merge Sort reassembles the list in log n steps, just as it broke the list down. The total size of all the sublists is n, the original size of the unsorted list. To merge the sublists, across all the sublists at one level of recursion, takes at most n-1 comparisons and n copies of one element from a sublist to the merged list. The total merge time is 2n-1, which is O(n). The O(n) merging happens log n times as the full sorted array is built. Thus, Merge Sort is O(n log n) 18

Merge Sort Time Here it is again, but with more math. Let T(n) denote the time required for sorting an array of n elements using merge sort. Without loss of generality, assume n is a power of 2. The merge sort algorithm splits the array into two sublists, sorts the sublists using the same algorithm recursively, and then merges the sublists. So, The first T(n/2) is the time required for sorting the first half of the list and the second T(n/2) is the time required for sorting the second half. 19

Merge Sort Time Merging two sublists takes at most n-1 comparisons plus n copies. The merge time is 2n-1. 20 2 log n = n and T(1) = 1 2n – 2 0 a =2, so a-1 = 1 Subtractive term, so the -1 from the summation becomes +1

Quick Sort Quick sort, developed by C. A. R. Hoare (1962), works as follows. These slides assume we are sorting an array, but the sort works similarly with a list: Select an element, called the pivot, in the array. Partition the list into one part that contains only elements less than or equal to the pivot and the other consists of elements greater than the pivot. Recursively apply the quick sort algorithm to the first part and then the second part. 21

Quick Sort function quicksort(a) // an array of zero or one elements is already sorted if length(a) ≤ 1 return a select and remove a pivot element pivot from array create empty lists less and greater for each x in a if x ≤ pivot then append x to less else append x to greater // two recursive calls return concatenate(quicksort(less), list(pivot), quicksort(greater)) The earliest version of quicksort used the first index as the pivot, and demos of quicksort often still do this for simplicity. However, in an already-sorted array, this will cause the worst case O(n 2 ) behavior. The middle index is safer, although yet more complex solutions to this problem also exist. 22

Quick Sort 23

Quick Sort How to accomplish the partition in a single array: Search from the beginning of the list for the first element greater than the pivot and from the end for the last element less than the pivot. When found, swap them. Continue until the list is partitioned. Finally, swap the last element in the left partition with the pivot 24

package demos; import javax.swing.JOptionPane; public class Demo { // http://www.mycstutorials.com/articles/sorting/quicksort public void quickSort(int array[]) { for (int x : array)System.out.print(x + " "); System.out.println(); quickSort(array, 0, array.length - 1); } public void quickSort(int array[], int start, int end) { int i = start; // index of left-to-right scan int k = end; // index of right-to-left scan if (end - start >= 1) // check that there are at least two elements { int pivot = array[start]; // set the first element as pivot System.out.println("pivot " + pivot); System.out.println("i: " + i + " k: " + k); while (k > i) // while the scan indices have not met { while (array[i] i) { // from the left, look for the first i++; // element greater than the pivot System.out.println("i: " + i); } while (array[k] > pivot && k >= start && k >= i) { // from the right, look for the first k--; // element not greater than the pivot System.out.println("k: " + k); } if (k > i) // if the left seekindex is still smaller than swap(array, i, k); // the right index, swap the corresponding elements } swap(array, start, k); // after the indices cross, swap the last element in the left partition with the // pivot quickSort(array, start, k - 1); // quicksort the left partition quickSort(array, k + 1, end); // quicksort the right partition } 25

else // if there is only one element in the partition, do not do any sorting { return; // the array is sorted, so exit } public void swap(int array[], int index1, int index2) { int temp = array[index1]; // store the first value in a temp array[index1] = array[index2]; // copy the value of the second into the // first array[index2] = temp; // copy the value of the temp into the second for (int x : array) System.out.print(x + " "); System.out.println(); } public static void main(String[] args) { Demo q = new Demo(); int[] myArray = { 5, 4, 10, 11, 9, 8, 1 }; q.quickSort(myArray); } 26

Quick Sort Partition Time To partition an array of n elements takes n-1 comparisons and n moves in the worst case. So, the time required for partition is O(n). 28

Worst-Case Time 29 F In the worst case, each time the pivot divides the array into one big subarray with the other empty. F The size of the big subarray is one less than the one before divided, so the O(n) partitioning occurs n-1 times. F Worst-case time:

Best-Case Time 30 F In the best case, each time the pivot divides the array into two parts of about the same size, so we partition log n times. F Since the O(n) partitioning occurs log n times, Quicksort is O(n log n) in this case.

Average-Case Time 31 F On the average, each time the pivot will not divide the array into two parts of the same size nor one empty part. F Statistically, the sizes of the two parts are close on average. So the average time is O(n logn). The exact performance depends on the data.

Bucket Sort All sort algorithms discussed so far are general sorting algorithms that work for any types of keys (e.g., integers, strings, and any comparable objects). These algorithms sort the elements by comparing their keys. The lower bound for general sorting algorithms is O(nlogn). So, no sorting algorithms based on comparisons can perform better than O(n log n). However, if the keys are small integers, you can use bucket sort without having to compare the keys. 32

Bucket Sort The bucket sort algorithm works as follows. Assume the keys are in the range from 0 to N-1. We need N buckets labeled 0, 1,..., and N-1. If an element’s key is i, the element is put into the bucket i. Each bucket holds the elements with the same key value. You can use an ArrayList to implement a bucket. Bucket Sort is O(n) 33

Common Recurrence Relations 34

Lecture 10 CS203 1. Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting.

Similar presentations

Presentation on theme: "Lecture 10 CS203 1. Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Lecture 10 CS203 1. Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting.

Similar presentations

Presentation on theme: "Lecture 10 CS203 1. Sorting Sorting is a classic subject in computer science. There are three reasons for studying sorting algorithms. First, sorting."— Presentation transcript:

Similar presentations

About project

Feedback