Data Structures Using C++

Slides:



Advertisements
Similar presentations
Chapter 11. Hash Tables.
Advertisements

CS Data Structures Chapter 8 Hashing.
Hash Tables.
Part II Chapter 8 Hashing Introduction Consider we may perform insertion, searching and deletion on a dictionary (symbol table). Array Linked list Tree.
CSCE 3400 Data Structures & Algorithm Analysis
Lecture 11 oct 6 Goals: hashing hash functions chaining closed hashing application of hashing.
Skip List & Hashing CSE, POSTECH.
Data Structures Using C++ 2E
What we learn with pleasure we never forget. Alfred Mercier Smitha N Pai.
CHAPTER 10 ARRAYS II Applications and Extensions.
Hashing Chapters What is Hashing? A technique that determines an index or location for storage of an item in a data structure The hash function.
Searching Kruse and Ryba Ch and 9.6. Problem: Search We are given a list of records. Each record has an associated key. Give efficient algorithm.
Hashing Techniques.
© 2006 Pearson Addison-Wesley. All rights reserved13 A-1 Chapter 13 Hash Tables.
Hash Tables and Associative Containers CS-212 Dick Steflik.
Searching Algorithms. Lecture Objectives Learn how to implement the sequential search algorithm Learn how to implement the binary search algorithm To.
Data Structures Using Java1 Chapter 8 Search Algorithms.
Hash Tables1 Part E Hash Tables  
COMP 171 Data Structures and Algorithms Tutorial 10 Hash Tables.
CS Data Structures Chapter 8 Hashing (Concentrating on Static Hashing)
Lecture 11 oct 7 Goals: hashing hash functions chaining closed hashing application of hashing.
Hashing General idea: Get a large array
Data Structures Using C++ 2E Chapter 9 Searching and Hashing Algorithms.
Introducing Hashing Chapter 21 Copyright ©2012 by Pearson Education, Inc. All rights reserved.
L. Grewe. Computing hash function for a string Horner’s rule: (( … (a 0 x + a 1 ) x + a 2 ) x + … + a n-2 )x + a n-1 ) int hash( const string & key )
Searching Chapter 2.
Data Structures Using Java1 Chapter 8 Search Algorithms.
Data Structures Using C++1 Chapter 9 Search Algorithms.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture8.
CHAPTER 09 Compiled by: Dr. Mohammad Omar Alhawarat Sorting & Searching.
Chapter 19: Searching and Sorting Algorithms
Lecture 12. Searching Algorithms and its analysis 1.
Hashing Table Professor Sin-Min Lee Department of Computer Science.
Hashing Chapter 20. Hash Table A hash table is a data structure that allows fast find, insert, and delete operations (most of the time). The simplest.
Advance Data Structure 1 College Of Mathematic & Computer Sciences 1 Computer Sciences Department م. م علي عبد الكريم حبيب.
Algorithm Course Dr. Aref Rashad February Algorithms Course..... Dr. Aref Rashad Part: 4 Search Algorithms.
Comp 335 File Structures Hashing.
Data Structures Using C++ 2E Chapter 9 Searching and Hashing Algorithms.
Prof. Amr Goneid, AUC1 CSCI 210 Data Structures and Algorithms Prof. Amr Goneid AUC Part 5. Dictionaries(2): Hash Tables.
Hashing Hashing is another method for sorting and searching data.
Hashing as a Dictionary Implementation Chapter 19.
Searching Given distinct keys k 1, k 2, …, k n and a collection of n records of the form »(k 1,I 1 ), (k 2,I 2 ), …, (k n, I n ) Search Problem - For key.
Data Structures and Algorithms Hashing First Year M. B. Fayek CUFE 2010.
Been-Chian Chien, Wei-Pang Yang, and Wen-Yang Lin 8-1 Chapter 8 Hashing Introduction to Data Structure CHAPTER 8 HASHING 8.1 Symbol Table Abstract Data.
Chapter 10 Hashing. The search time of each algorithm depend on the number n of elements of the collection S of the data. A searching technique called.
Chapter 11 Hash Tables © John Urrutia 2014, All Rights Reserved1.
CHAPTER 8 SEARCHING CSEB324 DATA STRUCTURES & ALGORITHM.
Searching CS 244 Brent M. Dingle, Ph.D. Game Design and Development Program Department of Mathematics, Statistics, and Computer Science University of Wisconsin.
Hashtables. An Abstract data type that supports the following operations: –Insert –Find –Remove Search trees can be used for the same operations but require.
Hash Tables © Rick Mercer.  Outline  Discuss what a hash method does  translates a string key into an integer  Discuss a few strategies for implementing.
Data Structures Using C++ 2E Chapter 9 Searching and Hashing Algorithms.
Chapter 13 C Advanced Implementations of Tables – Hash Tables.
Chapter 9 Hashing Dr. Youssef Harrath
Hashing. Search Given: Distinct keys k 1, k 2, …, k n and collection T of n records of the form (k 1, I 1 ), (k 2, I 2 ), …, (k n, I n ) where I j is.
1 Hash Tables Chapter Motivation Many applications require only: –Insert –Search –Delete Examples –Symbol tables –Memory management mechanisms.
Prof. Amr Goneid, AUC1 CSCI 210 Data Structures and Algorithms Prof. Amr Goneid AUC Part 5. Dictionaries(2): Hash Tables.
Data Structures Using C++ 2E
Data Structures Using C++ 2E
CSCI 210 Data Structures and Algorithms
Hashing Alexandra Stefan.
Hashing Alexandra Stefan.
Data Structures Using C++ 2E
Advanced Associative Structures
Hash Table.
Hashing.
CSCE 3110 Data Structures & Algorithm Analysis
CH 9.2 : Hash Tables Acknowledgement: These slides are adapted from slides provided with Data Structures and Algorithms in C++, Goodrich, Tamassia and.
CH 9.2 : Hash Tables Acknowledgement: These slides are adapted from slides provided with Data Structures and Algorithms in C++, Goodrich, Tamassia and.
What we learn with pleasure we never forget. Alfred Mercier
Data Structures Using C++ 2E
Presentation transcript:

Data Structures Using C++ Chapter 9 Search Algorithms Data Structures Using C++

Data Structures Using C++ Chapter Objectives Learn the various search algorithms Explore how to implement the sequential and binary search algorithms Discover how the sequential and binary search algorithms perform Become aware of the lower bound on comparison-based search algorithms Learn about hashing Data Structures Using C++

Data Structures Using C++ Sequential Search template<class elemType> int arrayListType<elemType>::seqSearch(const elemType& item) { int loc; bool found = false; for(loc = 0; loc < length; loc++) if(list[loc] == item) found = true; break; } if(found) return loc; else return -1; }//end seqSearch What is the time complexity? Data Structures Using C++

Data Structures Using C++ Search Algorithms Search item: target To determine the average number of comparisons in the successful case of the sequential search algorithm: Consider all possible cases Find the number of comparisons for each case Add the number of comparisons and divide by the number of cases Data Structures Using C++

Data Structures Using C++ Search Algorithms Suppose that there are n elements in the list. The following expression gives the average number of comparisons, assuming that each element is equally likely to be sought: It is known that Therefore, the following expression gives the average number of comparisons made by the sequential search in the successful case: Data Structures Using C++

Data Structures Using C++ Binary Search (assumes list is sorted) Data Structures Using C++

Binary Search: middle element first + last mid = 2 Data Structures Using C++

Data Structures Using C++ Binary Search template<class elemType> int orderedArrayListType<elemType>::binarySearch (const elemType& item) { int first = 0; int last = length - 1; int mid; bool found = false; while(first <= last && !found) mid = (first + last) / 2; if(list[mid] == item) found = true; else if(list[mid] > item) last = mid - 1; first = mid + 1; } if(found) return mid; return –1; }//end binarySearch Data Structures Using C++

Binary Search: Example Data Structures Using C++

Binary Search: Example Unsuccessful search Total number of comparisons is 6 Data Structures Using C++

Performance of Binary Search Data Structures Using C++

Performance of Binary Search Data Structures Using C++

Performance of Binary Search Unsuccessful search for a list of length n, a binary search makes approximately 2 log2 (n + 1) key comparisons Successful search for a list of length n, on average, a binary search makes 2 log2 n – 4 key comparisons Worst case upper bound: 2 + 2 log2 n Data Structures Using C++

Search Algorithm Analysis Summary Data Structures Using C++

Lower Bound on Comparison-Based Search Definition: A comparison-based search algorithm performs its search by repeatedly comparing the target element to the list elements. Theorem: Let L be a list of size n > 1. Suppose that the elements of L are sorted. If SRH(n) denotes the minimum number of comparisons needed, in the worst case, by using a comparison-based algorithm to recognize whether an element x is in L, then SRH(n) = log2 (n + 1). If list not sorted, worst case is n comparisons Corollary: The binary search algorithm is the optimal worst-case algorithm for solving search problems by the comparison method (when the list is sorted). For unsorted lists, sequential search is optimal Data Structures Using C++

Data Structures Using C++ Hashing An alternative to comparison-based search Requires storing data in a special data structure, called a hash table Main objectives to choosing hash functions: Choose a hash function that is easy to compute Minimize the number of collisions Data Structures Using C++

Commonly Used Hash Functions Mid-Square Hash function, h, computed by squaring the identifier Using appropriate number of bits from the middle of the square to obtain the bucket address Middle bits of a square usually depend on all the characters, it is expected that different keys will yield different hash addresses with high probability, even if some of the characters are the same Data Structures Using C++

Commonly Used Hash Functions Folding Key X is partitioned into parts such that all the parts, except possibly the last parts, are of equal length Parts then added, in convenient way, to obtain hash address Division (Modular arithmetic) Key X is converted into an integer iX This integer divided by size of hash table to get remainder, giving address of X in HT Data Structures Using C++

Commonly Used Hash Functions Suppose that each key is a string. The following C++ function uses the division method to compute the address of the key: int hashFunction(char *key, int keyLength) { int sum = 0; for(int j = 0; j <= keyLength; j++) sum = sum + static_cast<int>(key[j]); return (sum % HTSize); }//end hashFunction Data Structures Using C++

Data Structures Using C++ Collision Resolution Algorithms to handle collisions Two categories of collision resolution techniques Open addressing (closed hashing) Chaining (open hashing) Data Structures Using C++

Collision Resolution: Open Addressing Pseudocode implementing linear probing: hIndex = hashFunction(insertKey); found = false; while(HT[hIndex] != emptyKey && !found) if(HT[hIndex].key == key) found = true; else hIndex = (hIndex + 1) % HTSize; if(found) cerr<<”Duplicate items are not allowed.”<<endl; HT[hIndex] = newItem; Data Structures Using C++

Data Structures Using C++ Linear Probing 9 will be next location if h(x) = 6,7,8, or 9 Probability of 9 being next = 4/20, for 14, it’s 5/20, but only 1/20 for 0 or 1 Clustering Data Structures Using C++

Data Structures Using C++ Random Probing Uses a random number generator to find the next available slot ith slot in the probe sequence is: (h(X) + ri) % HTSize where ri is the ith value in a random permutation of the numbers 1 to HTSize – 1 All insertions and searches use the same sequence of random numbers Data Structures Using C++

Data Structures Using C++ Quadratic Probing ith slot in the probe sequence is: (h(X) + i2) % HTSize (start i at 0) Reduces primary clustering of linear probing We do not know if it probes all the positions in the table When HTSize is prime, quadratic probing probes about half the table before repeating the probe sequence Data Structures Using C++

Deletion: Open Addressing When deleting, need to remove the item from its spot, but cannot reset it to empty (Why?) Data Structures Using C++

Deletion: Open Addressing IndexStatusList[i] set to –1 to mark item i as deleted Data Structures Using C++

Collision Resolution: Chaining (Open Hashing) No probing needed; instead put linked list at each hash position Data Structures Using C++

Data Structures Using C++ Hashing Analysis Let Then a is called the load factor Data Structures Using C++

Average Number of Comparisons Linear probing: Successful search: Unsuccessful search: Quadratic probing: Successful search: Unsuccessful search: Data Structures Using C++

Chaining: Average Number of Comparisons 1. Successful search 2. Unsuccessful search Data Structures Using C++