Presentation is loading. Please wait.

Presentation is loading. Please wait.

C++ Classes and Data Structures Jeffrey S. Childs

Similar presentations


Presentation on theme: "C++ Classes and Data Structures Jeffrey S. Childs"— Presentation transcript:

1 C++ Classes and Data Structures Jeffrey S. Childs
Chapter 11 Hash Tables Jeffrey S. Childs Clarion University of PA © 2008, Prentice Hall 1 1

2 Contents Terminology: key, value, hash function
Operations: insert, retrieve.. Collisions and chaining Ideal Hash Table Implementation using Linked Lists 2

3 Hash Table ADT The hash table is a table of elements that have keys
A hash function is used for locating a position in the table The table of elements is the set of data acted upon by the hash table operations Key -> Element in the table 3

4 Hash Table ADT Operations
insert, to insert an element into a table retrieve, to retrieve an element from the table remove, to remove an element from the table update, to update an element in the table an operation to empty out the hash table 4

5 Fast Search A hash table uses a function of the key value of an element to identify its location in an array. A search for an element can be done in ( 1 ) time. The function of the key value is called a hash function. 5 5

6 Hash Functions The input into a hash function is a key value
The output from a hash function is an index of an array (hash table) where the object containing the key is located Example of a hash function: h( k ) = k % 100 6

7 Example Using a Hash Function
Suppose our hash function is: h( k ) = k % 100 We wish to search for the object containing key value 214 k is set to 214 in the hash function The result is 14 The object containing key value 214 is stored at index 14 of the array (hash table) The search is done in ( 1 ) time 7

8 Inserting an Element An element is inserted into a hash table using the same hash function h( k ) = k % 100 To find where an element is to be inserted, use the hash function on its key If the key value is 214, the object is to be stored in index 14 of the array Insertion is done in ( 1 ) time 8

9 Consider the Big Picture …
If we have millions of key values, it may take a long time to search a regular array or a linked list for a specific part number (on average, we might compare 500,000 key values) Using a hash table, we simply have a function which provides us with the index of the array where the object containing the key is located 9

10 Collisions Consider the hash function
h( k ) = k % 100 A key value of 393 is used for an object, and the object is stored at index 93 Then a key value of 193 is used for a second object; the result of the hash function is 93, but index 93 is already occupied This is called a collision 10

11 How are Collisions Resolved?
The most popular way to resolve collisions is by chaining Instead of having an array of objects, we have an array of linked lists, each node of which contains an object An element is still inserted by using the hash function -- the hash function provides an index of a linked list, and the element is inserted at the front of that (usually short) linked list When searching for an element, the hash function is used to get the correct linked list, then the linked list is searched for the key (still much faster than comparing 500,000 keys) 11

12 Example Using Chaining
1 2 3 4 5 6 A hash table which is initially empty. Every element is a LinkedList object. Only the start pointer of the LinkedList object is shown, which is set to NULL. The hash function is: h( k ) = k % 7 12

13 Example Using Chaining (cont.)
1 2 3 4 5 6 INSERT object with key 31 The hash function is: h( k ) = k % 7 13

14 Example Using Chaining (cont.)
1 2 3 4 5 6 INSERT object with key 31 31 % 7 is 3 The hash function is: h( k ) = k % 7 14

15 Example Using Chaining (cont.)
1 2 3 4 5 6 INSERT object with key 31 31 % 7 is 3 31 The hash function is: h( k ) = k % 7 15

16 Example Using Chaining (cont.)
1 2 3 4 5 6 Note: The whole object is stored but only the key value is shown INSERT object with key 31 31 % 7 is 3 31 The hash function is: h( k ) = k % 7 16

17 Example Using Chaining (cont.)
1 2 3 4 5 6 31 The hash function is: h( k ) = k % 7 17

18 Example Using Chaining (cont.)
1 2 3 4 5 6 INSERT object with key 9 31 The hash function is: h( k ) = k % 7 18

19 Example Using Chaining (cont.)
1 2 3 4 5 6 INSERT object with key 9 9 % 7 is 2 31 The hash function is: h( k ) = k % 7 19

20 Example Using Chaining (cont.)
1 2 3 4 5 6 9 INSERT object with key 9 9 % 7 is 2 31 The hash function is: h( k ) = k % 7 20

21 Example Using Chaining (cont.)
1 2 3 4 5 6 9 31 The hash function is: h( k ) = k % 7 21

22 Example Using Chaining (cont.)
1 2 3 4 5 6 9 INSERT object with key 36 31 The hash function is: h( k ) = k % 7 22

23 Example Using Chaining (cont.)
1 2 3 4 5 6 9 INSERT object with key 36 36 % 7 is 1 31 The hash function is: h( k ) = k % 7 23

24 Example Using Chaining (cont.)
1 2 3 4 5 6 36 9 INSERT object with key 36 36 % 7 is 1 31 The hash function is: h( k ) = k % 7 24

25 Example Using Chaining (cont.)
1 2 3 4 5 6 36 9 31 The hash function is: h( k ) = k % 7 25

26 Example Using Chaining (cont.)
1 2 3 4 5 6 36 9 INSERT object with key 42 31 The hash function is: h( k ) = k % 7 26

27 Example Using Chaining (cont.)
1 2 3 4 5 6 36 9 INSERT object with key 42 42 % 7 is 0 31 The hash function is: h( k ) = k % 7 27

28 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 42 42 % 7 is 0 31 The hash function is: h( k ) = k % 7 28

29 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 31 The hash function is: h( k ) = k % 7 29

30 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 46 31 The hash function is: h( k ) = k % 7 30

31 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 46 46 % 7 is 4 31 The hash function is: h( k ) = k % 7 31

32 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 46 46 % 7 is 4 31 46 The hash function is: h( k ) = k % 7 32

33 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 31 46 The hash function is: h( k ) = k % 7 33

34 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 20 31 46 The hash function is: h( k ) = k % 7 34

35 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 20 20 % 7 is 6 31 46 The hash function is: h( k ) = k % 7 35

36 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 20 20 % 7 is 6 31 46 The hash function is: h( k ) = k % 7 20 36

37 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 31 46 The hash function is: h( k ) = k % 7 20 37

38 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 31 46 The hash function is: h( k ) = k % 7 20 38

39 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 39

40 Example Using Chaining (cont.)
1 2 3 4 5 6 42 COLLISION occurs… 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 40

41 Example Using Chaining (cont.)
1 2 3 4 5 6 42 But key 2 is just inserted in the linked list 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 41

42 Example Using Chaining (cont.)
1 2 3 4 5 6 42 The insert function of LinkedList inserts a new element at the BEGINNING of the list 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 42

43 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 43

44 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 44

45 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 45

46 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 46

47 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 47

48 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 2 2 % 7 is 2 31 46 The hash function is: h( k ) = k % 7 20 48

49 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 31 46 The hash function is: h( k ) = k % 7 20 49

50 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 31 46 The hash function is: h( k ) = k % 7 20 50

51 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 31 46 The hash function is: h( k ) = k % 7 20 51

52 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 31 46 The hash function is: h( k ) = k % 7 20 52

53 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 31 46 The hash function is: h( k ) = k % 7 20 53

54 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 31 46 The hash function is: h( k ) = k % 7 20 54

55 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 31 46 The hash function is: h( k ) = k % 7 20 55

56 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 INSERT object with key 24 24 % 7 is 3 24 31 46 The hash function is: h( k ) = k % 7 20 56

57 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 24 31 46 The hash function is: h( k ) = k % 7 20 57

58 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 **FIND** the object with key 9 24 31 46 The hash function is: h( k ) = k % 7 20 58

59 Example Using Chaining (cont.)
1 2 3 4 5 6 42 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 59

60 Example Using Chaining (cont.)
1 2 3 4 5 6 42 We search this linked list for the object with key 9 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 60

61 Example Using Chaining (cont.)
1 2 3 4 5 6 42 Remember…the whole object is stored, only the key is shown 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 61

62 Example Using Chaining (cont.)
1 2 3 4 5 6 42 Does this object contain key 9? 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 62

63 Example Using Chaining (cont.)
1 2 3 4 5 6 42 Does this object contain key 9? No, so go on to the next object. 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 63

64 Example Using Chaining (cont.)
1 2 3 4 5 6 42 Does this object contain key 9? 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 64

65 Example Using Chaining (cont.)
1 2 3 4 5 6 42 Does this object contain key 9? YES, found it! Return the object. 36 2 9 **FIND** the object with key 9 9 % 7 is 2 24 31 46 The hash function is: h( k ) = k % 7 20 65

66 Uniform Hashing When the elements are spread evenly (or near evenly) among the indexes of a hash table, it is called uniform hashing If elements are spread evenly, such that the number of elements at an index is less than some small constant, uniform hashing allows a search to be done in ( 1 ) time The hash function largely determines whether or not we will have uniform hashing 66

67 Bad Hash Functions h( k ) = 5 is obviously a bad hash function
h( k ) = k % 100 could be a bad hash function if there is meaning attached to parts of a key Consider that the key might be an employee id The last two digits may give the state of birth 67

68 Ideal Hash Function for Uniform Hashing
The hash table size should be a prime number that is not too close to a power of 2 31 is a prime number but is too close to a power of 2 97 is a prime number not too close to a power of 2 A good hash function might be: h( k ) = k % 97 If the constant use in a hash and the number of buckets are coprime, collisions are minimized 68 68

69 Hash Functions Can be Made for Keys that are Strings
1 int sum = 0; 2 for ( int i = 0; i < int( str.length( ) ); i++ ) 3 sum += str[ i ]; 4 hash_index = sum % 97; 69

70 Speed vs. Memory Conservation
Speed comes from reducing the number of collisions In a search, if there are no collisions, the first element in the linked list in the one we want to find (fast) Therefore, the greatest speed comes about by making a hash table much larger than the number of keys (but there will still be an occasional collision) 70

71 Speed vs. Memory Conservation (cont.)
Each empty LinkedList object in a hash table wastes 8 bytes of memory (4 bytes for the start pointer and 4 bytes for the current pointer) The best memory conservation comes from trying to reduce the number of empty LinkedList objects The hash table size would be made much smaller than the number of keys (there would still be an occasional empty linked list) 71

72 Hash Table Design Decide whether speed or memory conservation is more important (and how much more important) for the application Come up with a good table size which Allows for the use of a good hash function Strikes the appropriate balance between speed and memory conservation 72

73 Ideal Hash Tables Can we have a hash function which guarantees that there will be no collisions? Yes: h( k ) = k Each key k is unique; therefore, each index produced from h( k ) is unique Consider 300 employees that have a 4 digit id A hash table size of with the hash function above guarantees the best possible speed 73

74 Ideal Hash Tables (cont.)
Should we use LinkedList objects if there are no collisions? Suppose each Employee object takes up 100 bytes An array size of Employee objects with only 300 used indexes will have 9700 unused indexes, each taking up 100 bytes Best to use LinkedList objects (in this case) – the unused indexes will only use 8 bytes each Additional space can be saved by not storing the employee id in the object (if no collisions) 74

75 Ideal Hash Tables (cont.)
Can we have a hash table without any collisions and without any empty linked lists? Sometimes. Consider 300 employees with id’s from 0 to We can make a hash table size of 300, and use h( k ) = k LinkedList objects wouldn’t be necessary and in fact, would waste space It would also not be necessary to store the employee id in the object This is actually an array 75 75

76 Implementing a Hash Table
We’ll implement a HashTable with linked lists (chaining) without chaining, a hash table can become full If the client has the ideal hash table mentioned on the previous slide, he/she would be better off to just use an Array for the hash table 76

77 Implementing a Hash Function
We shouldn’t write the hash function The client should write the hash function that he/she would like to use Then, the client should pass the hash function that he/she wrote as a parameter into the constructor of the HashTable class This can be implemented with function pointers 77

78 Function Pointers A function pointer is a pointer that holds the address of a function The function can be called using the function pointer instead of the function name 78

79 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); 79

80 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); funcptr is the name of the pointer; the name can be chosen like any other pointer name 80

81 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); The parentheses are necessary. 81

82 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); The return type of the function that funcptr can point to is given here (in this case, the return type is a float) 82

83 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); The parameter list of a function that funcptr can point to is given here – in this case, there is only one parameter of string type. 83

84 Function Pointers (cont.)
Example of a function pointer declaration: float (*funcptr) (string); What would a function pointer declaration look like if the function it can point to has a void return type and accepts two integer parameters? void (*fptr) (int, int) 84 84

85 Function Pointers (cont.)
void (*fp) (int, int); 85

86 Function Pointers (cont.)
void (*fp) (int, int); void foo( int a, int b ) { cout << “a is: “ << a << endl; cout << “b is: “ << b << endl; } A function that fp can point to 86

87 Assigning the Address of a Function to a Function Pointer
void (*fp) (int, int); void foo( int a, int b ) { cout << “a is: “ << a << endl; cout << “b is: “ << b << endl; } fp = foo; The address of foo is assigned to fp like this 87

88 Calling a Function by Using a Function Pointer
void (*fp) (int, int); void foo( int a, int b ) { cout << “a is: “ << a << endl; cout << “b is: “ << b << endl; } fp( 5, 10 ); Once the address of foo has been assigned to fp, the foo function can be called using fp like this 88

89 Design of the HashTable Constructor
Once the client designs the hash function, the client passes the name of the hash function, as a parameter into the HashTable constructor The HashTable constructor accepts the parameter using a function pointer in this parameter location The address of the function is saved to a function pointer in the private section Then, the hash table can call the hash function that the client made by using the function pointer 89

90 HashTable.h 1 #include "LinkedList.h" 2 #include "Array.h“ 3
4 template <class DataType> 5 class HashTable 6 { 7 public: 8 HashTable( int (*hf)(const DataType &), int s ); 9 bool insert( const DataType & newObject ); 10 bool retrieve( DataType & retrieved ); 11 bool remove( DataType & removed ); 12 bool update( DataType & updateObject ); 13 void makeEmpty( ); HashTable.h continued… 90

91 HashTable.h Space is necessary here 14 private: 15 Array< LinkedList<DataType> > table; 16 int (*hashfunc)(const DataType &); 17 }; 18 19 #include "HashTable.cpp" 91

92 Clientele The LinkedList class is being used in the HashTable class, along with the Array class Note that when one writes a class the clientele extends beyond the main programmers who might use the class The clientele extends to people who write other classes Remember! Check LinkedList.h because its operations are needed in implementation. 92 92

93 HashTable Constructor
1 template <class DataType> 2 HashTable<DataType>::HashTable( int (*hf)(const DataType &), int s ) 4 : table( s ) 5 { 6 hashfunc = hf; 7 } This call to the Array constructor creates an Array of LinkedList’s of type DataType Check the private part of .h 93 93

94 HashTable Constructor (cont.)
1 template <class DataType> 2 HashTable<DataType>::HashTable( int (*hf)(const DataType &), int s ) 4 : table( s ) 5 { 6 hashfunc = hf; 7 } The DataType for Array is LinkedList<DataType> (DataType in Array is different than DataType in HashTable) 94

95 HashTable Constructor (cont.)
1 template <class DataType> 2 HashTable<DataType>::HashTable( int (*hf)(const DataType &), int s ) 4 : table( s ) 5 { 6 hashfunc = hf; 7 } In the Array constructor, an Array of size s is made, having LinkedList elements – when this array is created, the LinkedList constructor is called for each element. 95

96 HashTable Constructor (cont.)
1 template <class DataType> 2 HashTable<DataType>::HashTable( int (*hf)(const DataType &), int s ) 4 : table( s ) 5 { 6 hashfunc = hf; 7 } 96

97 insert 8 template <class DataType>
9 bool HashTable<DataType>::insert( const DataType & newObject ) 11 { 12 int location = hashfunc( newObject ); 13 if ( location < 0 || location >= table.length( ) ) 14 return false; 15 table[ location ].insert( newObject ); 16 return true; 17 } Keep in mind that this is a LinkedList object. 97

98 retrieve 18 template <class DataType>
19 bool HashTable<DataType>::retrieve( DataType & retrieved ) 21 { 22 int location = hashfunc( retrieved ); 23 if ( location < 0 || location >= table.length( ) ) 24 return false; 25 if ( !table[ location ].retrieve( retrieved ) ) 26 return false; 27 return true; 28 } 98

99 remove 29 template <class DataType>
30 bool HashTable<DataType>::remove( DataType & removed ) 32 { 33 int location = hashfunc( removed ); 34 if ( location < 0 || location >= table.length( ) ) 35 return false; 36 if ( !table[ location ].remove( removed ) ) 37 return false; 38 return true; 39 } 99

100 update 40 template <class DataType>
41 bool HashTable<DataType>::update( DataType & updateObject ) 43 { 44 int location = hashfunc( updateObject ); 45 if ( location < 0 || location >= table.length( ) ) 46 return false; 47 if ( !table[location].find( updateObject ) ) 48 return false; 49 table[location].replace( updateObject ); 50 return true; 51 } 100

101 makeEmpty 50 template <class DataType>
51 void HashTable<DataType>::makeEmpty( ) 52 { 53 for ( int i = 0; i < table.length( ); i++ ) 54 table[ i ].makeEmpty( ); 55 } 101

102 Using HashTable 1 #include <iostream> 2 #include <string>
3 #include "HashTable.h" 4 5 using namespace std; 6 7 struct MyStruct { 8 string str; 9 int num; 10 bool operator ==( const MyStruct & r ) { return str == r.str; } 11 }; str will be the key 102

103 Using HashTable (cont.)
1 #include <iostream> 2 #include <string> 3 #include "HashTable.h" 4 5 using namespace std; 6 7 struct MyStruct { 8 string str; 9 int num; 10 bool operator ==( const MyStruct & r ) { return str == r.str; } 11 }; It is necessary to overload the == operator for the LinkedList functions 103

104 Using HashTable (cont.)
1 #include <iostream> 2 #include <string> 3 #include "HashTable.h" 4 5 using namespace std; 6 7 struct MyStruct { 8 string str; 9 int num; 10 bool operator ==( const MyStruct & r ) { return str == r.str; } 11 }; In the actual code, a comment is placed above HashTable, telling the client that this is needed and what is required. 104

105 Using HashTable (cont.)
12 const int SIZE1 = 97, SIZE2 = 199; 13 14 int hash1( const MyStruct & obj ); 15 int hash2( const MyStruct & obj ); 16 17 int main( ) 18 { 19 HashTable<MyStruct> ht1( hash1, SIZE1 ), 20 ht2( hash2, SIZE2); 105

106 Using HashTable (cont.)
21 MyStruct myobj; 22 23 myobj.str = "elephant"; 24 myobj.num = 25; 25 ht1.insert( myobj ); 26 27 myobj.str = "giraffe"; 28 myobj.num = 50; 29 ht2.insert( myobj ); // other code using the hash tables 106

107 Using HashTable (cont.)
30 return 0; 31 } 32 33 int hash1( const MyStruct & obj ) 34 { 35 int sum = 0; 36 for ( int i = 0; i < 3 && i < int( obj.str.length( ) ); i++ ) 37 sum += obj.str[ i ]; 38 return sum % SIZE1; 39 } 107

108 A Hash Table is Like a List
The hash table ADT description is very close to the list ADT description The only items missing from the hash table ADT description are: an iterator a function to determine when the “list” is empty find, to determine whether an element is in the “list” a current position If we had all of these, we would have a fast list (or an enhanced hash table) 108

109 Iterator Everything would be easy to implement for the hash table, except for the iterator The iterator is an important part of a “list”, so we would like it to be as fast as possible We can iterate through a collision list, but finding the next collision list to iterate through might not be so fast… 109

110 Iterator (cont.) table Large gap with empty linked lists . 110

111 Iterator (cont.) Instead, we can have a linked list run through the collision lists, so that the linked list contains every element Then, iterating through the linked list is simple and fast 111

112 Time Complexities for List ADT
insert – we’ll insert at the head of the linked list – ( 1 ) iterator – each step will be ( 1 ) find – element is found by hashing, so it is ( 1 ) for uniform hashing (the hash function and hash table are designed so that the length of the collision list is bounded by some small constant) retrieve – is ( 1 ) for uniform hashing more… 112

113 Time Complexities for List ADT (cont.)
replace – is ( 1 ) using the current position an operation to determine whether or not the list is empty – is ( 1 ), because we just test the linked list to see if it is empty an operation to empty out the list – is ( n ), the best we can do, since each node must be freed remove – to make this last operation as fast as possible, consider using a doubly-linked list for the linked list… 113

114 Remove In Chapter 10, we’ve seen that a node in a doubly-linked list can be removed in ( 1 ) time, given a pointer to it Using hashing, we obtain a collision list, which will have a pointer to the node we wish to remove 114

115 Doubly-Linked List ADT
The description for the doubly-linked list ADT is the same as that for the list ADT We don’t consider implementation in the ADT description, and double links have to do with implementation The implementation of the doubly-linked list using the HashTable is also not a part of the ADT description 115

116 Avoiding Special Cases
To avoid special cases, we’ll have a header node and a trailer node in the doubly-linked list Few data structures use arrays of doubly- linked lists – if such a use arises, we could create a doubly-linked list without header and trailer nodes to avoid wasting memory 116

117 An Example trailer hash function: h( k ) = k % 11
The red line is the doubly-linked list 33 22 47 31 header 99 36 70 63 53 65 117

118 An Example (cont.) trailer hash function: h( k ) = k % 11
Each node has three pointers (not just one). 33 22 47 31 header 99 36 70 63 53 65 118

119 An Example (cont.) trailer hash function: h( k ) = k % 11
The elements were inserted in this order: 33, 65, 63, 31, 53, 22, 47, 99, 36, 70 33 22 47 31 header 99 36 70 63 53 65 119

120 An Example (cont.) trailer hash function: h( k ) = k % 11
33, 65, 63, 31, 53, 22, 47, 99, 36, 70 – since each node is inserted at the beginning … 33 22 47 31 header 99 36 70 63 53 65 120

121 An Example (cont.) trailer hash function: h( k ) = k % 11
33, 65, 63, 31, 53, 22, 47, 99, 36, 70 – you’ll see these nodes from trailer to header. 33 22 47 31 header 99 36 70 63 53 65 121

122 An Example (cont.) trailer hash function: h( k ) = k % 11 INSERT: 52
33 22 47 31 header 99 36 70 63 53 65 122

123 An Example (cont.) trailer hash function: h( k ) = k % 11
INSERT: The hash function gives us index 8. 33 22 47 31 header 99 36 70 63 53 65 123

124 An Example (cont.) trailer hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there… 33 22 47 31 header 99 36 70 63 53 65 124

125 An Example (cont.) trailer hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there… 33 22 47 31 header 63 99 36 70 53 65 125

126 An Example (cont.) trailer hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there. 33 22 47 63 31 header 99 36 70 53 65 126

127 An Example (cont.) trailer hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there. 33 22 47 63 31 header 99 36 70 52 53 65 127

128 An Example (cont.) trailer hash function: h( k ) = k % 11
The new node, 52, must also be inserted at the beginning of the doubly-linked list… 33 22 47 63 31 header 99 36 70 52 53 65 128

129 An Example (cont.) trailer hash function: h( k ) = k % 11
The new node, 52, must also be inserted at the beginning of the doubly-linked list… 33 22 47 63 31 header 99 36 70 52 53 65 129

130 An Example (cont.) trailer hash function: h( k ) = k % 11
The new node, 52, must also be inserted at the beginning of the doubly-linked list 33 22 47 63 31 header 99 36 70 52 53 65 130

131 An Example (cont.) trailer hash function: h( k ) = k % 11 REMOVE: 31
33 22 47 63 31 header 99 36 70 52 53 65 131

132 An Example (cont.) trailer hash function: h( k ) = k % 11
REMOVE: The hash function gives us index 9, where we’ll find 31 33 22 47 63 31 header 99 36 70 52 53 65 132

133 An Example (cont.) trailer hash function: h( k ) = k % 11
REMOVE: Node 53 contains the pointer to it… 33 22 47 63 31 header 99 36 70 52 53 65 133

134 An Example (cont.) trailer hash function: h( k ) = k % 11
so 31 can be removed from the doubly-linked list in ( 1 ) time… 33 22 47 63 31 header 99 36 70 52 53 65 134

135 An Example (cont.) trailer hash function: h( k ) = k % 11
so 31 can be removed from the doubly-linked list in ( 1 ) time 33 22 47 63 31 header 99 36 70 52 53 65 135

136 An Example (cont.) trailer hash function: h( k ) = k % 11
31 is also removed from the collision list using LinkedList remove… 33 22 47 63 31 header 99 36 70 52 53 65 136

137 An Example (cont.) trailer hash function: h( k ) = k % 11
31 is also removed from the collision list using LinkedList remove 33 22 47 63 header 99 36 70 52 53 65 137

138 Application of HashTable and LinkedList: LRU
We want to maintain a cache of N elements that are inserted and accessed frequently 138

139 Application of HashTable and LinkedList: LRU
We already have N elements. A new element e comes in: we need to remove the oldest, then insert e e 139

140 Application of HashTable and LinkedList: LRU
But iterating the HashTable’s collision lists to look for the oldest member would be too slow 140

141 Application of HashTable and LinkedList: LRU
Instead: maintain a list trailer header ……... 141

142 Application of HashTable and LinkedList: LRU
header trailer This was the first (oldest) 142

143 Application of HashTable and LinkedList: LRU
If a new element comes in, add it in the front of the list header trailer This was the first (oldest) 143

144 Application of HashTable and LinkedList: LRU
If a new element comes in, add it in the front of the list header trailer This was the first (oldest) 144

145 Application of HashTable and LinkedList: LRU
If an element in the HashTable is accessed, move it in the front of the list header trailer access This was the first (oldest) 145

146 Application of HashTable and LinkedList: LRU
Eventually the HashTable gets full. But now we know that we need to remove the last item trailer header ……... 146

147 Application of HashTable and LinkedList: LRU
Removed.. trailer header ……... 147

148 Application of HashTable and LinkedList: LRU
Now add “e” and put it in the head of the list header trailer e ……... 148

149 Doubly-Linked List Order
The order of the doubly-linked list can be maintained independently of the singly- linked list If we wanted a sorted doubly-linked list using the same insertion order of the elements, it would look like this… 149

150 Sorted Example hash function: h( k ) = k % 11 33 22 47 31 header 99 36
70 63 53 65 trailer 150

151 Sorted Example (cont.) hash function: h( k ) = k % 11 INSERT: 52 33 22
47 31 header 99 36 70 63 53 65 trailer 151

152 Sorted Example (cont.) hash function: h( k ) = k % 11
INSERT: The hash function gives us index 8. 33 22 47 31 header 99 36 70 63 53 65 trailer 152

153 Sorted Example (cont.) hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there… 33 22 47 31 header 99 36 70 63 53 65 trailer 153

154 Sorted Example (cont.) hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there… 33 22 47 31 header 63 99 36 70 53 65 trailer 154

155 Sorted Example (cont.) hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there. 33 22 47 63 31 header 99 36 70 53 65 trailer 155

156 Sorted Example (cont.) hash function: h( k ) = k % 11
INSERT: 52 So 52 is inserted at the beginning of the collision list there. 33 22 47 63 31 header 99 36 70 52 53 65 trailer 156

157 Sorted Example (cont.) hash function: h( k ) = k % 11
52 must also be inserted in the doubly-linked list… 33 22 47 63 31 header 99 36 70 52 53 65 trailer 157

158 Sorted Example (cont.) hash function: h( k ) = k % 11
52 must also be inserted in the doubly-linked list. 33 22 47 63 31 header 99 36 70 52 53 65 trailer 158

159 Sorted Example (cont.) hash function: h( k ) = k % 11
52 must also be inserted in the doubly-linked list. However… 33 22 47 63 31 header 99 36 70 52 53 65 trailer 159

160 Sorted Example (cont.) hash function: h( k ) = k % 11
it would take ( n ) time since its position in the doubly-linked list must be found 33 22 47 63 31 header 99 36 70 52 53 65 trailer 160

161 Sorted Example (cont.) hash function: h( k ) = k % 11 REMOVE: 31 33 22
47 63 31 header 99 36 70 52 53 65 trailer 161

162 Sorted Example (cont.) hash function: h( k ) = k % 11
REMOVE: The hash function gives us index 9, where we’ll find 31 33 22 47 63 31 header 99 36 70 52 53 65 trailer 162

163 Sorted Example (cont.) hash function: h( k ) = k % 11
REMOVE: In ( 1 ) time, it is removed from the doubly-linked list… 33 22 47 63 31 header 99 36 70 52 53 65 trailer 163

164 Sorted Example (cont.) hash function: h( k ) = k % 11
REMOVE: In ( 1 ) time, it is removed from the doubly-linked list 33 22 47 63 31 header 99 36 70 52 53 65 trailer 164

165 Sorted Example (cont.) hash function: h( k ) = k % 11
REMOVE: In ( 1 ) time, it is also removed from the collision list… 33 22 47 63 31 header 99 36 70 52 53 65 trailer 165

166 Sorted Example (cont.) hash function: h( k ) = k % 11
REMOVE: In ( 1 ) time, it is also removed from the collision list 33 22 47 63 header 99 36 70 52 53 65 trailer 166

167 Memory Considerations
Each node has three pointers now: template <class DataType> struct Node { DataType info; Node<DataType> *next; Node<DataType> *dlnext; Node<DataType> *dlback; }; For the collision list 167

168 Memory Considerations (cont.)
Each node has three pointers now: template <class DataType> struct Node { DataType info; Node<DataType> *next; Node<DataType> *dlnext; Node<DataType> *dlback; }; For the doubly-linked list 168

169 Memory Considerations (cont.)
If there is only one node in the collision list, on average, then the LinkedList used for each element also has two pointers: start and current This gives us a total of 20 bytes of wasted memory per node (housekeeping) 169

170 Memory Considerations (cont.)
If each element is 20 bytes, 50% of memory is wasted However, if each element is 980 bytes, only 2% of memory is wasted Element size is an important consideration 170

171 LinkedList Considerations
In order to make the implementation easy, we’ll have to make some changes to the LinkedList class It will become a specialized “helper” class for the doubly-linked list 171

172 Changes to LinkedList change the name of the class from LinkedList to CollisionList modify the Node struct so it has the three pointers we need we need a function to retrieve the current pointer, called getcurrent instead of eliminating the current position when we insert a node, we will set the current position to the node we inserted 172

173 HashTable Class Considerations
We’ll make some changes to the HashTable class, too It will, again, be a specialized “helper” class just for use in the doubly-linked list 173

174 HashTable Changes rename the HashTable class to DLHashTable, short for “Doubly-Linked Hash Table” keep the location, used throughout the class, as a private data member have a function getcurrent, which retrieves the current pointer of the CollisionList that was used in the last use of location; that is, return table[location].getcurrent( ) 174

175 HashTable Changes We’ll also need some functions which are convenient for the copy constructor and deepCopy gethashfunc, which will return the hash function pointer sethashfunc, which will set a hash function pointer getsize, which will get the Array size of the hash table changeSize, which will change the Array size of the hash table 175

176 Returning a Function Pointer
int (*gethashfunc( ) const)(const DataType &); This is the function prototype for gethashfunc, the function to return the hash function pointer. 176

177 Returning a Function Pointer (cont.)
int (*gethashfunc( ) const)(const DataType &); This is the return type of the function – the return type is spread out instead of being at the beginning of the function heading, where it normally is. 177

178 Returning a Function Pointer (cont.)
int (*gethashfunc( ) const)(const DataType &); The return type indicates we are returning a function pointer which can point to a function that passes in DataType by const reference and returns an integer (describing a hash function) 178

179 Returning a Function Pointer (cont.)
int (*gethashfunc( ) const)(const DataType &); This is the name of the function that returns the function pointer – it has an empty parameter list in this case. 179

180 Returning a Function Pointer (cont.)
int (*gethashfunc( ) const)(const DataType &); This const means that we are not going to change any of the data members in this function (it is the same use of const that you normally see at the end of an isEmpty function heading). 180

181 Returning a Function Pointer (cont.)
The function definition in the implementation file template <class DataType> int (*DLHashTable<DataType>::gethashfunc( ) const) (const DataType &) { return hashfunc; } 181

182 DoublyLinkedList.h 1 #include "DLHashTable.h" 2
3 template <class DataType> 4 class DoublyLinkedList 5 { 6 public: 7 DoublyLinkedList( int (*hf)(const DataType &), int s ); 9 DoublyLinkedList( const DoublyLinkedList<DataType> & aplist ); 11 ~DoublyLinkedList( ); 12 DoublyLinkedList<DataType> & operator =( const DoublyLinkedList<DataType> & rlist ); 182

183 DoublyLinkedList.h (cont.)
14 bool insert( const DataType & element ); 15 bool first( DataType & listEl ); 16 inline bool getNext( DataType & listEl ); 17 bool last( DataType & listEl ); 18 inline bool getPrevious( DataType & listEl ); 19 bool find ( const DataType & element ); 20 bool retrieve( DataType & element ); 21 bool remove( DataType & element ); 22 bool replace( const DataType & newElement ); 23 bool isEmpty( ) const; 24 void makeEmpty( ); another iterator 183

184 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" These are static nodes (not made in the heap). 184

185 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" header will point to headerNode 185

186 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" likewise, trailer will point to trailerNode 186

187 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" We don’t really need the header and trailer pointers, but without them the code may be confusing… 187

188 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" Without header, we would access the first data node with: headerNode.dlnext 188

189 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" WITH header, we would access the first data node with: header->dlnext 189

190 DoublyLinkedList.h (cont.)
With this private section, do we really need a destructor, copy constructor, and overloaded assignment operator? 25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" 190

191 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" YES. There can be pointers to dynamic memory here (for example, header->next). 191

192 DoublyLinkedList.h (cont.)
25 private: 26 DLHashTable<DataType> table; 27 Node<DataType> *current; 28 Node<DataType> headerNode; 29 Node<DataType> trailerNode; 30 Node<DataType> *header; 31 Node<DataType> *trailer; 32 inline void deepCopy( 33 const DoublyLinkedList<DataType> & original ); 34 }; 35 36 #include "DoublyLinkedList.cpp" Sometimes pointers to dynamic memory are hidden. 192

193 Constructor 1 template <class DataType>
2 DoublyLinkedList<DataType>::DoublyLinkedList( int (*hf)(const DataType &), int s ) 4 : table( hf, s ), header( &headerNode ), 5 trailer( &trailerNode ) 6 { 7 current = header->dlnext = trailer; 8 trailer->dlback = header; 9 } When current is set to the trailer node, it means there is no current position 193

194 Copy Constructor 10 template <class DataType>
11 DoublyLinkedList<DataType>::DoublyLinkedList( const DoublyLinkedList<DataType> & aplist ) 13 : table( aplist.table.gethashfunc( ), aplist.table.getsize( ) ) 15 { 16 deepCopy( aplist ); 17 } Calls the constructor for DLHashTable in the copy – passes in the hash function and size of the aplist hash table. 194

195 Destructor 18 template <class DataType>
19 DoublyLinkedList<DataType>::~DoublyLinkedList( ) 20 { 21 makeEmpty( ); 22 } 195

196 Overloaded Assignment Operator
23 template <class DataType> 24 DoublyLinkedList<DataType> & DoublyLinkedList<DataType>::operator =( const DoublyLinkedList<DataType> & rlist ) 27 { 28 if ( this == &rlist ) 29 return *this; 30 makeEmpty( ); 31 deepCopy( rlist ); 32 return *this; 33 } 196

197 insert 34 template <class DataType>
35 bool DoublyLinkedList<DataType>::insert( const DataType & element ) 37 { 38 if ( !table.insert( element ) ) 39 return false; 40 41 current = table.getcurrent( ); When insert in the CollisionList is (eventually) used, it now makes the inserted node the current node. 197

198 insert (cont.) 34 template <class DataType>
35 bool DoublyLinkedList<DataType>::insert( const DataType & element ) 37 { 38 if ( !table.insert( element ) ) 39 return false; 40 41 current = table.getcurrent( ); This function eventually calls getcurrent in the CollisionList, which returns the current pointer there… 198

199 insert (cont.) 34 template <class DataType>
35 bool DoublyLinkedList<DataType>::insert( const DataType & element ) 37 { 38 if ( !table.insert( element ) ) 39 return false; 40 41 current = table.getcurrent( ); So the address of the node that was just inserted is assigned to the (different) current pointer here 199

200 insert (cont.) 34 template <class DataType>
35 bool DoublyLinkedList<DataType>::insert( const DataType & element ) 37 { 38 if ( !table.insert( element ) ) 39 return false; 40 41 current = table.getcurrent( ); insert continued… 200

201 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } In showing how insert works, we’ll use this convention for the parts of Node. info dlback dlnext next 201

202 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start The current node has been inserted into the collision list (line 38 of insert) doubly-linked list 202

203 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start However, it still needs to be placed into the doubly-linked list. doubly-linked list 203

204 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 204

205 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 205

206 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 206

207 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 207

208 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 208

209 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 209

210 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 210

211 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 211

212 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start doubly-linked list 212

213 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } header current collision list start Now, current has been inserted into the doubly- linked list. doubly-linked list 213

214 insert (cont.) 42 current->dlnext = header->dlnext;
43 current->dlback = header; 44 header->dlnext = header->dlnext->dlback = current; 45 current = trailer; 46 47 return true; 48 } There won’t be a current position in the DoublyLinkedList after the insertion – the client is using the DoublyLinkedList and this is what the ADT describes. 214

215 first 49 template <class DataType>
50 bool DoublyLinkedList<DataType>::first( DataType & listEl ) 52 { 53 if ( header->dlnext == trailer ) 54 return false; 55 56 current = header->dlnext; 57 listEl = current->info; 58 return true; 59 } 215

216 getNext 60 template <class DataType>
61 inline bool DoublyLinkedList<DataType>::getNext( DataType & listEl ) 63 { 64 if ( current->dlnext == trailer ) 65 current = trailer; 66 if ( current == trailer ) 67 return false; 68 69 current = current->dlnext; 70 listEl = current->info; 71 return true; 72 } 216

217 last 73 template <class DataType>
74 bool DoublyLinkedList<DataType>::last( DataType & listEl ) 76 { 77 if ( header->dlnext == trailer ) 78 return false; 79 80 current = trailer->dlback; 81 listEl = current->info; 82 return true; 83 } 217

218 getPrevious 84 template <class DataType>
85 inline bool DoublyLinkedList<DataType>::getPrevious( DataType & listEl ) 87 { 88 if ( current->dlback == header ) 89 current = trailer; 90 if ( current == trailer ) 91 return false; 92 93 current = current->dlback; 94 listEl = current->info; 95 return true; 96 } 218

219 find 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } If we pass in element here, retrieve will change the value of it… 219

220 find (cont.) 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } which will give us an error…in find, we are not supposed to retrieve 220

221 find (cont.) 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } So element is copied to el first, then passed into retreive. 221

222 find (cont.) 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } We could have passed by value here (instead of by const reference) to avoid this problem… 222

223 find (cont.) 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } but element copying (pass by value or here) can be avoided altogether by making a find function in DLHashTable (an exercise) 223

224 find (cont.) 97 template <class DataType>
98 bool DoublyLinkedList<DataType>::find( const DataType & element ) 100 { 101 DataType el = element; 102 if ( table.retrieve( el ) ) { 103 current = table.getcurrent( ); 104 return true; 105 } 106 107 current = trailer; 108 return false; 109 } 224

225 retrieve 110 template <class DataType>
111 bool DoublyLinkedList<DataType>::retrieve( DataType & element ) 113 { 114 if ( !find( element ) ) 115 return false; 116 117 element = current->info; 118 return true; 119 } 225

226 remove 120 template <class DataType>
121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } sets element before it is removed 226

227 remove (cont.) 120 template <class DataType>
121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } also sets current to the node that element is in 227

228 remove (cont.) 120 template <class DataType>
121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } removes current node from doubly-linked list 228

229 remove (cont.) 126 current->dlback->dlnext = current->dlnext;
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 229

230 remove (cont.) 126 current->dlback->dlnext = current->dlnext;
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 230

231 remove (cont.) 126 current->dlback->dlnext = current->dlnext;
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 231

232 remove (cont.) 126 current->dlback->dlnext = current->dlnext;
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 232

233 remove (cont.) 126 current->dlback->dlnext = current->dlnext;
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 233

234 remove (cont.) 128 current = trailer; current collision list
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } current collision list doubly-linked list 234

235 remove (cont.) 128 current = trailer; collision list current
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } collision list current doubly-linked list trailer 235

236 remove (cont.) 129 table.remove( element ); collision list current
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } collision list current doubly-linked list trailer 236

237 remove (cont.) 129 table.remove( element );
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } element is in the node current was pointing to collision list current doubly-linked list trailer 237

238 remove (cont.) 129 table.remove( element );
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } element is in the node current was pointing to collision list current doubly-linked list trailer 238

239 remove (cont.) 131 return true; collision list current
120 template <class DataType> 121 bool DoublyLinkedList<DataType>::remove( DataType & element ) 122 123 { 124 if ( !retrieve( element ) ) 125 return false; 126 current->dlback->dlnext = current->dlnext; 127 current->dlnext->dlback = current->dlback; 128 current = trailer; 129 table.remove( element ); 130 131 return true; 132 } collision list current doubly-linked list trailer 239

240 replace 133 template <class DataType>
134 bool DoublyLinkedList<DataType>::replace( const DataType & newElement ) 136 { 137 if ( current == trailer ) 138 return false; 139 current->info = newElement; 140 return true; 141 } 240

241 isEmpty / makeEmpty 142 template <class DataType>
143 bool DoublyLinkedList<DataType>::isEmpty( ) const 144 { 145 return header->dlnext == trailer; 146 } 147 148 template <class DataType> 149 void DoublyLinkedList<DataType>::makeEmpty( ) 150 { 151 table.makeEmpty( ); 152 current = header->dlnext = trailer; 153 trailer->dlback = header; 154 } 241

242 deepCopy 155 template <class DataType>
156 inline void DoublyLinkedList<DataType>::deepCopy( const DoublyLinkedList<DataType> & original ) 158 { 159 if ( original.table.getsize( ) != table.getsize( ) ) 160 table.changeSize( original.table.getsize( ) ); 161 table.sethashfunc( original.table.gethashfunc( ) ); 162 header = &headerNode; 163 trailer = &trailerNode; 164 Node<DataType> *save = header->dlnext = trailer; 165 trailer->dlback = header; used later to set current 242

243 deepCopy (cont.) start at the end of the original doubly-linked list
166 Node<DataType> *originalptr = original.trailer->dlback; 167 if ( (originalptr == original.header) || !insert( originalptr->info ) ) 169 return; 243

244 deepCopy (cont.) 166 Node<DataType> *originalptr = original.trailer->dlback; 167 if ( (originalptr == original.header) || !insert( originalptr->info ) ) 169 return; If original doubly-linked list is empty, we want to return; we’ve already created an empty copy 244

245 deepCopy (cont.) 166 Node<DataType> *originalptr = original.trailer->dlback; 167 if ( (originalptr == original.header) || !insert( originalptr->info ) ) 169 return; We may not be able to insert because of an error in the client’s hash function (detected in DLHashTable) 245

246 deepCopy (cont.) 166 Node<DataType> *originalptr = original.trailer->dlback; 167 if ( (originalptr == original.header) || !insert( originalptr->info ) ) 169 return; By starting at the back of the original list and going forwards, we insert each node encountered into the front of the copy – ensuring a duplication. 246

247 deepCopy (cont.) 192 while ( originalptr->dlback != original.header ) { 193 originalptr = originalptr->dlback; 194 if ( !insert( originalptr->info ) ) { 195 makeEmpty( ); 196 return; 197 } 198 if ( original.current == originalptr ) 199 save = header->dlnext; 200 } 201 202 current = save; 203 } We want the current pointer in the original to correspond to the current pointer in the copy 247

248 deepCopy (cont.) 192 while ( originalptr->dlback != original.header ) { 193 originalptr = originalptr->dlback; 194 if ( !insert( originalptr->info ) ) { 195 makeEmpty( ); 196 return; 197 } 198 if ( original.current == originalptr ) 199 save = header->dlnext; 200 } 201 202 current = save; 203 } But we can’t set current in the copy yet – on each insert, the current pointer is changed. 248

249 deepCopy (cont.) 192 while ( originalptr->dlback != original.header ) { 193 originalptr = originalptr->dlback; 194 if ( !insert( originalptr->info ) ) { 195 makeEmpty( ); 196 return; 197 } 198 if ( original.current == originalptr ) 199 save = header->dlnext; 200 } 201 202 current = save; 203 } So we just save the current position of the copy in the save pointer. 249

250 deepCopy (cont.) 192 while ( originalptr->dlback != original.header ) { 193 originalptr = originalptr->dlback; 194 if ( !insert( originalptr->info ) ) { 195 makeEmpty( ); 196 return; 197 } 198 if ( original.current == originalptr ) 199 save = header->dlnext; 200 } 201 202 current = save; 203 } Whether save was set to trailer (before the while loop), or save was set in the while loop, this sets current correctly. 250


Download ppt "C++ Classes and Data Structures Jeffrey S. Childs"

Similar presentations


Ads by Google