CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++,

CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++, GOODRICH, TAMASSIA AND MOUNT (WILEY 2004) AND SLIDES FROM NANCY M. AMATO   0 1 2 3 4 451-229-0004 981-101-0002 025-612-0001

READING Map ADT (Ch. 9.1) Dictionary ADT (Ch. 9.5) Ordered Maps (Ch. 9.3) Hash Tables (Ch. 9.2)

MAP ADT A map models a searchable collection of key-value pair (called entries) Multiple items with the same key are not allowed Applications: address book or student records mapping host names (e.g., cs16.net) to internet addresses (e.g., 128.148.34.101) Often called “associative” containers

LIST-BASED MAP IMPLEMENTATION tail header nodes/positions entries 9 c 6 c 5 c 8 c

DIRECT ADDRESS TABLE MAP IMPLEMENTATION

DICTIONARY ADT The dictionary ADT models a searchable collection of key-value entries The main difference from a map is that multiple items with the same key are allowed Any data structure that supports a dictionary also supports a map Applications: Dictionary which has multiple definitions for the same word

ORDERED MAP/DICTIONARY ADT An Ordered Map/Dictionary supports the usual map/dictionary operations, but also maintains an order relation for the keys. Naturally supports Ordered search tables - store dictionary in a vector by non-decreasing order of the keys Utilizes binary search

EXAMPLE OF ORDERED MAP: BINARY SEARCH 13457 8 91114161819 1 3 457891114161819 134 5 7891114161819 1345 7 891114161819 0 0 0 0 m l h m l h m l h l  m  h

MAP/DICTIONARY IMPLEMENTATIONS Space Unsorted list Direct Address Table (map only) Ordered Search Table (ordered map/dictionary)

CH. 9.2 HASH TABLES

HASH TABLES

ISSUES WITH HASH TABLES Issues Collisions - some keys will map to the same index of H (otherwise we have a Direct Address Table). Chaining - put values that hash to same location in a linked list (or a “bucket”) Open addressing - if a collision occurs, have a method to select another location in the table. Load factor Rehashing

EXAMPLE     0 1 2 3 4 9997 9998 9999 … 451-229-0004 981-101-0002 200-751-9998 025-612-0001

HASH FUNCTIONS

HASH CODES Memory address: We reinterpret the memory address of the key object as an integer Good in general, except for numeric and string keys Integer cast: We reinterpret the bits of the key as an integer Suitable for keys of length less than or equal to the number of bits of the integer type (e.g., byte, short, int and float in C++) Component sum: We partition the bits of the key into components of fixed length (e.g., 16 or 32 bits) and we sum the components (ignoring overflows) Suitable for numeric keys of fixed length greater than or equal to the number of bits of the integer type (e.g., long and double in C++)

HASH CODES Cyclic Shift: Like polynomial accumulation except use bit shifts instead of multiplications and bitwise or instead of addition Can be used on floating point numbers as well by converting the number to an array of characters

COMPRESSION FUNCTIONS

COLLISION RESOLUTION WITH SEPARATE CHAINING Collisions occur when different elements are mapped to the same cell Separate Chaining: let each cell in the table point to a linked list of entries that map there Chaining is simple, but requires additional memory outside the table    0 1 2 3 4 451-229-0004981-101-0004 025-612-0001

EXERCISE SEPARATE CHAINING

COLLISION RESOLUTION WITH OPEN ADDRESSING - LINEAR PROBING 0123456789101112 41 18445932223173 0123456789101112

SEARCH WITH LINEAR PROBING

UPDATES WITH LINEAR PROBING

EXERCISE OPEN ADDRESSING – LINEAR PROBING

COLLISION RESOLUTION WITH OPEN ADDRESSING – QUADRATIC PROBING

COLLISION RESOLUTION WITH OPEN ADDRESSING - DOUBLE HASHING

PERFORMANCE OF HASHING

UNIFORM HASHING ASSUMPTION

PERFORMANCE OF UNIFORM HASHING

ON REHASHING Keeping the load factor low is vital for performance When resizing the table: Reallocate space for the array Design a new hash function (new parameters) for the new array size For each item you reinsert it into the table

SUMMARY MAPS/DICTIONARIES (SO FAR) Space Log File Direct Address Table (map only) Lookup Table (ordered map/dictionary) Hashing (chaining) Hashing (open addressing)

CH. 9.4 SKIP LISTS  S0S0 S1S1 S2S2 S3S3  103623 15   2315

RANDOMIZED ALGORITHMS

WHAT IS A SKIP LIST? 566478  313444  122326   31  64  3134  23 S0S0 S1S1 S2S2 S3S3

IMPLEMENTATION x quad-node

 S0S0 S1S1 S2S2 S3S3  31  64  3134  23 566478  313444  122326

EXERCISE SEARCH  S0S0 S1S1 S2S2 S3S3  31  64  3134  23 566478  313444  122326

 S0S0 S1S1 S2S2 S3S3  103623 15   2315  1036  23  S0S0 S1S1 S2S2 p0p0 p1p1 p2p2

 4512  23  S0S0 S1S1 S2S2  S0S0 S1S1 S2S2 S3S3  451223 34   2334 p0p0 p1p1 p2p2

SPACE USAGE

HEIGHT

SEARCH AND UPDATE TIMES

EXERCISE You are working for ObscureDictionaries.com a new online start-up which specializes in sci-fi languages. The CEO wants your team to describe a data structure which will efficiently allow for searching, inserting, and deleting new entries. You believe a skip list is a good idea, but need to convince the CEO. Perform the following: Illustrate insertion of “X-wing” into this skip list. Randomly generated (1, 1, 1, 0). Illustrate deletion of an incorrect entry “Enterprise” Argue the complexity of deleting from a skip list   YodaBoba Fett  Enterprise   S0S0 S1S1 S2S2

SUMMARY Using a more complex probabilistic analysis, one can show that these performance bounds also hold with high probability Skip lists are fast and simple to implement in practice

CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++,

Similar presentations

Presentation on theme: "CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++,"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++,

Similar presentations

Presentation on theme: "CHAPTER 9 HASH TABLES, MAPS, AND SKIP LISTS ACKNOWLEDGEMENT: THESE SLIDES ARE ADAPTED FROM SLIDES PROVIDED WITH DATA STRUCTURES AND ALGORITHMS IN C++,"— Presentation transcript:

Similar presentations

About project

Feedback