Binary Search Trees One of the tree applications in Chapter 10 is binary search trees. In Chapter 10, binary search trees are used to implement bags.

Slides:



Advertisements
Similar presentations
Trees Types and Operations
Advertisements

L l One of the tree applications in Chapter 9 is binary search trees. l l In Chapter 9, binary search trees are used to implement bags and sets. l l This.
 One of the tree applications in Chapter 10 is binary search trees.  In Chapter 10, binary search trees are used to implement bags and sets.  This presentation.
CSC212 Data Structure Lecture 14 Binary Search Trees Instructor: Prof. George Wolberg Department of Computer Science City College of New York.
B+-Trees (PART 1) What is a B+ tree? Why B+ trees? Searching a B+ tree
Binary Trees, Binary Search Trees CMPS 2133 Spring 2008.
Binary Trees, Binary Search Trees COMP171 Fall 2006.
Trees, Binary Trees, and Binary Search Trees COMP171.
Data Structures Topic #9. Today’s Agenda Continue Discussing Trees Examine the algorithm to insert Examine the algorithm to remove Begin discussing efficiency.
Lec 15 April 9 Topics: l binary Trees l expression trees Binary Search Trees (Chapter 5 of text)
1 Trees III: Binary Search Trees. 2 A forest full of trees The generic toolkit of functions we have seen thus far can be applied to many types of data.
Trees Main and Savitch Chapter 10. Binary Trees A binary tree has nodes, similar to nodes in a linked list structure. Data of one sort or another may.
B-Tree. B-Trees a specialized multi-way tree designed especially for use on disk In a B-tree each node may contain a large number of keys. The number.
Data Structures - CSCI 102 Binary Tree In binary trees, each Node can point to two other Nodes and looks something like this: template class BTNode { public:
BINARY SEARCH TREE. Binary Trees A binary tree is a tree in which no node can have more than two children. In this case we can keep direct links to the.
Priority Queues and Binary Heaps Chapter Trees Some animals are more equal than others A queue is a FIFO data structure the first element.
Binary Trees, Binary Search Trees RIZWAN REHMAN CENTRE FOR COMPUTER STUDIES DIBRUGARH UNIVERSITY.
 One of the tree applications in Chapter 10 is binary search trees.  In Chapter 10, binary search trees are used to implement bags and sets.  This presentation.
Trees, Binary Trees, and Binary Search Trees COMP171.
Lec 15 Oct 18 Binary Search Trees (Chapter 5 of text)
CSC211 Data Structures Lecture 18 Heaps and Priority Queues Instructor: Prof. Xiaoyan Li Department of Computer Science Mount Holyoke College.
Binary Search Trees Chapter 7 Objectives
COMP 53 – Week Fourteen Trees.
TCSS 342, Winter 2006 Lecture Notes
AA Trees.
BCA-II Data Structure Using C
Binary Search Trees One of the tree applications in Chapter 10 is binary search trees. In Chapter 10, binary search trees are used to implement bags.
CSC212 Data Structure - Section AB
Multiway Search Trees Data may not fit into main memory
Binary Search Trees.
Binary Search Tree (BST)
B+-Trees.
B+-Trees.
B+-Trees.
Binary Search Tree Chapter 10.
Lecture 22 Binary Search Trees Chapter 10 of textbook
Binary Search Trees Why this is a useful data structure. Terminology
CSE373: Data Structures & Algorithms Lecture 7: AVL Trees
Binary Trees, Binary Search Trees
Chapter 22 : Binary Trees, AVL Trees, and Priority Queues
(edited by Nadia Al-Ghreimil)
Map interface Empty() - return true if the map is empty; else return false Size() - return the number of elements in the map Find(key) - if there is an.
Data Structures Balanced Trees CSCI
Complete Binary Trees Chapter 9 introduces trees.
original list {67, 33,49, 21, 25, 94} pass { } {67 94}
Lec 12 March 9, 11 Mid-term # 1 (March 21?)
i206: Lecture 14: Heaps, Graphs intro.
Balanced-Trees This presentation shows you the potential problem of unbalanced tree and show two way to fix it This lecture introduces heaps, which are.
Heaps Chapter 10 has several programming projects, including a project that uses heaps. This presentation shows you what a heap is, and demonstrates two.
Heaps Chapter 10 has several programming projects, including a project that uses heaps. This presentation shows you what a heap is, and demonstrates two.
B-Trees This presentation shows you the potential problem of unbalanced tree and show one way to fix it This lecture introduces heaps, which are used.
Heaps Chapter 11 has several programming projects, including a project that uses heaps. This presentation shows you what a heap is, and demonstrates.
Hash Tables Chapter 12 discusses several ways of storing information in an array, and later searching for the information. Hash tables are a common.
Balanced-Trees This presentation shows you the potential problem of unbalanced tree and show two way to fix it This lecture introduces heaps, which are.
Lecture 12 CS203 1.
Binary Search Trees.
CSE 332: Data Abstractions AVL Trees
Binary Search Trees One of the tree applications in Chapter 9 is binary search trees. In Chapter 9, binary search trees are used to implement bags and.
Heaps Chapter 11 has several programming projects, including a project that uses heaps. This presentation shows you what a heap is, and demonstrates.
Binary Trees, Binary Search Trees
Hash Tables Chapter 12 discusses several ways of storing information in an array, and later searching for the information. Hash tables are a common.
Hash Tables Chapter 12 discusses several ways of storing information in an array, and later searching for the information. Hash tables are a common.
Binary Search Trees One of the tree applications in Chapter 9 is binary search trees. In Chapter 9, binary search trees are used to implement bags and.
Richard Anderson Spring 2016
B-Trees.
B-Trees This presentation shows you the potential problem of unbalanced tree and show one way to fix it This lecture introduces heaps, which are used.
Binary Trees, Binary Search Trees
Heaps Chapter 10 has several programming projects, including a project that uses heaps. This presentation shows you what a heap is, and demonstrates two.
Chapter 11 Trees © 2011 Pearson Addison-Wesley. All rights reserved.
Presentation transcript:

Binary Search Trees One of the tree applications in Chapter 10 is binary search trees. In Chapter 10, binary search trees are used to implement bags. This presentation illustrates how another data type called a dictionary is implemented with binary search trees. This lecture shows a common application of binary trees: Binary Search Trees used to implement a dictionary data type. Before this lecture, students should have a good understanding of binary trees, and should have seen some basic container data types similar to a dictionary (for example, a bag or a set).

Binary Search Trees (BST) A data structure for efficient searching, insertion and deletion Binary search tree property For every node x All the keys in its left subtree are smaller than the key value in x All the keys in its right subtree are larger than the key value in x

Binary Search Trees A binary search tree Not a binary search tree

Binary Search Trees Average depth of a BST is O(log n) Maximum depth of a BST is O(n) The same set of keys may have different BSTs

The Dictionary Data Type A dictionary is a collection of items, similar to a bag. But unlike a bag, each item has a string attached to it, called the item's key. This lecture shows how to use a binary tree to implement an abstract data structure called a dictionary. We'll start by explaining what a dictionary is, without any reference to trees. Then we will show how the trees can be used to actually implement a dictionary. It's important to realize that trees are but one possible way to implement a dictionary, and the actual explanation of "What is a dictionary?" will not refer to trees at all. In other words, a dictionary is an abstract data type, and the trees are one of the mechanisms that can be used to implement the dictionary. So, what is a dictionary? In many ways it is like other ADTs that you have seen, such as a bag which contains a collection of items. The difference is that each item in a dictionary is attached to a string called the item's key.

The Dictionary Data Type A dictionary is a collection of items, similar to a bag. But unlike a bag, each item has a string attached to it, called the item's key. For this example, each item that I'm putting in the dictionary is a record which contain a bunch of geographical information about a state. Example: The items I am storing are records containing data about a state.

The Dictionary Data Type A dictionary is a collection of items, similar to a bag. But unlike a bag, each item has a string attached to it, called the item's key. The key for each record is the name of the state. In general, the keys could be some other sort of value such as social security numbers. The keys must have the property that they form a total order under some comparison operation such as “less than”. Example: The key for each record is the name of the state. Washington

The Dictionary Data Type void dictionary::insert(The key for the new item, The new item); The insertion procedure for a dictionary has two parameters. When an item is placed into the dictionary, we must specify both the record of information and the key that is attached to that information. For example, if the dictionary is implemented as an object type, then there will be an Insert method. The Insert method will have two parameters: a string (which is the key) and a record (which is the item being inserted). Washington

The Dictionary Data Type When you want to retrieve an item, you specify the key... When you want to retrieve information from the dictionary, you call a retrieval method, and specify the key of the item that you are looking for. This key is the parameter of the retrieval procedure. Item dictionary::retrieve("Washington");

The Dictionary Data Type When you want to retrieve an item, you specify the key... ... and the retrieval procedure returns the item. The dictionary finds the information, and returns it. Item dictionary::retrieve("Washington");

The Dictionary Data Type We'll look at how a binary tree can be used as the internal storage mechanism for the dictionary. That's enough about the abstract workings of a dictionary. Now we are going to look at how a binary tree can be used to store the information of a dictionary in a way that makes it fairly easy to add new items, to retrieve existing items, and to remove items. (You never know when a state might want to secede from the union.)

A Binary Search Tree of States Florida The data in the dictionary will be stored in a binary tree, with each node containing an item and a key. Oklahoma Colorado Arizona Arkansas Mass. Washington As you might have guessed, the data in the dictionary will be stored in a binary tree, with each node containing both a record of information and the key that's attached to that information. In this example, the dictionary currently has only 9 of the 50 states, but that's enough to illustrate the idea. Hampshire New Virginia West

A Binary Search Tree of States Florida Storage rules: Every key to the left of a node is alphabetically before the key of the node. Oklahoma Colorado Arizona Arkansas Colorado Mass. Washington The nodes cannot appear in just any order. The nodes must follow the special storage rules of a binary search tree. There are two such rules: 1. Every key to the left of a node is alphabetically before the key of the node. Hampshire New Virginia West

A Binary Search Tree of States Florida Storage rules: Every key to the left of a node is alphabetically before the key of the node. Oklahoma Colorado Arizona Mass. Washington Example: “Massachusetts” and “ New Hampshire” are alphabetically before “Oklahoma” Notice that this rule applies to every which can be reached by starting down the left branch of a node. For example, if I start at Oklahoma, and head down the left branch, I can reach Massachusetts and New Hampshire. So, both Massachusetts and New Hampshire must be alphabetically before Oklahoma. Hampshire New Virginia West Arkansas

A Binary Search Tree of States Florida Storage rules: Every key to the left of a node is alphabetically before the key of the node. Every key to the right of a node is alphabetically after the key of the node. Oklahoma Colorado Arizona Mass. Washington The second rule is the mirror image of the first rule: 2. Every key to the right of a node is alphabetically after the key of the node. Hampshire New Virginia West Arkansas

A Binary Search Tree of States Florida Storage rules: Every key to the left of a node is alphabetically before the key of the node. Every key to the right of a node is alphabetically after the key of the node. Oklahoma Colorado Arizona Mass. Washington Here's an example: Arkansas is alphabetically after Arizona. Hampshire New Virginia West Arkansas

Searching BST If we are searching for 15, then we are done. If we are searching for a key < 15, then we should search in the left subtree. If we are searching for a key > 15, then we should search in the right subtree.

Searching (Find) Find X: return a pointer to the node that has key X, or NULL if there is no such node Time complexity: O(depth of the tree)

Retrieving Data Start at the root. Florida Start at the root. If the current node has the key, then stop and retrieve the data. If the current node's key is too large, move left and repeat 1-3. If the current node's key is too small, move right and repeat 1-3. Oklahoma Colorado Arizona Mass. Washington Once a tree is organized according to the storage rule of a binary search tree, it is easy to find any particular key, following this algorithm. The algorithm starts at the root and repeatedly executes these steps. 1. First check to see if we found the key we were looking for. If so, then we can stop and return the associated information. 2. On the other hand, if the key at the current node is larger than the key that we're searching for, then we'll continue our search by moving leftward. 3. And if the key at the current node is smaller than the key that we're searching for, then we'll continue our search by moving rightward. Hampshire New Virginia West Arkansas

Retrieve " New Hampshire" Florida Start at the root. If the current node has the key, then stop and retrieve the data. If the current node's key is too large, move left and repeat 1-3. If the current node's key is too small, move right and repeat 1-3. Oklahoma Colorado Arizona Mass. Washington As an example, suppose we are searching for New Hampshire. We start at Florida, and since this is not the node we are after we must move down. Do we move left or right? Hampshire New Virginia West Arkansas

Inorder Traversal of BST Inorder traversal of BST prints out all the keys in sorted order Inorder: 2, 3, 4, 6, 7, 9, 13, 15, 17, 18, 20

findMin/ findMax Goal: return the node containing the smallest (largest) key in the tree Algorithm: Start at the root and go left (right) as long as there is a left (right) child. The stopping point is the smallest (largest) element Time complexity = O(depth of the tree)

Insertion Proceed down the tree as you would with a find If x is found, do nothing (or update something) Otherwise, insert x at the last spot on the path traversed Time complexity = O(depth of the tree)

Adding a New Item with a Given Key Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington Adding a new node requires two steps, the first of which is similar to searching. In this first step we pretend that we are trying to find the key. Of course, we won't find the key, because it is not yet in the tree. So eventually we will reach a spot where there is no "next node" to step onto. At this point we stop and add the new node. Hampshire New Virginia West Arkansas

Adding Iowa Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington For example, suppose that Iowa wants to join our dictionary. We start by pretending to search for Iowa, beginning at the root... Hampshire New Virginia West Arkansas

Adding Iowa Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington Which way will we move from the root if we are searching for Iowa? Hampshire New Virginia West Arkansas

Adding Iowa Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington From the root we have moved right because Florida was smaller than Iowa. Hampshire New Virginia West Arkansas

Adding Iowa Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington From Oklahoma we move left, onto Massachusetts,... Hampshire New Virginia West Arkansas

Adding Iowa Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington ...and from Massachusetts we would move left again, if we could. But there is no node here. So we stop... Hampshire New Virginia West Arkansas

Adding Florida Pretend that you are trying to find the key, but stop when there is no node to move to. Add the new node at the spot where you would have moved to if there had been a node. Oklahoma Colorado Arizona Mass. Washington ...and this is the location for Iowa. Later, when we are searching for Iowa, we will follow the same path down to Massachusetts. We will step left from Massachusetts, and there we find our goal of Iowa. Important note: New nodes are always added at the leaves. Iowa Hampshire New Virginia West Arkansas

Adding Where would you add this state? Kazakhstan Florida Oklahoma Colorado Arizona Mass. Washington One more example: Where would you add the new state of Kazakhstan?... Iowa Hampshire New Virginia West Arkansas

Adding Kazakhstan is the new right child of Iowa? Florida Oklahoma Colorado Arizona Mass. Washington If you followed the algorithm, then you saw that Kazakhstan is added as the right child of Iowa. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Deletion When we delete a node, we need to consider how we take care of the children of the deleted node. This has to be done such that the property of the search tree is maintained.

Deletion under Different Cases Case 1: the node is a leaf Delete it immediately Case 2: the node has one child Adjust a pointer from the parent to bypass that node

Deletion Case 3 Case 3: the node has 2 children Replace the key of that node with the minimum element at the right subtree Delete that minimum element Has either no child or only right child because if it has a left child, that left child would be smaller and would have been chosen. So invoke case 1 or 2. Time complexity = O(depth of the tree)

Removing an Item with a Given Key Florida Find the item. If necessary, swap the item with one that is easier to remove. Remove the item. Oklahoma Colorado Arizona Mass. Washington Removing an item requires three steps. We can look at the steps in detail in a moment, but first let's go through this outline so that you know roughly what to expect. 1. Find the item that you want to remove. 2. Some items are harder to remove than others. In particular, an item with two children is particularly hard to remove. So, in this second step we sometimes take an item that is hard to remove, and exchange it with an item that is easier to remove. 3. Finally, once the item is in a spot that is easy to remove, we remove it. Let's look at the three steps in detail. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" Find the item. Florida Oklahoma Colorado Arizona Mass. Washington As an example, let's remove Florida. (It's way too hot to be part of the union anyway). First we find it, which is easy enough by using the usual method to search for a key. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" Florida cannot be removed at the moment... Florida Oklahoma Colorado Arizona Mass. Florida cannot be removed at the moment... Washington In the second step we need to swap Florida with another item that is easier to remove. The reason that we need to do this swap is because of a problem that occurs if we just try to remove Florida... Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" ... because removing Florida would Oklahoma Colorado Arizona Mass. ... because removing Florida would break the tree into two pieces. Washington We end up breaking the tree into two separate parts. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" If necessary, do some rearranging. The problem of Oklahoma Colorado Arizona Mass. The problem of breaking the tree happens because Florida has 2 children. Washington In general it is hard to recombine these two parts into a single tree. So, our goal is to find another item that is easier to remove, and copy that other item spot that we are trying to remove. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" If necessary, do some rearranging. Oklahoma Colorado Arizona Mass. For the rearranging, take the smallest item in the right subtree... Washington In general, there are two items that we could copy on top of Florida. In the book you'll see that one of these items is in the left subtree. In this lecture I'll use the other one, from the right subtree. In an actual program you can follow either approach. Anyway, the approach we'll take here is to copy the smallest item in the right subtree onto Florida. To find that smallest item, step onto the right subtree (Oklahoma), and then race as far left as you can -- onto Iowa. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" If necessary, do some rearranging. Iowa If necessary, do some rearranging. Oklahoma Colorado Arizona Mass. ...copy that smallest item onto the item that we're removing... Washington Copy Iowa on top of the item that we are removing. Iowa Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" If necessary, do some rearranging. Iowa If necessary, do some rearranging. Oklahoma Colorado Arizona Mass. ... and then remove the extra copy of the item we copied... Washington We have eliminated Florida, but now we have two copies of Iowa. If you've ever been to Iowa, you know that one Iowa is more than enough, so next we must remove the extra Iowa... Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" If necessary, do some rearranging. Iowa If necessary, do some rearranging. Oklahoma Colorado Arizona Mass. Washington Note that the extra Iowa had one child (Kazakhstan), so that child is reconnected to the parent. Here's a good question for you: Remember that it's hard to remove nodes with two children. How do you know that the smallest item in the right subtree does not have two children? Answer: Since it is the smallest item, it can't have a left child because that left child would be even smaller. ... and reconnect the tree Kazakhstan Hampshire New Virginia West Arkansas

Removing "Florida" Why did I choose the smallest item Oklahoma Colorado Arizona Mass. Why did I choose the smallest item in the right subtree? Washington In fact, the guarantee that the smallest item has at most one child is one of the reasons why I selected the smallest item. There's a second reason, can you think of it? Hampshire New Virginia West Arkansas Kazakhstan

Removing "Florida" Because every key must be smaller than Iowa Oklahoma Colorado Arizona Mass. Because every key must be smaller than the keys in its right subtree Washington The second reason is that I am take this smallest item and place it in Florida's location. In order to maintain the binary search tree storage rules, this item must be smaller than anything that remains in the right subtree--therefore I must choose the smallest item in the right subtree. Kazakhstan Hampshire New Virginia West Arkansas

Removing an Item with a Given Key Find the item. If the item has a right child, rearrange the tree: Find smallest item in the right subtree Copy that smallest item onto the one that you want to remove Remove the extra copy of the smallest item (making sure that you keep the tree connected) else just remove the item. Here's a summary of the removal steps. Note that if the item that we want to remove does not have a right child, then we can just remove it (and reconnect its left child if there is one). In the text book, the process is done in a symmetrical way--using the largest item in the left subtree. Either approach works fine. Also, the textbook implements a Bag rather than a dictionary. One of the resultant differences is that the Bag may have several copies of the same item, so that items in the left subtree can be less than or equal to the item in a node.

Summary Binary search trees are a good implementation of data types such as sets, bags, and dictionaries. Searching for an item is generally quick since you move from the root to the item, without looking at many other items. Adding and deleting items is also quick. A quick summary . . .