CSE373: Data Structures & Algorithms Lecture 5: Dictionary ADTs; Binary Trees Linda Shapiro Spring 2016.

Slides:



Advertisements
Similar presentations
Binary Trees, Binary Search Trees CMPS 2133 Spring 2008.
Advertisements

CSE332: Data Abstractions Lecture 6: Dictionaries; Binary Search Trees Dan Grossman Spring 2010.
CSE 326: Data Structures Binary Search Trees Ben Lerner Summer 2007.
Lec 15 April 9 Topics: l binary Trees l expression trees Binary Search Trees (Chapter 5 of text)
1 abstract containers hierarchical (1 to many) graph (many to many) first ith last sequence/linear (1 to 1) set.
CSE373: Data Structures & Algorithms Lecture 5: Dictionaries; Binary Search Trees Aaron Bauer Winter 2014.
CSE373: Data Structures & Algorithms Lecture 6: Binary Search Trees Nicki Dell Spring 2014 CSE373: Data Structures & Algorithms1.
CSE332: Data Abstractions Lecture 6: Dictionaries; Binary Search Trees Tyler Robison Summer
Lecture 11COMPSCI.220.FS.T Balancing an AVLTree Two mirror-symmetric pairs of cases to rebalance the tree if after the insertion of a new key to.
CSE373: Data Structures & Algorithms Lecture 6: Binary Search Trees continued Aaron Bauer Winter 2014 CSE373: Data Structures & Algorithms1.
Week 10 - Friday.  What did we talk about last time?  Graph representations  Adjacency matrix  Adjacency lists  Depth first search.
CSE373: Data Structures & Algorithms Lecture 4: Dictionaries; Binary Search Trees Dan Grossman Fall 2013.
Week 15 – Wednesday.  What did we talk about last time?  Review up to Exam 1.
CSE373: Data Structures & Algorithms Lecture 6: Binary Search Trees Linda Shapiro Winter 2015.
CSE373: Data Structures & Algorithms Lecture 5: Dictionary ADTs; Binary Trees Lauren Milne Summer 2015.
CSE373: Data Structures & Algorithms Lecture 4: Dictionaries; Binary Search Trees Kevin Quinn Fall 2015.
BSTs, AVL Trees and Heaps Ezgi Shenqi Bran. What to know about Trees? Height of a tree Length of the longest path from root to a leaf Height of an empty.
CSE373: Data Structures & Algorithms Lecture 8: AVL Trees and Priority Queues Linda Shapiro Spring 2016.
Final Exam Review COP4530.
Instructor: Lilian de Greef Quarter: Summer 2017
CSE373: Data Structures & Algorithms Priority Queues
CSE373: Data Structures & Algorithms Lecture 5: Dictionary ADTs; Binary Trees Linda Shapiro Winter 2015.
CSE373: Data Structure & Algorithms
CSC317 Selection problem q p r Randomized‐Select(A,p,r,i)
COMP261 Lecture 23 B Trees.
CSE373: Data Structures & Algorithms
Instructor: Lilian de Greef Quarter: Summer 2017
Week 7 - Friday CS221.
October 2nd – Dictionary ADT
Multiway Search Trees Data may not fit into main memory
CSE373: Data Structures & Algorithms Lecture 6: Hash Tables
CSE373: Data Structures & Algorithms More Heaps; Dictionaries; Binary Search Trees Riley Porter Winter 2016 Winter 2017.
Binary Search Trees.
Week 6 - Wednesday CS221.
October 30th – Priority QUeues
Trees Lecture 12 CS2110 – Fall 2017.
CSE373: Data Structures & Algorithms Lecture 7: AVL Trees
CSE373: Data Structures & Algorithms Lecture 4: Dictionaries; Binary Search Trees Hunter Zahn Summer 2016 Summer 2016.
Week 11 - Friday CS221.
Hashing Exercises.
abstract containers sequence/linear (1 to 1) hierarchical (1 to many)
Cse 373 April 26th – Exam Review.
i206: Lecture 13: Recursion, continued Trees
Binary Trees, Binary Search Trees
Chapter 22 : Binary Trees, AVL Trees, and Priority Queues
Map interface Empty() - return true if the map is empty; else return false Size() - return the number of elements in the map Find(key) - if there is an.
CSE373: Data Structures & Algorithms Lecture 5: Dictionary ADTs; Binary Trees Catie Baker Spring 2015.
CMSC 341 Lecture 10 B-Trees Based on slides from Dr. Katherine Gibson.
CSE373: Data Structures & Algorithms Lecture 6: Binary Search Trees
CSE373: Data Structures & Algorithms Lecture 6: Binary Search Trees
CSE332: Data Abstractions Lecture 5: Binary Heaps, Continued
CSE373: Data Structures & Algorithms Lecture 11: Implementing Union-Find Linda Shapiro Spring 2016.
Final Exam Review COP4530.
B-Tree Insertions, Intro to Heaps
Heaps and the Heapsort Heaps and priority queues
Ch. 8 Priority Queues And Heaps
CSE 373: Data Structures and Algorithms
Hassan Khosravi / Geoffrey Tien
CSE 332: Data Abstractions Binary Search Trees
CSE373: Data Structures & Algorithms Lecture 5: AVL Trees
CMSC 202 Trees.
CSE 326: Data Structures Priority Queues & Binary Heaps
CSE 332: Data Abstractions AVL Trees
Binary Trees, Binary Search Trees
CSE373: Data Structures & Algorithms Lecture 7: Binary Heaps, Continued Dan Grossman Fall 2013.
Binary SearchTrees [CLRS] – Chap 12.
Richard Anderson Spring 2016
B-Trees.
Binary Trees, Binary Search Trees
Presentation transcript:

CSE373: Data Structures & Algorithms Lecture 5: Dictionary ADTs; Binary Trees Linda Shapiro Spring 2016

CSE 373 Algorithms and Data Structures Today’s Outline Announcements Homework 1 due TODAY at 11:59 pm  Homework 2 out (paper and pencil assignment) Due in class Wednesday April 13 at the START of class Today’s Topics Finish Asymptotic Analysis Dictionary ADT (a.k.a. Map): associate keys with values Extremely common Binary Trees Spring 2016 CSE 373 Algorithms and Data Structures

Summary of Asymptotic Analysis Analysis can be about: The problem or the algorithm (usually algorithm) Time or space (usually time) Or power or dollars or … Best-, worst-, or average-case (usually worst) Upper-, lower-, or tight-bound (usually upper) The most common thing we will do is give an O upper bound to the worst-case running time of an algorithm. g(n) ≤ f(n) O g(n) ≥ f(n) Ω both θ Spring 2016 CSE 373 Algorithms and Data Structures

How to apply the definition easily Theory (mine) Let g(n) = c1*g1(n) + c2*g2(n) + ... + ck*gk(n) + c0 Suppose the functions g1, g2, ... gk are already arranged in order with highest complexity at the far left. Select the LARGEST of the constants C = max(c1, c2, ..ck, c0) Then g(n) ≤ C*g1(n) + C*g2(n) + ... + C*gk(n) + C Or g(n) ≤ C(g1(n) + g2(n) + ... + gk(n) + 1) But g1(n) is bigger than g2(n) and all the others beyond some known n0. So g(n) ≤ C(g1(n) + g1(n) + ... +g1(n) + g1(n)) Or g(n) ≤ C(k+1)*g1(n) = C’*g1(n) for all n greater than n0. Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Example g(n) = 25 n4 + 30 n2 + 100 nlogn + 54 g(n) ≤ 100 n4 + 100n2 + 100 nlogn + 100 g(n) ≤ 100(n4 + n2 + nlogn + 1) g(n) ≤ 100(n4 + n4 + n4 + n4) g(n) ≤ 100*4*n4 g(n) ≤ 400*n4 for all n ≥ 1 Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Big-Oh Caveats Asymptotic complexity focuses on behavior for large n and is independent of any computer / coding trick But you can “abuse” it to be misled about trade-offs Example: n1/10 vs. log n Asymptotically n1/10 grows more quickly But the “cross-over” point is around 5 * 1017 So if you have input size less than 258, prefer n1/10 For small n, an algorithm with worse asymptotic complexity might be faster If you care about performance for small n then the constant factors can matter Spring 2016 CSE 373 Algorithms and Data Structures

Addendum: Timing vs. Big-Oh Summary Big-oh is an essential part of computer science’s mathematical foundation Examine the algorithm itself, not the implementation Reason about (even prove) performance as a function of n Timing also has its place Compare implementations Focus on data sets you care about (versus worst case) Determine what the constant factors “really are” Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Let’s take a breath So far we’ve covered Some simple ADTs: stacks, queues, lists Some math (proof by induction) How to analyze algorithms Asymptotic notation (Big-Oh) Coming up…. Many more ADTs Starting with dictionaries Spring 2016 CSE 373 Algorithms and Data Structures

The Dictionary (a.k.a. Map) ADT 7/5/2018 The Dictionary (a.k.a. Map) ADT Data: set of (key, value) pairs keys must be comparable Operations: insert(key,value) find(key) delete(key) … ezgi Ezgi Mercan OH: Thurs 10.30-11.30 … mert Mert Sagalm OH: TTH 3.30-4.30 bran Bran Hagger OH: Mon 10.00-11.00 insert(ezgi, ….) find(bran) Bran Hagger, … Will tend to emphasize the keys; don’t forget about the stored values Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 A Modest Few Uses key | attr1 | attr2 | attr3 k1 | v11 | v12 | v13 k2 | v21 | v22 | v23 Any time you want to store information according to some key and be able to retrieve it efficiently Lots of programs do that! Search: inverted indexes, phone directories, … Networks: router tables Operating systems: page tables Compilers: symbol tables Databases: dictionaries with other nice properties Biology: genome maps … Possibly the most widely used ADT Spring 2016 CSE 373 Algorithms and Data Structures

Simple implementations 7/5/2018 Simple implementations For dictionary with n key/value pairs insert find delete Unsorted linked-list Unsorted array Sorted linked list Sorted array * Unless we need to check for duplicates We’ll see a Binary Search Tree (BST) probably does better but not in the worst case (unless we keep it balanced) O(1)* O(n) O(n) O(1)* O(n) O(n) O(n) O(n) O(n) LL: O(1), O(n), O(n) Uns: O(1), O(n), O(n) Sorted: O(n), O(log n), O(n) Sorted array is oh-so-close. O(log n) find time and almost O(log n) insert time. What’s wrong? Let’s look at how that search goes: Draw recursive calls (and potential recursive calls) in binary search. Note how it starts looking like a binary tree where the left subtrees have smaller elements and the right subtrees have bigger elements. What if we could store the whole thing in the structure this recursive search is building? O(n) O(log n) O(n) Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Lazy Deletion 10 12 24 30 41 42 44 45 50   A general technique for making delete as fast as find: Instead of actually removing the item just mark it deleted Plusses: Simpler Can do removals later in batches If re-added soon thereafter, just unmark the deletion Minuses: Extra space for the “is-it-deleted” flag Data structure full of deleted nodes wastes space May complicate other operations Spring 2016 CSE 373 Algorithms and Data Structures

Better dictionary data structures There are many good data structures for (large) dictionaries Binary trees AVL trees Binary search trees with guaranteed balancing B-Trees Also always balanced, but different and shallower B-Trees are not the same as Binary Trees B-Trees generally have large branching factor Hash Tables Not tree-like at all Skipping: Other balanced trees (e.g., red-black, splay) Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Tree terms (review?) Tree T A E B D F C G I H L J M K N Depth 0 Root (tree) Leaves (tree) Children (node) Parent (node) Siblings (node) Ancestors (node) Descendents (node) Subtree (node) Depth (node) Height (tree) Degree (node) Branching factor (tree) Depth 1 4 Depth 2 Depth 3 Depth 4 Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures More tree terms There are many kinds of trees Every binary tree is a tree Every list is kind of a tree (think of “next” as the one child) There are many kinds of binary trees Every binary search tree is a binary tree Later: A binary heap is a different kind of binary tree A tree can be balanced or not A balanced tree with n nodes has a height of O(log n) Different tree data structures have different “balance conditions” to achieve this Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Kinds of trees Certain terms define trees with specific structure Binary tree: Each node has at most 2 children (branching factor 2) n-ary tree: Each node has at most n children (branching factor n) Perfect tree: Each row completely full Complete tree: Each row completely full except maybe the bottom row, which is filled from left to right What is the height of a perfect binary tree with n nodes? A complete binary tree? └ log2n ┘ Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 Binary Trees Binary tree: Each node has at most 2 children (branching factor 2) Binary tree is A root (with data) A left subtree that’s a binary tree A right subtree that’s a binary tree These subtrees may be empty. Representation: A B C D E F Data right pointer left G H For a dictionary, data will include a key and a value I J Spring 2016 CSE 373 Algorithms and Data Structures

Binary Tree Representation 7/5/2018 Binary Tree Representation Spring 2016 CSE 373 Algorithms and Data Structures

Binary Trees: Some Numbers 7/5/2018 Binary Trees: Some Numbers Recall: height of a tree = longest path from root to leaf (count edges) For binary tree of height h: max # of leaves: max # of nodes: min # of leaves: min # of nodes: 2h 2h, for perfect tree 2(h + 1) - 1 2h+1 – 1, for perfect tree 1 Some tree calculations, for tree of height h: - Max number of leaves (perfect tree): 2^h - Max number of nodes (perfect tree): 2^(h+1) - 1 - Min number of nodes/leaves (degenerate tree): h-1/1 - What fraction of the tree is located in the last level of a perfect tree? 1/2 - Average depth for N nodes: sqrt(N) (We won’t go into this, but if you take N nodes and assume all distinct trees of the nodes are equally likely, you get an average depth of SQRT(N). Is that bigger or smaller than log n? Bigger, so it’s not good enough!) 1, for “list” tree h + 1 h+1, for “list” tree For n nodes, we cannot do better than O(log n) height and we want to avoid O(n) height Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Calculating height What is the height of a tree with root root? int treeHeight(Node root) { ??? } Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Calculating height What is the height of a tree with root root? int treeHeight(Node root) { if(root == null) return -1; return 1 + max(treeHeight(root.left), treeHeight(root.right)); } Running time for tree with n nodes: O(n) – single pass over tree Note: non-recursive is painful – need your own stack of pending nodes; much easier to use recursion’s call stack Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 Tree Traversals A traversal is an order for visiting all the nodes of a tree Pre-order: root, left subtree, right subtree + * 2 4 5 In-order: left subtree, root, right subtree 2 * 4 + 5 Post-order: left subtree, right subtree, root 2 4 * 5 + + * 2 4 5 (an expression tree) Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A B C D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A B C D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A B C D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A B C ✓ D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ B C ✓ D E F G A = current node = processing (on the call stack) = completed node = element has been processed A ✓ = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ B C ✓ ✓ D E F G A = current node = processing (on the call stack) = completed node = element has been processed A ✓ = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals ✓ void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ B C ✓ ✓ D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals ✓ void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ B C ✓ ✓ D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals ✓ void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ ✓ B C ✓ ✓ ✓ D E F G A = current node = processing (on the call stack) = completed node = element has been processed A ✓ = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals ✓ void inOrderTraversal(Node t){ if(t != null) { inOrderTraversal(t.left); process(t.element); inOrderTraversal(t.right); } A ✓ ✓ B C ✓ ✓ ✓ ✓ D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ A = current node = processing (on the call stack) = completed node A A Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures 7/5/2018 More on traversals void preOrderTraversal(Node t){ if(t != null) { process(t.element); preOrderTraversal(t.left); preOrderTraversal(t.left) } A B C D E F G = current node = processing (on the call stack) = completed node = element has been processed A ✓ Spring 2016 CSE 373 Algorithms and Data Structures

CSE 373 Algorithms and Data Structures Preorder Exercise Q R S T U v w Spring 2016 CSE 373 Algorithms and Data Structures