CPS 100, Spring 2009 7.1 Data Structures revisited l Linked lists and arrays and ArrayLists and …  Linear structures, operations include insert, delete,

Slides:



Advertisements
Similar presentations
Comp 122, Spring 2004 Binary Search Trees. btrees - 2 Comp 122, Spring 2004 Binary Trees  Recursive definition 1.An empty tree is a binary tree 2.A node.
Advertisements

Binary Trees, Binary Search Trees CMPS 2133 Spring 2008.
CPS 100, Spring Spirits of Linear Structures l Past, Present, Future, … l Linked-lists and arrays and ArrayLists  ADT: insert, delete, iterate/traverse,
Binary Trees, Binary Search Trees COMP171 Fall 2006.
CS 171: Introduction to Computer Science II
Trees Chapter 8.
Fall 2007CS 2251 Trees Chapter 8. Fall 2007CS 2252 Chapter Objectives To learn how to use a tree to represent a hierarchical organization of information.
Trees Chapter 8. Chapter 8: Trees2 Chapter Objectives To learn how to use a tree to represent a hierarchical organization of information To learn how.
Trees Chapter 8. Chapter 8: Trees2 Chapter Objectives To learn how to use a tree to represent a hierarchical organization of information To learn how.
©Brooks/Cole, 2003 Chapter 12 Abstract Data Type.
Lec 15 April 9 Topics: l binary Trees l expression trees Binary Search Trees (Chapter 5 of text)
TCSS 342, Winter 2005 Lecture Notes
Fundamentals of Python: From First Programs Through Data Structures
CS 146: Data Structures and Algorithms June 18 Class Meeting Department of Computer Science San Jose State University Summer 2015 Instructor: Ron Mak
Advanced Algorithms Analysis and Design Lecture 8 (Continue Lecture 7…..) Elementry Data Structures By Engr Huma Ayub Vine.
Lecture Objectives  To learn how to use a tree to represent a hierarchical organization of information  To learn how to use recursion to process trees.
CompSci 100 Prog Design and Analysis II October 14, 2010 Prof. Rodger CompSci 100, Fall
CPS Wordcounting, sets, and the web l How does Google store all web pages that include a reference to “peanut butter with mustard”  What efficiency.
CompSci Binary Trees l Linked lists: efficient insertion/deletion, inefficient search  ArrayList: search can be efficient, insertion/deletion.
(c) University of Washington20d-1 CSC 143 Java Applications of Trees.
CPS 100, Spring Trees: no data structure lovelier?
Information and Computer Sciences University of Hawaii, Manoa
Lecture 10 Trees –Definiton of trees –Uses of trees –Operations on a tree.
S EARCHING AND T REES COMP1927 Computing 15s1 Sedgewick Chapters 5, 12.
Spring 2010CS 2251 Trees Chapter 6. Spring 2010CS 2252 Chapter Objectives Learn to use a tree to represent a hierarchical organization of information.
CPS Data Structures revisited l Linked lists and arrays and ArrayLists and …  Linear structures, operations include insert, delete, traverse,
Tree (new ADT) Terminology:  A tree is a collection of elements (nodes)  Each node may have 0 or more successors (called children)  How many does a.
Binary Trees, Binary Search Trees RIZWAN REHMAN CENTRE FOR COMPUTER STUDIES DIBRUGARH UNIVERSITY.
CompSci 100e Program Design and Analysis II March 29, 2011 Prof. Rodger CompSci 100e, Spring20111.
Chapter 19: Binary Trees Java Programming: Program Design Including Data Structures Program Design Including Data Structures.
CPS Sequential search revisited l What does the code below do? How would it be called initially?  Another overloaded function search with 2 parameters?
1 Chapter 17 Object-Oriented Data Structures. 2 Objectives F To describe what a data structure is (§17.1). F To explain the limitations of arrays (§17.1).
Starting at Binary Trees
Binary Search Trees Nilanjan Banerjee. 2 Goal of today’s lecture Learn about Binary Search Trees Discuss the first midterm.
CompSci 100E 9.1 Stack: What problems does it solve?  Stacks are used to avoid recursion, a stack can replace the implicit/actual stack of functions called.
Recursive Data Structures and Grammars  Themes  Recursive Description of Data Structures  Grammars and Parsing  Recursive Definitions of Properties.
CompSci 100e 7.1 From doubly-linked lists to binary trees l Instead of using prev and next to point to a linear arrangement, use them to divide the universe.
CPS 100, Fall Catching Up, Moving ahead l Last week and this week  How did you study for the exam? Next time?  How can you/should you work with.
CompSci 100e Program Design and Analysis II March 15, 2011 Prof. Rodger CompSci 100e, Spring20111.
CompSci 100e 7.1 Plan for the week l Review:  Union-Find l Understand linked lists from the bottom up and top-down  As clients of java.util.LinkedList.
CPS Scoreboard l What else might we want to do with a data structure? l What are maps’ limitations? AlgorithmInsertionDeletionSearch Unsorted.
IKI 10100I: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100I: Data.
Week 10 - Friday.  What did we talk about last time?  Graph representations  Adjacency matrix  Adjacency lists  Depth first search.
TREES K. Birman’s and G. Bebis’s Slides. Tree Overview 2  Tree: recursive data structure (similar to list)  Each cell may have zero or more successors.
Week 15 – Wednesday.  What did we talk about last time?  Review up to Exam 1.
BINARY TREES Objectives Define trees as data structures Define the terms associated with trees Discuss tree traversal algorithms Discuss a binary.
CPS Review of Data Structures l We’ve studied concrete data structures/type (CDT)  Vectors Homogeneous aggregates supporting random access  Linked.
18-1 Chapter 18 Binary Trees Data Structures and Design in Java © Rick Mercer.
CS6045: Advanced Algorithms Data Structures. Dynamic Sets Next few lectures will focus on data structures rather than straight algorithms In particular,
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture13.
CPS 100e 6.1 What’s the Difference Here? l How does find-a-track work? Fast forward?
CPS Recurrences l Summing Numbers int sum(int n) { if (0 == n) return 0; else return n + sum(n-1); } l What is complexity? justification? l T(n)
CPS 100, Fall Linear Structures Revisited l Linked-lists and arrays and ArrayLists and …  Linear structures, operations include insert, delete,
Prefix notation in action
From doubly-linked lists to binary trees
Stack: What problems does it solve?
PFTFBH Binary Search trees Fundamental Data Structure
Binary Trees Linked lists: efficient insertion/deletion, inefficient search ArrayList: search can be efficient, insertion/deletion not Binary trees: efficient.
Data Structures revisited
Plan for Week Linear structures: Stack, Queue
Binary Trees Linked lists: efficient insertion/deletion, inefficient search ArrayList: search can be efficient, insertion/deletion not Binary trees: efficient.
PFTWBH More examples of recursion and alternatives
What’s the Difference Here?
Compsci 201 Trees, Tries, Tradeoffs
Binary Trees, Binary Search Trees
Map interface Empty() - return true if the map is empty; else return false Size() - return the number of elements in the map Find(key) - if there is an.
CS6045: Advanced Algorithms
Binary Trees, Binary Search Trees
Compsci 201 Binary Trees Stacks+Queues
Binary Trees, Binary Search Trees
Presentation transcript:

CPS 100, Spring Data Structures revisited l Linked lists and arrays and ArrayLists and …  Linear structures, operations include insert, delete, traverse, …  Advantages and trade-offs include … l We want to move toward structures that support very efficient insertion and lookup, lists can't do better than O(n) for one of these: consider binary search and insert for arrays, or insert and lookup for linked lists l Interlude: linear structures that facilitate certain algorithms: Stack and Queue (and Dequeue)  Restricted access linear structures

CPS 100, Spring Wordladder Story l Ladder from ‘white’ to ‘house’  White, while, whale, shale, … l I can do that… optimally  My brother was an English major  My ladder is 16, his is 15, how? l There’s a ladder that’s 14 words!  The key is ‘sough’ l Guarantee optimality!

CPS 100, Spring Stack: What problems does it solve? l Stacks are used to avoid recursion, a stack can replace the implicit/actual stack of functions called recursively l Stacks are used to evaluate arithmetic expressions, to implement compilers, to implement interpreters  The Java Virtual Machine (JVM) is a stack-based machine  Postscript is a stack-based language  Stacks are used to evaluate arithmetic expressions in many languages l Small set of operations: LIFO or last in is first out access  Operations: push, pop, top, create, clear, size  More in postscript, e.g., swap, dup, rotate, …

CPS 100, Spring Simple stack example Stack is part of java.util.Collections hierarchy  It's an OO abomination, extends Vector (like ArrayList) Should be implemented using Vector Doesn't model "is-a" inheritance  what does pop do? What does push do? Stack s = new Stack (); s.push("panda"); s.push("grizzly"); s.push("brown"); System.out.println("size = "+s.size()); System.out.println(s.peek()); String str = s.pop(); System.out.println(s.peek()); System.out.println(s.pop());

CPS 100, Spring Postfix, prefix, and infix notation l Postfix notation used in some HP calculators  No parentheses needed, precedence rules still respected * *  Read expression For number/operand: push For operator: pop, pop, operate, push l See Postfix.java for example code, key ideas:  Use StringTokenizer, handy tool for parsing  Note: Exceptions thrown, what are these? l What about prefix and infix notations, advantages?

CPS 100, Spring Exceptions l Exceptions are raised or thrown in exceptional cases  Bad indexes, null pointers, illegal arguments, …  File not found, URL malformed, … l Runtime exceptions aren't meant to be handled or caught  Bad index in array, don't try to handle this in code  Null pointer stops your program, don't code that way! l Other exceptions must be caught or rethrown  See FileNotFoundException and IOException in Scanner class implementation l RuntimeException extends Exception, catch not required

CPS 100, Spring Prefix notation in action l Scheme/LISP and other functional languages tend to use a prefix notation (define (square x) (* x x)) (define (expt b n) (if (= n 0) 1 (* b (expt b (- n 1)))))

CPS 100, Spring Postfix notation in action l Practical example of use of stack abstraction l Put operator after operands in expression  Use stack to evaluate operand: push onto stack operator: pop operands push result l PostScript is a stack language mostly used for printing  drawing an X with two equivalent sets of code %! moveto rlineto moveto 100 –100 rlineto stroke showpage %! 100 – moveto rlineto stroke showpage

CPS 100, Spring Queue: another linear ADT l FIFO: first in, first out, used in many applications  Scheduling jobs/processes on a computer  Tenting policy?  Computer simulations l Common operations: add (back), remove (front), peek ??  java.util.Queue is an interface (jdk5) offer(E), remove(), peek(), size()  java.util.LinkedList implements the interface add(), addLast(), getFirst(), removeFirst() l Downside of using LinkedList as queue  Can access middle elements, remove last, etc. why?

CPS 100, Spring Stack and Queue implementations l Different implementations of queue (and stack) aren’t really interesting from an algorithmic standpoint  Complexity is the same, performance may change (why?)  Use ArrayList, growable array, Vector, linked list, … Any sequential structure l As we'll see java.util.LinkedList is good basis for all  In Java 5+, LinkedList implements the Queue interface, low-level linked lists/nodes facilitate (circular list!) l ArrayList for queue is tricky, ring buffer implementation, add but wrap-around if possible before growing  Tricky to get right (exercise left to reader)

CPS 100, Spring Implementation is very simple l Extends Vector, so simply wraps Vector/ArrayList methods in better names  push==add, pop==remove (also peek and empty )  Note: code below for ArrayList, Vector is used Stack is generic, so Object replaced by generic reference (see next slide) public Object push(Object o){ add(o); return o; } public Object pop(){ return remove(size()-1); }

CPS 100, Spring Implementation is very simple l Extends Vector, so simply wraps Vector/ArrayList methods in better names  What does generic look like? public class Stack extends ArrayList { public E push(E o){ add(o); return o; } public E pop(Object o){ return remove(size()-1); }

CPS 100, Spring Uses rather than "is-a" Suppose there's a private ArrayList myStorage  Doesn't extend Vector, simply uses Vector/ArrayList  Disadvantages of this approach? Synchronization issues public class Stack { private ArrayList myStorage; public E push(E o){ myStorage.add(o); return o; } public E pop(o){ return myStorage.remove(size()-1); }

CPS 100, Spring Using linear data structures l We’ve studied arrays, stacks, queues, which to use?  It depends on the application  ArrayList is multipurpose, why not always use it? Make it clear to programmer what’s being done Other reasons? l Other linear ADTs exist  List: add-to-front, add-to-back, insert anywhere, iterate Alternative: create, head, tail, Lisp or Linked-list nodes are concrete implementation  Deque: add-to-front, add-to-back, random access Why is this “better” than an ArrayList? How to implement?

CPS 100, Spring Maria Klawe Chair of Computer Science at UBC, Dean of Engineering at Princeton, President of Harvey Mudd College, ACM Fellow,… Klawe's personal interests include painting, long distance running, hiking, kayaking, juggling and playing electric guitar. She describes herself as "crazy about mathematics" and enjoys playing video games. "I personally believe that the most important thing we have to do today is use technology to address societal problems, especially in developing regions"

CPS 100, Spring Queue applications l Simulation, discrete-event simulation  How many toll-booths do we need? How many express lanes or self-checkout at grocery store? Runway access at aiport?  Queues facilitate simulation with mathematical distributions governing events, e.g., Poisson distribution for arrival times l Shortest path, e.g., in flood-fill to find path to some neighbor or in word-ladder  How do we get from "white" to "house" one-letter at a time? white, while, whale, shale, shake, …?

CPS 100, Spring Queue for shortest path (see APT) public boolean ladderExists(String[] words, String from, String to){ Queue q = new LinkedList (); Set used = new TreeSet (); for(String s : words){ if (oneAway(from,s)){ q.add(s); used.add(s); } while (q.size() != 0){ String current = q.remove(); if (oneAway(current,to)) return true; // add code here, what? } return false; }

CPS 100, Spring Shortest Path reprised l How does use of Queue ensure we find shortest path?  Where are words one away from start?  Where are words two away from start? l Why do we need to avoid revisiting a word, when?  Why do we use a set for this? Why a TreeSet?  Alternatives? l What if we want the ladder, not just whether it exists  What’s path from white to house? We know there is one.  Ideas? Options?

CPS 100, Spring Shortest path proof l Every word one away from start is on queue before loop  Obvious from code l All one-away words dequeued before two-away..n-away  See previous assertion, property of queues l Every two-away word is one away from a one-away word  So all enqueued after one-away, before three-away How do we find three-away word? l Word w put on queue is one-away from an n-away word  w is n+1(away), can’t be earlier than that, why?

CPS 100, Spring Binary Trees l Linked lists: efficient insertion/deletion, inefficient search  ArrayList: search can be efficient, insertion/deletion not l Binary trees: efficient insertion, deletion, and search  trees used in many contexts, not just for searching, e.g., expression trees  search in O(log n) like sorted array  insertion/deletion O(1) like list, once location found!  binary trees are inherently recursive, difficult to process trees non-recursively, but possible recursion never required, often makes coding simpler

CPS 100, Spring From doubly-linked lists to binary trees l Instead of using prev and next to point to a linear arrangement, use them to divide the universe in half  Similar to binary search, everything less goes left, everything greater goes right  How do we search?  How do we insert? “llama” “tiger” “monkey” “jaguar” “elephant” “ giraffe ” “pig” “hippo” “leopard” “koala”

CPS 100, Spring Basic tree definitions l Binary tree is a structure:  empty  root node with left and right subtrees l terminology: parent, children, leaf node, internal node, depth, height, path link from node N to M then N is parent of M –M is child of N leaf node has no children –internal node has 1 or 2 children path is sequence of nodes, N 1, N 2, … N k –N i is parent of N i+1 –sometimes edge instead of node depth (level) of node: length of root-to-node path –level of root is 1 (measured in nodes) height of node: length of longest node-to-leaf path –height of tree is height of root A B ED F C G

CPS 100, Spring A TreeNode by any other name… l What does this look like?  What does the picture look like? public class TreeNode { TreeNode left; TreeNode right; String info; TreeNode(String s, TreeNode llink, TreeNode rlink){ info = s; left = llink; right = rlink; } “llama” “tiger” “ giraffe ”

CPS 100, Spring Printing a search tree in order l When is root printed?  After left subtree, before right subtree. void visit(TreeNode t) { if (t != null) { visit(t.left); System.out.println(t.info); visit(t.right); } l Inorder traversal “llama” “tiger” “monkey” “jaguar” “elephant” “ giraffe ” “pig” “hippo” “leopard”

CPS 100, Spring Insertion and Find? Complexity? l How do we search for a value in a tree, starting at root?  Can do this both iteratively and recursively, contrast to printing which is very difficult to do iteratively  How is insertion similar to search? l What is complexity of print? Of insertion?  Is there a worst case for trees?  Do we use best case? Worst case? Average case? l How do we define worst and average cases  For trees? For vectors? For linked lists? For arrays of linked-lists?

CPS 100, Spring See SetTiming code What about ISimpleSet interface  How does this compare to java.util?  Why are we looking at this, what about Java source? l How would we implement most simply?  What are complexity repercussions: add, contains  What about iterating? l What would linked list get us? Scenarios where better?  Consider N adds and M contains operations  Move to front heuristic?

CPS 100, Spring What does contains look like? public boolean contains(E element) { return myList.indexOf(element) >= 0; } public boolean contains(E element){ returns contains(myHead, element); } private boolean contains(Node list, E element) { if (list == null) return false; if (list.info.equals(element)) return true; return contains(list.next,element); } l Why is there a private, helper method?  What will be different about Tree?

CPS 100, Spring What does contains look like? public boolean contains(E element){ returns contains(myRoot, element); } private boolean contains(TreeNode root, E element) { if (root == null) return false; if (list.info.equals(element)) return true; if (element.compareTo(root.info) <= 0){ return contains(root.left,element); else return contains(root.right,element); } l What is recurrence? Complexity?  When good trees go bad, how can this happen?

CPS 100, Spring What does insertion look like? l Simple recursive insertion into tree (accessed by root)  root = insert("foo", root); public TreeNode insert(TreeNode t, String s) { if (t == null) t = new Tree(s,null,null); else if (s.compareTo(t.info) <= 0) t.left = insert(t.left,s); else t.right = insert(t.right,s); return t; } l Note: in each recursive call, the parameter t in the called clone is either the left or right pointer of some node in the original tree  Why is this important?  Why must the idiom t = treeMethod(t,…) be used?

CPS 100, Spring Removal from tree? l For insertion we can use iteration (see BSTSet)  Look below, either left or right If null, stop and add Otherwise go left when l Removal is tricky, depends on number of children  Straightforward when zero or one child  Complicated when two children, find successor See set code for complete cases If right child, straightforward Otherwise find node that’s left child of its parent (why?)

CPS 100, Spring Implementing binary trees l Trees can have many shapes: short/bushy, long/stringy  if height is h, number of nodes is between h and 2 h -1  single node tree: height = 1, if height = 3  Java implementation, similar to doubly-linked list public class Tree { String info; TreeNode left; TreeNode right; TreeNode(String s, TreeNode llink, TreeNode rlink){ info = s; left = llink; right = rlink; } };

CPS 100, Spring Tree functions l Compute height of a tree, what is complexity? int height(Tree root) { if (root == null) return 0; else { return 1 + Math.max(height(root.left), height(root.right) ); } l Modify function to compute number of nodes in a tree, does complexity change?  What about computing number of leaf nodes?

CPS 100, Spring Tree traversals l Different traversals useful in different contexts  Inorder prints search tree in order Visit left-subtree, process root, visit right-subtree  Preorder useful for reading/writing trees Process root, visit left-subtree, visit right-subtree  Postorder useful for destroying trees Visit left-subtree, visit right-subtree, process root “llama” “tiger” “monkey” “jaguar” “elephant” “ giraffe ”

CPS 100, Spring Balanced Trees and Complexity l A tree is height-balanced if  Left and right subtrees are height-balanced  Left and right heights differ by at most one boolean isBalanced(Tree root) { if (root == null) return true; return isBalanced(root.left) && isBalanced(root.right) && Math.abs(height(root.left) – height(root.right)) <= 1; }

CPS 100, Spring What is complexity? l Assume trees are “balanced” in analyzing complexity  Roughly half the nodes in each subtree  Leads to easier analysis l How to develop recurrence relation?  What is T(n)?  What other work is done? l How to solve recurrence relation  Plug, expand, plug, expand, find pattern  A real proof requires induction to verify correctness

CPS 100, Spring Danny Hillis l The third culture consists of those scientists and other thinkers in the empirical world who, through their work and expository writing, are taking the place of the traditional intellectual in rendering visible the deeper meanings of our lives, redefining who and what we are. (Wired 1998) And now we are beginning to depend on computers to help us evolve new computers that let us produce things of much greater complexity. Yet we don't quite understand the process - it's getting ahead of us. We're now using programs to make much faster computers so the process can run much faster. That's what's so confusing - technologies are feeding back on themselves; we're taking off. We're at that point analogous to when single-celled organisms were turning into multicelled organisms. We are amoebas and we can't figure out what the hell this thing is that we're creating.