Presentation is loading. Please wait.

Presentation is loading. Please wait.

B-Trees ( Rizwan Rehman) Large degree B-trees used to represent very large dictionaries that reside on disk. Smaller degree B-trees used for internal-memory.

Similar presentations


Presentation on theme: "B-Trees ( Rizwan Rehman) Large degree B-trees used to represent very large dictionaries that reside on disk. Smaller degree B-trees used for internal-memory."— Presentation transcript:

1 B-Trees ( Rizwan Rehman) Large degree B-trees used to represent very large dictionaries that reside on disk. Smaller degree B-trees used for internal-memory dictionaries to overcome cache-miss penalties.

2 A B-tree is a keyed index structure, comparable to a number of memory resident keyed lookup structures such as balanced binary tree, the AVL tree, and the 2-3 tree. The difference is that a B-tree is meant to reside on disk, being made partially memory-resident only when entries int the structure are accessed. The B-tree structure is the most common used index type in databases today. It is provided by ORACLE, DB2, and INGRES.

3 For a binary search tree, the search time is O(h), where h is the height of the tree. The height cannot be less than roughly log 2 n for a tree with n nodes. If the tree is too large for internal memory, each access of a node is an I/O. For a tree with 10,000 nodes: log 2 10000 = 100 disk accesses We need a structure that would require only 3 or 4 disk accesses.

4 AVL Trees n = 2 30 = 10 9 (approx). 30 <= height <= 43. When the AVL tree resides on a disk, up to 43 disk access are made for a search. This takes up to (approx) 4 seconds. Not acceptable.

5 m-way Search Trees Each node has up to m – 1 pairs and m children. m = 2 => binary search tree.

6 4-Way Search Tree 10 30 35 k < 10 10 < k < 30 30 < k < 35 k > 35

7 Definition Of B-Tree Definition assumes external nodes (extended m-way search tree). B-tree of order m.  m-way search tree.  Not empty => root has at least 2 children.  Remaining internal nodes (if any) have at least ceil(m/2) children.  External (or failure) nodes on same level.

8 2-3 And 2-3-4 Trees B-tree of order m.  m-way search tree.  Not empty => root has at least 2 children.  Remaining internal nodes (if any) have at least ceil(m/2) children.  External (or failure) nodes on same level. 2-3 tree is B-tree of order 3. 2-3-4 tree is B-tree of order 4.

9 B-Trees Of Order 5 And 2 B-tree of order m.  m-way search tree.  Not empty => root has at least 2 children.  Remaining internal nodes (if any) have at least ceil(m/2) children.  External (or failure) nodes on same level. B-tree of order 5 is 3-4-5 tree (root may be 2-node though). B-tree of order 2 is full binary tree.

10 Minimum # Of Pairs n = # of pairs. # of external nodes = n + 1. Height = h => external nodes on level h + 1. 1 1 2 >= 2 3 >= 2*ceil(m/2) h + 1 >= 2*ceil(m/2) h-1 n + 1 >= 2*ceil(m/2) h-1, h >= 1 level# of nodes

11 Minimum # Of Pairs m = 200. n + 1 >= 2*ceil(m/2) h-1, h >= 1 height# of pairs 2 >= 199 3 >= 19,999 4 >= 2 * 10 6 – 1 5 >= 2 * 10 8 – 1 h <= log ceil(m/2) [(n+1)/2] + 1

12 Insert 15 20 8 4 1 35 630 409 Insertion into a full leaf triggers bottom-up node splitting pass. 16 17

13 Split An Overfull Node a i is a pointer to a subtree. p i is a dictionary pair. m a 0 p 1 a 1 p 2 a 2 … p m a m ceil(m/2)-1 a 0 p 1 a 1 p 2 a 2 … p ceil(m/2)-1 a ceil(m/2)-1 m-ceil(m/2) a ceil(m/2) p ceil(m/2)+1 a ceil(m/2)+1 … p m a m p ceil(m/2) plus pointer to new node is inserted in parent.

14 Insert 15 20 8 4 1 35 630 409 Insert a pair with key = 2. New pair goes into a 3-node. 16 17

15 Insert Into A Leaf 3-node Insert new pair so that the 3 keys are in ascending order. Split overflowed node around middle key. 1 2 3 Insert middle key and pointer to new node into parent. 13 2

16 Insert 15 20 8 4 1 35 630 409 Insert a pair with key = 2. 16 17

17 Insert 15 20 8 4 5 630 409 16 17 3 1 2 Insert a pair with key = 2 plus a pointer into parent.

18 Insert Now, insert a pair with key = 18. 15 20 8 1 2 4 5 630 409 16 17 3

19 Insert Into A Leaf 3-node Insert new pair so that the 3 keys are in ascending order. Split the overflowed node. 16 17 18 Insert middle key and pointer to new node into parent. 1816 17

20 Insert Insert a pair with key = 18. 15 20 8 1 2 4 5 630 409 16 17 3

21 Insert Insert a pair with key = 17 plus a pointer into parent. 15 20 8 1 2 4 5 630 4093 18 16 17

22 Insert Insert a pair with key = 17 plus a pointer into parent. 8 1 2 4 5 630 409316 17 15 18 20

23 Insert Now, insert a pair with key = 7. 1 2 4 5 630 409316 15 18 20 8 17

24 Insert Insert a pair with key = 6 plus a pointer into parent. 30 40 1 2 4 9316 15 18 20 8 17 5 7 6

25 Insert Insert a pair with key = 4 plus a pointer into parent. 30 40 1 9 3 16 15 18 20 8 17 6 4 2 57

26 Insert Insert a pair with key = 8 plus a pointer into parent. There is no parent. So, create a new root. 30 40 1 9 3 16 15 18 20 6 8 2 57 4 17

27 Insert Height increases by 1. 30 40 1 9 3 16 15 18 20 6 2 57 4 17 8


Download ppt "B-Trees ( Rizwan Rehman) Large degree B-trees used to represent very large dictionaries that reside on disk. Smaller degree B-trees used for internal-memory."

Similar presentations


Ads by Google