Database Applications (15-415) DBMS Internals- Part V Lecture 14, Oct 18, 2016 Mohammad Hammoud.

Slides:



Advertisements
Similar presentations
Hash-Based Indexes Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY.
Advertisements

Hash-based Indexes CS 186, Spring 2006 Lecture 7 R &G Chapter 11 HASH, x. There is no definition for this word -- nobody knows what hash is. Ambrose Bierce,
1 Hash-Based Indexes Module 4, Lecture 3. 2 Introduction As for any index, 3 alternatives for data entries k* : – Data record with key value k – –Choice.
Hash-Based Indexes The slides for this text are organized into chapters. This lecture covers Chapter 10. Chapter 1: Introduction to Database Systems Chapter.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11.
CPSC 404, Laks V.S. Lakshmanan1 Hash-Based Indexes Chapter 11 Ramakrishnan & Gehrke (Sections )
Chapter 11 (3 rd Edition) Hash-Based Indexes Xuemin COMP9315: Database Systems Implementation.
Index tuning Hash Index. overview Introduction Hash-based indexes are best for equality selections. –Can efficiently support index nested joins –Cannot.
ICS 421 Spring 2010 Indexing (2) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 2/23/20101Lipyeow Lim.
1 Hash-Based Indexes Yanlei Diao UMass Amherst Feb 22, 2006 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
1 Hash-Based Indexes Chapter Introduction  Hash-based indexes are best for equality selections. Cannot support range searches.  Static and dynamic.
FALL 2004CENG 3511 Hashing Reference: Chapters: 11,12.
1 Hash-Based Indexes Chapter Introduction : Hash-based Indexes  Best for equality selections.  Cannot support range searches.  Static and dynamic.
Database Management 8. course. Query types Equality query – Each field has to be equal to a constant Range query – Not all the fields have to be equal.
Hashing and Hash-Based Index. Selection Queries Yes! Hashing  static hashing  dynamic hashing B+-tree is perfect, but.... to answer a selection query.
1 Database Systems ( 資料庫系統 ) November 8, 2004 Lecture #9 By Hao-hua Chu ( 朱浩華 )
Database Management 7. course. Reminder Disk and RAM RAID Levels Disk space management Buffering Heap files Page formats Record formats.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 11 Modified by Donghui Zhang Jan 30, 2006.
Introduction to Database, Fall 2004/Melikyan1 Hash-Based Indexes Chapter 10.
1.1 CS220 Database Systems Indexing: Hashing Slides courtesy G. Kollios Boston University via UC Berkeley.
Static Hashing (using overflow for collision managment e.g., h(key) mod M h key Primary bucket pages 1 0 M-1 Overflow pages(as separate link list) Overflow.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Indexed Sequential Access Method.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 10.
Database Applications (15-415) DBMS Internals- Part IV Lecture 15, March 13, 2016 Mohammad Hammoud.
Database Applications (15-415) DBMS Internals- Part III Lecture 13, March 06, 2016 Mohammad Hammoud.
Database Management 7. course. Reminder Disk and RAM RAID Levels Disk space management Buffering Heap files Page formats Record formats.
COP Introduction to Database Structures
External Sorting Chapter 13
Database Applications (15-415) DBMS Internals- Part VII Lecture 16, October 25, 2016 Mohammad Hammoud.
Hash-Based Indexes Chapter 11
Hashing CENG 351.
Database Management Systems (CS 564)
Database Applications (15-415) DBMS Internals- Part III Lecture 15, March 11, 2018 Mohammad Hammoud.
Database Applications (15-415) DBMS Internals- Part V Lecture 17, March 20, 2018 Mohammad Hammoud.
Database Applications (15-415) DBMS Internals- Part VI Lecture 18, Mar 25, 2018 Mohammad Hammoud.
Introduction to Database Systems
Database Management Systems (CS 564)
Lecture#12: External Sorting (R&G, Ch13)
Database Applications (15-415) DBMS Internals- Part VII Lecture 19, March 27, 2018 Mohammad Hammoud.
Database Applications (15-415) DBMS Internals- Part VI Lecture 15, Oct 23, 2016 Mohammad Hammoud.
External Sorting The slides for this text are organized into chapters. This lecture covers Chapter 11. Chapter 1: Introduction to Database Systems Chapter.
B+-Trees and Static Hashing
External Memory Hashing
Tree- and Hash-Structured Indexes
CS222: Principles of Data Management Notes #8 Static Hashing, Extendible Hashing, Linear Hashing Instructor: Chen Li.
Hash-Based Indexes R&G Chapter 10 Lecture 18
Hash-Based Indexes Chapter 10
External Sorting Chapter 13
Selected Topics: External Sorting, Join Algorithms, …
CS222P: Principles of Data Management Notes #8 Static Hashing, Extendible Hashing, Linear Hashing Instructor: Chen Li.
Hashing.
Hash-Based Indexes Chapter 11
Tree-Structured Indexes
Index tuning Hash Index.
Lecture 2- Query Processing (continued)
Advance Database System
Database Systems (資料庫系統)
LINEAR HASHING E0 261 Jayant Haritsa Computer Science and Automation
External Sorting.
Index tuning Hash Index.
Database Systems (資料庫系統)
Hash-Based Indexes Chapter 11
Database Systems (資料庫系統)
Chapter 11 Instructor: Xin Zhang
Tree- and Hash-Structured Indexes
External Sorting Chapter 13
CS222/CS122C: Principles of Data Management UCI, Fall 2018 Notes #07 Static Hashing, Extendible Hashing, Linear Hashing Instructor: Chen Li.
Presentation transcript:

Database Applications (15-415) DBMS Internals- Part V Lecture 14, Oct 18, 2016 Mohammad Hammoud

Today… Last Session: DBMS Internals- Part IV Today’s Session: Tree-based (i.e., B+ Tree– cont’d) Today’s Session: DBMS Internals- Part V Hash-based indexes and External Sorting Announcements: Mid-semester grades are out PS3 is out. It is due on Nov 1st Project 2 is due on Oct 27

DBMS Layers Queries Query Optimization and Execution Continue… Relational Operators Transaction Manager Files and Access Methods Recovery Manager Buffer Management Lock Manager Disk Space Management DB

Outline B+ Trees with Duplicates B+ Trees with Key Compression Bulk Loading of a B+ Tree A Primer on Hash-Based Indexing Static Hashing Extendible Hashing

Hash-Based Indexing What indexing technique can we use to support range searches (e.g., “Find s_name where gpa >= 3.0)? Tree-Based Indexing What about equality selections (e.g., “Find s_name where sid = 102”? Hash-Based Indexing (cannot support range searches!) Hash-based indexing, however, proves to be very useful in implementing relational operators (e.g., joins)

Outline B+ Trees with Duplicates B+ Trees with Key Compression Bulk Loading of a B+ Tree A Primer on Hash-Based Indexing Static Hashing Extendible Hashing

Static Hashing A hash structure (or table or file) is a generalization of the simpler notion of an ordinary array In an array, an arbitrary position can be examined in O(1) A hash function h is used to map keys into a range of bucket numbers h(key) mod N 2 key With Static Hashing, allocated (as needed) when corresponding buckets become full h With Static Hashing, allocated sequentially and never de-allocated N-1 Primary bucket pages Overflow pages

Static Hashing Data entries can be any of the three alternatives (A (1), A (2) or A (3)- see previous lecture) Data entries can be sorted in buckets to speed up searches The hash function h is used to identify the bucket to which a given key belongs and subsequently insert, delete or locate a respective data record A hash function of the form h(key) = (a * key + b) works well in practice A search ideally requires 1 disk I/O, while an insertion or a deletion necessitates 2 disk I/Os

Static Hashing: Some Issues Similar to ISAM, the number of buckets is fixed! Cannot deal with insertions and deletions gracefully Long overflow chains can develop easily and degrade performance! Pages can be initially kept only 80% full Dynamic hashing techniques can be used to fix the problem Extendible Hashing (EH) Liner Hashing (LH)

Outline B+ Trees with Duplicates B+ Trees with Key Compression Bulk Loading of a B+ Tree A Primer on Hash-Based Indexing Static Hashing Extendible Hashing

Directory of Pointers How else (as opposed to overflow pages) can we add a data record to a full bucket in a static hash file? Reorganize the table (e.g., by doubling the number of buckets and redistributing the entries across the new set of buckets) But, reading and writing all pages is expensive! In contrast, we can use a directory of pointers to buckets Buckets number can be doubled by doubling just the directory and splitting “only” the bucket that overflowed The trick lies on how the hash function can be adjusted!

Extendible Hashing Extendible Hashing uses a directory of pointers to buckets The result of applying a hash function h is treated as a binary number and the last d bits are interpreted as an offset into the directory d is referred to as the global depth of the hash file and is kept as part of the header of the file GLOBAL DEPTH 4* 12* 32* 16* Bucket A 2 00 1* 5* 21* Bucket B 01 10 10* Bucket C 11 DIRECTORY 15* 7* 19* Bucket D DATA PAGES

Extendible Hashing: Searching for Entries To search for a data entry, apply a hash function h to the key and take the last d bits of its binary representation to get the bucket number Example: search for 5* 2 00 Bucket A 4* 12* 32* 16* 01 Bucket B 5 = 101 1* 5* 21* 10 Bucket C 10* 11 Bucket D 15* 7* 19* DIRECTORY DATA PAGES

Extendible Hashing: Inserting Entries An entry can be inserted as follows: Find the appropriate bucket (as in search) Split the bucket if full and redistribute contents (including the new entry to be inserted) across the old bucket and its “split image” Double the directory if necessary Insert the given entry

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 13* 2 00 Bucket A 4* 12* 32* 16* 01 Bucket B 13 = 1101 1* 5* 21* 13* 10 Bucket C 10* 11 Bucket D 15* 7* 19* DIRECTORY

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 20* FULL, hence, split and redistribute! 2 00 Bucket A 4* 12* 32* 16* 01 Bucket B 20 = 10100 1* 5* 21* 13* 10 Bucket C 10* 11 Bucket D 15* 7* 19* DIRECTORY

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 20* Bucket A 32* 16* 2 00 Bucket B 1* 5* 21* 13* 01 10 20 = 10100 Bucket C 11 10* DIRECTORY Bucket D 15* 7* 19* Is this enough? Bucket A2 4* 12* 20* (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 20* Bucket A 32* 16* 2 00 Bucket B 1* 5* 21* 13* 01 10 20 = 10100 Bucket C 11 10* DIRECTORY Bucket D 15* 7* 19* Double the directory and increase the global depth Bucket A2 4* 12* 20* (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 20* 32* 16* Bucket A GLOBAL DEPTH 3 0 00 1* 5* 21* 13* Bucket B 001 These two bits indicate a data entry that belongs to one of these two buckets 010 011 10* Bucket C 1 00 101 The third bit distinguishes between these two buckets! 15* 7* 19* 110 Bucket D 111 But, is it necessary always to double the directory? DIRECTORY 4* 12* 20* Bucket A2 (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 9* GLOBAL DEPTH 32* 16* Bucket A FULL, hence, split! 3 000 1* 5* 21* 13* Bucket B 001 010 9 = 1001 011 10* Bucket C 100 101 15* 7* 19* 110 Bucket D 111 DIRECTORY 4* 12* 20* Bucket A2 (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 9* GLOBAL DEPTH 32* 16* Bucket A 3 000 1* 9* Bucket B 001 010 10* Bucket C 9 = 1001 011 100 101 15* 7* 19* Bucket D Almost there… 110 111 4* 12* 20* Bucket A2 (`split image‘ of A) DIRECTORY 5* 21* 13* Bucket B2 (`split image‘ of B)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 9* GLOBAL DEPTH 32* 16* Bucket A 3 000 1* 9* Bucket B 001 010 10* Bucket C 9 = 1001 011 100 There was no need to double the directory! 101 15* 7* 19* Bucket D 110 111 4* 12* 20* Bucket A2 When NOT to double the directory? (`split image‘ of A) DIRECTORY 5* 21* 13* Bucket A2 (`split image‘ of A)

Extendible Hashing: Inserting Entries Find the appropriate bucket (as in search), split the bucket if full, double the directory if necessary and insert the given entry Example: insert 9* LOCAL DEPTH 3 GLOBAL DEPTH 32* 16* Bucket A 3 3 000 1* 9* Bucket B 001 2 010 10* Bucket C 9 = 1001 011 100 2 If a bucket whose local depth equals to the global depth is split, the directory must be doubled 101 15* 7* 19* Bucket D 110 3 111 4* 12* 20* Bucket A2 (`split image‘ of A) DIRECTORY 3 5* 21* 13* Bucket A2 (`split image‘ of A)

Extendible Hashing: Inserting Entries Example: insert 9* 3 Repeat… LOCAL DEPTH 32* 16* Bucket A GLOBAL DEPTH FULL, hence, split! 3 2 000 1* 5* 21* 13* Bucket B 001 2 010 9 = 1001 011 10* Bucket C 100 2 101 Because the local depth (i.e., 2) is less than the global depth (i.e., 3), NO need to double the directory 15* 7* 19* 110 Bucket D 111 3 DIRECTORY 4* 12* 20* Bucket A2 (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Example: insert 9* 3 Repeat… LOCAL DEPTH 32* 16* Bucket A GLOBAL DEPTH 3 3 000 1* 9* Bucket B 001 2 010 10* Bucket C 9 = 1001 011 100 2 101 15* 7* 19* Bucket D 110 3 111 4* 12* 20* Bucket A2 (`split image‘ of A) DIRECTORY 3 5* 21* 13* Bucket B2 (`split image‘ of B)

Extendible Hashing: Inserting Entries Example: insert 9* 3 Repeat… LOCAL DEPTH 32* 16* Bucket A GLOBAL DEPTH 3 3 000 1* 9* Bucket B 001 2 010 10* Bucket C 9 = 1001 011 100 2 101 15* 7* 19* Bucket D FINAL STATE! 110 3 111 4* 12* 20* Bucket A2 (`split image‘ of A) DIRECTORY 3 5* 21* 13* Bucket B2 (`split image‘ of B)

Extendible Hashing: Inserting Entries Example: insert 20* FULL, hence, split! Repeat… LOCAL DEPTH 2 Bucket A 4* 12* 32* 16* GLOBAL DEPTH 2 2 Bucket B 00 1* 5* 21* 13* 01 20 = 10100 10 2 Bucket C 11 10* Because the local depth and the global depth are both 2, we should double the directory! 2 DIRECTORY Bucket D 15* 7* 19* DATA PAGES

Extendible Hashing: Inserting Entries Example: insert 20* Repeat… 2 LOCAL DEPTH Bucket A 32* 16* GLOBAL DEPTH 2 2 Bucket B 00 1* 5* 21* 13* 01 10 2 20 = 10100 Bucket C 11 10* 2 Is this enough? DIRECTORY Bucket D 15* 7* 19* 2 Bucket A2 4* 12* 20* (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Example: insert 20* Repeat… 2 LOCAL DEPTH 32* 16* Bucket A GLOBAL DEPTH 3 2 000 1* 5* 21* 13* Bucket B 001 2 010 011 10* Bucket C 100 2 101 Is this enough? 15* 7* 19* 110 Bucket D 111 2 DIRECTORY 4* 12* 20* Bucket A2 (`split image' of Bucket A)

Extendible Hashing: Inserting Entries Example: insert 20* Repeat… 3 LOCAL DEPTH 32* 16* Bucket A GLOBAL DEPTH 3 2 000 1* 5* 21* 13* Bucket B 001 FINAL STATE! 2 010 011 10* Bucket C 100 2 101 15* 7* 19* 110 Bucket D 111 3 DIRECTORY 4* 12* 20* Bucket A2 (`split image' of Bucket A)

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

Linear Hashing Another way of adapting gracefully to insertions and deletions (i.e., pursuing dynamic hashing) is to use Linear Hashing (LH) In contrast to Extendible Hashing, LH Does not require a directory Deals naturally with collisions Offers a lot of flexibility w.r.t the timing of bucket split (allowing trading off greater overflow chains for higher average space utilization)

How Linear Hashing Works? LH uses a family of hash functions h0, h1, h2, ... hi(key) = h(key) mod(2iN); N = initial # buckets h is some hash function (range is not 0 to N-1) If N = 2d0, for some d0, hi consists of applying h and looking at the last di bits, where di = d0 + i hi+1 doubles the range of hi (similar to directory doubling)

How Linear Hashing Works? (Cont’d) LH uses overflow pages, and chooses buckets to split in a round-robin fashion Splitting proceeds in “rounds” A round ends when all NR (for round R) initial buckets are split Buckets 0 to Next-1 have been split; Next to NR yet to be split Current round number is referred to as Level Buckets split in this round Next Buckets that existed at the beginning of this round: this is the range of h Level ‘split image’ buckets created in this round

Linear Hashing: Searching For Entries To find a bucket for data entry r, find hLevel(r): If hLevel(r) in range `Next to NR’ , r belongs there Else, r could belong to bucket hLevel(r) or bucket hLevel(r) + NR; must apply hLevel+1(r) to find out Example: search for 5* Level=0, N=4 h h PRIMARY 1 Next=0 PAGES 32* 44* 36* Level = 0  h0 5* = 101  01 000 00 Data entry r 9* 25* 5* 001 01 with h(r)=5 14* 18* 10* 30* 010 10 Primary bucket page 31* 35* 7* 11* 011 11

Linear Hashing: Inserting Entries Find bucket as in search If the bucket to insert the data entry into is full: Add an overflow page and insert data entry (Maybe) Split Next bucket and increment Next Some points to Keep in mind: Unlike Extendible Hashing, when an insert triggers a split, the bucket into which the data entry is inserted is not necessarily the bucket that is split As in Static Hashing, an overflow page is added to store the newly inserted data entry However, since the bucket to split is chosen in a round-robin fashion, eventually all buckets will be split

Linear Hashing: Inserting Entries Example: insert 43* Level = 0  h0 43* = 101011  11 Level=0, N=4 h h PRIMARY 1 Next=0 PAGES 32* 44* 36* 000 00 9* 25* 5* 001 01 14* 18* 10* 30* 010 10 31* 35* 7* 11* 011 11 Add an overflow page and insert data entry

Linear Hashing: Inserting Entries Example: insert 43* Level = 0  h0 43* = 101011  11 Level=0, N=4 h h PRIMARY OVERFLOW 1 Next=0 PAGES PAGES 32* 44* 36* 000 00 9* 25* 5* 001 01 14* 18* 10* 30* 010 10 Split Next bucket and increment Next 31* 35* 7* 11* 011 11 43*

Linear Hashing: Inserting Entries Example: insert 43* Level = 0  h0 43* = 101011  11 Level=0, N=4 PRIMARY OVERFLOW h h PAGES PAGES 1 Next=0 000 00 32* 001 01 9* 25* 5* 010 10 14* 18* 10* 30* Almost there… 011 11 31* 35* 7* 11* 43* 100 00 44* 36*

Linear Hashing: Inserting Entries Example: insert 43* Level = 0  h0 43* = 101011  11 Level=0, N=4 PRIMARY OVERFLOW h h PAGES PAGES 1 000 00 32* Next=1 001 01 9* 25* 5* 010 10 14* 18* 10* 30* FINAL STATE! 011 11 31* 35* 7* 11* 43* 100 00 44* 36*

Linear Hashing: Inserting Entries Another Example: insert 50* Level=0, N= 4 PRIMARY OVERFLOW Level = 0  h0 50* = 110010  10 h PAGES 1 h PAGES 000 00 32* 001 01 9* 25* 010 10 66* 18* 10* 34* Next=3 011 11 31* 35* 7* 11* 43* 100 00 44* 36* 101 01 5* 37* 29* Add an overflow page and insert data entry 14* 30* 22* 110 10

Linear Hashing: Inserting Entries Another Example: insert 50* Level=0, N= 4 PRIMARY OVERFLOW Level = 0  h0 50* = 110010  10 h h PAGES 1 PAGES 000 00 32* 001 01 9* 25* 010 10 66* 18* 10* 34* 50* Next=3 011 11 31* 35* 7* 11* 43* 100 00 44* 36* Split Next bucket and increment Next 101 01 5* 37* 29* 14* 30* 22* 110 10

Linear Hashing: Inserting Entries Another Example: insert 50* Level=0 PRIMARY OVERFLOW h 1 h PAGES PAGES Level = 0  h0 50* = 110010  10 000 00 32* 001 01 9* 25* 010 10 66* 18* 10* 34* 50* Next=3 011 11 43* 35* 11* 100 00 44* 36* Almost there… 101 11 5* 37* 29* 110 10 14* 30* 22* 111 11 31* 7*

Linear Hashing: Inserting Entries Another Example: insert 50* Level=0 PRIMARY OVERFLOW h 1 h PAGES PAGES Next=0 Level = 0  h0 50* = 110010  10 000 00 32* 001 01 9* 25* 010 10 66* 18* 10* 34* 50* 011 11 43* 35* 11* 100 00 44* 36* Almost there… 101 11 5* 37* 29* 110 10 14* 30* 22* 111 11 31* 7*

Linear Hashing: Inserting Entries Another Example: insert 50* Level=1 PRIMARY OVERFLOW h 1 h PAGES PAGES Next=0 Level = 0  h0 50* = 110010  10 000 00 32* 001 01 9* 25* 010 10 66* 18* 10* 34* 50* 011 11 43* 35* 11* 100 00 44* 36* FINAL STATE! 101 11 5* 37* 29* 110 10 14* 30* 22* 111 11 31* 7*

Linear Hashing: Deleting Entries Deletion is essentially the inverse of insertion If the last bucket in the file is empty, it can be removed and Next can be decremented If Next is zero and the last bucket becomes empty Next is made to point to bucket M/2 -1 (where M is the current number of buckets) Level is decremented The empty bucket is removed The insertion examples can be worked out backwards as examples of deletions!

DBMS Layers Queries Query Optimization and Execution But, before we will discuss “Sorting” Relational Operators Transaction Manager Files and Access Methods Recovery Manager Buffer Management Lock Manager Disk Space Management DB

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

When Does A DBMS Sort Data? Users may want answers in some order SELECT FROM student ORDER BY name SELECT S.rating, MIN (S.age) FROM Sailors S GROUP BY S.rating Bulk loading a B+ tree index involves sorting Sorting is useful in eliminating duplicates records The Sort-Merge Join algorithm involves sorting (next session!)

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

In-Memory vs. External Sorting Assume we want to sort 60GB of data on a machine with only 8GB of RAM In-Memory Sort (e.g., Quicksort) ? Yes, but data do not fit in memory What about relying on virtual memory? In this case, external sorting is needed In-memory sorting is orthogonal to external sorting!

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

A Simple Two-Way Merge Sort IDEA: Sort sub-files that can fit in memory and merge Let us refer to each sorted sub-file as a run Algorithm: Pass 1: Read a page into memory, sort it, write it 1-page runs are produced Passes 2, 3, etc.,: Merge pairs (hence, 2-way) of runs to produce longer runs until only one run is left

A Simple Two-Way Merge Sort Algorithm: Pass 1: Read a page into memory, sort it, write it How many buffer pages are needed? Passes 2, 3, etc.,: Merge pairs (hence, 2-way) of runs to produce longer runs until only one run is left ONE THREE INPUT 1 OUTPUT INPUT 2 Main memory buffers Disk Disk

2-Way Merge Sort: An Example 3,4 6,2 9,4 8,7 5,6 3,1 2 Input File PASS 0 3,4 2,6 4,9 7,8 5,6 1,3 2 1-Page Runs PASS 1 2,3 4,7 1,3 2-Page Runs 4,6 8,9 5,6 2 PASS 2 2,3 4,4 1,2 4-Page Runs 6,7 3,5 8,9 6 Read each run into an input buffer, ONE page at a time PASS 3 1,2 2,3 3,4 8-Page Runs 4,5 6,6 7,8 9

2-Way Merge Sort: I/O Cost Analysis If the number of pages in the input file is 2k How many runs are produced in pass 0 and of what size? 2k 1-page runs How many runs are produced in pass 1 and of what size? 2k-1 2-page runs How many runs are produced in pass 2 and of what size? 2k-2 4-page runs How many runs are produced in pass k and of what size? 2k-k 2k-page runs (or 1 run of size 2k) For N number of pages, how many passes are incurred? How many pages do we read and write in each pass? 2N What is the overall cost?

2-Way Merge Sort: An Example 3,4 6,2 9,4 8,7 5,6 3,1 2 Input File PASS 0 3,4 2,6 4,9 7,8 5,6 1,3 2 1-Page Runs PASS 1 2,3 4,7 1,3 2-Page Runs 4,6 8,9 5,6 2 PASS 2 = 4 passes 2,3 4,4 1,2 4-Page Runs 6,7 3,5 8,9 6 PASS 3 Formula Check: = (2 × 8) × (3 + 1) = 64 I/Os Correct! 1,2 2,3 3,4 8-Page Runs 4,5 6,6 7,8 9

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

B-Way Merge Sort How can we sort a file with N pages using B buffer pages? Pass 0: use B buffer pages and sort internally This will produce sorted B-page runs Passes 1, 2, …: use B – 1 buffer pages for input and the remaining page for output; do (B-1)-way merge in each run INPUT 1 . . . INPUT 2 . . . . . . OUTPUT INPUT B-1 Disk Disk B Main memory buffers

B-Way Merge Sort: I/O Cost Analysis I/O cost = 2N × Number of passes Number of passes = Assume the previous example (i.e., 8 pages), but using 5 buffer pages (instead of 2) I/O cost = 32 (as opposed to 64) Therefore, increasing the number of buffer pages minimizes the number of passes and accordingly the I/O cost!

Number of Passes of B-Way Sort High Fan-in during merging is crucial! How else can we minimize I/O cost?

Outline Linear Hashing Why Sorting? In-Memory vs. External Sorting A Simple 2-Way External Merge Sorting General External Merge Sorting Optimizations: Replacement Sorting, Blocked I/O and Double Buffering

Replacement Sort With a more aggressive implementation of B-way sort, we can write out runs of 2×B (on average) internally sorted pages This is referred to as replacement sort 2 12 8 3 4 10 5 INPUT CURRENT SET OUTPUT IDEA: Pick the tuple in the current set with the smallest value that is greater than the largest value in the output buffer and append it to the output buffer

When do we terminate the current run and start a new one? Replacement Sort With a more aggressive implementation of B-way sort, we can write out runs of 2×B (on average) internally sorted pages This is referred to as replacement sort 2 12 8 3 4 10 5 INPUT CURRENT SET OUTPUT When do we terminate the current run and start a new one?

Blocked I/O and Double Buffering So far, we assumed random disk accesses Would cost change if we assume that reads and writes are done sequentially? Yes How can we incorporate this fact into our cost model? Use bigger units (this is referred to as Blocked I/O) Mask I/O delays through pre-fetching (this is referred to as double buffering) We can minimize I/O cost by maximizing run sizes!

Blocked I/O Normally, we go with‘B’ buffers of size (say) 1 page INPUT 1 . . . INPUT 2 . . . . . . OUTPUT INPUT 5 Disk Disk

Blocked I/O Normally, we go with‘B’ buffers of size (say) 1 page INSTEAD: let us go with B/b buffers, of size ‘b’ pages INPUT 1 OUTPUT . . . INPUT 2 . . . Disk Disk 3 Main memory buffers

Blocked I/O Normally, we go with‘B’ buffers of size (say) 1 page INSTEAD: let us go with B/b buffers, of size ‘b’ pages What is the main advantage? Fewer random accesses (as some of the pages will be arranged sequentially!) What is the main disadvantage? Smaller fan-in and accordingly larger number of passes!

Double Buffering Normally, when, say ‘INPUT1’ is exhausted We issue a ‘read’ request and We wait … INPUT 1 . . . INPUT 2 . . . . . . OUTPUT INPUT B-1 Disk Disk B Main memory buffers

Double Buffering INSTEAD: pre-fetch INPUT1’ into a `shadow block’ When INPUT1 is exhausted, issue a ‘read’ BUT, also proceed with INPUT1’ Thus, the CPU can never go idle! INPUT 1 INPUT 1' INPUT 2 OUTPUT INPUT 2' OUTPUT' b block size Disk INPUT k Disk INPUT k' B main memory buffers, k-way merge

Next Class Queries Query Optimization and Execution Continue… Relational Operators Transaction Manager Files and Access Methods Recovery Manager Buffer Management Lock Manager Disk Space Management DB