File Organizations and Indexing

Slides:

Advertisements

Similar presentations

Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 How index-learning turns no student pale Yet holds.

Advertisements

File Organizations and Indexing Chapter 8

File Organizations and Indexing Lecture 4 R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears,

Indexes An index on a file speeds up selections on the search key fields for the index. Any subset of the fields of a relation can be the search key for.

Tree-Structured Indexes

B+-Trees and Hashing Techniques for Storage and Index Structures

Introduction to Database Systems1 Records and Files Storage Technology: Topic 3.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8 “How index-learning turns no student pale Yet.

File Organizations and Indexing Lecture 5 R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears,

1 Overview of Storage and Indexing Chapter 8 (part 1)

1 File Organizations and Indexing Module 4, Lecture 2 “How index-learning turns no student pale Yet holds the eel of science by the tail.” -- Alexander.

1 Overview of Storage and Indexing Yanlei Diao UMass Amherst Feb 13, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.

File Organizations and Indexing R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears, Roebuck,

1 v es/SIGMOD98.asp.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8 “How index-learning turns no student pale Yet.

1 Overview of Storage and Indexing Chapter 8 1. Basics about file management 2. Introduction to indexing 3. First glimpse at indices and workloads.

DBMS Internals: Storage February 27th, Representing Data Elements Relational database elements: A tuple is represented as a record CREATE TABLE.

Storage and Indexing February 26 th, 2003 Lecture 19.

Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8.

Indexing - revisited CS 186, Fall 2012 R & G Chapter 8.

Overview of File Organizations and Indexing Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY courtesy of Joe Hellerstein for some slides.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.

Database Management 8. course. Query types Equality query – Each field has to be equal to a constant Range query – Not all the fields have to be equal.

File Organizations and Indexing Lecture 4 R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears,

Database Systems An Overview of Storage and Indexing

1 IT420: Database Management and Organization Storage and Indexing 14 April 2006 Adina Crăiniceanu

Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 “How index-learning turns no student pale Yet holds.

1 Overview of Storage and Indexing Chapter 8 (part 1)

Storage and Indexing1 Overview of Storage and Indexing.

1 Overview of Storage and Indexing Chapter 8 “How index-learning turns no student pale Yet holds the eel of science by the tail.” -- Alexander Pope ( )

Implementation of Relational Operators/Estimated Cost 1.Select 2.Join.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.

1 Overview of Storage and Indexing Chapter 8. 2 Data on External Storage  Disks: Can retrieve random page at fixed cost  But reading several consecutive.

Overview of Storage and Indexing Content based on Chapter 4 Database Management Systems, (Third Edition), by Raghu Ramakrishnan and Johannes Gehrke. McGraw.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8 “How index-learning turns no student pale Yet.

1 Database Systems ( 資料庫系統 ) October 4, 2004 Lecture #4 By Hao-hua Chu ( 朱浩華 )

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8 “If you don’t find it in the index, look very.

Layers of a DBMS Query optimization Execution engine Files and access methods Buffer management Disk space management Query Processor Query execution plan.

1 Database Systems ( 資料庫系統 ) October 29/31, 2007 Lecture #6.

Storage and Indexing. How do we store efficiently large amounts of data? The appropriate storage depends on what kind of accesses we expect to have to.

B+-Trees and Hashing, R. Ramakrishnan and J. Gehrke; extended and significantly revised by Ch. Eick 1 B+-Trees and Hashing Techniques for Storage and Index.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.

CS4432: Database Systems II

Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 Jianping Fan Dept of Computer Science UNC-Charlotte.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.

1 Overview of Storage and Indexing Chapter 8. 2 Review: Architecture of a DBMS  A typical DBMS has a layered architecture.  The figure does not show.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8 “If you don’t find it in the index, look very.

CS522 Advanced database Systems

Pertemuan <<6>> Tempat Penyimpanan Data dan Indeks

Storage and Indexes Chapter 8 & 9

File Organizations and Indexes

CS222P: Principles of Data Management Notes #6 Index Overview and ISAM Tree Index Instructor: Chen Li.

Lecture 12 Lecture 12: Indexing.

Introduction to Database Systems File Organization and Indexing

Overview of Storage and Indexing

File Organizations and Indexing

File Organizations and Indexing

Overview of Storage and Indexing

Overview of Storage and Indexing

Storage and Indexing May 17th, 2002.

CSE 544: Lecture 11 Storing Data, Indexes

CS222/CS122C: Principles of Data Management Notes #6 Index Overview and ISAM Tree Index Instructor: Chen Li.

Storage and Indexing.

General External Merge Sort

Files and access methods

Indexing February 28th, 2003 Lecture 20.

Overview of Storage and Indexing

CS222/CS122C: Principles of Data Management UCI, Fall 2018 Notes #05 Index Overview and ISAM Tree Index Instructor: Chen Li.

Presentation transcript:

File Organizations and Indexing Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 – DB Applications Faloutsos & Pavlo Lecture #10 (R&G ch8) File Organizations and Indexing 1

Overview Review Index classification Cost estimation 2 Faloutsos - Pavlo Faloutsos SCS 15-415 SCS 15-415/615 Overview Review Index classification Cost estimation 2 Faloutsos - Pavlo CMU SCS 15-415 2

Alternative File Organizations Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Alternative File Organizations Many alternatives exist, each good for some situations, and not so good in others: Heap files: Suitable when typical access is a file scan retrieving all records. Sorted Files: Best for retrieval in some order, or for retrieving a `range’ of records. Index File Organizations: (ISAM, or B+ trees) Faloutsos - Pavlo 3

How to find records quickly? Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 How to find records quickly? E.g., student.gpa = ‘3’ Q: On a heap organization, with B blocks, how many disk accesses? Faloutsos - Pavlo 4

Heap File Implemented Using Lists Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Heap File Implemented Using Lists Data Page Data Page Data Page Full Pages Header Page Free Page Free Page Free Page Pages with Free Space The header page id and Heap file name must be stored someplace. Each page contains 2 `pointers’ plus data. Faloutsos - Pavlo 5

How to find records quickly? Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 How to find records quickly? E.g., student.gpa = ‘3’ Q: On a heap organization, with B blocks, how many disk accesses? A: B Faloutsos - Pavlo 6

How to accelerate searches? Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 How to accelerate searches? A: Indices, like: Faloutsos - Pavlo 7

Example: Simple Index on GPA Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Example: Simple Index on GPA Directory 2 2.5 3 3.5 Data entries: 1.2* 1.7* 1.8* 1.9* 2.2* 2.4* 2.7* 2.7* 2.9* 3.2* 3.3* 3.3* 3.6* 3.8* 3.9* 4.0* (Index File) (Data file) Data Records An index contains a collection of data entries, and supports efficient retrieval of records matching a given search condition Faloutsos - Pavlo 10 8

Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Indexes Sometimes, we want to retrieve records by specifying the values in one or more fields, e.g., Find all students in the “CS” department Find all students with a gpa > 3 An index on a file speeds up selections on the search key fields for the index. Any subset of the fields of a relation can be the search key for an index on the relation. Search key is not the same as key (e.g., doesn’t have to be unique). Faloutsos - Pavlo 9

Index Search Conditions Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Index Search Conditions Search condition = <search key, comparison operator> Examples… (1) Condition: Department = “CS” Search key: “CS” Comparison operator: equality (=) (2) Condition: GPA > 3 Search key: 3 Comparison operator: greater-than (>) Faloutsos - Pavlo 10

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 11 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 11

Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Details ‘data entries’ == what we store at the bottom of the index pages what would you use as data entries? (3 alternatives here) Faloutsos - Pavlo 12

Example: Simple Index on GPA Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Example: Simple Index on GPA Directory 2 2.5 3 3.5 Data entries: 1.2* 1.7* 1.8* 1.9* 2.2* 2.4* 2.7* 2.7* 2.9* 3.2* 3.3* 3.3* 3.6* 3.8* 3.9* 4.0* (Index File) (Data file) Data Records An index contains a collection of data entries, and supports efficient retrieval of records matching a given search condition Faloutsos - Pavlo 10 13

Alternatives for Data Entry k* in Index Faloutsos - Pavlo Faloutsos SCS 15-415/615 SCS 15-415 Alternatives for Data Entry k* in Index Actual data record (with key value k) 123 Smith; Main str; 412-999.9999 2. <k, rid of matching data record> $40 Rid-1 Rid-2 … 3. <k, list of rids of matching data records> $40 Rid-1 Rid-2 … Faloutsos - Pavlo 14

Alternatives for Data Entry k* in Index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Alternatives for Data Entry k* in Index 1. Actual data record (with key value k) 2. <k, rid of matching data record> 3. <k, list of rids of matching data records> Choice is orthogonal to the indexing technique. Examples of indexing techniques: B+ trees, hash-based structures, R trees, … Typically, index contains auxiliary info that directs searches to the desired data entries Can have multiple (different) indexes per file. E.g. file sorted on age, with a hash index on name and a B+tree index on salary. Faloutsos - Pavlo 15

Alternatives for Data Entries (Contd.) Faloutsos - Pavlo Faloutsos SCS 15-415 SCS 15-415/615 Alternatives for Data Entries (Contd.) 123 Smith; Main str; 412-999.9999 Alternative 1: Actual data record (with key value k) Then, this is a clustering/sparse index, and constitutes a file organization (like Heap files or sorted files). At most one index on a given collection of data records can use Alternative 1. Saves pointer lookups but can be expensive to maintain with insertions and deletions. Faloutsos - Pavlo 16

Alternatives for Data Entries (Contd.) Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Alternatives for Data Entries (Contd.) Alternative 2 <k, rid of matching data record> and Alternative 3 <k, list of rids of matching data records> Easier to maintain than Alternative 1. If more than one index is required on a given file, at most one index can use Alternative 1; rest must use Alternatives 2 or 3. Alternative 3 more compact than Alternative 2, but leads to variable sized data entries even if search keys are of fixed length. Even worse, for large rid lists the data entry would have to span multiple pages! $40 Rid-1 Rid-2 $40 Rid-1 Rid-2 … Faloutsos - Pavlo 17

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 18 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 18

Indexing - clustered index example Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Indexing - clustered index example Clustering/sparse index on ssn >=123 >=456 Faloutsos - Pavlo 19

Indexing - non-clustered Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Indexing - non-clustered Non-clustering / dense index Faloutsos - Pavlo 20

Index Classification - clustered Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Index Classification - clustered Clustered vs. unclustered: If order of data records is the same as, or `close to’, order of index data entries, then called clustered index. Index entries UNCLUSTERED CLUSTERED direct search for data entries Data entries Data entries (Index File) (Data file) Data Records Data Records Faloutsos - Pavlo 21

Index Classification - clustered Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Index Classification - clustered A file can have a clustered index on at most one search key. Cost of retrieving data records through index varies greatly based on whether index is clustered! Note: Alternative 1 implies clustered, but not vice-versa. But, for simplicity, you may think of them as equivalent.. Faloutsos - Pavlo 22

Clustered vs. Unclustered Index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Clustered vs. Unclustered Index Cost of retrieving records found in range scan: Clustered: cost = Unclustered: cost ≈ What are the tradeoffs???? Faloutsos - Pavlo 23

Clustered vs. Unclustered Index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Clustered vs. Unclustered Index Cost of retrieving records found in range scan: Clustered: cost = # pages in file w/matching records Unclustered: cost ≈ # of matching index data entries What are the tradeoffs???? Faloutsos - Pavlo 24

Clustered vs. Unclustered Index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Clustered vs. Unclustered Index Cost of retrieving records found in range scan: Clustered: cost = # pages in file w/matching records Unclustered: cost ≈ # of matching index data entries What are the tradeoffs???? Clustered Pros: Efficient for range searches May be able to do some types of compression Clustered Cons: Expensive to maintain (on the fly or sloppy with reorganization) Faloutsos - Pavlo 25

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 26 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 26

Primary vs. Secondary Index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Primary vs. Secondary Index Primary: index key includes the file’s primary key Secondary: any other index Sometimes confused with Alt. 1 vs. Alt. 2/3 Primary index never contains duplicates Secondary index may contain duplicates If index key contains a candidate key, no duplicates => unique index Faloutsos - Pavlo 27

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 28 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 28

Dense vs. Sparse Index Dense: at least one data entry per key value Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Dense vs. Sparse Index Dense: at least one data entry per key value Sparse: an entry per data page in file Every sparse index is clustered! Sparse indexes are smaller; however, some useful optimizations are based on dense indexes. Ashby, 25, 3000 22 Basu, 33, 4003 25 Bristow, 30, 2007 30 Ashby 33 Cass Cass, 50, 5004 Smith Daniels, 22, 6003 40 Jones, 40, 6003 44 44 Smith, 44, 3000 50 Tracy, 44, 5004 Sparse Index Dense Index on on Data File Name Age Faloutsos - Pavlo 29

Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Dense vs. Sparse Index Sparse <-> Clustering <-> Alt#1 (full record) Ashby, 25, 3000 22 Basu, 33, 4003 25 Bristow, 30, 2007 30 Ashby 33 Cass Cass, 50, 5004 Smith Daniels, 22, 6003 40 Jones, 40, 6003 44 44 Smith, 44, 3000 50 Tracy, 44, 5004 Sparse Index Dense Index on on Data File Name Age Faloutsos - Pavlo 30

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 31 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 31

Composite Search Keys Search on combination of fields. Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Composite Search Keys Search on combination of fields. Equality query: Every field is equal to a constant value. E.g. wrt <sal,age> index: age=12 and sal =75 Range query: Some field value is not a constant. E.g.: age =12; or age=12 and sal > 20 Data entries in index sorted by search key for range queries. “Lexicographic” order. Examples of composite key indexes using lexicographic order. <age, sal> 12,20 12,10 11,80 13,75 <age> 11 12 13 name age sal <sal, age> 20,12 10,12 75,13 80,11 bob 12 10 <sal> 10 20 75 80 cal 11 80 joe 12 20 sue 13 75 Data records sorted by name Faloutsos - Pavlo 32

Overview Review Index classification Cost estimation Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation of data entries in index Clustered vs. Unclustered Primary vs. Secondary Dense vs. Sparse Single Key vs. Composite Indexing technique Cost estimation 33 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 33

Tree vs. Hash-based index Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Tree vs. Hash-based index Hash-based index Good for equality selections. File = a collection of buckets. Bucket = primary page plus 0 or more overflow pages. Hash function h: h(r.search_key) = bucket in which record r belongs. Tree-based index Good for range selections. Hierarchical structure (Tree) directs searches Leaves contain data entries sorted by search key value B+ tree: all root->leaf paths have equal length (height) Faloutsos - Pavlo 34

Overview Review Index classification Cost estimation Representation … Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Overview Review Index classification Representation … Cost estimation 35 Faloutsos - Pavlo Faloutsos CMU SCS 15-415 35

Cost estimation Methods Operations(?) Heap file Sorted Clustered Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Cost estimation Heap file Sorted Clustered Unclustured tree index Unclustered hash index Methods Operations(?) Faloutsos - Pavlo 36

Cost estimation Methods Operations Consider only I/O cost; Faloutsos - Pavlo Faloutsos SCS 15-415 SCS 15-415/615 Cost estimation Heap file Sorted Clustered Unclustured tree index Unclustered hash index scan equality search range search insertion deletion Methods Operations Consider only I/O cost; suppose file spans B pages Faloutsos - Pavlo 37

Cost estimation Assume that: Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation Assume that: Clustered index spans 1.5B pages (due to empty space) Data entry= 1/10 of data record Faloutsos - Pavlo 38

Cost estimation Faloutsos - Pavlo Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation Faloutsos - Pavlo 39

Cost estimation heap: seq. scan sorted: binary search index search #1 Faloutsos Faloutsos - Pavlo SCS 15-415/615 SCS 15-415 Cost estimation heap: seq. scan sorted: binary search index search #1 #2 … #B Faloutsos - Pavlo 40

Cost estimation index – cost? In general #1 .. #2 Faloutsos - Pavlo Faloutsos SCS 15-415/615 SCS 15-415 Cost estimation index – cost? In general levels of index + blocks w/ qual. tuples #1 .. #2 for primary key – cost: h for clustering index h’+1 for non-clustering … h’ #B Faloutsos - Pavlo 41

Cost estimation h= height of btree ~ logF (1.5B) Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation h= height of btree ~ logF (1.5B) h’= height of unclustered index btree ~ logF (1.5B) Faloutsos - Pavlo 42

Cost estimation index – cost? #1 #2 sec. key – clustering index Faloutsos - Pavlo Faloutsos SCS 15-415/615 SCS 15-415 Cost estimation index – cost? levels of index + blocks w/ qual. tuples #1 #2 sec. key – clustering index h + #qual-pages … #B h Faloutsos - Pavlo 43

Cost estimation index – cost? #1 ... #2 sec. key – non-clust. index Faloutsos - Pavlo Faloutsos SCS 15-415/615 SCS 15-415 Cost estimation index – cost? levels of index + blocks w/ qual. tuples #1 ... #2 sec. key – non-clust. index h’ + #qual-records (actually, a bit less...) … #B h’ Faloutsos - Pavlo 44

Cost estimation m: # of qualifying pages m’: # of qualifying records Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation m: # of qualifying pages m’: # of qualifying records Faloutsos - Pavlo 45

Cost estimation Faloutsos - Pavlo Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation Faloutsos - Pavlo 46

Cost estimation - big-O notation: Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Cost estimation - big-O notation: Faloutsos - Pavlo 47

Index specification in SQL:1999 Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Index specification in SQL:1999 CREATE INDEX IndAgeRating ON Students WITH STRUCTURE=BTREE, KEY = (age, gpa) Faloutsos - Pavlo 48

Summary To speed up selection queries: index. Terminology: Faloutsos Faloutsos - Pavlo SCS 15-415 SCS 15-415/615 Summary To speed up selection queries: index. Terminology: Clustered / non-clustered index Clustered = sparse = alt#1 primary / secondary index Typically, B-tree index hashing is only good for equality search At most one clustered index per table many non-clustered ones are possible composite indexes are possible Faloutsos - Pavlo 49