Chapter 11: Indexing and Hashing - PowerPoint PPT Presentation

About This Presentation

Title:

Chapter 11: Indexing and Hashing

Description:

Title: Chapter 7: Relational Database Design Author: Marilyn Turnamian Last modified by: Huang, Yan Created Date: 2/23/2000 6:58:38 PM Document presentation format – PowerPoint PPT presentation

Number of Views:74

Avg rating:3.0/5.0

Slides: 35

Provided by: MarilynT78

Learn more at: https://computerscience.engineering.unt.edu

Category:

more less

Transcript and Presenter's Notes

Title: Chapter 11: Indexing and Hashing

1
Chapter 11 Indexing and Hashing

Indexing
Basic Concepts
Ordered Indices
B-Tree Index Files
Hashing
Static
Dynamic Hashing

2
Basic Concepts

Value
Search Key - set of attributes used to look up
records in a file.

record
?
value
3
Index Evaluation Metrics

Access types supported efficiently. E.g.,
Point query find Tom
Range query find students whose age is between
20-40
Access time
Update time
Space overhead

4
Ordered Indices

In an ordered index, index entries are stored
sorted on the search key value. E.g., author
catalog in library.

5
Search key

Primary index
Also called clustering index
The search key of a primary index is usually but
not necessarily the primary key.

6
Search key

Secondary index
non-clustering index.

7
Sequential File
Dense Index
Dense Index contains index records for every
search-key values.
8
Sequential File
Sparse Index
Sparse Index contains index records for only
some search-key values. Applicable when records
are sequentially ordered on search-key
9
Secondary indexes
Sequence field

Sparse index

10
Multilevel Index
Sequential File
Sparse 2nd level
11
Multilevel Index
Secondary indexes
Sequence field

Lowest level is dense
Other levels are sparse

12
Conventional indexes

Advantage
- Simple
- Index is sequential file good for
scans

Disadvantage - Inserts expensive
13
Outline

Conventional indexes
B-Tree ? NEXT

NEXT Another type of index
Give up on sequentiality of index
Try to get balance

15
BTree Example n4

Root

100
120 150 180
30
3 5 11
120 130
180 200
100 101 110
150 156 179
30 35
16
Sample non-leaf
57 81 95
to keys to keys to keys to keys lt 57 57?
klt81 81?klt95 ?95
Key is moved (not copied) from lower level
non-leaf node to upper level non-leaf node
17
Sample leaf node

From non-leaf node
to next leaf
in sequence

57 81 95
To record with key 57 To record with key
81 To record with key 85
Key is copied (not moved) from leaf node to
non-leaf node
18
n4
30 35

Leaf
Non-leaf

30
35
30
30
19

Size of nodes
n pointers
n-1 keys

20
Dont want nodes to be too empty

Use at least
Root 2 pointers
Non-leaf ?n/2? pointers
Leaf ?(n-1)/2? keys

21
n4

Full node min. node
Non-leaf
Leaf

120 150 180
30
3 5 11
30 35
counts even if null
22
Btree rules tree of order n

(1) All leaves at same lowest level (balanced
tree)
(2) Pointers in leaves point to records
except for sequence pointer

(3) Number of pointers/keys for Btree

Max Max Min Min ptrs keys
ptrs?data keys
Non-leaf (non-root)
n
n-1
?n/2?
?n/2?- 1
Leaf (non-root)
n
n-1
?(n-1)/2?
?(n-1)/2?
Root
n
n-1
2
1
24
Insert into Btree