Reading and Review Chapter 12: Indexing and Hashing

About This Presentation

Title:

Reading and Review Chapter 12: Indexing and Hashing

Description:

Multiple-key indices, Grid files, Bitmap Indices. 12.10 Summary. Reading and Review ... Estimation of number of distinct values. 14.3 Transformation of ... – PowerPoint PPT presentation

Number of Views:206

Avg rating:3.0/5.0

Slides: 161

Provided by: david227

Learn more at: http://www.cs.umd.edu

Category:

more less

Transcript and Presenter's Notes

Title: Reading and Review Chapter 12: Indexing and Hashing

1
Reading and ReviewChapter 12 Indexing and
Hashing

12.1 Basic Concepts
12.2 Ordered Indices
12.3 B-tree Index Files
lots of stuff
12.4 B-tree Index Files
12.5 Static Hashing
lots of stuff
12.6 Dynamic Hashing (Extendable Hashing)
lots of stuff
12.7 Comparison of Ordered Indexing and Hashing
12.9 Multiple-Key Access
Multiple-key indices, Grid files, Bitmap Indices
12.10 Summary

2
Reading and ReviewChapter 13 Query Processing

13.1 Overview
13.2 Measures of Query Cost
13.3 Selection Operation
13.4 Sorting (Sort-Merge Algorithm)
13.5 Join Operation
Nested-loop Join (regular, block, indexed)
Merge Join
Hash Join
Complex Joins
13.6 Other Operations
13.7 Evaluation of Expressions
13.8 Summary

3
Reading and Review Chapter 14 Query Optimization

14.1 Overview
14.2 Estimating Statistics of Expression Results
Catalog information
Selection size estimation
Join size estimation
Size estimation for other operations
Estimation of number of distinct values
14.3 Transformation of Relational Expressions
Equivalence rules and examples
Join ordering, enumeration of equivalent
expressions
14.4 Choice of Evaluation Plans
from 14.4 on, ignore any section with a at
the end of the section title
14.6 Summary

4
Reading and Review Chapter 15 Transaction
Management

15.1 Transaction Concept
ACID properties
15.2 Transaction State
state diagram of a transaction
(15.3 Implementation of Atomicity and Durability)
not important -- we covered this material better
in section 17.4
15.4 Concurrent Execution and Scheduling
15.5 Serializability (Conflict Serializability)
(ignore 15.5.2 View Serializability)
15.6 Recoverability
(15.7 Isolation -- not important, see 16.1 on
locking instead)
15.9 Testing for Serializability (Precedence
Graphs)
15.10 Summary

5
Reading and Review Chapters 16 and 17

16.1 Lock-Based Protocols
granting locks
deadlocks
two-phase locking protocol
ignore 16.1.4 and 16.1.5
16.6.3 Deadlock Detection and Recovery
Wait-for Graph
17.1, 17.2.1 -- skim these sections. Material
should be familiar to you as background, but I
wont be testing on definitions or memorization
of it
17.4 Log-Based Recovery
logs, redo/undo, basic concepts
checkpoints

6
Indexing and Hashing Motivation

Query response speed is a major issue in database
design
Some queries only need to access a very small
proportion of the records in a database
Find all accounts at the Perryridge bank branch
Find the balance of account number A-101
Find all accounts owned by Zephraim Cochrane
Checking every single record for the queries
above is very inefficient and slow.
To allow fast access for those sorts of queries,
we create additional structures that we associate
with files indices (index files).

7
Basic Concepts

Indexing methods are used to speed up access to
desired data
e.g. Author card catalog in a library
Search key -- an attribute or set of attributes
used to look up records in a file. This use of
the word key differs from that used before in
class.
An index file consists of records (index entries)
of the form
(search-key, pointer)
where the pointer is a link to a location in the
original file
Index files are typically much smaller than the
original file
Two basic types of index
ordered indices search keys are stored in sorted
order
hash indices search keys are distributed
uniformly across buckets using a hash function

8
Index Evaluation Metrics

We will look at a number of techniques for both
ordered indexing and hashing. No one technique
is best in all circumstances -- each has its
advantages and disadvantages. The factors that
can be used to evaluate different indices are
Access types supported efficiently.
Finding records with a specified attribute value
Finding records with attribute values within a
specified range of values
Access (retrieval) time. Finding a single
desired tuple in the file.
Insertion time
Deletion time
Update time
Space overhead for the index

9
Index Evaluation Metrics (2)

Speed issues are
Access time
Insertion time
Deletion time
Update time
Access is the operation that occurs the most
frequently, because it is also used for insert,
delete, update
For insert, delete, and update operations we must
consider not only the time for the operation
itself (inserting a new record into the file) but
also any time required to update the index
structure to reflect changes.
We will often want to have more than one index
for a file
e.g., card catalogs for author, subject, and
title in a library

10
Ordered Indices

An ordered index stores the values of the search
keys in sorted order
records in the original file may themselves be
stored in some sorted order (or not)
the original file may have several indices, on
different search keys
when there are multiple indices, a primary index
is an index whose search key also determines the
sort order of the original file. Primary indices
are also called clustering indices
secondary indices are indices whose search key
specifies an order different from the sequential
order of the file. Also called non-clustering
indices
an index-sequential file is an ordered sequential
file with a primary index.

11
Dense and Sparse Indices

A dense index is where an index record appears
for every search-key value in the file
A sparse index contains index records for only
some search-key values
applicable when records are sequentially ordered
on the search key
to locate a record with search-key value K we
must
find index record with largest search-key value
ltK
search file sequentially starting at the location
pointed to
stop (fail) when we hit a record with search-key
value gtK
less space and less maintenance overhead for
insert, delete
generally slower than dense index for locating
records

12
Example of a Dense Index
13
Example of a Sparse Index
14
Problems with Index-Sequential Files

Retrieve search until the key value or a larger
key value is found
individual key access BAD
scan the file in order of the key GOOD
Insert is hard -- all higher key records must be
shifted to place the new record
Delete may leave holes
Update is equivalent to a combined delete and
insert
updating a search key field may cause the
combined disadvantages of an insert and a delete
by shifting the records location in the sorted
file
differential files are often used to hold the
recent updates until the database can be
reorganized off-line

15
Multi-level Index

If the primary index does not fit in memory,
access becomes expensive (disk reads cost)
To reduce the number of disk accesses to index
records, we treat the primary index kept on disk
as a sequential file and construct a sparse index
on it
If the outer index is still too large, we can
create another level of index to index it, and so
on
Indices at all levels must be updated on
insertion or deletion from the file

16
Index Update Deletion

When deletions occur in the primary file, the
index will sometimes need to change
If the deleted record was the only record in the
file with its particular search-key value, the
search-key is deleted from the index also
Single-level index deletion
dense indices -- deletion of search-key is
similar to record deletion
sparse indices -- if an entry for the search key
exists in the index, it is replaced by the next
search key value in the original file (taken in
search-key order)
Multi-level index deletion is a simple extension
of the above

17
Index Update Insertion

As with deletions, when insertions occur in the
primary file, the index will sometimes need to
change
Single-level index insertion
first perform a lookup using the search-key value
appearing in the record to be inserted
dense indices -- if the search-key value does not
appear in the index, insert it
sparse indices -- depends upon the design of the
sparse index. If the index is designed to store
an entry for each block of the file, then no
change is necessary to the index unless the
insertion creates a new block (if the old block
overflows). If that happens, the first
search-key value in the new block is inserted in
the index
multi-level insertion is a simple extension of
the above

18
Secondary Index Motivation

Good database design is to have an index to
handle all common (frequently requested) queries.
Queries based upon values of the primary key can
use the primary index
Queries based upon values of other attributes
will require other (secondary) indices.

19
Secondary Indices vs. Primary Indices

Secondary indices must be dense, with an index
entry for every search-key value, and a pointer
to every record in the file. A primary index may
be dense or sparse. (why?)
A secondary index on a candidate key looks just
like a dense primary index, except that the
records pointed to by successive values of the
index are not sequential
when a primary index is not on a candidate key it
suffices if the index points to the first record
with a particular value for the search key, as
subsequent records can be found with a sequential
scan from that point

20
Secondary Indices vs. Primary Indices (2)

When the search key of a secondary index is not a
candidate key (I.e., there may be more than one
tuple in the relation with a given search-key
value) it isnt enough to just point to the first
record with that value -- other records with that
value may be scattered throughout the file. A
secondary index must contain pointers to all the
records
so an index record points to a bucket that
contains pointers to every record in the file
with that particular search-key value

21
Secondary Index on balance
22
Primary and Secondary Indices

As mentioned earlier
secondary indices must be dense
Indices offer substantial benefits when searching
for records
When a file is modified, every index on the file
must be updated
this overhead imposes limits on the number of
indices
relatively static files will reasonably permit
more indices
relatively dynamic files make index maintenance
quite expensive
Sequential scan using primary index is efficient
sequential scan on secondary indices is very
expensive
each record access may fetch a new block from
disk (oy!)

23
Disadvantages of Index-Sequential Files

The main disadvantage of the index-sequential
file organization is that performance degrades as
the file grows, both for index lookups and for
sequential scans through the data. Although this
degradation can be remedied through file
reorganization, frequent reorgs are expensive and
therefore undesirable.
Next well start examining index structures that
maintain their efficiency despite insertion and
deletion of data the B-tree (section 12.3)

24
B- Tree Index Files

Main disadvantage of ISAM files is that
performance degrades as the file grows, creating
many overflow blocks and the need for periodic
reorganization of the entire file
B- trees are an alternative to
indexed-sequential files
used for both primary and secondary indexing
B- trees are a multi-level index
B- tree index files automatically reorganize
themselves with small local changes on insertion
and deletion.
No reorg of entire file is required to maintain
performance
disadvantages extra insertion, deletion, and
space overhead
advantages outweigh disadvantages. B-trees are
used extensively

25
B- Tree Index Files (2)

Definition A B-tree of order n has
All leaves at the same level
balanced tree (B in the name stands for
balanced)
logarithmic performance
root has between 1 and n-1 keys
all other nodes have between n/2 and n-1 keys (gt
50 space utilization)
we construct the tree with order n such that one
node corresponds to one disk block I/O (in other
words, each disk page read brings up one full
tree node).

26
B- Tree Index Files (3)

A B-tree is a rooted tree satisfying the
following properties
All paths from root to tree are the same length
Search for an index value takes time according to
the height of the tree (whether successful or
unsuccessful)

27
B- Tree Node Structure

The B-tree is constructed so that each node
(when full) fits on a single disk page
parameters B size of a block in bytes (e.g.,
4096)
K size of the key in bytes (e.g., 8)
P size of a pointer in bytes (e.g., 4)
internal node must have n such that
(n-1)K nP lt B
nlt (BK)/(KP)
with the example values above, this becomes
nlt(40968)/(84)4114/12
nlt342.83

28
B- Tree Node Structure (2)

Typical B-tree Node
Ki are the search-key values
Pi are the pointers to children (for non-leaf
nodes) or pointers to records or buckets of
records (for leaf nodes)
the search keys in a node are ordered
K1ltK2 ltK3 ltKn-1

29
Non-Leaf Nodes in B-Trees

Non-leaf nodes form a multi-level sparse index on
the leaf nodes. For a non-leaf node with n
pointers
all the search keys in the subtree to which P1
points are less than K1
For 2lt i lt n-1, all the search keys in the
subtree to which Pi points have values greater
than or equal to Ki-1 and less than Kn-1

30
Leaf Nodes in B-Trees

As mentioned last class, primary indices may be
sparse indices. So B-trees constructed on a
primary key (that is, where the search key order
corresponds to the sort order of the original
file) can have the pointers of their leaf nodes
point to an appropriate position in the original
file that represents the first occurrence of that
key value.
Secondary indices must be dense indices.
B-trees constructed as a secondary index must
have the pointers of their leaf nodes point to a
bucket storing all locations where a given search
key value occur this set of buckets is often
called an occurrence file

31
Example of a B-tree

B-tree for the account file (n3)

32
Another Example of a B-tree

B-tree for the account file (n5)
Leaf nodes must have between 2 and 4 values
(?(n-1)/2? and (n-1), with n5)
Non-leaf nodes other than the root must have
between 3 and 5 children
(?n/2? and n, with n5)
Root must have at least 2 children

33
Observations about B-trees

Since the inter-node connections are done by
pointers, logically close blocks need not be
physically close
The non-leaf levels of the B-tree form a
hierarchy of sparse indices
The B-tree contains a relatively small number of
levels (logarithmic in the size of the main
file), thus searches can be conducted efficiently
Insertions and deletions to the main file can be
handled efficiently, as the index can be
restructured in logarithmic time (as we shall
examine later in class)

34
Queries on B-trees

Find all records with a search-key value of k
start with the root node (assume it has m
pointers)
examine the node for the smallest search-key
value gt k
if we find such a value, say at Kj , follow the
pointer Pj to its child node
if no such k value exists, then k gt Km-1, so
follow Pm
if the node reached is not a leaf node, repeat
the procedure above and follow the corresponding
pointer
eventually we reach a leaf node. If we find a
matching key value (our search value k Ki for
some i) then we follow Pi to the desired record
or bucket. If we find no matching value, the
search is unsuccessful and we are done.

35
Queries on B-trees (2)

Processing a query traces a path from the root
node to a leaf node
If there are K search-key values in the file, the
path is no longer than ?log?n/2? (K)?
A node is generally the same size as a disk
block, typically 4 kilobytes, and n is typically
around 100 (40 bytes per index entry)
With 1 million search key values and n100, at
most log50(1,000,000) 4 nodes are accessed in a
lookup
In a balanced binary tree with 1 million search
key values, around 20 nodes are accessed in a
lookup
the difference is significant since every node
access might need a disk I/O, costing around 20
milliseconds

36
Insertion on B-trees

Find the leaf node in which the search-key value
would appear
If the search key value is already present, add
the record to the main file and (if necessary)
add a pointer to the record to the appropriate
occurrence file bucket
If the search-key value is not there, add the
record to the main file as above (including
creating a new occurrence file bucket if
necessary). Then
if there is room in the leaf node, insert
(key-value, pointer) in the leaf node
otherwise, overflow. Split the leaf node (along
with the new entry)

37
Insertion on B-trees (2)

Splitting a node
take the n (search-key-value, pointer) pairs,
including the one being inserted, in sorted
order. Place half in the original node, and the
other half in a new node.
Let the new node be p, and let k be the least key
value in p. Insert (k, p) in the parent of the
node being split.
If the parent becomes full by this new insertion,
split it as described above, and propogate the
split as far up as necessary
The splitting of nodes proceeds upwards til a
node that is not full is found. In the worst
case the root node may be split, increasing the
height of the tree by 1.

38
Insertion on B-trees Example
39
Deletion on B-trees

Find the record to be deleted, and remove it from
the main file and the bucket (if necessary)
If there is no occurrence-file bucket, or if the
deletion caused the bucket to become empty, then
delete (key-value, pointer) from the B-tree
leaf-node
If the leaf-node now has too few entries,
underflow has occurred. If the active leaf-node
has a sibling with few enough entries that the
combined entries can fit in a single node, then
combine all the entries of both nodes in a single
one
delete the (K,P) pair pointing to the deleted
node from the parent. Follow this procedure
recursively if the parent node underflows.

40
Deletion on B-trees (2)

Otherwise, if no sibling node is small enough to
combine with the active node without causing
overflow, then
Redistribute the pointers between the active node
and the sibling so that both of them have
sufficient pointers to avoid underflow
Update the corresponding search key value in the
parent node
No deletion occurs in the parent node, so no
further recursion is necessary in this case.
Deletions may cascade upwards until a node with
?n/2? or more pointers is found. If the root
node has only one pointer after deletion, it is
removed and the sole child becomes the root
(reducing the height of the tree by 1)

41
Deletion on B-trees Example 1
42
Deletion on B-trees Example 2
43
Deletion on B-trees Example 3
44
B-tree File Organization

B-Tree Indices solve the problem of index file
degradation. The original data file will still
degrade upon a stream of insert/delete
operations.
Solve data-file degradation by using a B-tree
file organization
Leaf nodes in a B-tree file organization store
records, not pointers into a separate original
datafile
since records are larger than pointers, the
maximum number of recrods that can be stored in a
leaf node is less than the number of pointers in
a non-leaf node
leaf nodes must still be maintained at least half
full
insert and delete are handled in the same was as
insert and delete for entries in a B-tree index

45
B-tree File Organization Example

Records are much bigger than pointers, so good
space usage is important
To improve space usage, involve more sibling
nodes in redistribution during splits and merges
(to avoid split/merge when possible)
involving one sibling guarantees 50 space use
involving two guarantees at least 2/3 space use,
etc.

46
B-tree Index Files

B-trees are similar to B-trees, but search-key
values appear only once in the index (eliminates
redundant storage of key values)
search keys in non-leaf nodes dont appear in the
leaf nodes, so an additional pointer field for
each search key in a non-leaf node must be stored
to point to the bucket or record for that key
value
leaf nodes look like B-tree leaf nodes
(P1, K1, P2, K2, , Pn)
non-leaf nodes look like so
(P1, B1, K1, P2, B2, K2, , Pn)
where the Bi are pointers to buckets or file
records.

47
B-tree Index File Example

B-tree
and
B-tree

48
B-tree Index Files (cont.)

Advantages of B-tree Indices (vs. B-trees)
May use less tree nodes than a B-tree on the
same data
Sometimes possible to find a specific key value
before reaching a leaf node
Disadvantages of B-tree Indices
Only a small fraction of key values are found
early
Non-leaf nodes are larger, so fanout is reduced,
and B-trees may be slightly taller than B-trees
on the same data
Insertion and deletion are more complicated than
on B-trees
Implementation is more difficult than B-trees
In general, advantages dont outweigh
disadvantages

49
Hashing

Weve examined Ordered Indices (design based upon
sorting or ordering search key values) the other
type of major indexing technique is Hashing
Underlying concept is very simple
observation small files dont require indices or
complicated search methods
use some clever method, based upon the search
key, to split a large file into a lot of little
buckets
each bucket is sufficiently small
use the same method to find the bucket for a
given search key

50
Hashing Basics

A bucket is a unit of storage containing one or
more records (typically a bucket is one disk
block in size)
In a hash file organization we find the bucket
for a record directly from its search-key value
using a hash function
A hash function is a function that maps from the
set of all search-key values K to the set of all
bucket addresses B
The hash function is used to locate records for
access, insertion, and deletion
Records with different search-key values may be
mapped to the same bucket
the entire bucket must be searched to find a
record
buckets are designed to be small, so this task is
usually not onerous

51
Hashed File Example

So we
divide the set of disk blocks that make up the
file into buckets
devise a hash function that maps each key value
into a bucket
V set of key values
B number of buckets
H hashing function H V--gt (0, 1, 2, 3, , B-1)
Example V 9 digit SS B1000 H key modulo
1000

52
Hash Functions

To search/insert/delete/modify a key do
compute H(k) to get the bucket number
search sequentially in the bucket (heap
organization within each bucket)
Choosing H almost any function that generates
random numbers in the range 0, B-1
try to distribute the keys evenly into the B
buckets
one rule of thumb when using MOD -- use a prime
number

53
Hash Functions (2)

Collision is when two or more key values go to
the same bucket
too many collisions increases search time and
degrades performance
no or few collisions means that each bucket has
only one (or very few) key(s)
Worst-case hash functions map all search keys to
the same bucket

54
Hash Functions (3)

Ideal hash functions are uniform
each bucket is assigned the same number of
search-key values from the set of all possible
values
Ideal hash functions are random
each bucket has approximately the same number of
records assigned to it irrespective of the actual
distribution of search-key values in the file
Finding a good hash function is not always easy

55
Examples of Hash Functions

Given 26 buckets and a string-valued search key,
consider the following possible hash functions
Hash based upon the first letter of the string
Hash based upon the last letter of the string
Hash based upon the middle letter of the string
Hash based upon the most common letter in the
string
Hash based upon the average letter in the
string the sum of the letters (using A0, B1,
etc) divided by the number of letters
Hash based upon the length of the string (modulo
26)
Typical hash functions perform computation on the
internal binary representation of the search key
example searching on a string value, hash based
upon the binary sum of the characters in the
string, modulo the number of buckets

56
Overflow

Overflow is when an insertion into a bucket cant
occur because it is full.
Overflow can occur for the following reasons
too many records (not enough buckets)
poor hash function
skewed data
multiple records might have the same search key
multiple search keys might be assigned the same
bucket

57
Overflow (2)

Overflow is handled by one of two methods
chaining of multiple blocks in a bucket, by
attaching a number of overflow buckets together
in a linked list
double hashing use a second hash function to
find another (hopefully non-full) bucket
in theory we could use the next bucket that had
space this is often called open hashing or
linear probing. This is often used to construct
symbol tables for compilers
useful where deletion does not occur
deletion is very awkward with linear probing, so
it isnt useful in most database applications

58
Hashed File Performance Metrics

An important performance measure is the loading
factor
(number of records)/(Bf)
B is the number of buckets
f is the number of records that will fit in a
single bucket
when loading factor too high (file becomes too
full), double the number of buckets and rehash

59
Hashed File Performance

(Assume that the hash table is in main memory)
Successful search best case 1 block worst case
every chained bucket average case half of worst
case
Unsuccessful search always hits every chained
bucket (best case, worst case, average case)
With loading factor around 90 and a good hashing
function, average is about 1.2 blocks
Advantage of hashing very fast for exact queries
Disadvantage records are not sorted in any
order. As a result, it is effectively impossible
to do range queries

60
Hash Indices

Hashing can be used for index-structure creation
as well as for file organization
A hash index organizes the search keys (and their
record pointers) into a hash file structure
strictly speaking, a hash index is always a
secondary index
if the primary file was stored using the same
hash function, an additional, separate primary
hash index would be unnecessary
We use the term hash index to refer both to
secondary hash indices and to files organized
using hashing file structures

61
Example of a Hash Index

Hash index into file
account, on search key
account-number
Hash function computes
sum of digits in account
number modulo 7.
Bucket size is 2

62
Static Hashing

Weve been discussing static hashing the hash
function maps search-key values to a fixed set of
buckets. This has some disadvantages
databases grow with time. Once buckets start to
overflow, performance will degrade
if we attempt to anticipate some future file size
and allocate sufficient buckets for that expected
size when we build the database initially, we
will waste lots of space
if the database ever shrinks, space will be
wasted
periodic reorganization avoids these problems,
but is very expensive
By using techniques that allow us to modify the
number of buckets dynamically (dynamic hashing)
we can avoid these problems
Good for databases that grow and shrink in size
Allows the hash function to be modified
dynamically

63
Dynamic Hashing

One form of dynamic hashing is extendable hashing
hash function generates values over a large range
-- typically b-bit integers, with b being
something like 32
At any given moment, only a prefix of the hash
function is used to index into a table of bucket
addresses
With the prefix at a given moment being j, with
0ltjlt32, the bucket address table size is 2j
Value of j grows and shrinks as the size of the
database grows and shrinks
Multiple entries in the bucket address table may
point to a bucket
Thus the actual number of buckets is lt 2j
the number of buckets also changes dynamically
due to coalescing and splitting of buckets

64
General Extendable Hash Structure
65
Use of Extendable Hash Structure

Each bucket j stores a value ij all the entries
that point to the same bucket have the same
values on the first ij bits
To locate the bucket containing search key Kj
compute H(Kj) X
Use the first i high order bits of X as a
displacement into the bucket address table and
follow the pointer to the appropriate bucket
T insert a record with search-key value Kj
follow lookup procedure to locate the bucket, say
j
if there is room in bucket j, insert the record
Otherwise the bucket must be split and insertion
reattempted
in some cases we use overflow buckets instead (as
explained shortly)

66
Splitting in Extendable Hash Structure

To split a bucket j when inserting a record with
search-key value Kj
if igt ij (more than one pointer in to bucket j)
allocate a new bucket z
set ij and iz to the old value ij incremented by
one
update the bucket address table (change the
second half of the set of entries pointing to j
so that they now point to z)
remove all the entries in j and rehash them so
that they either fall in z or j
reattempt the insert (Kj). If the bucket is
still full, repeat the above.

67
Splitting in Extendable Hash Structure (2)

To split a bucket j when inserting a record with
search-key value Kj
if i ij (only one pointer in to bucket j)
increment i and double the size of the bucket
address table
replace each entry in the bucket address table
with two entries that point to the same bucket
recompute new bucket address table entry for Kj
now igt ij so use the first case described earlier
When inserting a value, if the bucket is still
full after several splits (that is, i reaches
some preset value b), give up and create an
overflow bucket rather than splitting the bucket
entry table further
how might this occur?

68
Deletion in Extendable Hash Structure

To delete a key value Kj
locate it in its bucket and remove it
the bucket itself can be removed if it becomes
empty (with appropriate updates to the bucket
address table)
coalescing of buckets is possible
can only coalesce with a buddy bucket having
the same value of ij and same ij -1prefix, if one
such bucket exists
decreasing bucket address table size is also
possible
very expensive
should only be done if the number of buckets
becomes much smaller than the size of the table

69
Extendable Hash Structure Example

Hash function
on branch name
Initial hash table
(empty)

70
Extendable Hash Structure Example (2)

Hash structure after insertion of one Brighton
and two Downtown records

71
Extendable Hash Structure Example (3)

Hash structure after insertion of Mianus record

72
Extendable Hash Structure Example (4)

Hash structure after insertion of three
Perryridge records

73
Extendable Hash Structure Example (5)

Hash structure after insertion of Redwood and
Round Hill records

74
Extendable Hashing vs. Other Hashing

Benefits of extendable hashing
hash performance doesnt degrade with growth of
file
minimal space overhead
Disadvantages of extendable hashing
extra level of indirection (bucket address table)
to find desired record
bucket address table may itself become very big
(larger than memory)
need a tree structure to locate desired record in
the structure!
Changing size of bucket address table is an
expensive operation
Linear hashing is an alternative mechanism which
avoids these disadvantages at the possible cost
of more bucket overflows

75
ComparisonOrdered Indexing vs. Hashing

Each scheme has advantages for some operations
and situations. To choose wisely between
different schemes we need to consider
cost of periodic reorganization
relative frequency of insertions and deletions
is it desirable to optimize average access time
at the expense of worst-case access time?
What types of queries do we expect?
Hashing is generally better at retrieving records
for a specific key value
Ordered indices are better for range queries

76
Index Definition in SQL

Create an index
create index ltindex-namegt on ltrelation-namegt
(ltattribute-listgt)
e.g.,
create index br-index on branch(branch-name)
Use create unique index to indirectly specify and
enforce the condition that the search key is a
candidate key
not really required if SQL unique integrity
constraint is supported
To drop an index
drop index ltindex-namegt

77
Multiple-Key Access

With some queries we can use multiple indices
Example select account-number
from account
where branch-namePerryridge and
balance1000
Possible strategies for processing this query
using indices on single attributes
use index on balance to find accounts with
balances 1000, then test them individually to
see if branch-namePerryridge
use index on branch-name to find accounts with
branch-namePerryridge, then test them
individually to see if balances 1000
use branch-name index to find pointers to all
records of the Perryridge branch, and use balance
index similarly, then take intersection of both
sets of pointers

78
Multiple-Key Access (2)

With some queries using a single-attribute index
is unnecessarily expensive
with methods (1) and (2) from the earlier slide,
we might have the index we use return a very
large set, even though the final result is quite
small
Even with method (3) (use both indices and then
find the intersection) we might have both indices
return a large set, which will make for a lot of
unneeded work if the final result (the
intersection) is small
An alternative strategy is to create and use an
index on more than one attribute -- in this
example, an index on (branch-name, balance) (both
attributes)

79
Indices on Multiple Attributes

Suppose we have an ordered index on the combined
search-key (branch-name, balance)
Examining the earlier query with the clause
where branch-namePerryridge and balance1000
Our new index will fetch only records that
satisfy both conditions -- much more efficient
than trying to answer the query with separate
single-valued indices
We can also handle range queries like
where branch-namePerryridge and balancelt1000
But our index will not work for
where branch-nameltPerryridge and balance1000

80
Multi-Attribute Indexing

Example EMP(eno, ename, age, sal)
lots of ways to handle this problem
separate indices lots of false hits
combined index based on composite key
key sal100 age
search for 30ltsallt40 translates into
3000ltkeylt4000 (easy)
search for 60ltagelt80 difficult
Grid files (up next)
R-tree (B-tree generalization)
Quad-trees, K-d trees, etc...

81
Grid Files

Structure used to speed processing of general
multiple search-key queries involving one or more
comparison operators
The grid file has
a single grid array
array has number of dimensions equal to the
number of search-key attributes
one linear scale for each search-key attribute
Multiple cells of the grid array can point to the
same bucket
To find the bucket for a search-key value, locate
the row and column of the cell using the linear
scales to get the grid location, then follow the
pointer in that grid location to the bucket

82
Example Grid File for account
83
Queries on a Grid File

A grid file on two attributes A and B can handle
queries of all the following forms with
reasonable efficiency
(a1 lt A lt a2)
(b1 lt B lt b2)
(a1 lt A lt a2 and b1 lt B lt b2)
For example, to answer (a1 lt A lt a2 and b1 lt B lt
b2), use the linear scales to find corresponding
candidate grid array cells, and look up all the
buckets pointed-to from those cells

84
Grid Files (cont)

During insertion, if a bucket becomes full, a new
bucket can be created if more than one cell
points to it
idea similar to extendable hashing, but in
multiple dimensions
if only one cell points to the bucket, either an
overflow bucket must be created or the grid array
size must be increased
Linear scales can be chosen to uniformly
distribute records across buckets
not necessary to have scale uniform across the
domain -- if records are distributed in some
other pattern, the linear scale can mirror it (as
shown in the example earlier)

85
Grid Files (end)

Periodic re-organization to increase grid array
size helps with good performance
reorganization can be very expensive, though
Space overhead of the grid array can be high
R-trees (chapter 23) are an alternative

86
Bitmap Indices

Bitmap indices are a special type of index
designed for efficient queries on multiple keys
Records in a relation are assumed to be numbered
sequentially from 0
given a number n the objective is for it to be
easy to retrieve record n
very easy to achieve if were looking at
fixed-length records
Bitmap indices are applicable on attributes that
take on a relatively small number of distinct
values
e.g. gender, country, state, hockey team
or an arbitrary mapping of a wider spectrum of
values into a small number of categories (e.g.
income level divide income into a small number
of levels such as 0-9999, 10K-19999, 20K-49999,
50K and greater)
A bitmap is simply an array of bits

87
Bitmap Indices (cont)

In the simplest form a bitmap index on an
attribute has a bitmap for each value of the
attribute
bitmap has as many bits as there are records
in a bitmap for value v, the bit for a record is
1 if the record has the value v for the
attribute, and 0 otherwise

88
Bitmap Indices (cont)

Bitmap indices are useful for queries on multiple
attributes
not particularly useful for single-attribute
queries
Queries are answered using bitmap operations
intersection (and)
union (or)
complementation (not)
Each operation takes two bitmaps of the same size
and applies the operation on corresponding bits
to get the result bitmap
e.g. 100110 and 110011 100010
100110 or 110011 110111
not 100110 011001

89
Bitmap Indices (cont)

Every 1 bit in the result marks a desired tuple
can compute its location (since records are
numbered sequentially and are all of the same
size) and retrieve the records
counting number of tuples in the result (SQL
count aggregate) is even faster
Bitmap indices are generally very small compared
to relation size
e.g. if record is 100 bytes, space for a single
bitmap on all tuples of the relation takes 1/800
of the size of the relation itself
if bitmap allows for 8 distinct values, bitmap is
only 1 of the size of the whole relation

90
Bitmap Indices (cont)

Deletion needs to be handled properly
cant just store zero values at deleted
locations (why?)
need existence bitmap to note if the record at
location X is valid or not
existence bitmap necessary for complementation
not(Av) (not bitmap-A-v) and ExistenceBitmap
Should keep bitmaps for all values, even null
to correctly handle SQL null semantics for not
(Av) must compute
not(bitmap-A-Null) and (not bitmap-A-v) and
ExistenceBitmap

91
Efficient Implementation of Bitmap Operations

Bitmaps are packed into words a single word and
is a basic CPU instruction that computes the and
of 32 or 64 bits at once
e.g. one-million-bit maps can be anded with just
31,250 instructions
Counting number of 1s can be done fast by a
trick
use each byte to index into a precomputed array
of 256 elements each storing the count of 1s in
the binary representation
add up the retrieved counts
can use pairs of bytes to speed up further in a
similar way, but at a higher memory cost (64K
values in precomputed array for 2 byte)

92
Bitmaps and B-trees

Bitmaps can be used instead of Tuple-ID lists at
leaf levels of B-trees, for values that have a
large number of matching records
assuming a tuple-id is 64 bits, this becomes
worthwhile if gt1/64 of the records have a given
value
This technique merges benefits of bitmap and
B-tree indices
useful if some values are uncommon, and some are
quite common
why not useful if there are only a small number
of different values total (16, 20, something less
than 64)?
That covers all of chapter 12.

93
Query Processing

SQL is good for humans, but not as an internal
(machine) representation of how to calculate a
result
Processing an SQL (or other) query requires these
steps
parsing and translation
turning the query into a useful internal
representation in the extended relational algebra
optimization
manipulating the relational algebra query into
the most efficient form (one that gets results
the fastest)
evaluation
actually computing the results of the query

94
Query Processing Diagram
95
Query Processing Steps

1. parsing and translation
details of parsing are covered in other places
(texts and courses on compilers). Weve already
covered SQL and relational algebra translating
between the two should be relatively familiar
ground
2. optimization
This is the meat of chapter 13. How to figure
out which plan, among many, is the best way to
execute a query
3. evaluation
actually computing the results of the query is
mostly mechanical (doesnt require much
cleverness) once a good plan is in place.

96
Query Processing Example

Initial query select balance
from account
where balancelt2500
Two different relational algebra expressions
could represent this query
sel balancelt2500(Pro balance(account))
Pro balance( sel balancelt2500(account))
which choice is better? It depends upon metadata
(data about the data) and what indices are
available for use on these operations.

97
Query Processing Metadata

Cost parameters (some are easy to maintain some
are very hard -- this is statistical info
maintained in the systems catalog)
n(r ) number of tuples in relation r
b(r ) number of disk blocks containing tuples of
relation r
s(r ) average size of a tuple of relation r
f(r ) blocking factor of r how many tuples fit
in a disk block
V(A,r) number of distinct values of attribute A
in r. (V(A,r)n(r ) if A is a candidate key)
SC(A,r) average selectivity cardinality factor
for attribute A of r. Equivalent to n(r
)/V(A,r). (1 if A is a key)
min(A,r) minimum value of attribute A in r
max(A,r) maximum value of attribute A in r

98
Query Processing Metadata (2)

Cost parameters are used in two important
computations
I/O cost of an operation
the size of the result
In the following examination well find it useful
to differentiate three important operations
Selection (search) for equality (R.A1c)
Selection (search) for inequality (R.A1gtc) (range
queries)
Projection on attribute A1

99
Selection for Equality (no indices)

Selection (search) for equality (R.A1c)
cost (sequential search on a sorted relation)
b(r )/2 average unsuccessful
b(r )/2 SC(A1,r) -1 average successful
cost (binary search on a sorted relation)
log b(r ) average unsuccessful
log b(r ) SC(A1,r) -1 average successful
size of the result n(select(R.A1c))
SC(A1,r) n(r )/V(A1,r)

100
Selection for Inequality (no indices)

Selection (search) for inequality (R.A1gtc)
cost (file unsorted)
b(r )
cost (file sorted on A1)
b(r )/2 b(r )/2 (if we assume that half the
tuples qualify)
b(r ) in general
(regardless of the number of tuples that
qualify. Why?)
size of the result
depends upon the query unpredictable

101
Projection on A1

Projection on attribute A1
cost
b(r )
size of the result n(Pro(R,A1))
V(A1,r)

102
Selection (Indexed Scan) for Equality

Primary Index on key
cost (height1) unsuccessful
cost (height1) 1 successful
Primary (clustering) Index on non-key
cost (height1) SC(A1,r)/f(r )
all tuples with the same value are clustered
Secondary Index
cost (height1) SC(A1,r)
tuples with the same value are scattered

103
Selection (Indexed Scan) for Inequality

Primary Index on key search for first value and
then pick tuples gt value
cost (height1) 1 size of the result (in disk
pages)
height2 n(r ) (max(A,r)-c)/(max(A,r)-min(A
,r))/f(r )
Primary (clustering) Index on non-key
cost as above (all tuples with the same value are
clustered)
Secondary (non-clustering) Index
cost (height1) B-treeLeaves/2 size of
result (in tuples)
height1 B-treeLeaves/2 n(r )
(max(A,r)-c)/(max(A,r)-min(A,r))

104
Complex Selections

Conjunction (select where theta1 and theta2)
(s1 of tuples satisfying selection
condition theta1)
combined SC (s1/n(r )) (s2/n(r )) s1s2/n(r
)2
assuming independence of predicates
Disjunction (select where theta1 or theta2)
combined SC 1 - (1 - s1/n(r )) (1 - s2/n(r ))
s1/n(r )) s2/n(r ) - s1s2/n(r )2
Negation (select where not theta1)
n(! Theta1) n(r ) - n(Theta1)

105
Complex Selections with Indices

GOAL apply the most restrictive condition first
and combined use of multiple indices to reduce
the intermediate results as early as possible
Why? No index will be available on intermediate
results!
Conjunctive selection using one index B
select using B and then apply remaining
predicates on intermediate results
Conjunctive selection using a composite key index
(R.A1, R.A2)
create a composite key or range from the query
values and search directly (range search on the
first attribute (MSB of the composite key) only)
Conjunctive selection using two indices B1 and
B2
search each separately and intersect the tuple
identifiers (TIDs)
Disjunctive selection using two indices B1 and
B2
search each separately and union the tuple
identifiers (TIDs)

106
Sorting?

What has sorting to do with query processing?
SQL queries can specify the output be sorted
several relational operations (such as joins) can
be implemented very efficiently if the input data
is sorted first
as a result, query processing is often concerned
with sorting temporary (intermediate) and final
results
creating a secondary index on the active relation
(logical sorting) isnt sufficient --
sequential scans through the data on secondary
indices are very inefficient. We often need to
sort the data physically into order

107
Sorting

We differentiate two types of sorting
internal sorting the entire relation fits in
memory
external sorting the relation is too large to
fit in memory
Internal sorting can use any of a large range of
well-established sorting algorithms (e.g.,
Quicksort)
In databases, the most commonly used method for
external sorting is the sort-merge algorithm.
(based upon Mergesort)

108
Sort-merge Algorithm

create runs phase.
Load in M consecutive blocks of the relation (M
is number of blocks that will fit easily in main
memory)
Use some internal sorting algorithm to sort the
tuples in those M blocks
Write the sorted run to disk
Continue with the next M blocks, etcetera, until
finished
merge runs phase (assuming that the number of
runs, N, is less than M)
load the first block of each run into memory
grab the first tuple (lowest value) from all the
runs and write it to an output buffer page
when the last tuple of a block is read, grab a
new block from that run
when the output buffer page is full, write it to
disk and start a new one
continue until all buffer pages are empty

109
Sort-merge Algorithm (2)

Merge-runs phase (NgtM)
operate on M runs at a time, creating runs of
length M2, and continue in multiple passes of the
Merge operation
Cost of sorting b(r ) is the number of blocks
occupied by relation r
runs phase does one read, one write on each block
of r cost 2b(r )
total number of runs (N) b(r )/M
number of passes in merge operation 1 if NltM
otherwise logM-1(b(r )/M)
during each pass in the merge phase we read the
whole relation and write it all out again cost
2b(r ) per pass
total cost of merge phase is therefore 2b(r )
(logM-1(b(r )/M)1)
if only one merge pass is required (NltM) the cost
is 4b(r )
if Mgtb(r ) then there is only one run (internal
sorting) and the cost is b(r )

110
Join Operation

Join is perhaps the most important operation
combining two relations
Algorithms computing the join efficiently are
crucial to the optimization phase of query
processing
We will examine a number of algorithms for
computing joins

Write a Comment

User Comments (0)