Chap8: Trends in DBMS - PowerPoint PPT Presentation

1 / 42

About This Presentation

Title:

Chap8: Trends in DBMS

Description:

Video, i.e. time series of images. Audio data. Focus: Primarily images, ... 2. Do-it-yourself. Divide a raster data-item into smaller slices. Q? ... – PowerPoint PPT presentation

Number of Views:56

Avg rating:3.0/5.0

Slides: 43

Provided by: spatia

Learn more at: https://www.spatial.cs.umn.edu

Category:

more less

Transcript and Presenter's Notes

Title: Chap8: Trends in DBMS

1
Chap8 Trends in DBMS
8.1 Database support for Field Entities 8.2
Content-based retrieval 8.3 Introduction to
spatial data warehouses 8.4 Summary
2
Learning Objectives

Learning Objectives (LO)
LO1 Learn about field data
Why learn about field data type?
What is field data type? How is represented in
SDBMS?
What are common operations on fit?
LO2 Learn about storage and retrieval of field
data
LO3 Learn about spatial data warehouses
Mapping Sections to learning objectives
LO1 - 8.1.1
LO2 - 8.1.2, 8.2
LO3 - 8.3

3
Why learn about Field data-sets?

Field data is timely and abundant
Sensors (e.g. satellite based ones) provide
periodic snapshot of Earth
Most up-to-date data about current events (e.g.
fires, flood)
Field data are useful
in creating, revising and evaluating vector data
sets
digital archival of fragile historical paper maps
to manually get details not captured in vector
interpretations
Example Location selection for a facility (e.g.
a grocery store)
Consider a set of Aerial photographs of different
locations
Vector interpretation includes roads, water
bodies, elevation
What other information can aerial imagery reveal
for construction planning?
Trees (types and location), buildings,

4
What are Field data-sets?

Field data set examples
Satellite images, aerial photographs
Digitized paper maps
Earth Science data-sets, e.g. rainfall,
temperature maps
Data types of Spatial field data sets
Images
Satellite based, e.g. www.terradata.com
Aerial photographs
Measurements from a Geo-registered sensor
networks, e.g. weather
Video, i.e. time series of images
Audio data
Focus Primarily images,
though some discussion will apply to other data
types

5
Fields and Rasters An Sampling of Field values

Definitions
Field a mapping from a spatial domain to a
value domain
Image a mapping from a rectangular grid to a
value domain
A rectangular grid is a collection of cells
called pixels
Raster is geo-registered image, i.e. grid axis
have absolute spatial locations
Fields are often approximated as rasters
Example Figure 8.1
Identify spatial domain, field, rectangular
grid, raster approximation
Fields can be approximated as images if relative
spatial locations are adequate

Fig 8.1
6
Computing with field data

Field data manipulated using operations of
map algebra
image algebra
An Algebra is a mathematical structure consisting
of
Operands and Operations.
Map Algebra
Operand rasters
Operations Can be classified into four groups
Local, Focal, Zonal and Global
Image Algebra
Operand images
Operations crop, zoom, rotate

7
Local Operation
A local operation maps a raster into another
raster such that the value of a cell in the new
raster depends only on the value of that cell in
the original raster. Examples unary operation
thresholding binary operation point wise
addition
Fig 8.2
8
Focal Operation
In a focal operation, the value of a cell in the
new raster is dependent on the values of the cell
and its neighboring cells in the original
raster. Examples unary operations focal sum,
gradient,
Neighborhoods Rook, Bishop and Queen
Fig 8.3
9
Zonal Operation
In a global operation, the value of a cell in the
new raster is a function of the location or
values of all cells in the original or another
raster. Examples zonal sum, zonal average, ...
Fig 8.4
10
Global Operation
In a zonal operation, the value of a cell in the
new raster is a function of the value of that
cell in the original layer and the values of
other cells which appear in the same zone
specified in another raster. Example distance
from nearest facility
Fig 8.5
11
Image OperationsTrim

Image Operations
ignore the absolute locations of pixels.
come from image processing literature
Ex. smoothing, low pass filter, high pass filter,
Example A trim operation extracts an
axis-aligned subset of the original raster.

Fig 8.6
12
Learning Objectives

Learning Objectives (LO)
LO1 Learn about field data
LO2 Learn about storage and retrieval of raster
data
How is raster data stored on secondary storage?
What query families are used for retrieval?
What is content based retrieval (CBR)? Why is it
interesting?
How is CBR computationally approached?
LO3 Learn about spatial data warehouses
Mapping Sections to learning objectives
LO1 - 8.1
LO2 - 8.1.2, 8.2
LO3 - 8.3

13
Storage and Retrieval of Raster Data - 1

Traditional Approach
store raster data in a file system
use custom software to retrieve data-items of
interest
Example personal photographs stored on MS
Windows
Q? What attributes can one attach to digital
photographs ?
Q? Is there an easy way to retrieve all pictures
taken in San Francisco?
Limitations
Rigid schema
Limited ability to add and manage additional
attributes
Canned Queries only
Limited ability to support ad-hoc queries
Data quality
Limited ability to identify duplicates or similar
data-items

14
Storage and Retrieval of Raster Data in a SDBMS

A database approach
Database tables store
raster data items
attributes (i.e. meta-data), e.g. creation date,
geo-location, subject, ...
use SQL like query language to retrieve desired
data-items
retrieve all raster data-items overlapping with
city of San Francisco (Q1)
retrieve latest raster data-item within city of
Paris (Q2)
retrieve raster data-items similar to a given
image (Q3)
Pros
table schema definition allows user defined
attributes
improve ability to pose ad-hoc queries (Ex. Q1,
Q2)
improve data reliability and quality
Example Query Q3 may be used for duplicate
reduction

15
Storage and Retrieval of Raster Data - Challenges

Challenges in database based approach
storage size( raster data item) gt size (disk
blocks)
retrieval raster has rich content
A picture is worth a thousand word!
Approaches to storage challenge
1. Delegate storage to DBMS
Use Binary Large Object (BLOB) data-type
create table my_picture(
image BLOB
creation_date date
place point
)
2. Do-it-yourself
Divide a raster data-item into smaller slices
Q? Which way of slicing reduce disk I/Os for
common queries?

16
8.1.2 How is raster data stored on secondary
storage?

Slicing approaches
Linear, e.g. one row per disk block (see Fig.
8.8(b))
Tiling - see Fig. 8.8(c )
Tiling is preferred
for queries extracting rectangular sub-images
Example - terraserver.com

Fig 8.8
17
8.2 How is raster data queried?

Retrieval challenge of rich content
A. Meta-data approach
B. Content based retrieval
Meta-data approach
select a set of descriptive attributes
simpler SQL data types, e.g. numeric, string,
date, ...
Example source, location, time stamp, subject,
resolution, ...
Store values of descriptive attributes for each
raster data-item
Allow SQL queries on the descriptive attributes
Limitation of meta-data approach
Restricts queries to content captured by
descriptive attributes
Does not support Similarity based queries
Ex. Find all raster data-items similar to a given
raster data item.

18
8.2 Content Based Retrieval (CBR)

Examples
Q1. Find all raster data-items similar to a given
raster data item
Q2. Locate a photograph of a river in Minnesota
with trees nearby.
Q3. Find all images of state parks which have a
lake within them, are within a radius of one
hundred miles from Chicago, and are southwest of
Chicago.
State of the Art
However, few robust implementations of CBR are
available as of 2002
Several research prototypes address similarity
query Q1
Result quality is similar to those of web
searches (e.g. www.google.com)
Some of the retrieved raster data-item are
useful.
Many similar data item are not retrieved in the
result
Usable in application domains such as publishing
Our goal is to understand a current approach to
similarity queries
involving spatial similarities

19
8.2 Content Based Retrieval (CBR)

Spatial Similarity
Consider a pair of raster images with common
objects (e.g. parks, lakes)
Spatial similarity between raster images can be
defined based on
similarity of spatial relationships (e.g.
topological, directional)
Q? Which pairs exhibit higher similarity?
P1 (inside, disjoint) or P2 (inside, covered
by)
P3 (disjoint, touch) or P4 (disjoint, inside)
P5 (north west, north) or P6 (west, east)
A graph framework for comparing spatial
relationships
Nodes spatial relationships Edges connect
most similar nodes
Similarity metric number of edge on shortest
path between 2 nodes
See Figures 8.9 and 8.10

20
8.2.1 Topological Relationship Similarity

Study Fig. 8.9, pp. 234
Nodes topological relationships
Edges most similar
Similarity measure path length
Inference from Model
P2 (inside, covered by) more similar than P1
(inside, disjoint)
Do you agree?
Review Figure 2.3 (pp. 30)

Fig 8.9
21
8.2.2 Direction Relationship Similarity

Study Fig. 8.10, pp. 235
Nodes topological relationships Edges most
similar
Similarity measure path length
Inference P5 (north-west, north) more similar
than P6 (west, east)

Fig 8.10
22
8.2.3 Distance Similarity

Distance similarity is based on
Euclidean distance between the centroids of the
objects.
Example Image R is more similar to P than Q in
Fig. 8.11 (pp. 235)

Fig 8.11
23
8.2.4 A Computational Approach to CBR

Attribute Relation Graph (ARG)
Node objects in a raster
Edges relationships
Ex. Raster of Fig. 8.12(a)
ARG in Fig. Fig. 8.12(b)
Point object O3
Rectangles O1, O2
Edge (O1, O2) shows that they are disjoint, at
61 degree direction and 5.2 units distant.
Vector representation of ARG
Lists objects and edge properties
Ex. In Fig. 8.12

Fig 8.12
24
A Computational Approach to CBR

Steps
1. Represent each raster data item by its ARG
vector
2. Map query raster data item by its ARG vector
3. Find most similar raster data-items in the
database by comparing ARG vector representations.
Use a distance metric
Use a multi-dim. Index
Comment Result quality is similar to those of
web searches. Some of the retrieved raster
data-item are useful.

Fig 8.13
25
Learning Objectives