Data Warehousing - PowerPoint PPT Presentation

1 / 10
About This Presentation
Title:

Data Warehousing

Description:

In some instances, processes might involve multiple conditional ... Outrigger: a foreign key for the split table is located in the slowly changing dimension ... – PowerPoint PPT presentation

Number of Views:196
Avg rating:3.0/5.0
Slides: 11
Provided by: brianem4
Category:

less

Transcript and Presenter's Notes

Title: Data Warehousing


1
Data Warehousing
  • DSCI 4103
  • Dr. Mennecke

Chapter 4
2
Complex, multi-stage transactions
  • In some instances, processes might involve
    multiple conditional transactions (e.g., one type
    of payment requires a credit check, another
    requires specialized invoicing, etc.)
  • A choice needs to be made about whether to have
    multiple fact tables associated with each process

3
Multiple fact tables
  • Criteria in deciding
  • What are the users analytical requirements
  • Will separate processes be examined together or
    separately?
  • Are there really multiple separate business
    processes?
  • If so, then two or more fact tables are probably
    more appropriate since these processes may
    operate independently
  • Are multiple source systems involved?
  • If yes, use separate fact tables
  • How do the processes relate to dimensions?
  • If separate processes have different dimensions
    that they interact with, then separate fact
    tables may be implied

4
Slowly changing dimensions
  • What do we do when dimension information changes
    over time? (e.g., a product category changes)

5
Dealing with slowly changing dimensions
  • Overwrite the old attribute value with the new
    attribute value
  • Advantage easy to manage
  • Disadvantage lose historical information

6
Dealing with slowly changing dimensions
  • Add a new, unique dimension row (e.g., product
    description 2 will be added to product dimension
    1)
  • Advantage this maintains the historical
    information and is more in line with the goal of
    the data warehouse
  • Disadvantage
  • Requires the creation and management of a new
    surrogate key for the new dimension record
  • May result in the creation of very large
    dimension tables
  • Tracking across the dimension change may be
    difficult

7
Dealing with slowly changing dimensions
  • Add a new column to the dimension table to hold
    information about the old dimension information
    (e.g., the old product category)
  • Advantage allows tracking of dimensions over the
    change in dimensional information
  • Disadvantage becomes unwieldy when multiple
    changes occur

8
Dealing with slowly changing dimensions
  • Predictable changes with multiple version
    overlays creates columns representing the
    dimension attributes value for each time period
  • Advantage allows tracking of dimensions over
    time
  • Disadvantage only handles periodic and
    predictable changes

9
Dealing with rapidly changing dimensions
  • Quickly changing dimensions cannot easily be
    handled using the previous techniques
  • Split quickly changing attributes into a separate
    dimension table
  • Minidimension A foreign key for the split table
    is located in the fact table (e.g., a demographic
    category such as age would be applied to the
    customer when the purchase was made)
  • Outrigger a foreign key for the split table is
    located in the slowly changing dimension

10
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com