The Data Warehouse

1 / 48
About This Presentation
Title:

The Data Warehouse

Description:

What a data warehouse is and how data for it are prepared ... and Adding Value with Data Warehouse-Based Solutions (continued) ... 13. The Data Warehouse ... – PowerPoint PPT presentation

Number of Views:338
Avg rating:3.0/5.0

less

Transcript and Presenter's Notes

Title: The Data Warehouse


1
Chapter 12
  • The Data Warehouse
  • Database Systems Design, Implementation, and
    Management, Sixth Edition, Rob and Coronel

2
In this chapter, you will learn
  • How operational data and decision support data
    differ
  • What a data warehouse is and how data for it are
    prepared
  • What star schemas are and how they are
    constructed
  • What steps are required to successfully implement
    a data warehouse
  • What data mining is and what role it plays in
    decision support

3
The Need for Data Analysis
  • Managers must be able to track daily transactions
    to evaluate how the business is performing
  • By tapping into the operational database,
    management can develop strategies to meet
    organizational goals
  • Data analysis can provide information about
    short-term tactical evaluations and strategies

4
Solving Business Problems and Adding Value with
Data Warehouse-Based Solutions
5
Solving Business Problems and Adding Value with
Data Warehouse-Based Solutions (continued)
6
Decision Support Systems
  • Methodology (or series of methodologies) designed
    to extract information from data and to use such
    information as a basis for decision making
  • Decision support system (DSS)
  • Arrangement of computerized tools used to assist
    managerial decision making within a business
  • Usually requires extensive data massaging to
    produce information
  • Used at all levels within an organization
  • Often tailored to focus on specific business
    areas
  • Provides ad hoc query tools to retrieve data and
    to display data in different formats

7
Decision Support Systems (continued)
  • Composed of four main components
  • Data store component
  • Basically a DSS database
  • Data extraction and filtering component
  • Used to extract and validate data taken from
    operational database and external data sources
  • End-user query tool
  • Used to create queries that access database
  • End-user presentation tool
  • Used to organize and present data

8
Main Components of a Decision Support System
(DSS)
9
Transforming Operational Data Into Decision
Support Data
10
Contrasting Operational and DSS Data
Characteristics
11
Ten-Year Sales History for a Single Department,
in Millions of Dollars
12
Yearly Sales Summaries, Two Stores and Two
Departments per Store,in Millions of Dollars
13
The Data Warehouse
  • Integrated, subject-oriented, time-variant,
    nonvolatile database that provides support for
    decision making

14
A Comparison of Data Warehouse and Operational
Database Characteristics
15
Creating a Data Warehouse
16
DSS Architectural Styles
17
Online Analytical Processing
  • Advanced data analysis environment that supports
    decision making, business modeling, and
    operations research
  • OLAP systems share four main characteristics
  • Use multidimensional data analysis techniques
  • Provide advanced database support
  • Provide easy-to-use end-user interfaces
  • Support client/server architecture

18
Operational vs. Multidimensional View of Sales
19
Integration of OLAP with a Spreadsheet Program
20
OLAP Client/Server Architecture
21
OLAP Server Arrangement
22
OLAP Server with Multidimensional Data Store
Arrangement
23
OLAP Server With Local Mini Data Marts
24
Bitmap Representation of REGION Values
25
Typical ROLAP Client/Server Architecture
26
MOLAP Client/Server Architecture
27
Relational vs. Multidimensional OLAP
28
Star Schemas
  • Data modeling technique used to map
    multidimensional decision support data into a
    relational database
  • Creates the near equivalent of a multidimensional
    database schema from the existing relational
    database
  • Yield an easily implemented model for
    multidimensional data analysis, while still
    preserving the relational structures on which the
    operational database is built
  • Has four components facts, dimensions,
    attributes, and attribute hierarchies

29
Simple Star Schema
30
Possible Attributes for Sales Dimensions
31
Three-Dimensional View of Sales

32
Slice and Dice View of Sales
33
Location Attribute Hierarchy
34
Attribute Hierarchies In Multidimensional Analysis
35
Star Schema for Sales
36
Orders Star Schema
37
Normalized Dimension tables
38
Multiple Fact Tables
39
Implementing a Data Warehouse
  • Numerous constraints
  • Available funding
  • Managements view of the role played by an IS
    department and of the extent and depth of the
    information requirements
  • Corporate culture
  • No single formula can describe perfect data
    warehouse development

40
Factors Common to Data Warehousing
  • Data warehouse is not a static database
  • Dynamic framework for decision support that is
    always a work in progress
  • Data warehouse data cross departmental lines and
    geographical boundaries
  • Must satisfy
  • Data integration and loading criteria
  • Data analysis capabilities with acceptable query
    performance
  • End-user data analysis needs
  • Apply database design procedures

41
Data Warehouse Implementation Road Map
42
Data Mining
  • Tools that
  • analyze data
  • uncover problems or opportunities hidden in data
    relationships,
  • form computer models based on their findings, and
    then
  • use the models to predict business behavior
  • Require minimal end-user intervention

43
Extraction of Knowledge From Data
44
Data-Mining Phases
45
A Sample of Current Data Warehousing and
Data-Mining Vendors
46
Summary
  • Data analysis is used to derive and interpret
    information from data
  • Decision support is a methodology designed to
    extract information from data and to use such
    information as a basis for decision making
  • Decision support system is an arrangement of
    computerized tools used to assist managerial
    decision making within a business
  • Data warehouse is an integrated,
    subject-oriented, time-variant, nonvolatile
    database that provides support for decision
    making

47
Summary (continued)
  • Online analytical processing is an advanced data
    analysis environment that supports decision
    making, business modeling, and operations
    research
  • Star schema is a data-modeling technique used to
    map multidimensional decision support data into a
    relational database
  • The implementation of any company-wide
    information system is subject to conflicting
    organizational and behavioral factors

48
Summary (continued)
  • Data mining automates analysis of operational
    data with the intention of finding previously
    unknown data characteristics, relationships,
    dependencies, and/or trends
  • Data warehouse is storage location for decision
    support data
Write a Comment
User Comments (0)