Introduction to Data Warehousing - PowerPoint PPT Presentation

1 / 34
About This Presentation
Title:

Introduction to Data Warehousing

Description:

The week that represents the corporate calendar. ... analyze data and expose interesting information for analysis by decision makers ... – PowerPoint PPT presentation

Number of Views:76
Avg rating:3.0/5.0
Slides: 35
Provided by: eceU5
Category:

less

Transcript and Presenter's Notes

Title: Introduction to Data Warehousing


1
Introduction to Data Warehousing
  • Ivan Otero
  • BSEMBAPMP

2
Topics
  • Operational Data Stores
  • Why Warehousing
  • Data Warehouse defined
  • Data Warehouse basic elements
  • Data Warehouse concepts
  • Further info

3
Operational Data Store
  • subject-oriented, integrated, volatile,
    current-valued data store containing only
    corporate detailed data1

Bank
Manufacturing
Telecomm
ATM Transactions
Work in Process Transactions
Cellular Phone Calls
1. Building the Operational Data Store (Wiley
1996), Bill Inmon, et. all
4
Operational Data Stores Characteristics
  • High Performance
  • Quick Response Time (acceptable)
  • Multi-User
  • Pre-defined transactions
  • Data is archived
  • Most are Relational Databases
  • OLTP (On-Line Transaction Processing)

5
(No Transcript)
6
(No Transcript)
7
Data Warehouse Definition
  • A structured, extensible environment designed for
    the analysis of non-volatile data, logically and
    physically transformed from multiple source
    applications to align with business structure,
    updated and maintained for a long period of time,
    expressed in simple business terms, and
    summarized for quick analysis.

8
Basic Elements of a Data Warehouse
DATA STAGING AREA
Source Systems
End User Data Access
Data Warehouse
Ad Hoc
Reporting
End User
Data Marts
General Data Flow
9
Source Systems
  • Legacy Applications (Mainframe)
  • Flat Files
  • Spreadsheets
  • OLTP databases (Oracle, Informix, SQL Server)
  • Enterprise Systems

10
Data Staging Area
  • Temporary Storage for cleaning, pruning,
  • combining, remove duplicates, standardize.
  • export
  • ETL (Extract, Transform and Load) Tools are
    tools used to perform this task
  • Example Informatica, Data Stage, AbInitio

11
(No Transcript)
12
(No Transcript)
13
(No Transcript)
14
Data Marts
  • Subject-Area Data Warehouses

Production
Planning
Finance
15
End User Applications
  • Graphical Representation of Multidimensional data
    (2D and 3D)
  • Excel (2D)
  • Web Applications
  • Ad-hoc Reporting Applications

16
(No Transcript)
17
Graphical Representation of Multidimensional
Structure
18
Pivot Tables
19
Pivot Charts
20
Why build a Warehouse?
21
What does the business gain?
22
Data Warehouse Definition
  • A structured, extensible environment designed for
    the analysis of Non-Volatile data, logically and
    physically transformed from multiple source
    applications to align with business structure,
    updated and maintained for a long period of time,
    expressed in simple business terms, and
    summarized for quick analysis.

23
Typical Architecture
24
Data WareHouse Concepts
  • Fact Table
  • Surrogate Keys
  • Metric Facts
  • Dimensions
  • Star Schema
  • Snowflake Schema
  • OLAP
  • Data Cube
  • MetaData
  • Data Mining

25
What is a Fact Table?
Primary Keys
Metric Facts
26
What is a Dimension?
  • Surrogate Key
  • Should not encode any information about the
    contents of records
  • Automatically increasing integers make good
    surrogate keys.
  • Maintain data warehouse information when
    dimensions change

I can query ANY of these data items
Other Concepts Slow or Fast Changing Dimensions
27
(No Transcript)
28
(No Transcript)
29
OLAP
  • On-Line Analytical Processing
  • Fast Analysis of Shared Multidimensional
    Information

30
What is a Data Cube?
  • Non-relational data structure that stores a
    multidimensional data set
  • Some vendors implement this with tables in a
    relational database (ROLAP)

31
What is Meta Data?
  • Data about Data

32
Star Schema Real Examples
33
Data Mining
  • Is a technology that applies sophisticated and
    complex algorithms to analyze data and expose
    interesting information for analysis by decision
    makers
  • Used in Marketing Research, Fraud Prevention

34
Further Information
  • www.olapreport.com
  • www.datawarehousingonline.com
  • www.datawarehousing.com
  • www.tdwi.org
  • www.datawarehousing.org
  • May search for
  • OLAP
  • Business Intelligence
  • Decision Support Systems
  • Data Mining
  • Vendors Ascential Software, Hyperion, Cognos,
    ProClarity, Sagent Technology, and Business
    Objects
Write a Comment
User Comments (0)
About PowerShow.com