Title: Introduction to Data Warehousing
1????? ??????
- ????? 2 ????? ?????? ?? ?????
2Reminder
- The purpose of a data warehouse is to expose
business information for use by decision makers
in the organization. - A data warehouse containing hundreds of millions
of pieces of data is of no use to a decision
maker without tools to assist in the analysis and
evaluation process. - These analysis tools can vary from simple reports
to sophisticated data mining algorithms.
3??????? ???????
?????? Data
?????? ?? ???? ?? ?????? ??????? ???? ??????
?????????? ?????? ???????. ??????? ?????? ???????
????? ????????? ?? ?????? ?????? (????? ?????)
????? ???? ??????? ???? ?????? ????????? ??
?????? ??? ???? ??????.
4??????? ???????
?????? Data
???? Information
???? ????? ?????? ????? ????? ????? ?????
?????. ???? ???? ????? ?? ????"?, ?????
??????. ?????? ????? ??????? ???? ??? Total Lost
??? ??? ?????? ???? ??????.
5??????? ???????
?????? Data
???? Information
??? Knowledge
??? ???? ?????? ???? ?????. ?????? ????
??????? ????? ????? ?????? ????? ??????? ????.
6??????? ???????
?????? Data
???? Information
??? Knowledge
??? ???? ?????? ???? ?????. ?????? ????
??????? ????? ????? ?????? ????? ??????? ????,
???? ???? ??????? ????? ?????? ???? ????? ?????
??????? ????.
7??????? ???????
?????? Data
???? Information
??? Knowledge
????? Decision
?????. ?????? ?????? ?? ?????? ????? ???????
????? ????? ?????? ?????? ?????? ??????? ?????
??????? ????? ??????.
8What is Business Intelligence?
- The processes, technologies, and tools needed to
turn data into information and information into
knowledge plans that drive profitable business
action. -
9????? ?????? ?????? ???? ??????
- ????? ?????? ?????? ???? ?????? ?????? ???
????????? ?????? ???? ??????? ??? ????? ??????
??????? ?????? ?????. - ?????? ??? ??????? ????? ???????? ???? ??????
????"?, ???? ?????? ????? ??????, ??? ????? ??
?????, ???? ????????, ???? ?????? ?????? ???'.
10????? ?????? ????? ?????
- ????? ???? ????? ??? ?? ????? ?? ??????? ???
????? ??? ???? ???? ???????. ?? ?????? ?????
????? ?????? ?? ????? ???? -gt ????? ????"?. - ????? ???? ??????? ??? ?? ???? ????? ???? ???
?????? ????? ????? ?? ?????? ??? ????? ??????
????? ????? ?????? ????? -gt ????? ?? ?????. - ????? ???? ?????? ??? ????? ??????? ????? ??"?
?? ?????? ??? ?????? ????? ????? -gt ??? ?????
??????.
11????????
12Statisticians
- There are typically only a handful of
sophisticated analysts Statisticians and
operations research types in any organization. - Though few in number, they are some of the best
users of the data warehouse those whose work can
contribute to the profitability of the company. - It is vital that these users come to love the
data warehouse. - Usually that is not difficult these people are
often very self-sufficient and need only to be
pointed to the database and given some simple
instructions about how to get to the data and
what times of the day are best for performing
large queries to retrieve data to analyze using
their own sophisticated tools.
13Knowledge Workers
- A relatively small number of analysts perform the
bulk of new queries and analyses against the data
warehouse. - Their queries and reports typically get published
for the benefit of the Information Consumers. - Knowledge Workers are often deeply engaged with
the data warehouse design and place the greatest
demands on the ongoing data warehouse operations
team for training and support.
14Information Consumers
- Most users of the data warehouse are Information
Consumers - They will probably never compose a true ad hoc
query. They use static or simple interactive
reports that others have developed. - Do not neglect these users! This group includes a
large number of people.
15Executives
- Executives are a special case of the Information
Consumers group. - Few executives actually issue their own queries,
but an executive's slightest musing can generate
a flurry of activity among the other types of
users.
16????? ????? ????? ?????? ?????
- ????? ????? (Verification) ????? ???? ????
????? ?????? ???????? ????? ??????? ????? ??
????? ?? ?????. - ????? ????? (Discovery) ????? ???? ???? ?????
????? ??? ????? ????? ???????? ????? ??? ???????
?????? ?? ??????? ???????? ???.
17???? ?????? ??????
- ?????? ??"???
- ??? ????? ?? ?????
- ??? ????? ??????
18Simple predefined summary reports
- Simple predefined summary reports can provide
managers with periodic or on-demand snapshots of
the state of the business at a point in time.
More sophisticated reports can display trends of
predetermined business variables. - Such reports are useful and have historically
been produced from online transaction (OLTP)
systems. To capture up-to-the-minute status,
snapshot detail and summary reports must continue
to be produced from the data source systems.
19Reports - Disadvantages
- Managers cant do it by their own.
- Producing the report takes time.
- Predefined reports serve their specific purposes
well but are not suited to explorative analysis.
Analysts want to discover trends and anomalies in
the data and explore various areas of the data to
find the sources of these trends and anomalies
20????? ?????? ?? ?????
- ????? ????? ??????? ????? ?? ??????? ?????? ?????
???????? ??? ?????. - ?????? ??? ????? ????? ?? ?????????? ??? ??????
???????
21?????? ???????
- ????? ?? ??? ???"? ?????????
?????
??????
22??????
23??????? ???????
24The Cube - Properties
- A data warehouse cube is defined by any number of
dimensions (it is not limited to three) - Dimension can be organized into any number of
levels. - The cells are facts. Facts correspond to the
cross product of each dimension of the cube. - The fact can show a measure of something like 120
units sold or 960.00 in sales.
25??? ?? ????? ?? ??????
- ?????? ????? ??? ???? ??????? ??? ??? ?????? ????
?? ????? ??? ?? ????? ???? ???? ????? ????? ???
???? ???? ?? ?????. - ????? ???? ???? ?????? ????? ?????? ??? ?????
?????? - ??????
- ????
- ??????
- ??????
26?????? 1 ???? ?????? ?? ?? ??? ?????? ?? ?????
??????? ?????? ????? ?????
27?????? 2 ?????? ???????
28?????? ?? ?????
- Roll up (drill-up) summarize data
- by climbing up hierarchy or by dimension
reduction - Drill down (roll down) reverse of roll-up
- from higher level summary to lower level summary
or detailed data, or introducing new dimensions - Slice and dice
- project and select
- Pivot (rotate)
- reorient the cube.
29Operations
30Drill-Down and Roll-up
31Tool 1 Pivot Table of Excel
32Tool 2 Data Analyzer
33???? ?????
34USING THE WEB