Download presentation
Presentation is loading. Please wait.
1
The Data Warehouse Environment
2
The Structure of the Data Warehouse There are different levels of detail in the data warehouse. Older level of detail (usually on alternate, bulk storage) A Current level of detail A level o f lightly summarized data (the data mart level) A level of highly summarized data.
3
Subject Orientation The data warehouse is oriented to the major subject areas of the corporation that have been defined in the high level corporate data model. Typical subject areas include the following : Customer Product Transaction or activity Policy Claim Account
4
Day 1-Day n Phenomenon On day 1, there is a polyglot of legacy systems essentially doing operational, transactional processing On day 2, the first few tables of the first subject area of the data warehouse are populated. At this point, a certain amount of curiosity is raised, and the users start to discover data warehouses and analytical processing On day 3, more of the data warehouse is populated, and the population of more data comes more users.
5
Day 1-Day n Phenomenon (continue...) On day 4, as more of the warehouse becomes populated, some of the data that had resided in the operational environment becomes properly placed in the data warehouse. And the data warehouse is now discovered as a source for doing analytical processing On day 5, departmental database (data mart or OLAP) start to blossom. Departmental find that it is cheaper and easier to get their processing done by bringing data from the data warehouse into their own departmental processing environment.
6
On day 6, the land rush to departmental systems takes place. It is cheaper, faster, and easier to get departmental data that it is to get data from the data warehouse. Soon end users are weaned from the detail of data warehouse to departmental processing. On the day n, the architecture is fully developed. Day 1-Day n Phenomenon (continue...)
7
Granularity What is granularity ? The Benefit of granularity Granularity Example Dual levels of granularity
8
Exploration and Data Mining Granular data found in the data warehouse supports more than data marts. It also supports the processes of exploration and data mining What is Data mining ?
9
Living Sample Database The other way of changing the granularity of data How ?
10
Partitioning as a Design approach What is Partitioning ? How to do a Partitioning ? The benefit Loading data Accessing data Archiving data Deleting data Monitoring data Storing data Problem doing partitioning
11
Structuring Data in the Data Warehouse The most common way to structure data within the data warehouse Simple cumulative Rolling summary Simple direct Continuous
12
Data Warehouse : The Standard Manual The kinds of things the publication should contain are the following : A description of what a data warehouse is A description of source systems feeding the warehouse How to use the data warehouse How to get help if there is a problem Who is responsible for what The migration plan for the warehouse How warehouse data relates to operational data How to use warehouse data for DSS When not to add data to the warehouse What kind of data is not in the warehouse A guide to the meta data that is available What the system of record is
13
Auditing and the Data Warehouse The primary reasons for not doing auditing from data warehouse Data that otherwise would not find its way into the warehouse suddenly has to be there The timing of data entry into the warehouse changes dramatically when auditing capability is required The backup and recovery restrictions for the data warehouse change drastically when auditing capability is required Auditing data at the warehouse forces the granularity of data in the warehouse to be at the very lowest level. In short, it is possible to audit from the data warehouse environment, but due to the complications involved, it makes much more sense to audit elsewhere
14
Cost Justification Why not using ROI ? Justifying your data warehouse Cost of running reports Cost of building the data warehouse
15
Data Homogeneity/Heterogeneity Data homogeneity ? Data heterogeneity? “Heterogeneities are everywhere” Personal Databases Digital Libraries Scientific Databases World Wide Web
16
Purging Warehouse Data How data is purged or the detail of data is transformed ?
17
Reporting and the Architected Environment The differences between the two types of reporting Operational Reporting The line item is of the essence; the summary is of little or no importance once used Of interest to the clerical community Operational Reporting The line item is of little or no use once used;the summary or other calculation is of primary importance Of interest to the managerial community
18
The Operational Window of Opportunity Sample of opportunity
19
Incorrect Data in the Data Warehouse How should the architect handle incorrect data in the data warehouse ?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.