Download presentation
Presentation is loading. Please wait.
Published byIsabella Miller Modified over 9 years ago
1
Business Intelligence Instructor: Bajuna Salehe Email: bajunar@yahoo.combajunar@yahoo.com Web: http://www.ifm.ac.tz/staff/bajuna/courses http://www.ifm.ac.tz/staff/bajuna/courses Design the Data Warehouse Schema
2
Data Warehouse Schema The entity-relationship data model (ERD) is commonly used in the design of relational databases, where a database schema consists of a set of entities and the relationships between them. Such a data model is appropriate for on- line transaction processing (OLTP)
3
Data Warehouse Schema A data warehouse, however, requires a concise, subject-oriented schema that facilitates on-line data analysis (OLAP). The most popular data model for a data warehouse is a multidimensional model.
4
Data Warehouse Schema Such a model can exist in the following forms –a star schema –a snowflake schema –a fact constellation schema. The major focus will be on the star schema which is commonly used in the design of many data warehouse.
5
Star Schema This is the most common modeling paradigm for designing data warehouse. In this model a data warehouse consists of: –a large central table (fact table) containing the bulk of the data, with no redundancy –a set of smaller attendant tables (dimension tables), one for each dimension. The diagram below show an example of star schema
6
Star Schema Star schema of a data warehouse for sales.
7
Star Schema A star schema for AllElectronics sales is shown in Figure in the above slide. Sales are considered along four dimensions namely, time, item, branch, and location. The schema contains a central fact table for sales that contains keys to each of the four dimensions, along with two measures: dollars sold and units sold. To minimize the size of the fact table, dimension identifiers (such as time key and item key) are system-generated identifiers.
8
Star Schema Notice that in the star schema, each dimension is represented by only one table, and each table contains a set of attributes. For example, the location dimension table contains the attribute set {location key, street, city, province or state, country}
9
Issues With Data Warehousing Underestimation of resources for data loading Hidden problems with source systems Required data not captured Increased end-user demands Data homogenization High demand for resources Data ownership High maintenance Long duration projects Complexity of integration
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.