Data Warehousing: Architecture, Components and The Building Blocks

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

Database Architectures and the Web
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehousing Concepts Transparencies
1 Minggu 13, Pertemuan 25 Data Warehousing and Data Mining Concepts (Ch , 30.5, ; 3rd ed.) Matakuliah: T0206-Sistem Basisdata.
Chapter 3 Database Management
Designing the Data Warehouse and Data Mart Methodologies and Techniques.
Components and Architecture CS 543 – Data Warehousing.
Organizing Data & Information
Data Warehousing Dale-Marie Wilson, Ph.D..
Chapter 14 The Second Component: The Database.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Business Intelligence Dr. Mahdi Esmaeili 1. Technical Infrastructure Evaluation Hardware Network Middleware Database Management Systems Tools and Standards.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Data Warehousing Concepts
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
L/O/G/O Metadata Business Intelligence Erwin Moeyaert.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA ebay
Understanding Data Warehousing
Database Systems – Data Warehousing
Data Warehouse Concepts Transparencies
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
AN OVERVIEW OF DATA WAREHOUSING
Datawarehouse Objectives
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS CHAPTER 3
1 Data Warehouses BUAD/American University Data Warehouses.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Information Builders : SmartMart Seon-Min Rhee Visualization & Simulation Lab Dept. of Computer Science & Engineering Ewha Womans University.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Sachin Goel (68) Manav Mudgal (69) Piyush Samsukha (76) Rachit Singhal (82) Richa Somvanshi (85) Sahar ( )
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
CISB594 – Business Intelligence Data Warehousing Part I.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
DATA RESOURCE MANAGEMENT
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
7 Strategies for Extracting, Transforming, and Loading.
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Defining Data Warehouse Concepts and Terminology
Chapter 13 The Data Warehouse
Data Warehouse.
Defining Data Warehouse Concepts and Terminology
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehouse.
Data Warehousing Concepts
Presentation transcript:

Data Warehousing: Architecture, Components and The Building Blocks Chapter Name September 98 Data Warehouse Fundamentals Chapter 2 Data Warehousing: Architecture, Components and The Building Blocks Paul K Chen 1

Summary of Topics The Nature of the Data in the Data Warehousing Operational Data Store vs. Data Warehouse Technology Typical Architecture of A Data Warehouse Major Building Blocks (Components) of the Data Warehouse Data Warehouse Information Flows Data Warehousing Tools and Technologies Business Issues for Middleware Decision Processing—Four Tasks Reasons for Creating a Data Mart

The Nature of the Data in the Data Warehousing A subject-oriented, integrated, time-variant, and non-volatile collection of data in support of management’s decision-making process (Inmon, 1993). 9

Subject-Oriented Data The warehouse is organized around the major subjects of the enterprise (e.g. customers, products, and sales) rather than the major application areas (e.g. customer invoicing, stock control, and product sales). This is reflected in the need to store decision-support data rather than application-oriented data. 10

Integrated Data The data warehouse integrates corporate application-oriented data from different source systems, which often includes data that is inconsistent. The integrated data source must be made consistent to present a unified view of the data to the users. 11

Time-Variant Data Data in the warehouse is only accurate and valid at some point in time or over some time interval. Time-variance is also shown in the extended time that the data is held, the implicit or explicit association of time with all data, and the fact that the data represents a series of snapshots. 12

Non-Volatile Data Data in the warehouse is not updated in real-time but is refreshed from operational systems on a regular basis. New data is always added as a supplement to the database, rather than a replacement. 13

Data Granularity Data in the warehouse is summarized at different levels. Granularity levels are based on the data types and the expected system performance for queries.

Data Warehouse Data The data should be well-defined, consistent, and nonvolatile in nature. The quantity of data should be large enough to support data analysis, querying, reporting, and comparisons of historical data over a longer period of time. The data warehouse must be user driven.

Operational Data Store vs. Data Warehouse Technology Issue Operational Data Warehouse How built Critical to Data access Data volume One application at a time in the legacy environment or one subject area at time in the ODS Daily business operation Smaller numbers of rows retrieved in a single call Volume needed for daily operation One or more subject areas at a time Management decisions that may affect profitability Large sets of data scanned to retrieve results Larger volume needed to support statistical analysis, forecasting, ad hoc reporting, and querying

Operational Data Store vs. Data Warehouse Technology Issue Operational Data Warehouse Data retention Data currency Data Availability Data retained to meet daily requirements Must be up to minute High availability may be needed Data retained longer to support historical reporting, comparison, analysis, etc. Usually represents a static point in time; usually important that data does not change minute by minute Usually does not require as high availability as the production environment unless worldwide access is necessary

Comparison of OLTP Systems and Data Warehousing From Data Contents 15

Typical Data Warehouse Queries Which type of property sells for prices above the average selling price for properties in the main cities of Great Britain and how does this correlate to demographic data? What are the three most popular areas in each city for renting property in 1997 and how does this compare with the figures for the previous two years? What is the current monthly revenue for property sales at each branch office, compared with rolling 12-monthly prior figures? What is the relationship between the total annual revenue generated by each branch office and the total number of sales staff assigned to each branch office? 16

Typical Architecture of A Data Warehouse 18

Architecture In Three Major Areas The structure that brings all the components of a data warehouse together is known as the architecture. Data Acquisition Data Storage Information Delivery

A Typical Data Warehousing System Architecture-Bill of Material Data Warehouse End-user Access tools Load Manager Warehouse manager Subject Data Convert Data Maintain Data Change Inf BOM Verified Data BOM Application Subject Data Data Warehouse data Query Results Bill of material Data Update Data Update Access Data Maintain On-line Update User User Query Request System Security Data Manage Security Applications Manage System Query manager Meta data manager

Major Building Blocks (Components) of the Data Warehouse Source data component (operation data store) Data staging component Data storage component Information delivery component Metadata component Management and control component

1. Source Data Component - Operational Data Store An operational data store (ODS) provides the basis for operational processing and may be used to feed the data warehouse. It consists of the following: Production data Internal Data Archived data External Data

Operational Data Sources (Structure & Environment) Mainframe first generation hierarchical and network databases. Departmental propriety file systems (e.g. VSAM, RMS) and relational DBMSs (e.g. Informix, Oracle). Private workstations and servers. External systems such as the internet, commercially available databases, or databases associated with an organization’s suppliers or customers. 19

2. Source Staging Component Three major functions need to be performed for getting the data ready. You have to extract the data, transform the data, and then load the data into the data warehouse storage. Data staging provides a place and an area with a set of functions to clean, change, combine, convert, duplicate, prepare source data for storage and use in the data warehouse.

Extraction, Cleansing, and Transformation Tools Tasks of capturing data from source systems, cleansing and transforming it, and loading the results into a target system can be carried out either by separate products, or by a single integrated solution. Integrated solutions include Code Generators Database Data Replication Tools Dynamic Transformation Engines

EAI & ETI EAI (Enterprise Application Integrator) tools provide a foundation for these models that address an enterprise’s tactical data requirements by efficiently moving data between applications with few integration challenges. EAI also preprocess and stage targeted data for enterprise data warehousing ETL stands for Extract-transform-load.

Load Manager Performs all the operations associated with the extraction and loading of data into the warehouse. Size and complexity will vary between data warehouses and may be constructed using a combination of vendor data loading tools and custom-built programs. 20

3. Data Storage Component- Detailed Data The foundation of the warehouse consists of detailed data at its most basic level. Stores all the detailed data in the database schema. In most cases, the detailed data is not stored online but aggregated to the next level of detail. On a regular basis, detailed data is added to the warehouse to supplement the aggregated data.

Data Warehouse Data Storage External Data Archived Data Multidimensional Data Summary 1 Level 1 Summary 4 Level 2 Detail Data Summary 2 Level 1 Summary 5 Level 2 Summary 3 Level 1 Summary 6 Level 3

Data Storage Component- Lightly and Highly Summarized Data Stores all the pre-defined lightly and highly aggregated data generated by the warehouse manager. Transient as it will be subject to change on an on-going basis in order to respond to changing query profiles.

Data Storage Component- Lightly and Highly Summarized Data (cont’d) The purpose of summary information is to speed up the performance of queries. Removes the requirement to continually perform summary operations (such as sort or group by) in answering user queries. The summary data is updated continuously as new data is loaded into the warehouse.

Data Storage Component- Archive / Backup Data Stores detailed and summarized data for the purposes of archiving and backup. May be necessary to backup online summary data if this data is kept beyond the retention period for detailed data. The data is transferred to storage archives such as magnetic tape or optical disk.

Warehouse Manager Performs all the operations associated with the management of the data in the warehouse. Constructed using vendor data management tools and custom-built programs.

Warehouse Manager - Operations Analysis of data to ensure consistency. Transformation and merging of source data from temporary storage into data warehouse tables. Creation of indexes and views on base tables. Generation of denormalizations, (if necessary). Generation of aggregations, (if necessary). Backing-up and archiving data.

Warehouse Manager In some cases, also generates query profiles to determine which indexes and aggregations are appropriate. A query profile can be generated for each user, group of users, or the data warehouse and is based on information that describes the characteristics of the queries such as frequency, target table(s), and size of results set.

4. Information Delivery Component Functionality: Provide information to the wide community of data warehouse users via Online access Intranet Internet E-mail For Ad hoc reports, complex queries, MD (multi-dimension) analysis, Statistical analysis, EIS feed and Data Mining.

End-User Access Tools The principal purpose of data warehousing is to provide information to business users for strategic decision-making. These users interact with the warehouse using end-user access tools. The data warehouse must efficiently support ad hoc and routine analysis.

End-User Access Tools High performance is achieved by pre-planning the requirements for joins, summations, and periodic reports by end-users (where possible). There are five main groups of access tools Data reporting and query tools (crystal reporting) Application development tools Executive Information System (EIS) tools Online Analytical Processing (OLAP) tools Data mining tools

Query Manager Performs all the operations associated with the management of user queries. Typically constructed using vendor end-user data access tools, data warehouse monitoring tools, database facilities, and custom-built programs. Complexity determined by the facilities provided by the end-user access tools and the database.

Query Manager (cont’d) The operations performed by this component include directing queries to the appropriate tables and scheduling the execution of queries. In some cases, the query manager also generates query profiles to allow the warehouse manager to determine which indexes and aggregations are appropriate.

5. Metadata Component This area of the warehouse stores all the metadata (data about data) definitions used by all the processes in the warehouse.

What’s Metadata THE DATA WAREHOUSE PROVIDES A MEANS FOR IMPLEMENTING AN EFFECTIVE DECISION SUPPORT ENVIRONMENT BY BUILDING EXISTING DATA FROM DISPARATE SOURCES SCATTERED ALL OVER AN ORGANIZATION. METADATA (META MODEL) COULD BE COMPARED TO AN INFORMATION DIRECTORY, CONTAINING THE “YELLOW PAGES,” ROAD MAP FOR NAVIGATING A DATA WAREHOUSE.

What’s Metadata METADATA IS DEFINED AS DATA ABOUT DATA. FOR EXAMPLE: 6.33 HAS LITTLE MEANING. 6.33 % MEANS MORE. 6.33 % BIRTH REDUCTION RATE FROM A NATIONAL CAMPAIGN.

Why Metadata –(cont’d) THE DATA WAREHOUSING IS GROWING PHENOMENON. (THE WAREHOUSE SOFTWARE PRODUCTS ARE EXPECTED TO GROW AT AN ANNUAL RATE OF 24% TO REACH A $2.2 BILLION MARKET BY 1998). WITHOUT METADATA, INFORMATION IS REDUCED TO A MEANINGLESS DATA REPOSITORY.

Types of Metadata Extraction and Transformation Metadata--Extraction and loading processes - metadata is used to map data sources to a common view of information within the warehouse. Operational Metadata-- Warehouse management process - metadata is used to automate the production of summary tables. End-User Metadata -- Query management process - metadata is used to direct a query to the most appropriate data source.

Metadata Views BUSINESS USER’S VIEW FROM A BUSINESS USER’S VIEW, METADATA SHOULD CONTAIN THE FOLLOWING SIX ELEMENTS: 1. TABLE OF CONTENTS 2. ORIGIN OF THE DATA FOR THE WAREHOUSE 3. TRANSFORMATION SEQUENCE 4. ACCESS LEVEL 5. TIMELINE OF THE JOURNEY 6. ACCESS ESTIMATES

Metadata Views DATA WAREHOUSE ADMINISTRATOR'S VIEW 1. VERSION CONTROL 2. PROFILE AND GROWTH METRICS – FOR PURGING PURPOSE

Metadata Views DSS (DECISION SUPPORT SYSTEM) DEVELOPER’S VIEW 1. TRANSFORMATION AND BUSINESS RULES 2. DATA MODELS 3. AVAILABLE OPERATION DATA

Metadata Views CORPORATE VIEW METADATA IS A LOGICAL COLLECTION OF METADATA FROM VARIOUS SOURCES, INCLUDING THE FOLLOWING SIX PLACES:

Metadata Views 1. LEGACY SYSTEM METADATA CONSISTING OF A DATA DICTIONARY CONTAINING INFORMATION ABOUT PROGRAM LIBRARIES, DATABASE CATALOGS AND FILE LAYOUTS. 2. OPERATIONAL CLIENT/SERVER SYSTEMS – CONSISTING OF DISTRIBUTED SOFTWARE COMPONENTS FROM A VARIETY OF VENDORS. 3. ENTERPRISE MODELS –THEY ARE THE FIRST STAGE IN THE ULTIMATE GOAL OF BUILDING CORPORATE METADATA.

Metadata Example Assume your user wants to know about the table or entity called Customer in your data warehouse before running any queries on the customer data. What’s the information content about Customer in your metadata repository? Let’s review the metadata elements for the Customer entity as shown on next slide.

Entity Name: Customer; Alias Names: Account, Client Definition: A person or an organization that purchases goods or services from the company. Remarks: Customer entity includes regular, current, and past customers. Source Systems: Finished goods orders; Maintenance contracts; Online sales. Create Date January 15,1999 Last update date January 21,2001 Update Cycle weekly Last full refresh date December 20, 2000 Data quality review January 25, 2001 Planned archival Every six months Responsible user Jim Brown

6. Management and Control Component --Warehouse Manager Performs all the operations associated with the management of the data in the warehouse. Constructed using vendor data management tools and custom-built programs.

Warehouse Manager - Operations Analysis of data to ensure consistency. Transformation and merging of source data from temporary storage into data warehouse tables. Creation of indexes and views on base tables. Generation of denormalizations, (if necessary). Generation of aggregations, (if necessary). Backing-up and archiving data.

Warehouse Manager In some cases, also generates query profiles to determine which indexes and aggregations are appropriate. A query profile can be generated for each user, group of users, or the data warehouse and is based on information that describes the characteristics of the queries such as frequency, target table(s), and size of results set.

Data Warehouse Information Flows 36

Data Warehouse Information Flows Inflow - Processes associated with the extraction, cleansing, and loading of the data from the source systems into the data warehouse. Upflow - Processes associated with adding value to the data in the warehouse through summarizing, packaging, and distribution of the data. 37

Data Warehouse Information Flows Downflow - Processes associated with archiving and backing-up/recovery of data in the warehouse. Outflow - Processes associated with making the data available to the end-users. Metaflow - Processes associated with the management of the metadata. 38

Data Flow Across the Corporation Personal Data Warehouse Production Systems Data Marts Operational Data Store Extract,transform & Load Processing Data Warehouse Metadata/Data Dictionary

Data Warehousing Tools and Technologies Building a data warehouse is a complex task because there is no vendor that provides an ‘end-to-end’ set of tools. Necessitates that a data warehouse is built using multiple products from different vendors. Ensuring that these products work well together and are fully integrated is a major challenge. 39

Tools for your Data Warehouse Data Acquisition Data Storage Information Delivery OLAP Source Systems Data Modeling DW/ Data Marts Extraction Report Writer Data Loading Transformation Staging Ara Quality Assurance Load Image Creation Alert Systems Data Mining

Front End Tools Production queries Access for existing tools Ad hoc queries “Intelligent” global optimization Query governor – preset limit Predictive governor – estimates cost (CPU, I/O) Tool connectivity to all databases

Accessing DW Databases Heterogeneous DBs, linking data marts Gateways Database gateway (requires DBMS) Independent gateway Aspects Point-to-point, point-to-many-points Data location transparency Global metadata catalog Access to distributed databases Heterogeneous joins Global optimizer SMP

Data Warehouse DBMS Requirements Load performance Load processing Data quality management Query performance Terabyte scalability Mass user scalability Networked data warehouse Warehouse administration Integrated dimensional analysis Advanced query functionality 41

Components of a DBMS Query processor Database manager (DM) File manager DML preprocessor DDL compiler Catalog manager

Components of a DBMS

Components of Database Manager (DM)

Administration and Management Tools Monitoring data loading from multiple sources. Data quality and integrity checks. Managing and updating metadata. Monitoring database performance to ensure efficient query response times and resource utilization. Auditing data warehouse usage to provide user chargeback information. 46

Administration and Management Tools Replicating, subsetting, and distributing data. Maintaining efficient data storage management. Purging data. Archiving and backing-up data. Implementing recovery following failure. Security management. 47

Gluing the Warehouse Together Middleware Gluing the Warehouse Together

Business Issues for Middleware Definition: software that shields users and developers from differences in services and resources used by applications Data warehouses often have heterogeneous databases, operating systems, networks, hardware, applications

Business Issues for Middleware Role of middleware Assist developer in data extraction/transformation and populating DW Assist business user in accessing DW Therefore needed at different points in life cycle Types Copy management: data extraction, transformation, replication, and propagation Gateways: DB and independent gateways Program-to program: RPCs, TP monitors, ORBs Message-oriented

Populating the Data Warehouse Connect tool to data (networking or communications protocol) Access to databases (access method for connection and update) Data sources Populating DW databases Maintenance (replication)

Connectivity and Interoperability Communications gateways Protocols – e.g., TCP/IP, DECnet, NetBIOS, ODBC, SPC/IX, Async, OBDC, DRDA (LU0, LU2, LU6.2) NOSs – e.g., SNA, Windows, OS/2, UNIX, MVS, VMS, Netware, LAN Server, Banyan DBs – SQL dialects Feasibility Multi-platform, multi-vendor Versions, upgrades Solutions Single vendor Multi-vendor DBMS-independent vendors Architecture standards, technical skills

Decision Processing—Four Tasks Capturing data This involves capturing data from operational systems, transforming it into business information, and loading into a data warehouse information store. Current extract templates on the market are primarily at capturing data from ERP (Enterprise Resource Planning) transaction processing systems –for example: SAP Business Information Warehouse and Peoplesoft BPM data warehouse)

Decision Processing—Four Tasks (Cont’d) Managing information This task encompasses the maintenance of business information in information stores, and how these information stores are processed by business intelligence tools and analytic applications. The cornerstone of decision processing is data warehousing, and warehouse information stores should be organized and modeled into relational and multidimensional database products.

Decision Processing—Four Tasks (Cont’d) Analyzing and modeling information The traditional approach to decision processing is to build a data warehouse and supply business users with a set of business intelligence tools (query, reporting, OLAP and data mining, for example) to process information in data warehouse information stores. A better approach is employ turn-key and web-based analytic application packages that are designed to provide comprehensive analyses for the business area being researched. Key business metrics (ex. Revenue dollars per sales rep per day) are useful.

Decision Processing—Four Tasks (Cont’d) Distributing information Business intelligence tools and analytic applications distribute information and the results of analysis operations to business users via standard graphical and Web interfaces. To help users uncover and organize this range of business information, an enterprise information portal (EIP) is required. An EIP provides a single point of entry to any piece of business information, no matter where it resides. The main components of an EIP are information assistant (Web browser interface) , an information directory and a subscription facility.

The Decision Processing Information Supply Chain Business Metrics Operational Systems External Data Analytic Applications E-Business Applications DW Collaborative & Office Systems Back-Office Transaction Applications Business Intelligence Tools Information Staging Area Front-Office Applications Business Decisions

Reasons for Creating a Data Mart To give users access to the data they need to analyze most often. To provide data in a form that matches the collective view of the data by a group of users in a department or business function area. To improve end-user response time due to the reduction in the volume of data to be accessed.

Reasons for Creating a Data Mart To provide appropriately structured data as dictated by the requirements of the end-user access tools. Building a data mart is simpler compared with establishing a corporate data warehouse. The cost of implementing data marts is normally less than that required to establish a data warehouse.

Reasons for Creating a Data Mart The potential users of a data mart are more clearly defined and can be more easily targeted to obtain support for a data mart project rather than a corporate data warehouse project.

Data Warehouse vs. Data Mart –In Terms of Data Granularity Departmental A single business process Star-join (facts & dimensions) Technology optimal for data access and analysis Structure to suit the departmental view of data Corporate/Enterprise-wide Union of all data marts Data received from staging area Queries on presentation source Structure for corporate view of data Organized on E-R Model

Data Mart –From Data Granularity A subset of a data warehouse that supports the requirements of a particular department or business function. Characteristics include Focuses on only the requirements of one department or business function. Do not normally contain detailed operational data unlike data warehouses. More easily understood and navigated.

Typical Data Warehouse and Data Mart Architecture