Atlanta Microsoft Database Forum Introduction to Data Warehousing Concepts Brian Thomas Solution Builders, Inc. Presented by March 8, 2004

Slides:



Advertisements
Similar presentations
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Advertisements

Business Intelligence (BI) PerformancePoint in SharePoint 2010 Sayed Ali – SharePoint Administrator.
OCS Infotech Proprietary & Confidential Typical BI solution Architecture.
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Data Warehousing M R BRAHMAM.
2/10/05Salman Azhar: Database Systems1 On-Line Analytical Processing Salman Azhar Warehousing Data Cubes Data Mining These slides use some figures, definitions,
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
INTRODUCTION TO OLAP MIS 497. Why OLAP? Online Analytical Processing vs. Online Transaction Processing Online Analytical Processing vs. Online Transaction.
Chapter 2: Data Warehousing
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
Introduction to Building a BI Solution 권오주 OLAPForum
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Understanding Analysis Services Architecture. Microsoft Data Warehousing Overview OLTP Source DTS DW Storage Analysis Services Clients OLE DB for OLAP,
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
DATA WAREHOUSING IN SQL SERVER 2005/2008 BUSINESS INTELLIGENCE.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Data Warehouse & Data Mining
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Activity Running Time DurationIntro0 2 min Setup scenario 2 2 min SQL BI components & concepts 4 5 min Data input (Let’s go shopping) 9 7 min Whiteboard.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
OnLine Analytical Processing (OLAP)
Business Intelligence Zamaneh Jahed. What is Business Intelligence? Business Intelligence (BI) is a broad category of applications and technologies for.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
Module 1: Introduction to Data Warehousing and OLAP
BI Terminologies.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
DW-2: Designing a Data Warehousing System 용 환승 이화여자대학교
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
1 On-Line Analytic Processing Warehousing Data Cubes.
Decision supports Systems Components
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
What is OLAP?.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
Copyright © 2006, Oracle. All rights reserved. Czinkóczki László oktató Using the Oracle Warehouse Builder.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Defining Data Warehouse Structures Data Warehouse Data Access End User Data Access Data Sources Staging Area Data Marts Data Extract, Transform, and Load.
Advanced Applied IT for Business 2
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Chapter 13 Business Intelligence and Data Warehouses
Overview of MDM Site Hub
On-Line Analytic Processing
Data warehouse and OLAP
Competing on Analytics II
Data Warehouse and OLAP
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehousing Concepts
Data Warehouse and OLAP
Presentation transcript:

Atlanta Microsoft Database Forum Introduction to Data Warehousing Concepts Brian Thomas Solution Builders, Inc. Presented by March 8, 2004

Data collected from one or many systems that exist within and outside the organization. The Data is structured in such a way as to reduce the amount of time that it takes to produce reliable information. What is a Data Warehouse?

Why Build a Data Warehouse? To Provide a Consistent Common Source for Corporate Information To Store Large Volumes of Historical Detail Data from Mission Critical Applications Improve the Ability to Access, Report Against, and Analyze Information To Solve or Improve Upon Business Processes

Turning Data into Information Sales System System Generated Reports Sales Analysis is extrapolated from the System Reports. Functional Data Warehouse

Turning Data into Information Functional Data Warehouse of Sales Information Sales Information is available to a wider audience of decision makers. Sales System Functional Data Warehouse

Turning Data into Information Sales System Division A Division B Sales System Division C Centralized Data Warehouse of Sales Data from across the Organization Analysis performed and Decisions drawn from the Cross Organizational Sales Data Cross Organizational Functional Data Warehouse

Turning Data into Information Sales System Production Systems Marketing System System Generated Reports Corporate Performance Analysis is extrapolated from the System Reports. Cross Functional Data Warehouse

Turning Data into Information Sales System Production Systems Marketing System Cross Functional Data Warehouse of Information Corporate Performance Analysis is available to a wider audience. Cross Functional Data Warehouse

Turning Data into Information Division A Division B Division C Centralized Cross Functional Data Warehouse of Information Analysis is performed and Decisions made from the Cross Functional Organizational Performance Data Cross Organizational & Cross Functional Data Warehouse

Source SystemsData Warehouse Components Access Methods Extraction Transformation Load (ETL) Corporate Level Business Group Level Divisional Level Enterprise Data Warehouse Increased Level of Standardization Increased Local Specifications DW / DM DM DW / DM Data Access & Query Management Services Planning & Forecasting Performance Management Scorecards & Dashboards Analytics & Modeling Query & Reporting Portal / Web Interface Desktop Applications Printed Reports Mobile Devices Division A Division B Division C External Data Data Warehouse Architecture Management Systems

Data Warehouse Architecture Source Systems Division A Division B Division C External Data Data Staging Area Data Warehouse Repository Extract, Transformation and Load (ETL)

Data Warehouse Architecture Data Staging Area Subject Area Oriented Data Structure more closely mirrors Operational System Data Layouts Supports Identification of Changed Data Acts as a Working Area to Support the Transformation Process

Data Warehouse Architecture Extraction, Transformation & Load (ETL) Extract, Transformation and Load (ETL) Perform Attribute Standardization and Cleansing Apply Business Rules and Calculations Consolidate using Matching and Merge / Purge Logic Ensure Proper Linking and Tracking of History

Data Warehouse Architecture Extraction, Transformation & Load (ETL) App. A: Male, Female App. B: 1, 0 App. C: x, y App. D: m, f App. A: pipeline (cm) App. B: pipeline (inches) App. C: pipeline (mcf) App. D: pipeline (yds) App. A: Date (julian) App. B: Date (yyyymmdd) App. C: Date (mm/dd/yyyy) App. D: Date (absolute) App. A: Description App. B: Description App. C: Description App. D: Description App. A: balance on hand App. B: current balance App. C: cash in house App. D: balance Male, Female pipeline (cm) Date (julian) Description Balance Lookup Function Conversion Function Formatting Function Merging Function Mapping Function

Data Warehouse Architecture Data Warehouse Repository Organized around Conformed Dimensions and Facts Promotes Usability and Intuitiveness Consolidated and Cross-Functional Historical and Atomic Representation of Data Insulated from Source System Modifications and Additions

Data Warehouse Repository Star Schema Concepts Fact Table This table is the core of the Star Schema Structure and contains the Facts or Measures available through the Data Warehouse. These Facts answer the questions of “What”, “How Much”, or “How Many”. Some Examples: Sales Dollars, Units Sold, Gross Profit, Expense Amount, Net Income, Unit Cost, Number of Employees, Turnover, Salary, Tenure, etc.

Dimension Tables Data Warehouse Repository Star Schema Concepts These tables describe the Facts or Measures. These tables contain the Attributes and may also be Hierarchical. These Dimensions answer the questions of “Who”, “What”, “When”, or “Where”. Some Examples: Day, Week, Month, Quarter, Year Sales Person, Sales Manager, VP of Sales Product, Product Category, Product Line Cost Center, Unit, Segment, Business, Company

Data Warehouse Repository Star Schema Concepts Time_DimTime_Dim TimeKey TheDate. TheDate. Sales_Fact TimeKey EmployeeKey ProductKey CustomerKey ShipperKey TimeKey EmployeeKey ProductKey CustomerKey ShipperKey Required Data (Business Metrics) or (Measures). Required Data (Business Metrics) or (Measures). Employee_DimEmployee_Dim EmployeeKey EmployeeID. EmployeeID. Product_DimProduct_Dim ProductKey ProductID. ProductID. Customer_DimCustomer_Dim CustomerKey CustomerID. CustomerID. Shipper_DimShipper_Dim ShipperKey ShipperID. ShipperID.

Apples Cherries Grapes Melons Q4 Q1Q2Q3 Time Dimension Dallas Denver Chicago Markets Dimension Atlanta Product Dimension Data Warehouse Repository Cube Concepts

Q4 Data Warehouse Repository Cube Concepts Cherries Grapes Melons Q1Q2Q3 Time Dimension Dallas Denver Chicago Markets Dimension Atlanta Product Dimension Sales Fact Apples

Data Warehouse Repository Storage Concepts Relational On-Line Analytical Processing (ROLAP): The information that is stored in the Data Warehouse is held in a relational structure. Aggregations are performed on the fly either by the database or in the analysis tool. Multidimensional On-Line Analytical Processing (MOLAP): This information is aggregated in a predefined manner based on the characteristics of the Measures and the defined hierarchy of the Dimensions. Since the data is pre- aggregated, navigating through the hierarchies is instantaneous. The user is simply navigating to a point within the Multidimensional Cube and not performing any on the fly aggregations. Hybrid On-Line Analytical Processing (HOLAP): This is a combination of MOLAP and ROLAP. A portion of the data is predefined and aggregated. This would typically be the set of information that is accessed most frequently. Additional detail can be held in a ROLAP structure and allow a user to drill through the MOLAP structure into the ROLAP structure.

Client perspective MOLAPMOLAPHOLAPHOLAPROLAPROLAP Query performance Storage consumption Fastest High Faster Medium Fast Low Data Warehouse Repository Cube Concepts

Source SystemsData Warehouse Components Access Methods Extraction Transformation Load (ETL) Corporate Level Business Group Level Divisional Level Enterprise Data Warehouse Increased Level of Standardization Increased Local Specifications DW / DM DM DW / DM Data Access & Query Management Services Planning & Forecasting Performance Management Scorecards & Dashboards Analytics & Modeling Query & Reporting Portal / Web Interface Desktop Applications Printed Reports Mobile Devices Division A Division B Division C External Data Management Systems Where does Microsoft fit in? SQL Server DTS SQL Server Relational Database and Analysis Services SQL Stored Procedures, SQL Views, MDX, and.NET Web Services Microsoft Office, Reporting Services and.NET Framework SharePoint Portal, Exchange, and.NET Framework

Q & A