Data Warehousing M R BRAHMAM.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Supervisor : Prof . Abbdolahzadeh
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
Data Warehousing – An Introductory Perspective
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Data Warehouse IMS5024 – presented by Eder Tsang.
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
Data Warehouse Models and OLAP Operations
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 2: Data Warehousing
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
CS346: Advanced Databases
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Datawarehousing Concepts | 7.0 9/7/2015 Datawarehousing Concepts.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Data Warehouse & Data Mining
Bab 3 Data Warehousing. Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai, Delhi, Chennai and Banglore. The Sales Manager.
Introduction to the Orion Star Data
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
AN OVERVIEW OF DATA WAREHOUSING
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
1 On-Line Analytic Processing Warehousing Data Cubes.
Data Warehousing Multidimensional Analysis
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Data Warehousing.
Advanced Database Concepts
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
Introduction to Data Warehousing. Why Data Warehouse? Necessity is the mother of invention.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
An Overview of Data Warehousing and OLAP Technology
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
Introduction to Data Warehousing. Subject: Data Warehousing.
Business Intelligence Overview
Supervisor : Prof . Abbdolahzadeh
Intro to MIS – MGS351 Databases and Data Warehouses
Advanced Applied IT for Business 2
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Data Warehouse and OLAP
Data Warehousing and OLAP
An Introduction to Data Warehousing
Data Warehousing: Data Models and OLAP operations
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

Data Warehousing M R BRAHMAM

Data Warehousing - Architecture Data and Metadata Repository Layer Presentation Layer Source Systems ETL Layer Execution Systems CRM ERP Legacy e-Commerce Extract, Transformation, and Load (ETL) Layer Cleanse Data Filter Records Standardize Values Decode Values Apply Business Rules Householding Dedupe Records Merge Records Enterprise Data Warehouse ODS Reporting Tools OLAP Tools Ad Hoc Query Tools Data Mining Tools Data Mart External Data Purchased Market Data Spreadsheets Metadata Repository Data Mart Data Mart Custom Tools HTML Reports Cognos Business Objects MicroStrategy Oracle Discoverer Brio Data Mining Tools Portals Sample Technologies: PeopleSoft SAP Siebel Oracle Applications Manugistics Custom Systems ETL Tools: Informatica PowerMart ETI Oracle Warehouse Builder Custom programs SQL scripts Oracle SQL Server Teradata DB2

OLTP vs DW OLTP DW Data dependencies (E-R) model Dimensional model Microscopic data consistency Global data consistency Millions of transactions per day One transaction per day Mostly does not keep history Keeping history is necessary Gets loaded in the day Gets loaded in the night

Dimensional Data Modeling E-R model Symmetric Divides data into many entities Describes entities and relationships Seeks to eliminate data redundancy Good for high transaction performance Dimensional model Asymmetric Divides data into dimensions and facts Describes dimensions and measures Encourages data redundancy Good for high query performance

Facts/Dimensions Fact Central, dominant table Multi-part primary key Holds millions & billions of records Links directly to dimensions Stores business measures Constantly varying data

Facts/Dimensions (contd.) Single join to the fact table (single primary key) Stores business attributes Attributes are textual in nature Organized into hierarchies More or less constant data E.g. Time, Product, Customer, Store, etc.

Star/Snowflake schema Star schema Fact surrounded by 4-15 dimensions Dimensions are de-normalized Snowflake schema Star schema with secondary dimensions Don’t snowflake for saving space Snowflake if secondary dimensions have many attributes

Star schema

Star schema example

Snowflake schema example Store Dimension District_ID Region_ID STORE KEY District Desc. Region_ID Region Desc. Regional Mgr. Store Description City State District ID District Desc. Region_ID Region Desc. Regional Mgr. Store Fact Table STORE KEY PRODUCT KEY PERIOD KEY Dollars Units Price

DM , DW & ODS DM Organized around a single business process Represents small part of the organization’s business Logical subset of the complete data warehouse Faster roll out, but complex integration in the long run

DM , DW & ODS (contd.) DW ODS Union of its constituent data marts Queryable source of data in the organization Requires extensive business modeling (may take years to design and build) ODS Point of integration for operational systems Low-level decision support Can store integrated data, but at detailed level

OLAP Element of decision support systems (DSS) Support (almost) ad-hoc querying for business analyst Helps the knowledge worker (executive, manager, analyst) make faster & better decisions ROLAP - extended RDBMS that maps operations on multidimensional data to standard relational operators MOLAP - Special-purpose server that directly implements multidimensional data and operations

Others Additive, semi-additive & non-additive facts Factless facts Slowly changing dimensions Conformed facts and dimensions Cubes Drill down / Drill up Slice and dice