An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.

Slides:



Advertisements
Similar presentations
Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
Advertisements

Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Dimensional Modeling.
Database Modeling Past and Present
Chapter 10: Designing Databases
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Chapter 13 The Data Warehouse
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
C6 Databases.
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
April 30, Data Warehousing and OLAP Technology: An Overview  What is a data warehouse?  Data warehouse architecture  From data warehousing to.
Management Information Systems, Sixth Edition
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehousing M R BRAHMAM.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
DATA WAREHOUSE (Muscat, Oman).
CS346: Advanced Databases
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
An overview of Data Warehousing and OLAP Technology
Data Warehousing.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
OnLine Analytical Processing (OLAP)
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Data Mining Data Warehouses.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Advanced Database Concepts
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
An Overview of Data Warehousing and OLAP Technology
Data Warehousing COMP3017 Advanced Databases Dr Nicholas Gibbins –
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Presented By: Pedel Oppong-Abebrese,Pedel Oppong-Abebrese Michael Boadi, William Osei, Nana Amoa OforiMichael BoadiWilliam OseiNana Amoa Ofori DATA WAREHOUSING.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
11/20/ :11 AMData Mining 1 Data Mining – CSE 9033 Chapter – 1; Data Warehousing Dr. Goutam Sarker, B.E., M.E., Ph.D.(Engineering), Fellow: IE(I),
Overview of Data Warehousing (DW) and OLAP
Business Intelligence Overview
Advanced Applied IT for Business 2
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing: Data Models and OLAP operations
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehousing Concepts
Data Warehouse and OLAP
Presentation transcript:

An overview of Data Warehousing and OLAP Technology Presented By Manish Desai

2 Introduction What is data warehouse ? Explanation of definition Data warehouse Vs. Operational Database Data warehouse architecture Back end tools Conceptual model Database design Warehouse servers Index structures Meta data Conclusion References

3 Introduction Essential elements of decision support Enables The Knowledge Worker to make better and faster decisions Used in many industries like: –Manufacturing (for order shipment) –Retail (for inventory management) –Financial Services (claims and risk analysis) Every major database vendor offers product in this area

4 What is Data Warehouse ? A data warehouse is a subject-oriented, integrated, time-varying, non-volatile collection of data that is used primarily in organizational decision making Typically maintained separately from operational databases

5 Explanation of definition Subject-Oriented: –Designed around subject such as customer, vendor, product and activity –Does not includes data that are not needed for Decision support system (DSS) Integrated: –Most important feature –Consistent naming convention, measurement of variables and so forth –The data should be stored in single globally acceptable fashion

6 Explanation (continues…) Time Varying: –All data in the warehouse should be accurate as of some moment in time –Data stored over a long time horizon (5 –10 years) –Key structure contains element of time (implicitly or explicitly) –Data once correctly recorded cant be updated Non Volatile: –No Update of data allowed – only loading and access of data operations

7 Data Warehouse Vs. Operational Database Data WarehouseOperational Database userKnowledge workerClerk, IT professional FunctionDecision supportDay to day operations DataHistorical,summarized, multidimensional, integrated Current, up-to-date, detailed Unit of workComplex queryShort, simple transaction metricQuery throughout, response Transaction throughput

8 Architecture Data sourcing,migration,cleanup tools Meta data repository Data marts Data query, reporting, analysis and mining tools Data warehouse administration and management

9 Architecture (continues…) Distributed Data warehouse –Load balancing, scalability,higher availability –Meta data replicated and centrally administrated –Too expansive Data marts –Departmental subset focused on selected subjects –example: marketing department includes customer, sales and product tabels –Has own repository and administration –May lead to complex integration problems if not designed properly

10 Back end tools and Utilities Data cleaning, loading, refreshing tools Cleaning –Multiple source, possibility of errors –Example: replace string sex by gender Loading –Building indices, sorting and making access paths –Large amount of data Incremental loading Only updated tuples are inserted,Process hard to manage Refresh –Propagating updates –When to refresh ? –Set by administrator depending on user needs and traffic

11 Conceptual Model and front end tools Multi dimensional view –Dimensions together uniquely determine the measure –Example: Sales can be represented as city,product, data –Each dimension is described by set of attribute –Example: product consist of Category of product Industry of product Year of introduction Front end tools –Multi dimensional spreadsheet Supports Pivoting-reorientation Roll_up - summarized data Drill_down - go from high level to low level summary

12 Database design Two ways to represent Multi dimensional model –Star schema Database consist of single fact table and single table for each dimension Each tuples in fact table consist of pointer to each of dimension –Snowflake schema Refinement over star schema Dimensional hierarchy is explicitly represented by normalizing dimension tables

13 Warehouse Servers Specialized SQL servers –Provides advanced query language and query processing support for SQL queries over star and snowflake schemas –Example: Redbrick ROLAP –Between relational back end and client front end tools –Extend traditional relational servers to support multidimensional queries –Example: Microstratergy MOLAP –Multidimensional storage engine –Direct mapping –Example: Essbase from Arbor Inc.

14 Index structures Bit map indices –Use single bit to indicate specific value of attribute –Example: instead of storing eight characters to record engineer as skill of employee use single bit id# Name Skill 1000 John 1 Join indices –Maintains the relationship between foreign key with its matching primary keys

15 Meta data and warehouse management Its data about data Used for building, maintain, managing and using data warehouse Administrative meta data –Information about setting up and using warehouse Business meta data –Business terms and definition Operational meta data –Information collected during operation of warehouse

16 Conclusion Data warehouse is the technology for the future. data warehouse enables knowledge worker to make faster and better decisions

17 References Inmon W. H.,Building the data warehouse Kimball, R. The data warehouse toolkit.