An Analysis of the Publication "An Overview of Data Warehousing and OLAP Technology” by Surajit Chaudhuri, Umeshwar Dayal Michael Goshey University of.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Supervisor : Prof . Abbdolahzadeh
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
Technical BI Project Lifecycle
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehousing M R BRAHMAM.
Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
IST722 Data Warehousing Technical Architecture Michael A. Fudge, Jr. * Figures taken from Kimball Ch. 4.
Chapter 13 The Data Warehouse
DATA WAREHOUSE (Muscat, Oman).
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Components of the Data Warehouse Michael A. Fudge, Jr.
1 Data Warehousing – CG124 Dr. Akhtar Ali School of Computing, Engineering and Information Sciences Computing.unn.ac.uk/staff/CGMA2/CG124.
Jeremy Brinkman Director of Administrative Systems University of Northwestern Ohio Great Lakes Users’ Group Conference August 10-11,
ETL Design and Development Michael A. Fudge, Jr.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Understanding Data Warehousing
A Paradigm Shift in Database Optimization: From Indices to Aggregates Presented to: The Data Warehousing & Data Mining mini-track – AMCIS 2002 as Research-in-Progress.
Introduction to the Orion Star Data
Vidas Matelis, Toronto SQL Server User Group November 13, 2008.
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
1 Cube Computation and Indexes for Data Warehouses CPS Notes 7.
OnLine Analytical Processing (OLAP)
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
Business Intelligence Zamaneh Jahed. What is Business Intelligence? Business Intelligence (BI) is a broad category of applications and technologies for.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-4 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Data Mining: A KDD Process Data mining: the core of knowledge discovery process. Data Cleaning Data Integration Databases Data Warehouse Task-relevant.
Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.
Data Warehousing.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
IST722 Data Warehousing Master Data Management and Data Governance Michael A. Fudge, Jr.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
Indexes and Views Unit 7.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Chapter 11: Data Warehousing Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Data Warehousing.
Advanced Database Concepts
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
An Overview of Data Warehousing and OLAP Technology
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Overview of Data Warehousing (DW) and OLAP
Supervisor : Prof . Abbdolahzadeh
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Summarized from various resources Modern Database Management
Star Schema.
Dimensional Model January 14, 2003
On-Line Analytical Processing (OLAP)
Data Warehouse and OLAP
Introduction of Week 9 Return assignment 5-2
Technical Architecture
Data Warehouse and OLAP
Presentation transcript:

An Analysis of the Publication "An Overview of Data Warehousing and OLAP Technology” by Surajit Chaudhuri, Umeshwar Dayal Michael Goshey University of Minnesota, Fall 2006 CSci 8701: Overview of Database Research

Michael Goshey: 9/19/20062 Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/20063 Introduction Selected paper  S. Chaudhuri and U. Dayal, An Overview of Data Warehousing and OLAP Technology, SIGMOD Record 26(1): 65-74(1997). Motivation Personal Interest

Michael Goshey: 9/19/20064 Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/20065 Problem Addressed Problem Statement  Survey: organizing the data warehousing space  Differing requirements between OLTP and OLAP Significance  Growth area  Reference work establishing consensus on terms, architectures and issues

Michael Goshey: 9/19/20066 Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/20067 Major Contributions Bridging the gulf between industry and academia OLTP vs. OLAP: clarifying the differences Concise survey of relevant issues, architectures and tools Concrete list of data warehouse design and build steps

Michael Goshey: 9/19/20068 Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/20069 Key Concepts Data warehouses and data marts OLTP, OLAP, ROLAP vs. MOLAP) Relational and dimensional data models Bitmap Index ETL Metadata Managed query vs. ad hoc environments Materialized views SQL extensions (cube, rollup, rank, percentile, etc.)

Michael Goshey: 9/19/ Data Warehouse, Data Mart

Michael Goshey: 9/19/ Relational or Dimensional?

Michael Goshey: 9/19/ Relational or Dimensional? (image from

Michael Goshey: 9/19/ Bitmap Indices customerage 0-10age 11-20age 21-30age Mary1000 John0100 Steve0010 Tom0001 Lisa0010 cardinality: unique values/total rows B-Tree vs. bitmap: 1% rule, uniqueness Boolean algebra directly on indices

Michael Goshey: 9/19/ Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/ Validation Methodology Survey paper goals Academic and industry citations Referencing tools, vendors Case studies

Michael Goshey: 9/19/ Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/ Assumptions Read-only environments Shortcomings  (occasional) transactional commitments  the data revision problem

Michael Goshey: 9/19/ Outline 1. Introduction 2. Problem Addressed 3. Major Contributions 4. Key Concepts 5. Validation Methodology 6. Assumptions Rewrite

Michael Goshey: 9/19/ Rewrite Changes in terminology, tools, vendors  Fact constellations -> conformed dimensions  Decision support -> BI  Vendors and tools in BI, ETL, OLAP Multiple user constituencies Data history difficulties petabyte databases -> very large warehouses common data expiry challenges slowly changing dimensions

Michael Goshey: 9/19/ Slowly Changing Dimensions CustomerIDNameStatus 001Mary JohnsonGold CustomerIDNameStatus 001Mary JohnsonPlatinum CustomerIDNameStatus 001Mary JohnsonGold 001Mary JohnsonPlatinum CustomerIDNameOriginal StatusCurrent StatusEffective Date 001Mary JohnsonGoldPlatinum10/1/2006 Before After: Type 1 After: Type 2 After: Type 3 CustomerIDNameStatus 001Mary JohnsonPlatinum

Michael Goshey: 9/19/ Questions?