1 9 Adv. DBMS Data Warehouse CSC5301 Review Hachim Haddouti.

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

April 30, Data Warehousing and OLAP Technology: An Overview  What is a data warehouse?  Data warehouse architecture  From data warehousing to.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Data Warehousing M R BRAHMAM.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
Dr. M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2010 COMP207: Data Mining Data Warehousing COMP207: Data Mining.
1 9 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 9 Database Management Approaches.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 2: Data Warehousing
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
1 9 Data Warehouse CSC5301 Hachim Haddouti. 2 9 About Me u Hachim Haddouti, born in 1969, married, one baby 9 weeks u Ph.D. in Computer Science (Database.
Chapter 13 The Data Warehouse
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
CS346: Advanced Databases
Designing a Data Warehouse
Components of the Data Warehouse Michael A. Fudge, Jr.
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Warehouse & Data Mining
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
AN OVERVIEW OF DATA WAREHOUSING
Datawarehouse Objectives
Data Warehousing Xintao Wu. Can You Easily Answer These Questions? What are Personnel Services costs across all departments for all funding sources? What.
1 Data Warehouses BUAD/American University Data Warehouses.
13 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management 4th Edition Peter Rob & Carlos Coronel.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
UNIT-II Principles of dimensional modeling
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
Data Mining Data Warehouses.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
What is OLAP?.
Advanced Database Concepts
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Data Warehousing COMP3017 Advanced Databases Dr Nicholas Gibbins –
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Data Warehouse and OLAP
Data Warehousing: Data Models and OLAP operations
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehouse and OLAP
Presentation transcript:

1 9 Adv. DBMS Data Warehouse CSC5301 Review Hachim Haddouti

2 9 Do You Remember? OLTP DSS MD drill down RollUp Slice/dice MOLAP ROLAP Star schema Data mining Data cube Data extraction Fact table

3 9 Data Warehouses  “Subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management’s decision-making process” Inmon ( AP = analytical processing is missing) u Used for analysis of existing data u Resolves performance issues suffered by operational RDBMSs and OLTPs

4 9 Sizing DW? Mining of mobile phone calls: (Caller, Callee, Time, Duration, Geogr. Location) ~ 100 B/tuple In Germany 10 7 users * 10 calls/(day*user) * 100 B/call = = B/day ~ 3*10 12 B/year = 3 TB/year Scanning data at 10 7 B/s takes 3*10 12 /10 7 = 3*10 5 s > 3 days

5 9 Data Warehouse Architecture

6 9 Data model ER Model u a disaster for querying a huge amount of data (time) u not understandable for users and they can not be navigated usefully by DBMS software. u hard to visualize; many possible connections between tables, u To avoid redundancy MD Model u better performance u Better data organisation u Better visualization u Business queries (why, what if)

7 9 Typical DWH Analyses/Queries u What are the consequences of new orders for production capacity w.r. to investment, personnel, maintenance, extra hours,... u Seasonal adaptions, e.g. when to produce how many skis, bikinis, convertibles,... u Influence of external financing on profits

8 9 Operations: aggregation slice dice (cube) rollup to coarser level drill down to more detailed level grouping sorting

9 9 Data Cube Representation

10 9 Steps to build a DWH u Acquisition of data u Data cleansing u Storage u Processing: AP u Maintenance,... Not possible with classical DB-technology alone

11 9 On-Line Analytical Processing u OLTP (online transaction processing) for operational data of enterprise, e.g. in relational DBMS, IMS, SAP/R3,... u DSS: Decision Support System to store data/information for strategic management decisions: aggregations, summaries, etc. u Optimized to work with data warehouses u Used to answer questions u Allows users to perceive data as a multidimensional data cube u Data mining

12 9 OLTP versus OLAP Thematic focus u OLTP: many small transactions (microscopic view of business processes, individual steps at lowest level, single order, delivery) u OLAP: finances in general, personnel in general,... u OLAP requires integration and unification of many detailed data into big picture u Time orientation u Durability: data extracted once, no updates

13 9 Technical Comparison OLTP vs OLAP u OLTP: high rate of updates, several thousand t/s u OLAP: read only transactions, very complex, DWH is loaded at certain time intervals, e.g. after the end of the month, quarter l Compute intensive l Special systems with new access methods, e.g. multidimensional data organization and access methods l Special OLAP systems necessary to offload OLTP systems

14 9 ROLAP and MOLAP Solution 1: ROLAP relational online analytical processing, built on top of relational DBS, additional middleware or client front end (star schema) Solution 2: MOLAP: multidimensional online analytical processing u new model u new data organizations u new algorithms u new query languages u new optimization techniques

15 9 DW Review degenerate dimension big dimensions hierarchies snow falcking Slowly changing dimensions dirty dimensions Hetegrogeneous prodcuts (core and custom) Factless Fact table