Chapter 13 The Data Warehouse

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

Intro to Data Mining: Extracting Information and Knowledge from Data.
Data Warehousing M R BRAHMAM.
Chapter 13 The Data Warehouse.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Introduction to Data Warehouse and Data Mining MIS 2502 Data Analytics
Chapter 13 Business Intelligence and Data Warehouses
Database Systems: Design, Implementation, and Management Tenth Edition
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Chapter 12 The Data Warehouse
13 Chapter 13 The Data Warehouse Hachim Haddouti.
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
DATA WAREHOUSE (Muscat, Oman).
Designing a Data Warehouse
Components of the Data Warehouse Michael A. Fudge, Jr.
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
ITEC 3220A Using and Designing Database Systems
Data Warehousing/Mining 1 Data Warehousing/Mining Comp 150 Additional Information Instructor: Dan Hebert.
Chapter 13 The Data Warehouse
12 The Data Warehouse and Data Mining MIS 304 Winter 2006.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Warehouse & Data Mining
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Datawarehouse Objectives
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
13 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management 4th Edition Peter Rob & Carlos Coronel.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 13 Business Intelligence and Data Warehouses.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Ayyat IT Group Murad Faridi Roll NO#2492 Muhammad Waqas Roll NO#2803 Salman Raza Roll NO#2473 Junaid Pervaiz Roll NO#2468 Instructor :- “ Madam Sana Saeed”
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 13 Business Intelligence and Data Warehouses.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Managing Data for DSS II. Managing Data for DS Data Warehouse Common characteristics : –Database designed to meet analytical tasks comprising of data.
What is OLAP?.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Advanced Database Concepts
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
12 1 Database Systems: Design, Implementation, & Management, 6 th Edition, Rob & Coronel 12.4 Online Analytical Processing OLAP creates an advanced data.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
ITEC 3220M Using and Designing Database Systems Instructor: Prof. Z.Yang Course Website: c3220m.htm Office: TEL.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Managing Data Resources File Organization and databases for business information systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Data warehouse.
Chapter 13 Business Intelligence and Data Warehouses
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Chapter 13 – Data Warehousing
MANAGING DATA RESOURCES
Data Warehouse and OLAP
Introduction of Week 9 Return assignment 5-2
Chapter 13 The Data Warehouse
Chapter 13 The Data Warehouse
Data Warehouse and OLAP
Presentation transcript:

Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel

In this chapter, you will learn: How operational data and decision support differ What a data warehouse is and how its data are prepared What star schemas are and how they are constructed What steps are required to implement a data warehouse successfully What data mining is and what role it plays in decision support Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

The Need for Data Analysis External and internal forces require tactical and strategic decisions Search for competitive advantage Business environments are dynamic Decision-making cycle time is reduced Different managers require different decision support systems (DSS) Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Decision Support Systems Is a methodology Extracts information from data Uses information as basis for decision making Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Decision Support Systems Decision support system (DSS) Arrangement of computerized tools Used to assist managerial decision Extensive data “massaging” to produce information Used at all levels in organization Tailored to focus on specific areas and needs Interactive Provides ad hoc query tools Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

DSS Components Figure 13.1 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Operational vs. Decision Support Data Operational data Relational, normalized database Optimized to support transactions Real time updates DSS Snapshot of operational data Summarized Large amounts of data Data analyst viewpoint Timespan Granularity Dimensionality Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

The DSS Database Requirements Database schema Support complex (non-normalized) data Extract multidimensional time slices Data extraction and filtering End-user analytical interface Database size Very large databases (VLDBs) Contains redundant and duplicated data Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Data Warehouse Integrated Subject-Oriented Time Variant Non-Volatile Centralized Holds data retrieved from entire organization Subject-Oriented Optimized to give answers to diverse questions Used by all functional areas Time Variant Flow of data through time Projected data Non-Volatile Data never removed Always growing Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Creating a Data Warehouse Figure 13.3 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Data Marts Single-subject data warehouse subset Decision support to small group Can be test for exploring potential benefits of Data warehouses Address local or departmental problems Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

DSS Architectural Styles Traditional mainframe-based OLTP Managerial information system (MIS) with 3GL First-generation departmental DSS First-generation enterprise data warehouse using RDMS Second-generation data warehouse using MDBMS Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Twelve Data Warehouse Rules 1. Separated from operational environment 2. Data are integrated 3. Contains historical data over long time horizon 4. Snapshot data captured at given time 5. Subject-oriented data 6. Mainly read-only data with periodic batch updates from operational source, no online updates 7. Development life cycle differs from classical one, data driven not process driven Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Twelve Data Warehouse Rules (Con’t.) 8. Contains different levels of data detail Current and old detail Lightly and highly summarized 9. Characterized by read-only transactions to large data sets 10. Environment has system to trace data resources, transformation, and storage 11. Metadata critical components Identify and define data elements Provide the source, transformation, integration, storage, usage, relationships, and history of data elements 12. Contains charge-back mechanism for usage Enforces optimal use of data Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Online Analytical Processing (OLAP) Advanced data analysis environment Supports decision making, business modeling, and operations research activities Characteristics of OLAP Use multidimensional data analysis techniques Provide advanced database support Provide easy-to-use end-user interfaces Support client/server architecture Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

OLAP Client/Server Architecture Figure 13.6 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

OLAP Server Arrangement Figure 13.7 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

OLAP Server with Multidimensional Data Store Arrangement Figure 13.8 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

OLAP Server with Local Mini-Data-Marts Figure 13.9 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Relational OLAP (ROLAP) OLAP functionality Uses relational DB query tools Extensions to RDBMS Multidimensional data schema support Data access language and query performance optimized for multidimensional data Support for very large databases (VLDBs) Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Typical ROLAP Client/Server Architecture Figure 13.10 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Multidimensional OLAP (MOLAP) OLAP functionality to multidimensional databases (MDBMS) Stored data in multidimensional data cube N-dimensional cubes called hypercubes Cube cache memory speeds processing Affected by how the database system handles density of data cube called sparsity Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

MOLAP Client/Server Architecture Figure 13.11 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Star Schema Data-modeling technique Maps multidimensional decision support into relational database Yield model for multidimensional data analysis while preserving relational structure of operational DB Four Components: Facts Dimensions Attributes Attribute hierarchies Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Simple Star Schema Figure 13.12 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Slice and Dice View of Sales Figure 13.14 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Star Schema Representation Facts and dimensions represented by physical tables in data warehouse DB Fact table related to each dimension table (M:1) Fact and dimension tables related by foreign keys Subject to the primary/foreign key constraints Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Star Schema for Sales Figure 13.17 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Performance-Improving Techniques for Star Schema Normalization of dimensional tables Multiple fact tables representing different aggregation levels Denormalization of the fact tables Table partitioning and replication Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Data Warehouse Implementation Road Map Figure 13.21 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Data Mining Seeks to discover unknown data characteristics Automatically searches data for anomalies and relationships Data mining tools Analyze data Uncover problems or opportunities Form computer models based on findings Predict business behavior with models Require minimal end-user intervention Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Extraction of Knowledge from Data Figure 13.22 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel

Data Mining Process Figure 13.23 Database Systems: Design, Implementation, & Management, 5th Edition, Rob & Coronel