Chap8: Trends in DBMS 8.1 Database support for Field Entities 8.2 Content-based retrieval 8.3 Introduction to spatial data warehouses 8.4 Summary.

Slides:



Advertisements
Similar presentations
Relational Algebra, Join and QBE Yong Choi School of Business CSUB, Bakersfield.
Advertisements

5.1Database System Concepts - 6 th Edition Chapter 5: Advanced SQL Advanced Aggregation Features OLAP.
The Relational Database Model
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Chapter 11 Group Functions
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Group 3 Akash Agrawal and Atanu Roy 1 Raster Database.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
Chap8: Trends in DBMS 8.1 Database support for Field Entities
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Organizing Data & Information
Data Cube and OLAP Server
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Introduction to Databases Transparencies
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
The Relational Database Model. 2 Objectives How relational database model takes a logical view of data Understand how the relational model’s basic components.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS CHAPTER 3
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Advanced Databases 5841 DATA CUBE. Index of Content 1. The “ALL” value and ALL() function 2. The New Features added in CUBE 3. Computing the CUBE and.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals Presenter : Parminder Jeet Kaur Discussion Lead : Kailang.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
1 CUBE: A Relational Aggregate Operator Generalizing Group By By Ata İsmet Özçelik.
Chapter 8: Trends in DBMS 8.1 Database Support for Field Entities 8.2 Content-based Retrieval 8.3 Introduction to Spatial Data Warehouses 8.4 Summary.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
2005 SPRING CSMUIntroduction to Information Management1 Organizing Data John Sum Institute of Technology Management National Chung Hsing University.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
Partitioning – A Uniform Model for Data Mining Anne Denton, Qin Ding, William Jockheck, Qiang Ding and William Perrizo.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS CHAPTER 3
The Relational Database Model
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
Data Warehousing.
BI Terminologies.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Access 2007 ® Use Databases How can Microsoft Access 2007 help you structure your database?
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Presented By: Solutions Delivery Managing Reports in CRMnext.
DATA RESOURCE MANAGEMENT
1 2 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 2 The Relational Model 1: Introduction, QBE, and Relational Algebra.
Trends in DBMS. Learning Objectives After this segment, students will be able to Describe why learn about field data type Describe what field data type.
3 1 Database Systems The Relational Database Model.
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
Week 2 Lecture The Relational Database Model Samuel ConnSamuel Conn, Faculty Suggestions for using the Lecture Slides.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Data Analysis Decision Support Systems Data Analysis and OLAP Data Warehousing.
Data Analysis and OLAP Dr. Ms. Pratibha S. Yalagi Topic Title
The Relational Database Model
Relational Algebra - Part 1
Chapter 5: Advanced SQL Database System concepts,6th Ed.
Data Warehouse.
Relational Algebra Chapter 4, Part A
DATA CUBE Advanced Databases 584.
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
Data warehouse Design Using Oracle
Chapter 4: Dimensions, Hierarchies, Operations, Modeling
Relational Algebra Chapter 4, Sections 4.1 – 4.2
The Relational Database Model
DATA CUBES E0 261 Jayant Haritsa Computer Science and Automation
Evaluation of Relational Operations: Other Techniques
Slides based on those originally by : Parminder Jeet Kaur
Presentation transcript:

Chap8: Trends in DBMS 8.1 Database support for Field Entities 8.2 Content-based retrieval 8.3 Introduction to spatial data warehouses 8.4 Summary

Learning Objectives Learning Objectives (LO) LO1: Learn about field data LO2 : Learn about storage and retrieval of field data LO3: Learn about spatial data warehouses What are data warehouses? Why are they interesting? What are aggregate functions? Which ones are easy to compute? Mapping Sections to learning objectives LO1-8.1 LO , 8.2 LO3-8.3

8.3 Why are Data Warehouses Interesting? Data Warehouse facilitate group decision making Consider a dataset 1 measure (i.e. Sales) 3 dimensions (e.g. Company, Year, Region) Analysis questions Q1. Rank Regions by total sales. Q2. Rank years by total sales. Q3. Where are sales consistently growing? Cross tabulates summaries reports used to analyze the trends Example:

8.3 Generating cross-tabulation summaries Traditional Approach Use custom software pulling data out of a DBMS Limitations: redundant of work, inefficient use of resources Data Warehouse approach Cross-tab. Can be generated using a set of simple report Each report is generated from a SQL “Select... group by” statement Example: Fig (pp. 244) and Table 8.3 (pp. 245) Cross-tab example in last slide is a union of SALES-L0-A, SALES-L1-A, SALES-L1-B and SALES-L2 Table 8.3 shows SQL queries to compute each part Advantage Rest of SQL is available for pre/post processing of data Performance gains by eliminating unnecessary copying of data

Example Data Warehouse (Fig. 8.19) Fig 8.19

8.3.4 Cross-tabulation vs.report hierarchy Spreadsheet view of a report Views a report a N-dim. Spreadsheet N = number of dimension attributes Each cell contains value of “measure” Cross-tabulation view of a Report hierarchy Example: report hierarchy for SALES-L0-A, SALES-L2-A, SALES-L1-B, SALES-L2, Fig (pp. 244)

8.3 What is a Data Warehouse? Data Warehouse is a special purpose database Primarily used for specialized data analysis purposes Facilitates generation and navigation of a hierarchy of reports Special purpose data-sets and queries Data consists of a few measure attributes a set of dimension attributes The measure attribute depends on dimension attributes Queries generate reports Report measure for selected values of dimensions Aggregate measure for given subset of dimensions What is a spatial data warehouse? Data warehouses with spatial measures or dimensions Example: census data - census tract is a spatial dimension Example: logistics data - route is a spatial dimension

8.3.4 Data Warehouse Operations Operations on a data warehouse Roll-up, Drill-down Slice, Dice Pivot Roll-up Inputs: A report R, A subset S of dimensions in R Output: A sequence of reports summarizing R Example 1: R = SALES-Base, S = (Year, Region) in Fig (pp. 244) Output consists of reports SALES-L0-A, SALES-L1-B, SALES-L2 Example 2: R = SALES-Base, S = (Region, Year) Output consists of reports SALES-L0-A, SALES-L1-A, SALES-L2 Drill-down Inputs: A report R, A dimension D not in R Output: A reports detailing R on D Example: R = SALES-L1-B, D = Region in Fig (pp. 244) Output : report SALES-L0-A

8.3.4 Data Warehouse Operations Slice, Dice Reduce dimensions in a table- (Fig. 8.7, pp 232). Inputs: A report R, A value V for a dimension D in R Output: A subset of R where D =V Example: R = SALES-L0-A, D = Year, V = 1994 in Fig (pp. 244) Output: Table 8.5 (pp. 246) includes tuple (ALL, 1994, America, 35) Fig 8.7

8.3.4 Data Warehouse Operations Pivot For a spreadsheet view of reports Transposes a spreadsheet Example Inputs: A spreadsheet view of a report R Output: A transposed spreadsheet Ex.: R= SALES-L0-A, Fig (pp. 244)

Logical Data Model of a DWH Purpose of a logical data model Specify a framework to specify computational structure Allow extension of SQL to model new needs Cube operation Input : A fact table Output: A set of summary reports covering all subsets of dimension columns Equivalent to union of all tables and reports in Fig (pp. 244) Ex. Fig. 8.18, pp. 243 SELECT Company, Year, Region, Sum(Sales) AS Sales FROMSALES GROUP BY CUBE Company, Year, Region

Fig 8.18

Physical Data Model of a DWH Purpose: Computationally efficient implementation Ideas: Pre-computation - pre-compute some of reports and use those to compute other reports New indexing methods, e.g. bit-map index Query Processing Strategies Strategies for aggregate functions New strategies for multi-table joins Let us look at strategies for aggregate functions

DWH Physical Model: Aggregate function strategies Aggregate Functions Compute summary statistics for a given set of values Examples: sum, average, centroid (Table 8.1, pp. 238) Strategies for efficient computation Characterize easy to compute aggregate functions 3 categories Distributive Algebraic Holistic First 2 categories can be computed easily in one scan of the dataset

Definitions of Aggregate Function Categories Notation: F, G, G1, G2, … Gn are aggregate functions where n is small S is a set of values, e.g. S = (1, 2, 3, 4) P = (S1, S2, …, Sp) is a partition of S, e.g. P = (S1, S2), S1 = (1, 2), S2 = (3, 4) Distributive( F ) if there exists a G such that F( S ) = G ( F(S1), F(S2), …, F(Sn) ) Example: sum is distributive Illustration: sum(1, 2, 3, 4) = sum ( sum(1, 2), sum(3, 4) Algebraic( F ) if there exists G1, …, Gn, (where n is small) and F( S ) = G ( G1(S1), …, Gn(S1), G2(S1), …, Gn(S2), …, G1(Sp), …, Gn(Sp) ) Example: average is distributive Illustration: average(1, 2, 3, 4) = { count(1, 2) * average(1, 2) + count(3, 4) * average } / { count(1,2) + count(3,4) }

Example: Distributive Aggregate Function Fig 8.14 Examples in cross-tabulation scenario (Fig. 8.14, pp.238): Example 1. Min is distributive Example 2. Count is distributive

Examples: Algebraic Aggregate Functions Fig 8.15 Examples in cross-tabulation scenario (Fig. 8.15, pp.239): Average and Variance are algebraic

Discussion - Spatial Data Warehouse Example Consider the example in Fig. 8.16, pp. 241 A map interpretation may be attached to each report Each row has a spatial footprint, which can be aggregated by geometric-union The collection of maps may be called a mapcube Issues: What is needed in OGIS standard to support map-cube operation? Hierarchical collection of maps in mapcube What is an appropriate cartography to convey the relationship among maps?

Spatial Data Warehouses and Mapcube Fig 8.16

Fig 8.17

Summary Field data useful in many applications due to rich content Represented as raster or image Operations can be categorized into local, focal, zonal, and global Field data storage and retrieval Tiling is a preferred way to divide raster data into disk blocks Meta-data based query is often used for retrieval Content based retrieval may be used for similarity searches Data warehouses support analysis e.g. cross-tabulation reports SQL CUBE operator support generation of DWH reports Distributive and Algebraic aggregate functions can be computed easily