Presentation is loading. Please wait.

Presentation is loading. Please wait.

Database Processing for Business Intelligence Systems

Similar presentations


Presentation on theme: "Database Processing for Business Intelligence Systems"— Presentation transcript:

1 Database Processing for Business Intelligence Systems
DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4th Edition Chapter Eight Database Processing for Business Intelligence Systems

2 Chapter Objectives Learn the basic concepts of data warehouses and data marts Learn the basic concepts of dimensional databases Learn the basic concepts of business intelligence (BI) systems Learn the basic concepts of OLAP and data mining KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

3 Heather Sweeney Designs: Database Design
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

4 Heather Sweeney Designs: HSD Database Diagram in SQL Server 2005
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

5 Business Intelligence Systems
Business intelligence (BI) systems are information systems that Assist managers and other professionals in the analysis of current and past activities and in the prediction of future events Do not support operational activities, such as the recording and processing of orders These are supported by transaction processing systems Support management assessment, analysis, planning and control BI systems fall into two broad categories Reporting systems that sort, filter, group, and make elementary calculations on operational data Data mining applications that perform sophisticated analyses on data, analyses that usually involve complex statistical and mathematical processing KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

6 The Relationship Among Operational and BI Applications
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

7 Characteristics of Business Intelligence Applications
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

8 Characteristics of a Data Warehouse
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

9 Problems with Operational Data
“Dirty Data” Example – “G” for Gender Example – “213” for Age Missing Values Inconsistent Data Example – data that have changed, such as a customer’s phone number KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

10 Problems with Operational Data (Continued)
Nonintegrated Data Example – data from two or more sources that need to be combined Incorrect Format Example – time data in hours when needed in minutes Too Much Data Example – An excess number of columns KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

11 ETL Data Transformation
Data may need to be transformed for use in a data warehouse Example {CountryCode  CountryName} “US”  “United States” address to domain  “somewhere.com” KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

12 Characteristics of a Data Mart
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

13 Enterprise Data Warehouse (EDW) Architecture
Combines the data warehouse structure and the data mart structures shown above Expensive to create, staff and operate Smaller organizations use subsets of the EDW architecture KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

14 Dimensional Databases
A non-normalized database structure used for data warehouses May use slowly changing dimensions Values change infrequently Phone Number Address Use a Date or Time dimension KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

15 Star Schema KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

16 HSD-DW Star Schema KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

17 Two-Dimensional Matrix
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

18 Three-Dimensional Matrix
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

19 Conformed Dimensions and the Extended HSD-DW Schema
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

20 Reporting Systems KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

21 Reporting Systems: RFM Analysis
RFM Analysis analyzes and ranks customers according to purchasing patterns: R = Recent (most recent order) F = Frequent (how often an order is made) M = Money (dollar amount of orders) Customers are sorted into five groups, each containing 20% of the customers. Each group is given a numerical value: 1 = Top 20% 2, 3, 4 = Each 20% in between top and bottom 20% 5 = Bottom 20% KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

22 The RFM Score Report KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

23 Reporting Systems: Report Characteristics
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

24 Reporting Systems: Report System Functions
Report Authoring: Connect to data sources Create the report structure Format the report Report Management: Defines who receives what reports when and by what means Report Delivery: Push reports or allow them to be pulled KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

25 OLAP and Data Mining OnLine Analytical Processing (OLAP) is a technique for dynamically examining database data OLAP uses arithmetic functions such as Sum and Average Data Mining is a mathematically sophisticated technique for analyzing database data Data mining uses mathematical and statistical techniques KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

26 OLAP OLAP systems produce an OLAP report, also know as an OLAP cube
The OLAP report uses inputs called dimensions The OLAP report calculates outputs called measures Excel PivotTables can be used to create OLAP reports KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

27 Excel PivotTable OLAP Report I
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

28 Excel PivotTable OLAP Report II
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

29 Excel PivotTable OLAP Report III
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

30 Data Mining Applications: The Convergence of the Disciplines
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

31 Data Mining Applications: Popular Data Mining Techniques
Cluster analysis – Identifies groups of entities that have similar characteristics Decision tree analysis – Classifies entities into groups based on past history Logistic regression – Produces equations that offer probabilities that certain events will occur Neural Networks – Complex statistical prediction techniques Market Basket Analysis – Determines patterns of associated buying behavior KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

32 Data Mining Applications: Cluster Analysis I
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

33 Data Mining Applications: Cluster Analysis II
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

34 Data Mining Applications: Cluster Analysis III
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

35 Data Mining Applications: Market Basket Analysis
KROENKE and AUER - DATABASE CONCEPTS (4th Edition) © 2010, 2008 Pearson Prentice Hall

36 Database Processing for Business Intelligence Systems
DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4th Edition End of Presentation on Chapter Eight Database Processing for Business Intelligence Systems

37 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America. Copyright © 2010 Pearson Education, Inc.   Publishing as Prentice Hall


Download ppt "Database Processing for Business Intelligence Systems"

Similar presentations


Ads by Google