University of Manitoba Asper School of Business 3500 DBMS Bob Travica Business Analytics and Decision Making OLTP, OLAP & SAP Chapter 9 & SAP Materials.

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Data Warehousing M R BRAHMAM.
Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Chapter 9 Business Intelligence Systems
Data Warehousing - 2 ISYS 650. Data Warehouse Design - Star Schema - Dimension tables – contain descriptions about the subjects of the business such as.
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
DATABASE 1 Data Warehouse OLTP Database 3NF tables Operations data Predefined reports Data warehouse Star configuration Daily data transfer Interactive.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Organizing Data & Information
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall COS 346 Day 26.
Business Intelligence. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views.
Database Processing for Business Intelligence Systems
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
CS346: Advanced Databases
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Data warehousing Data Mining.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Chapter 8 Newer Database Topics Based on G. Post, DBMS: Designing & Building Business Applications University of Manitoba Asper School of Business 3500.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Scenario Management Data.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
OLAP On Line Analytic Processing. OLTP On Line Transaction Processing –support for ‘real-time’ processing of orders, bookings, sales –typically access.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Warehousing.
Advanced Database Concepts
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 6 The Data Warehouse Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
MIS2502: Data Analytics Advanced Analytics - Introduction
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data Warehouse.
On-Line Analytical Processing (OLAP)
Database Management Systems
Data Warehousing.
Presentation transcript:

University of Manitoba Asper School of Business 3500 DBMS Bob Travica Business Analytics and Decision Making OLTP, OLAP & SAP Chapter 9 & SAP Materials Updated 2015

DBSYSTEMS 2 of 20 OLTP vs. OLAP Online Transaction Processing (OLTP) = relational database systems Online Analytical Processing (OLAP)

DBSYSTEMS 3 of 20 OLAP via Data Warehousing Online Transaction Processing (OLTP): Querying Databases with 3NF tables Operations’ data Predefined reports Online Analytical Processing (OLAP); Data warehousing; Data Mining. Usually de-normalized data. Periodical transfers Interactive data analysis Flat files MIS 3500

DBSYSTEMS OLTP & OLAP in Enterprise Systems  Enterprise Systems (Enterprise Resource Planning Systems) support both.  Example: An SAP-based system can be a TPS, MIS and DSS for the entire organization. DSS capability draws on data warehousing & cubing.  Process approach to organization with data flowing smoothly end-to-end. Process link up horizontally (department-to- department) and vertically (process-sub-process).  Business process* is for the most part the system process.  More... More... 4 of 20

DBSYSTEMS 5 of 20 Date Warehousing Goals Data warehouse (DW) yields multidimensional view of data (cubing):  Integrate data from different sources to get a larger picture of business  Data aggregations (summaries on different dimensions) DW expands scope of data analysis:  Statistical analysis (test hypotheses on relationships between pieces of data)  Discover new relationships by querying cubes or applying data mining software.

DBSYSTEMS 6 of 20 Extraction, Transformation, and Loading Data warehouse: All data must be consistent. Customers Convert “Client” to “Customer” Apply standard product numbers Convert currencies Fix region codes Transaction data from diverse systems. Preparations performed on data – ETL process Extract Transform (Transport) Load

DBSYSTEMS 7 of 20 Three-Dimensional View of Data: Cube Product Sales at Location Sales Date Logic similar to crosstab query And pivot table. Created in a datawarehouse Days in year P5 P1 P2 P4 P3

DBSYSTEMS 8 of 20 Data Hierarchy Year Quarter Month Week Day Levels Roll-up To get higher-level totals Drill-down To get lower-level details

DBSYSTEMS Location LocationID Detail 9 of 20 Datawarehouse Tables: Star Design Revenue=Price*Quantity Fact Table * Sale SaleDate Quantity Discount Dimension Inputted from Product and Sale Revenue broken down by product, sales location, and desired time period (time column/s – day of year, or even smaller; basis for rollup). Product ProductID Price Customer CustomerID Detail Dimension Design is: - Hierarchical (dimension tables have no direct association) - De-normalized (fact table): Price & Quantity inputted to Fact table; New keys usually used in the fact table (e.g., SaleTbl#-Row#). Sale SaleDate Quantity Discount StoreID Calculated fact

DBSYSTEMS 10 of 20 Datawarehouse Tables: Snowflake Design MerchTblRow SaleTblRow Price Quantity OLAPItems ItemID Description Price Category Product SaleID SaleDate CustomerID Discount SalesTax Sale CustomerID Phone FirstName LastName Address ZipCode CityID Customer CityID ZipCode City State City Design is: -Network-like (dimension tables can connect directly) -Still partly normalized (Sale- Customer-City) Fact Table Dimension Tables Advantage: Design simplified in the Fact table part.

DBSYSTEMS SAP Datawarehouse 11 of 20 More on SAP Datawarehouse Datawarehouse Cube Details Can also be Dimensions 

DBSYSTEMS 12 of 20 Multidimensional View of Data – Precursors to DW: Excel Pivot Table Can place data in rows or columns. By grouping months, can instantly get quarterly or monthly totals. Time hierarchy “Dimensions” “Facts” (Measures)

DBSYSTEMS 13 of 20 Multidimensional View of Data: CUBE Option in SQL 99 Bird Bird … Bird(null) Bird(null) Cat Cat … Cat(null) (null) (null) (null) … (null)(null) CategoryMonthAmountGcGm SELECT Category, Month, Sum, GROUPING (Category) AS Gc, GROUPING (Month) AS Gm FROM … GROUP BY CUBE (Category, Month...)

DBSYSTEMS 14 of 20 GROUPING SETS: Hiding Details Bird(null) Cat(null) … (null) (null) (null) … (null)(null) CategoryMonthAmount SELECT Category, Month, Sum FROM … GROUP BY GROUPING SETS (ROLLUP (Category), ROLLUP (Month), ( ) )

DBSYSTEMS 15 of 20 SQL RANK Functions SELECT Employee, SalesValue RANK() OVER (ORDER BY SalesValue DESC) AS rank DENSE_RANK() OVER (ORDER BY SalesValue DESC) AS dense FROM Sales ORDER BY SalesValue DESC, Employee; EmployeeSalesValuerankdense Jones18,00011 Smith16,00022 Black16,00022 White14,00043 DENSE_RANK does not skip numbers Therefore, advances in SQL motivate DBMS vendors to support OLAP and data warehousing.

DBSYSTEMS 16 of 20 Broader Data Analysis with Data Mining  Goal: To discover unknown relationships in the data that can be used to make better decisions.  Exploratory analysis.  A bottom-up approach that scans the data to find relationships  Some statistical routines, but they are not sufficient  Statistics relies on averages  Sometimes the important data lies in more detailed pairs  Supervised by developer vs. unsupervised (self-organizing artificial neural networks)

DBSYSTEMS 17 of 20 Common Techniques  1. Classification/Prediction  2. Association Rules/Market Basket Analysis  3. Clustering

DBSYSTEMS 18 of Classification (Prediction)  Purpose: “Classify” things that are causes and those that are effects.  Examples  Which borrowers/loans are most likely to be successful?  Which customers are most likely to want a new item?  Which companies are likely to file bankruptcy?  Which workers are likely to quit in the next six months?  Which startup companies are likely to succeed?  Which tax returns are fraudulent?

DBSYSTEMS 19 of 20 Classification Process  Clearly identify the outcome/dependent variable.  Identify potential variables that might affect the outcome.  Use sample data to test and validate the model.  Regression/correlation analysis, decision tables and trees, etc. IncomeCredit HistoryJob StabilityCredit Success 50000Good Yes 75000MixedBadNo

DBSYSTEMS 20 of Association/Market Basket  Purpose: Determine what events or items go together/co-occur.  Examples:  What items are customers likely to buy together? (Business use: Consider putting the two together to increase cross-selling.)

DBSYSTEMS 21 of 20 Association Challenges  If an item is rarely purchased, any other item bought with it seems important. So combine items into categories.  Some relationships are obvious.  Burger and fries.  Some relationships are puzzling/meaningless.  Hardware store found that toilet rings sell well only when a new store first opened. But what does it mean?

DBSYSTEMS 22 of Cluster Analysis  Purpose: Determine groups of people or some entities.  Examples  Are there groups of customers? (If so, we could target them; market segmentation)  Do the locations for our stores have elements in common? (If so, we can search for similar clusters for new locations.)  Do employees have common characteristics? (If so, we can hire similar, or dissimilar, people.) Small intra- cluster distance Large inter- cluster distance