Data Warehousing Willem Visser RW334. Somebody is watching! Everybody seems to be recording your every move Loyalty cards Cookies – Facebook, Twitter,…

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Chapter 13 The Data Warehouse
Outline What is a data warehouse? A multi-dimensional data model Data warehouse architecture Data warehouse implementation Further development of data.
April 30, Data Warehousing and OLAP Technology: An Overview  What is a data warehouse?  Data warehouse architecture  From data warehousing to.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehousing Xintao Wu. Evolution of Database Technology (See Fig. 1.1) 1960s: Data collection, database creation, IMS and network DBMS 1970s: Relational.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Dr. M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2010 COMP207: Data Mining Data Warehousing COMP207: Data Mining.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
Chapter 13 The Data Warehouse
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
DATA WAREHOUSE (Muscat, Oman).
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
Components of the Data Warehouse Michael A. Fudge, Jr.
Dr. Bernard Chen Ph.D. University of Central Arkansas
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehousing Xintao Wu. Can You Easily Answer These Questions? What are Personnel Services costs across all departments for all funding sources? What.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Data Mining Data Warehouses.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2009 This is the full course notes, but not quite complete. You.
Advanced Database Concepts
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
COMP 430 Intro. to Database Systems Denormalization & Dimensional Modeling.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
An Overview of Data Warehousing and OLAP Technology
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Data Mining: Data Warehousing
Data Warehousing Data warehousing provides architectures & tools for business executives to systematically organize, understand & use their data to make.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data storage is growing Future Prediction through historical data
OLAP Concepts and Techniques
Data Warehouse.
Chapter 13 – Data Warehousing
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Data Warehousing Concepts
Analytics, BI & Data Integration
Data Warehouse and OLAP
Presentation transcript:

Data Warehousing Willem Visser RW334

Somebody is watching! Everybody seems to be recording your every move Loyalty cards Cookies – Facebook, Twitter,… – Check out Collusion plug-in for Firefox They want to know how to market to you Same is true in business – Know your data, know your business

Data Warehousing Integrated repository of data to understand your business Separate from the Operational Database Supports decision making Subject oriented Time variant Non-volatile

Features Only necessary data to allow modeling and decision making Coming from potentially many sources Time component – Even though operationally there might not be Data doesn’t change after loading – No operational updates – Periodic refresh

Operational vs Warehouse Operational – Optimized for on-line transactional processing – OLTP Warehouse – Optimized for online analytic processing – OLAP – Complex queries – Very large volumes

Data cube Multi dimensional data – Not just 3D (but mostly shown as such)

Lattice of Cuboids all product date country product,dateproduct,countrydate, country product, date, country 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D(base) cuboid Slide by Dr. Hany Saleeb How much of the cube is materialized before the query: Full (complete cuboid) None (materialized on the fly) Partial

OLAP “Querying and presenting text and numeric data from data warehouses in a dimensional cube-style” – Slicing a dimension: Per region, per product, per period – Drill-down: Country  Region  Town  Suburb – Drill-up, drill-around, etc. – Visualization Slide by Cor Winkler

Multi-Tiered Architecture Data Warehouse Extract Transform Load Refresh OLAP Engine Analysis Query Reports Data mining Monitor & Integrator Metadata Data Sources Front-End Tools Serve Data Marts Operational DBs other sources Data Storage OLAP Server Slide by Dr. Hany Saleeb

Steps Data extraction: – get data from multiple, heterogeneous, and external sources Data cleaning: – detect errors in the data and rectify them when possible Data transformation: – convert data from legacy or host format to warehouse format Load: – sort, summarize, consolidate, compute views, check integrity, and build indices and partitions Refresh – propagate the updates from the data sources to the warehouse

PRODUCT time_key (PK)‏ SQL_date day_of_week week_number month time_key (PK)‏ SQL_date day_of_week week_number month time_key (FK)‏ product_key (FK)‏ store_key (FK)‏ promo_key (FK)‏ dollars units cost time_key (FK)‏ product_key (FK)‏ store_key (FK)‏ promo_key (FK)‏ dollars units cost product_key (PK)‏ SKU description brand category package_type size flavor product_key (PK)‏ SKU description brand category package_type size flavor promotion_key (PK)‏ promotion_name promotion_type price_treatment ad_treatment display_treatment coupon_type promotion_key (PK)‏ promotion_name promotion_type price_treatment ad_treatment display_treatment coupon_type store_key (PK)‏ store_ID store_name address district region store_key (PK)‏ store_ID store_name address district region STORE DIMENSION PROMOTION DIMENSION TIME DIMENSION SALES FACT TABLEPRODUCT DIMENSION Total Cost $ 1,058 $ 2,200 $ 650 $ 1,848 $ 2,350 $ 580 Brand Clean Fast More Power Zippy Clean Fast More Power Zippy District Atherton Belmont Total Dollars $ 1,233 $ 2,239 $ 848 $ 2,097 $ 2,428 $ 633 Gross Profit $ 175 $ 39 $ 198 $ 249 $ 78 $ 53 Star-Schema Example Slide by Cor Winkler

Data Warehouse Usage Three kinds of data warehouse applications – Information processing supports querying, basic statistical analysis, and reporting using crosstabs, tables, charts and graphs – Analytical processing multidimensional analysis of data warehouse data supports basic OLAP operations, slice-dice, drilling, pivoting – Data mining knowledge discovery from hidden patterns supports associations, constructing analytical models, performing classification and prediction, and presenting the mining results using visualization tools. Differences among the three tasks Slide by Dr. Hany Saleeb

Information Dashboards Slide by Cor Winkler

Information Exploitation Business Intelligence (BI) R Value of decision Complexity → Data Mining What might happen? Obscure data relationships and trends Analysis Why it happened? Dynamic slice&dice Reporting What happened? Historical info Intelligent agents Make it happen! Automatic response to business triggers Complexity → ← # of users Slide by Cor Winkler