1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Warehouse Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.

Slides:



Advertisements
Similar presentations
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
Advertisements

BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehousing M R BRAHMAM.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Data Warehouse IMS5024 – presented by Eder Tsang.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
IS Consulting Process (IS 6005) Masters in Business Information Systems 2009 / 2010 Fergal Carton Business Information Systems.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Data Staging Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) From Information Management to Knowledge Management Olivia R. Liu Sheng, Ph.D.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) The Data Warehouse Lifecycle Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
1 9 Adv. DBMS Data Warehouse CSC5301 Review Hachim Haddouti.
Chapter 13 The Data Warehouse
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
CS346: Advanced Databases
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Business Intelligence Instructor: Bajuna Salehe Web:
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Data Warehouse & Data Mining
Database Systems – Data Warehousing
Data Warehouse Concepts Transparencies
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
AN OVERVIEW OF DATA WAREHOUSING
Datawarehouse Objectives
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Sachin Goel (68) Manav Mudgal (69) Piyush Samsukha (76) Rachit Singhal (82) Richa Somvanshi (85) Sahar ( )
Dimensional Modeling Primer Chapter 1 Kimball & Ross.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
An Overview of Data Warehousing and OLAP Technology
Data Warehousing COMP3017 Advanced Databases Dr Nicholas Gibbins –
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
Chapter 1 Overview of Databases and Transaction Processing.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Data Model –Part 1
Introduction of Week 9 Return assignment 5-2
Data Warehousing Concepts
Data Warehouse and OLAP
Presentation transcript:

1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Warehouse Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of Business

2 Outline Why Data Warehouse? –Problems, causes and data warehouse solutions What is Data Warehouse? –Characteristics and components Current Practices of Data Warehouse

3 Why Data Warehouse? Knowledge Management Problems (Drowning in data, starving for knowledge) 1.Can’t access data (easily) E.g., data from different branches, years, functional areas, etc. 2.Give me only what’s important (knowledge) E.g., Regions and products that have upward sales trends over the last five years. 3.I need to reduce data to what’s important by slicing and dicing. E.g., by branch, product, year, etc.

4 Why Data Warehouse? 4.Data inconsistency and poor data quality E.g., the 2001 PC sales amount in SLC from the CFO and the SLC Account Manager are not the same. 5.Need to improve the practices of making informed decisions. E.g., Did the VP for Marketing decide on the advertising budgets for branches in the SW region based on their sales performances over the last five years? 6.Hard and slow to query the database? E.g., VP for Marketing, CFO and Account Manager had to wait for the MIS Department to generate sales performance reports and analyses.

5 Why Data Warehouse? ROI Problems 7.Can I get more value out of my data? Ans: Make informed, potent decisions using knowledge extracted from integrated and consistent data over a long period of time. 8.Can I do this cost-effectively? Options: federated (interoperable) databases vs. a data warehouse 9.Can I easily scale up or change how I get knowledge out of my data? E.g., Add more regions, functional areas or years in sales performance analyses.

6 Causes for the Problems Cause 1: Isolated databases distributed in an enterprise Sales CRM Inventory A Root cause for problems 1, 4, 5, 6, 7, 8 and 9

7 Why Data Warehouse Cause 1: Isolated databases distributed in an enterprise SalesCRM Inventory Ad hoc access solutions cannot alleviate the problems

8 Why Data Warehouse Cause 2: Historical data is archived in offline storage systems Sales Another Root cause for problems 1, 4, 5, 6, 7, 8 and 9 Archive Historical Sales Data

9 Why Data Warehouse Cause 2: Historical data is archived in offline storage systems Sales Ad hoc accesses are slow and inconvenient Archive Historical Sales Data

10 Cause 3: Metadata for Transaction DB systems is Not User Friendly Student Course InstructorDependent Under- graduate Graduate IS-A Take Has SSN Address Name Phone Major Minor SSN Rank Name C-Name C-No Name Relation Sex Grade MM M 1 1

11

12 Why Data Warehouse Cause 4: Query and programming languages are even less user friendly –DESB students’ academic grades and GPAs since the freshman year –Sales amount distribution by product category, customer state and year –Slicing and dicing –SQL statements??? –Report/screen interface codes???

13 Why Data Warehouse Cause 5: Transaction databases are optimized (normalized) to process transactions but not to answer decision support queries –Bad query performance to join the normalized tables –Heavy transaction processing workload

14 What is Data Warehouse Designed to solve problems associated with current database practices: Isolated, distributed databases SalesCRM Inventory Extract, replicate, integrate, cleanse & load Data Warehouse

15 Why Data Warehouse Historical data is archived in offline storage systems Sales Archive Historical Sales Data Data Warehouse Integrate Historical Data with Current Data

16 What is Data Warehouse Causes 3, 4 and 5: Hard-to-understand metadata, and query and programming languages; poor decision support query performances Solution: In data warehouse, organize data in subject –oriented way rather than process- oriented way – dimensional modeling.

17 Dimensional Modeling (Star Schema) Academic Performance. Grade. Name. Rank Instructor. Name. UG/PG. Major Student Course. Number. Title Semester. Year. Length. Start date

18 Dimensional Modeling (Star Schema) Sales. Qty. Amt. Name. State. City Branch. Name. Category Product Customer. Name. State. City Time. Year. Quarter. Month

19 One System for Multiple Uses Database Management System (DBMS) Database Application Program Interactive Queries/ Transactions Database System Application Program Metadata

20 Two Worlds -> Two Systems Operational DSS Operational Application Operational Application Operational Application Data warehouse Executive Information System Decision Support System (DSS) Reporting OLTP DBs

21 What is Data Warehouse Data Warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process. 1. Subject-oriented means the data warehouse focuses on the high-level entities of business such as sales, products, and customers. This is in contrast to database systems, which deals with processes such as placing an order.

22 What is Data Warehouse  2. Integrated means the data is integrated from distributed data sources and historical data sources and stored in a consistent format. 3. Time-variant means the data associates with a point in time (i.e., semester, fiscal year and pay period) 4. Non-volatile means the data doesn’t change once it gets into the warehouse.

23 Characteristics of Data Warehouse

24 Data Warehouse and Data Mart Data warehouse – defined by its decision support purpose and other characteristics –Other characteristics: subject-oriented, integrated Data mart – a data warehouse for a more limited business scope (e.g., a department, etc.) A data warehouse may be built from several data marts

25 Source System (Legacy) extract Storage: Flat files (fastest); RDBMS; Other Processing: Clean; Prune; Combine; Remove duplicates; households; standardize; conform dimensions; store awaiting replications; archive; export to data marts No user query services Populate, replicate, recover Data Mart #1: OLAP ( ROLAP and/or MOLAP) query services; dimensional! Subject oriented; locally implemented; user group driven; may store atomic data; may be frequently refreshed; conform to DW Bus Data Mart #2 Data Mart #3 Populate, replicate, recover Ad Hoc Query Tools Report Writers End User Applications feed Models: forecasting; scoring; allocating; data mining; other downstream Systems; other parameters; special UI Data Staging Area The Data Warehouse Presentation Servers End User Data Access Uploaded cleaned dimensions Uploaded model results Basic Elements of a Data Warehouse System DW BUS Conformed dimensions and facts Relational Flat files Spreadsheets ERP Legacy

26 Current Practice of DW * Expected DW market value in 2002 was projected to have grown to $113.5 billion. Average DW development cost is $1.5 million and average maintenance cost is $0.5 million. DW development time ranges from 1 to 3 yrs. * Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001

27 Current Practice of DW * Sponsorship for the DW project SponsorPercentage VP of a business unit39.8 CIO26.9 Business unit manager16.7 CEO11.1 Other25.0 * Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001

28 Current Practice of DW * DW Benefits – Less effort to produce better information – Better decisions – Improvement of business processes – Support for accomplishments of strategic business objectives Return on Investments and Cost of Ownership? * Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001