Data Warehouse Prerequisites Familiarity with Microsoft SQL Server Familiarity with Microsoft SQL Server System Administration for Microsoft SQL Server.

Slides:



Advertisements
Similar presentations
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
Advertisements

Business Information Warehouse Business Information Warehouse.
Chapter 13 The Data Warehouse
C6 Databases.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
Chapter 3 Database Management
The Data Warehouse Environment. The Structure of the Data Warehouse  There are different levels of detail in the data warehouse.  Older level of detail.
L The Difference Between Logical and Physical Views of Information l Databases and Database Management Systems l How You Can Develop Database Applications.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Introduction to Data Warehousing Enrico Franconi CS 636.
Chapter 13 The Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Defining Data Warehouse Concepts and Terminology.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Chapter 1 Overview of Databases and Transaction Processing.
Data Resource Management Chapter 5 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
Database Systems – Data Warehousing
The McGraw-Hill Companies, Inc Information Technology & Management Thompson Cats-Baril Chapter 3 Content Management.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
1099 Why Use InterBase? Bill Todd The Database Group, Inc.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS CHAPTER 3
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Operational vs. Informational System. Operational System Operational systems maintain records of daily business transactions whereas a Data Warehouse.
12/6/05 The Data Warehouse from William H. Inmon, Building the Data Warehouse (4 th ed)
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Master Data Management & Microsoft Master Data Services Presented By: Jeff Prom Data Architect MCTS - Business Intelligence (2008), Admin (2008), Developer.
© 2006 Pearson Education Canada Inc. 3-1 Chapter 3 Database Management PowerPoint Presentation Jack Van Deventer Ward M. Eagen.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Advanced Database Concepts
Oracle 8i Data Warehousing (chapter 1, 2) Data Warehousing Lab. 석사 1 학기 HyunSuk Jung.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Data Warehouse Data Mart Elahe Soroush. Agenda  Data Warehouse definition  Concepts  Logical transformation  Physical transformation  DW components.
Chapter 1 Overview of Databases and Transaction Processing.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Intro to MIS – MGS351 Databases and Data Warehouses
Defining Data Warehouse Concepts and Terminology
Data warehouse.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Defining Data Warehouse Concepts and Terminology
Basic Concepts in Data Management
Data Warehouse and OLAP
DATA WAREHOUSE: THE BUILDING BLOCKS
An Introduction to Data Warehousing
Introduction to Data Warehousing
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Chapter 17 Designing Databases
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

Data Warehouse Prerequisites Familiarity with Microsoft SQL Server Familiarity with Microsoft SQL Server System Administration for Microsoft SQL Server 7.0 and Implementing a Database on Microsoft SQL Server 7.0 System Administration for Microsoft SQL Server 7.0 and Implementing a Database on Microsoft SQL Server 7.0 Knowledge of Transact-SQL Usage in Developing OLTP Systems Knowledge of Transact-SQL Usage in Developing OLTP Systems Basic Understanding of Programming Principles and Experience with a Scripting Language Basic Understanding of Programming Principles and Experience with a Scripting Language Understanding of Basic Database Design, Administration, and Implementation Concepts Understanding of Basic Database Design, Administration, and Implementation Concepts

What is Data Warehousing? Special Thanks to Bill Inmon, the “grandfather” of data warehousing. Peter Rawsthorne

OLTP vs. DSS Online Transaction Processing (OLTP) Online Transaction Processing (OLTP) Decision Support System (DSS) Decision Support System (DSS) OLTP OLTP –ATM, Bank Teller, Ticket Master, POS… DSS DSS –Marketing, What if?, Inventory, Health (Walmart)… –Click-through analysis

In-class Exercise 1. Break into teams of three 2. Think of a business or subject area 3. Determine three OLTP systems required to support business or subject 4. Determine two DSS systems required to support business or subject 5. Provide one example of how the DSS system could be used to predict the future

Exercise Example Business: Yacht Club Business: Yacht Club OLTP OLTP –Membership System –Accounting System –Yacht Racing Results System DSS DSS –Quarterly and Yearly Expenses –Race Results How much beer will we need for next years regatta for the male non-members who are crew on yachts over 40 feet? How much beer will we need for next years regatta for the male non-members who are crew on yachts over 40 feet?

What then is a data warehouse? A data warehouse is a: subject oriented, integrated, time variant, non volatile collection of data in support of management's decision making process.

Subject Orientation Data is organized via subject rather than process or business function. The application world is concerned both with data base design and process design. The data warehouse world focuses on data modeling and database design exclusively.

Integration Easily the most important aspect of the data warehouse environment is that data found within the data warehouse is integrated. ALWAYS. WITH NO EXCEPTIONS. consistent naming conventions, consistent measurement of variables, consistent encoding structures, consistent physical attributes of data, and so forth.

Time Variant OPERATIONAL Current valued data Current valued data Time horizon: 60 – 90 days Time horizon: 60 – 90 days Key fields may or may not have an element of time Key fields may or may not have an element of time Data can be updated Data can be updated DATA WAREHOUSE Snapshot data Time horizon: 5 – 10 days Keys do not have an element of time Once snapshot is made, records cannot be updated

non volatile Inserts, deletes, and changes - are done regularly to the operational environment on a record by record basis. There are only two kinds of operations that occur in the data warehouse - the initial loading of data, and the access of data.

The structure of the warehouse The different components of the data warehouse are: metadata, current detail data, old detail data, lightly summarized data, and highly summarized data.

Current Detail Data Most recent happenings Most recent happenings Voluminous Voluminous Lowest level of granularity Lowest level of granularity Almost always stored on disk storage Almost always stored on disk storage Fast to access Fast to access Expensive and complex to manage Expensive and complex to manage

Older detail data Stored on some form of mass storage Stored on some form of mass storage Infrequently accessed Infrequently accessed Stored at a level of detail consistent with current detailed data Stored at a level of detail consistent with current detailed data Often stored on an alternate storage medium Often stored on an alternate storage medium Anticipated large volume Anticipated large volume

Lightly summarized data Distilled from the low level of detail Distilled from the low level of detail Almost always stored on disk Almost always stored on disk Design issues facing the data architect are; Design issues facing the data architect are; –what unit of time –what contents – attributes Frequently mined data, a lot of “what if?” Frequently mined data, a lot of “what if?”

Highly summarized data Compact and easily accessible Compact and easily accessible Sometimes found in the data warehouse Sometimes found in the data warehouse Sometimes found outside the data warehouse Sometimes found outside the data warehouse In any case, the highly summarized data is part of the data warehouse In any case, the highly summarized data is part of the data warehouse Yearly or multi year summaries Yearly or multi year summaries

Metadata Sits in a different dimension Sits in a different dimension Contains no data directly taken from the operational environment Contains no data directly taken from the operational environment Special and very important role Special and very important role Metadata is used as: Metadata is used as: –a directory to locate the contents –a guide to the mapping of data –a guide to the algorithms used for summarization

Metadata – levels of summarization

Flow of Data data enters from the operational environment, it is transformed data goes into the current detail level of detail It resides there and is used there until one of three events occurs: –it is purged, –it is summarized, and/or –it is archived.

Using the Data Warehouse

Example

Summary A data warehouse is a A data warehouse is a subject oriented, integrated, time variant, non volatile collection of data in support of management's decision needs. Four levels of data warehouse data: Four levels of data warehouse data: old detail, current detail, lightly summarized data, and highly summarized data. Metadata is a very important part Metadata is a very important part

Lab deliverables W2KS Install W2KS Install SQL7.0 Install SQL7.0 Install SQL7.0 OLAP Services Install SQL7.0 OLAP Services Install MSPress install MSPress install Complete MSPress Chapter 1 Complete MSPress Chapter 1

Contact Information Peter Rawsthorne, B.Tech, MCSD, MCT, CCR President, Eclectic Endeavours Inc. 559A Artisan Lane PO Box 281 Bowen Island, BC CANADA V0N 1G0 Phone: Fax: web: