1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.

Slides:



Advertisements
Similar presentations
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Advertisements

Chapter 11: Data Warehousing
Chapter 1: The Database Environment
Chapter 7 System Models.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
UNITED NATIONS Shipment Details Report – January 2006.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 5 second questions
Year 6 mental test 10 second questions
Database Systems: Design, Implementation, and Management
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Week 2 The Object-Oriented Approach to Requirements
Data Warehousing Design Transparencies
Data Warehousing – A Technology Marvel -by Swati Chawla.
Information Systems Today: Managing in the Digital World
Microsoft Confidential. We look at the world... with our own eyes...
Microsoft Access.
Chapter Information Systems Database Management.
Chapter 6 Data Design.
Supervisor : Prof . Abbdolahzadeh
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
VOORBLAD.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
© 2012 National Heart Foundation of Australia. Slide 2.
Page 1 of 43 To the ETS – Bidding Query by Map Online Training Course Welcome This training module provides the procedures for using Query by Map for a.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
CHAPTER 8 INFORMATION IN ACTION
Executional Architecture
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Chapter 10: The Traditional Approach to Design
Systems Analysis and Design in a Changing World, Fifth Edition
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
Database Administration
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Chapter 13 The Data Warehouse
Management Information Systems, 10/e
© 2007 by Prentice Hall Management Information Systems, 10/e Raymond McLeod and George Schell 1 Management Information Systems, 10/e Raymond McLeod Jr.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Components and Architecture CS 543 – Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Advanced Applied IT for Business 2
Building Data ware House
Chapter 13 The Data Warehouse
Data Warehouse.
An Introduction to Data Warehousing
Data Warehousing Concepts
Data Warehouse and OLAP Technology
Presentation transcript:

1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data Warehousing Presented by Joseph M. Wilson EPA

2 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. In the Beginning, life was simple…

3 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. But…

4 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Our information needs…

5 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Kept growing. (The Spider web) SOURCE: William H. Inmon

6 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Purpose To explore and discuss the purpose and principles of data warehousing.

7 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Briefing Contents

8 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. So What Is a Data Warehouse? u Definition: A data warehouse is the data repository of an enterprise. It is generally used for research and decision support. u By comparison: an OLTP (on-line transaction processor) or operational system is used to deal with the everyday running of one aspect of an enterprise. u OLTP systems are usually designed independently of each other and it is difficult for them to share information.

9 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Why Do We Need Data Warehouses? u Consolidation of information resources u Improved query performance u Separate research and decision support functions from the operational systems u Foundation for data mining, data visualization, advanced reporting and OLAP tools

10 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. What Is a Data Warehouse Used for? u Knowledge discovery l Making consolidated reports l Finding relationships and correlations l Data mining l Examples n Banks identifying credit risks n Insurance companies searching for fraud n Medical research

11 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. u Goals u Structure u Size u Performance optimization u Technologies used How Do Data Warehouses Differ From Operational Systems?

12 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Comparison Chart of Database Types Data warehouseOperational system Subject orientedTransaction oriented Large (hundreds of GB up to several TB) Small (MB up to several GB) Historic dataCurrent data De-normalized table structure (few tables, many columns per table) Normalized table structure (many tables, few columns per table) Batch updatesContinuous updates Usually very complex queriesSimple to complex queries

13 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Design Differences Star Schema Data Warehouse Operational System ER Diagram

14 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Supporting a Complete Solution Operational System- Data Entry Data Warehouse- Data Retrieval

15 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Data Warehouses, Data Marts, and Operational Data Stores u Data Warehouse – The queryable source of data in the enterprise. It is comprised of the union of all of its constituent data marts. u Data Mart – A logical subset of the complete data warehouse. Often viewed as a restriction of the data warehouse to a single business process or to a group of related business processes targeted toward a particular business group. u Operational Data Store (ODS) – A point of integration for operational systems that developed independent of each other. Since an ODS supports day to day operations, it needs to be continually updated. SOURCE: Ralph Kimball

16 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Briefing Contents

17 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Building a Data Warehouse l Analysis l Design l Import data l Install front-end tools l Test and deploy Data Warehouse Lifecycle

18 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Stage 1: Analysis u Identify: l Target Questions l Data needs l Timeliness of data l Granularity u Create an enterprise-level data dictionary u Dimensional analysis l Identify facts and dimensions Analysis –Design –Import data –Install front-end tools –Test and deploy

19 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Stage 2: Design u Star schema u Data Transformation u Aggregates u Pre-calculated Values u HW/SW Architecture –Analysis Design –Import data –Install front-end tools –Test and deploy Dimensional Modeling

20 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Dimensional Modeling u Fact Table – The primary table in a dimensional model that is meant to contain measurements of the business. u Dimension Table – One of a set of companion tables to a fact table. Most dimension tables contain many textual attributes that are the basis for constraining and grouping within data warehouse queries. SOURCE: Ralph Kimball

21 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Stage 3: Import Data u Identify data sources u Extract the needed data from existing systems to a data staging area u Transform and Clean the data l Resolve data type conflicts l Resolve naming and key conflicts l Remove, correct, or flag bad data l Conform Dimensions u Load the data into the warehouse –Analysis –Design Import data –Install front-end tools –Test and deploy

22 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Importing Data Into the Warehouse Operational Systems (source systems)

23 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Stage 4: Install Front-end Tools u Reporting tools u Data mining tools u GIS u Etc. –Analysis –Design –Import data Install front-end tools –Test and deploy

24 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Stage 5: Test and Deploy u Usability tests u Software installation u User training u Performance tweaking based on usage –Analysis –Design –Import data –Install front-end tools Test and deploy

25 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Special Concerns u Time and expense u Managing the complexity u Update procedures and maintenance u Changes to source systems over time u Changes to data needs over time

26 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Briefing Contents

27 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Goals of the STORET Central Warehouse u Improved performance and faster data retrieval u Ability to produce larger reports u Ability to provide more data query options u Streamlined application navigation

28 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Old Web Application Flow

29 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Central Warehouse Application Flow Search Criteria Selection Report Size Feedback/ Report Customization Report Generation

30 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. STORET Central Warehouse: Web Application Demo

31 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. STORET Central Warehouse – Potential Future Enhancements u More query functionality u Additional report types u Web Services u Additional source systems?

32 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Data Warehouse Components SOURCE: Ralph Kimball

33 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Data Warehouse Components – Detailed SOURCE: Ralph Kimball

34 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. Briefing Contents