An Introduction to Data Warehousing

Slides:



Advertisements
Similar presentations
Supervisor : Prof . Abbdolahzadeh
Advertisements

1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Data Warehousing M R BRAHMAM.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Chapter 3 Database Management
An Introduction to Dimensional Data Warehouse Design Presented by Joseph J. Sarna Jr. JJS Systems, LLC.
Components and Architecture CS 543 – Data Warehousing.
Chapter 13 The Data Warehouse
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Architecture. Inmon’s Corporate Information Factory The enterprise data warehouse is not intended to be queried directly by analytic applications,
© 2007 by Prentice Hall 1 Introduction to databases.
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Business Intelligence Overview
Supervisor : Prof . Abbdolahzadeh
Intro to MIS – MGS351 Databases and Data Warehouses
Advanced Applied IT for Business 2
Building Data ware House
Data Warehouse.
Data warehouse and OLAP
Fundamentals & Ethics of Information Systems IS 201
Manajemen Data (2) PTI Pertemuan 6.
Chapter 13 The Data Warehouse
What is an attribute? How is it related to an entity?
Data Warehouse—Subject‐Oriented
Data Warehouse.
Databases and Data Warehouses Chapter 3
Chapter 13 – Data Warehousing
CMPE 226 Database Systems April 11 Class Meeting
Basic Concepts in Data Management
MANAGING DATA RESOURCES
Data Warehouse and OLAP
C.U.SHAH COLLEGE OF ENG. & TECH.
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Data warehouse.
VIEWS / TSS Overview.
Data Warehousing Concepts
Analytics, BI & Data Integration
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

An Introduction to Data Warehousing Presented by Joseph M. Wilson EPA

In the Beginning, life was simple…

But…

Our information needs…

Kept growing. (The Spider web) SOURCE: William H. Inmon

To explore and discuss the purpose and principles of data warehousing.

Briefing Contents

So What Is a Data Warehouse? Definition: A data warehouse is the data repository of an enterprise. It is generally used for research and decision support. By comparison: an OLTP (on-line transaction processor) or operational system is used to deal with the everyday running of one aspect of an enterprise. OLTP systems are usually designed independently of each other and it is difficult for them to share information.

Why Do We Need Data Warehouses? Consolidation of information resources Improved query performance Separate research and decision support functions from the operational systems Foundation for data mining, data visualization, advanced reporting and OLAP tools

What Is a Data Warehouse Used for? Knowledge discovery Making consolidated reports Finding relationships and correlations Data mining Examples Banks identifying credit risks Insurance companies searching for fraud Medical research

Performance optimization Technologies used How Do Data Warehouses Differ From Operational Systems? Goals Structure Size Performance optimization Technologies used

Comparison Chart of Database Types Data warehouse Operational system Subject oriented Transaction oriented Large (hundreds of GB up to several TB) Small (MB up to several GB) Historic data Current data De-normalized table structure (few tables, many columns per table) Normalized table structure (many tables, few columns per table) Batch updates Continuous updates Usually very complex queries Simple to complex queries

Design Differences Operational System Data Warehouse ER Diagram Star Schema

Supporting a Complete Solution Operational System- Data Entry Data Warehouse- Data Retrieval

Data Warehouses, Data Marts, and Operational Data Stores Data Warehouse – The queryable source of data in the enterprise. It is comprised of the union of all of its constituent data marts. Data Mart – A logical subset of the complete data warehouse. Often viewed as a restriction of the data warehouse to a single business process or to a group of related business processes targeted toward a particular business group. Operational Data Store (ODS) – A point of integration for operational systems that developed independent of each other. Since an ODS supports day to day operations, it needs to be continually updated. SOURCE: Ralph Kimball

Briefing Contents

Building a Data Warehouse Data Warehouse Lifecycle Analysis Design Import data Install front-end tools Test and deploy

Create an enterprise-level data dictionary Dimensional analysis Stage 1: Analysis Analysis Design Import data Install front-end tools Test and deploy Identify: Target Questions Data needs Timeliness of data Granularity Create an enterprise-level data dictionary Dimensional analysis Identify facts and dimensions

Pre-calculated Values HW/SW Architecture Stage 2: Design Analysis Design Import data Install front-end tools Test and deploy Star schema Data Transformation Aggregates Pre-calculated Values HW/SW Architecture Dimensional Modeling

Dimensional Modeling Fact Table – The primary table in a dimensional model that is meant to contain measurements of the business. Dimension Table – One of a set of companion tables to a fact table. Most dimension tables contain many textual attributes that are the basis for constraining and grouping within data warehouse queries. SOURCE: Ralph Kimball

Stage 3: Import Data Identify data sources Analysis Design Import data Install front-end tools Test and deploy Identify data sources Extract the needed data from existing systems to a data staging area Transform and Clean the data Resolve data type conflicts Resolve naming and key conflicts Remove, correct, or flag bad data Conform Dimensions Load the data into the warehouse

Importing Data Into the Warehouse Operational Systems (source systems)

Stage 4: Install Front-end Tools Analysis Design Import data Install front-end tools Test and deploy Reporting tools Data mining tools GIS Etc.

Software installation User training Stage 5: Test and Deploy Analysis Design Import data Install front-end tools Test and deploy Usability tests Software installation User training Performance tweaking based on usage

Managing the complexity Update procedures and maintenance Special Concerns Time and expense Managing the complexity Update procedures and maintenance Changes to source systems over time Changes to data needs over time

Briefing Contents

Goals of the STORET Central Warehouse Improved performance and faster data retrieval Ability to produce larger reports Ability to provide more data query options Streamlined application navigation

Old Web Application Flow

Central Warehouse Application Flow Search Criteria Selection Report Size Feedback/ Report Customization Report Generation

Web Application Demo STORET Central Warehouse: http://epa.gov/storet/dw_home.html

STORET Central Warehouse – Potential Future Enhancements More query functionality Additional report types Web Services Additional source systems?

Data Warehouse Components SOURCE: Ralph Kimball

Data Warehouse Components – Detailed SOURCE: Ralph Kimball

Briefing Contents