DATA WAREHOUSE CONCEPTS. A Definition · A Data Warehouse: Is a repository for collecting, standardizing, and summarizing snapshots of transactional data.

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

The Database Environment
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Sharing Enterprise Data Data administration Data administration Data downloading Data downloading Data warehousing Data warehousing.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Chapter 3 Database Management
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Introduction to Data Warehousing Enrico Franconi CS 636.
1 1 File Systems and Databases Chapter 1 Prof. Sin-Min Lee Dept. of Computer Science.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehouse Concepts & Architecture.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
Defining Data Warehouse Concepts and Terminology.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
© 2007 by Prentice Hall 1 Introduction to databases.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Operational vs. Informational System. Operational System Operational systems maintain records of daily business transactions whereas a Data Warehouse.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
DATA RESOURCE MANAGEMENT
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Data Warehousing 4 Definition of Data Warehouse 4 Architecture of Data Warehouse 4 Different Data Warehousing Tools 4 Summary.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
Data Warehousing/Mining 1 Data Warehousing/Mining Introduction.
Chapter 1 Overview of Databases and Transaction Processing.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Building a Data Warehouse: Understanding Why & How
The Database Environment
Defining Data Warehouse Concepts and Terminology
Data warehouse.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehousing and Data Mining By N.Gopinath AP/CSE
Data Warehouse.
Defining Data Warehouse Concepts and Terminology
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Introduction to Data Warehousing
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
C.U.SHAH COLLEGE OF ENG. & TECH.
Chapter 1: The Database Environment
Data Warehouse.
Data Warehousing Concepts
The Database Environment
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Data Warehousing & DATA MINING (SE-409) Lecture-1 Introduction and Background Huma Ayub Software Engineering department University of Engineering and Technology,
Presentation transcript:

DATA WAREHOUSE CONCEPTS

A Definition · A Data Warehouse: Is a repository for collecting, standardizing, and summarizing snapshots of transactional data contained in an organization’s operations or production systems provides a historical perspective of information is most often, but not exclusively, used for decision support applications and business information queries can be more than one database Is not a new concept

Another Definition · Decision Support: is a set of tools to easily access data is becoming a critical business tool is usually graphically oriented is empowering end users with tools to access vital business information is moving lots of data down to the end user workstation is a rapidly expanding area because of data warehousing efforts and projects

Why a warehouse? · For analysis and decision support, end users require access to data captured and stored in an organization’s operational or production systems · This data is stored in multiple formats, on multiple platforms, in multiple data structures, with multiple names, and probably created using different business rules

Why do we want a central data store

Interesting Statistics · 85% of the Fortune 1000 companies have, are implementing, or are looking at, data warehouses (Meta Group) · 90% of all information processing organizations will be pursuing a data warehouse strategy in the next three years (Meta Group) · The Decision Support industry will be a $1 Billion industry by 1997 (IDC & Forrester)

Data Warehouse Evolution - Stage 0 No end user access to production files “What we print” is “what you get”

Data Warehouse Evolution - Stage 1 End users denied direct access to production files Snapshots or copies of production files are made available instead Solution: Provide end users access to production systems

No Integration Between Systems

Data Warehouse Evolution - Stage 2

Data Characteristics TypeProductionWarehouse · Data UseOperationalMgt Reporting · Level of detailDetailedSummary · CurrencyReal time, Multiple Latest valuegenerations · LongevityRelatively briefForever · StabilityDynamicStatic · Scope of definitionApplication wideEnterprise wide · Data OperationsCapture/updateRead only · Data valuesCodedDecoded

Transforming the Logical Model

Key Differences - Part 1 · Key differences between “data jails” (operational database) & warehouses Subject orientation - operational systems are application- segmented (i.e. banks = auto loan, demand deposit accounting or mortgages). Subject areas for banks would be customer and each financial product Level of integration - warehouses resolve years of application inconsistency in encoding/decoding, data name rationalization, etc Update volatility - record at a time updates in operational database vs bulk loads in data warehouse Time variance norms include: days of transactions for operational system, 1-10 years for data warehouses

Key Differences - Part 2 CharacteristicOperationalWarehouse Transaction volumeHighLow to huge Response timeVery fastReasonable UpdatingHigh volumeVery Low Time PeriodCurrent PeriodPast to Future ScopeInternalExternal ActivitiesFocused, clericalExploratory, operationalanalytical, managerial QueriesPredictable, Can be Unpredictable, periodicAd hoc

Types of Warehouse Configurations · Enterprise · Division · Functional Financial Personnel Engineering/Product · Departmental · Special Project

What’s Really Involved?

Typical Users of a Data Warehouse · Decision Support Analysts, Business Analysts Marketing, Actuaries, Financial, Sales, Executive · Grocery Store attitudes Going to the store, not knowing what they want Close proximity says give me “everything” · Explorers Don’t know what they want Search on a random basis, non-repetitively Frequently finds nothing, but when they do, there are huge rewards · Farmers Know what they want Non random searches, finds frequent “flakes of gold” Finds small amounts of data

Advanced Warehouse Topics · Metadata repositories Information about the data in the warehouse Like a library card catalog Data about when the information was created, what files accessed, how much data Data about changes in business rules, processes Context versus Content “What does it mean?” · Data Mining Drilling down into databases with tools to find specific anomolies · Online Analysis Processing (OLAP) Really means summary data