Data Warehouse Components

Slides:



Advertisements
Similar presentations
Business process engineering: an overview The goal of business process engineering (BPE) is to define architectures that will enable a business to use.
Advertisements

Supervisor : Prof . Abbdolahzadeh
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
ITEC 423 Data Warehousing and Data Mining Lecture 3.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Designing the Data Warehouse and Data Mart Methodologies and Techniques.
Components and Architecture CS 543 – Data Warehousing.
DATA WAREHOUSING.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Accelerated Access to BW Al Weedman Idea Integration.
Data Warehouse Components
Designing a Data Warehouse
Architecture and Infrastructure Module 2 G.Anuradha.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Understanding Data Warehousing
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
More ETL. ETL in a nutshell ETL is an abbreviation of the three words Extract, Transform and Load. It is an ETL process to –extract data, mostly from.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Data Warehouse Development Methodology
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
3. Data Warehouse Architecture
Data Warehouse Fundamentals
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
12/6/05 The Data Warehouse from William H. Inmon, Building the Data Warehouse (4 th ed)
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
CHAPTER 7: ARCHITECTURAL COMPONENTS. CHAPTER OBJECTIVES  Understand data warehouse architecture  Examine how the architectural framework supports the.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
 Understand the basic definitions and concepts of data warehouses  Describe data warehouse architectures (high level).  Describe the processes used.
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
7 Strategies for Extracting, Transforming, and Loading.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Copyright © 2006, Oracle. All rights reserved. Czinkóczki László oktató Using the Oracle Warehouse Builder.
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
Metadata Driven Clinical Data Integration – Integral to Clinical Analytics April 11, 2016 Kalyan Gopalakrishnan, Priya Shetty Intelent Inc. Sudeep Pattnaik,
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Business Intelligence Overview
Supervisor : Prof . Abbdolahzadeh
Building a Data Warehouse
Advanced Applied IT for Business 2
Defining Data Warehouse Concepts and Terminology
PowerMart of Informatica
Data storage is growing Future Prediction through historical data
MIS5101: Extract, Transform, Load (ETL)
Introduction to Data Warehousing
Data Warehousing and Data Mining By N.Gopinath AP/CSE
Defining Data Warehouse Concepts and Terminology
MIS5101: Extract, Transform, Load (ETL)
DATA WAREHOUSE: THE BUILDING BLOCKS
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
Metadata Construction in Collaborative Research Networks
THE ARCHITECTURAL COMPONENTS
VIEWS / TSS Overview.
Data Warehouse.
Metadata The metadata contains
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Concepts
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

Data Warehouse Components

Overview of the Components Source Data Component Production data Internal data Archive data External data Data staging component Extraction Transformation Cleaning standardization Loading Data storage component Information delivery component Metadata component Management and control component

Architectural Framework

Data Acquisition You are the data analyst on the project team building a DW for an insurance company. List the possible data sources from which you will bring data into DW Production data: data from various operational systems External data: for finding trends and comparisons against other organizations. Internal data: private confidential data important to an organization Archived data: for getting some historical information

Architectural Framework

Data Staging Performs ETL Extraction Transformation Loading Select data sources, determine filters Automatic replicate Create intermediary files Transformation Clean, merge, de-duplicate data Covert data types Calculate derived data Resolve synonyms and homonyms Loading Initial loading Incremental loading

Why is a separate data staging area required? Data is across various operational databases It should be subject-oriented data Data staging is mandatory

Architectural Framework

Characteristics of data storage area Separate repository Data content Read only Integrated High volumes Grouped by business subjects Metadata driven Data from DW is aggregated in MDDBs

Architectural Framework

Information delivery component Depends on the user Novice user: prefabricated reports, preset queries Casual user: once in a while information business analyst: complex analysis Power users: picks up interesting data

Information delivery component

Architectural Framework

Metadata component Data about data in the datawarehouse Metadata can be of 3 types Operational metadata: contains information about operational data sources Extraction and transformation metadata: Details pertaining to extraction frequencies, extraction methods, business rules for data extraction End-user metadata: navigational map of DW

Why is metadata especially important in a data warehouse? It acts as the glue that connects all parts of the data warehouse. It provides information about the contents and structures to the developers. It opens the door to the end-users and makes the contents recognizable in their own terms.

Management and Control Sits on top of all components Coordinates the services and activities within the DW Controls the data transformation and transfer in DW storage

Summing up Data warehouse building blocks or components are: source data, data staging, data storage, information delivery, metadata, and management and control. In a data warehouse, metadata is especially significant because it acts as the glue holding all the components together and serves as a roadmap for the end-users.

Doubts????????????????

Trends in DW

Case study 1 As a senior analyst on DW project of a large retail chain, you are responsible for improving data visualization of the output results. Make a list of recommendations

Parallel processing Performance of DW may be improved using parallel processing with appropriate hardware and software options. Parallel processing options Symmetric multiprocessing Massively parallel processing clusters

DW with ERP packages

Web Enabled configuration