Basic Concepts of Datawarehousing An Overview Prasanth Gurram.

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

April 30, Data Warehousing and OLAP Technology: An Overview  What is a data warehouse?  Data warehouse architecture  From data warehousing to.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
Chapter 3 Database Management
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
DATA WAREHOUSE (Muscat, Oman).
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Data Warehouse & Data Mining
Understanding Data Warehousing
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
Database Systems – Data Warehousing
Data Warehouse Concepts Transparencies
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
AN OVERVIEW OF DATA WAREHOUSING
Datawarehouse Objectives
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
CISB594 – Business Intelligence
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Sachin Goel (68) Manav Mudgal (69) Piyush Samsukha (76) Rachit Singhal (82) Richa Somvanshi (85) Sahar ( )
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
CISB594 – Business Intelligence Data Warehousing Part I.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
CISB594 – Business Intelligence Data Warehousing Part I.
CISB594 – Business Intelligence Data Warehousing Part I.
Data Warehouses Kathy S. Schwaig. Outline  Data Explosion  Data Warehouses  Multi-dimensional databases Portions of this presentation are adapted from.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
CISB594 – Business Intelligence Data Warehousing Part I.
Advanced Database Concepts
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Data Model
Data Warehousing Data Model –Part 1
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehouse and OLAP
Presentation transcript:

Basic Concepts of Datawarehousing An Overview Prasanth Gurram

What is the sales distribution region wise? What is Defaulter’s Profile? What are the slow movers in my product line? How did my revenue improve in the past 5 years? Which of my Sales Agents are doing better? Who are my profitable customers? Currency Risk, Interest Rate Risk, Liquidity Risk Strategic Planning / Budgeting Which channel costs me more and pays less? How to answer these Business Queries?

Enable users to get a “Business View” of the data Facilitate Data based Decision Making that would drive and improve the Business Discover “Hidden Trends” Decision Support Systems Decision Support Systems (DSS) are interactive computer- based systems intended to help decision makers utilize data and models to identify and solve problems and make decisions. Data Warehouse is the foundation of DSS process. It is a Strategy and a Process for Staging Corporate Data. Decision Support Systems Decision Support Systems (DSS) are interactive computer- based systems intended to help decision makers utilize data and models to identify and solve problems and make decisions. Data Warehouse is the foundation of DSS process. It is a Strategy and a Process for Staging Corporate Data. DSS

Driving Forces for DSS RESULT: Customers Reform Technology Business Speed COMPETITION

 Unavailability of Tools and Techniques for acquisition of data from various sources for answering business questions and making decisions, in earlier days  Intensive efforts in data formatting than data analysis  Static and inflexible report generation  Time-lag in accessing the information at central place Scenario without DSS

OLTP Environment  get data IN  large volumes of simple transaction queries  continuous data changes  low processing time  mode of processing  transaction details  data inconsistency  mostly current data DSS Environment  get information OUT  small number of diverse queries  periodic updates only  high processing time  mode of discovery  subject oriented - summaries  data consistency  historical data is relevant OLTP v/s DSS Environment

OLTP Environment  high concurrent usage  highly normalized data structure  static applications  automates routines DSS Environment  low concurrent usage  fewer tables, but more columns per table  dynamic applications  facilitates creativity OLTP v/s DSS Environment

Benefits for Business User  Flexible Information Access  High Availability  Ease of Use  Quality & Completeness of Data  Focus on Information Processing  Information Base for Knowledge Discovery

 Advances in dbms technology  Data warehousing  On-line analytical processing  Data mining Available line of technology

Datawarehouse  Data warehouses store large volumes of data which are frequently used by DSS.It is maintained separately from the organization’s operational databases  Data warehouse is subject-oriented, integrated, time- variant, and nonvolatile collection of data Subject-oriented : Contains information regarding objects of interest for decision support: Sales by region, by product, etc. Itegrated: Data are typically extracted from multiple, heterogeneous data sources (e.g., from sales, inventory, billing DBs etc.). Time-variant: Contain historical data, longer horizon than operational system. Nonvolatile : Data is not (or rarely) directly updated.

Datawarehouse  Is the enabling technology that facilitates improved business decision-making  It’s a process, not a product  A technique for assembling and managing a wide variety of data from multiple operational systems for decision support and analytical processing It’s a journey not a destination...

Transmission NETWORKNETWORK Metadata Layer Cleansing Transformation Aggregation Summarization Data Mart Population Knowledge Discovery ODS DW OLAP ANALYSIS Extraction DM1 DM2 DMn Legacy System FS1 FS2 FSn STAGINGAREASTAGINGAREA DW Components

 Data extraction  Data Cleansing and Transformation  Data Load and refresh  Build derived data and views  Service queries  Administer the warehouse Operational Process

Data Capturing Process Feed System Application Business Transactions Incremental Data Control Metadata Extract the incremental data from feed system Store the extracted data into a temporary area Extract data from multiple, heterogeneous, and external sources Extraction Process ( Data Capturing )

Network Cloud Transmit the extracted data from Feed system to Staging area Periodicity of transmission ( daily / weekly ) depends upon the feed system Feed System Side Incremental Data Staging area Incremental Data FTP Extraction Process (Data Transmission )

Raw data (Staging Area) Process Metadata Cleansing Rules Control Metadata Cleansing Process Cleansing Reports Good Bad Clean data Detect errors in the data and rectify them when possible Mark it Good/Bad Generate the cleansing Reports and mail to the DWA and Feed System representatives Cleansing Process

Transformation Process Clean Operational Data Operational Data Store Transform the cleaned Operational Data into DSS Data Load the DSS data into ODS ODS contains the current DSS data at the lowest level of granularity Control Metadata Process Metadata Mapping Detail Transformation Rule Transformation Process

Summarization Process ODS WeeklyMonthlyYearly DW Summarize and aggregate ODS data and Populate to the Warehouse Periodicity of Summarization Process depends upon the level of summarization at Warehouse ( weekly, monthly, daily ) Control Metadata Summarization Process

 Data about Data  Used to maintain Datawarehouse  Control data  Static:  Roles, permissions, naming standards, source system names,  Locations, target names, transformation and mapping rules  Dynamic:  Scheduling, scripts, load statistics, space usage,  Backup statistics  Business data  Business rules,Who validates data,Who controls,How they validate Metadata

 Extraction/transformation/load tool (family of tools including data modeling tool, extraction tool, Meta data repository, and DW administration tools)  Meta data exchange architecture (API used to integrate all components of DW with central Meta data)  Target databases (relational, multidimensional, hybrid)  Data access and analysis tools for end users  Database servers, operating systems, networks DW Components/Tools

DW Tools