Dr. Abdul Basit Siddiqui Assistant Professor FUIEMS (Lecture Slides Week # 2)

Slides:



Advertisements
Similar presentations
Lecture-1 Introduction and Background
Advertisements

MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Managing Data Resources
Data Warehouse IMS5024 – presented by Eder Tsang.
DATA WAREHOUSE CONCEPTS. A Definition · A Data Warehouse: Is a repository for collecting, standardizing, and summarizing snapshots of transactional data.
Business Intelligence in Detail What is a Data Warehouse?
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Business Intelligence Michael Gross Tina Larsell Chad Anderson.
DATA WAREHOUSING.
Introduction to Data Warehousing Enrico Franconi CS 636.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
DATA WAREHOUSE (Muscat, Oman).
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Defining Data Warehouse Concepts and Terminology.
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
D ATABASE S YSTEMS D ATA W AREHOUSING I Asma Ahmad 29 th April, 2011.
Bab 3 Data Warehousing. Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai, Delhi, Chennai and Banglore. The Sales Manager.
Data Warehouse Concepts Transparencies
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
AN OVERVIEW OF DATA WAREHOUSING
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
BUS1MIS Management Information Systems Semester 1, 2012 Week 6 Lecture 1.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
Data Warehousing Lecture-2 Introduction and Background 1.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
CISB594 – Business Intelligence
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
CISB594 – Business Intelligence Data Warehousing Part I.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
CISB594 – Business Intelligence Data Warehousing Part I.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
CISB594 – Business Intelligence Data Warehousing Part I.
Why BI….? Most companies collect a large amount of data from their business operations. To keep track of that information, a business and would need to.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
Data Warehousing/Mining 1 Data Warehousing/Mining Introduction.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Data Warehouse Data Mart Elahe Soroush. Agenda  Data Warehouse definition  Concepts  Logical transformation  Physical transformation  DW components.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Introduction to Data Warehousing. Subject: Data Warehousing.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Lecture-3 Introduction and Background
Data warehouse.
Data warehouse and OLAP
Data Warehouse.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Introduction to Data Warehousing
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Data Warehousing Data Model –Part 1
Business Intelligence
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Concepts
Analytics, BI & Data Integration
Data Warehouse and OLAP Technology
Data Warehousing & DATA MINING (SE-409) Lecture-1 Introduction and Background Huma Ayub Software Engineering department University of Engineering and Technology,
Presentation transcript:

Dr. Abdul Basit Siddiqui Assistant Professor FUIEMS (Lecture Slides Week # 2)

Why a Data Warehouse (DWH)? Data recording and storage is growing: Almost every industry has huge amount of operational data. Careful use/analysis of historic information may result in excellent prediction for the future: Knowledge worker wants to turn available data into useful information. This information is used by them to support strategic decision making. Gives total view of the organization: It is a platform for consolidated historical data for analysis. It stores data of good quality so that knowledge worker can make correct decisions. Intelligent decision-support is required for decision- making. Data Warehouse & Mining- Spring 2012

Why a Data Warehouse? (Contd.) From business perspective: It is latest marketing weapon. Helps to keep customers by learning more about their needs. Valuable tool in today’s competitive fast evolving world. Data Warehouse & Mining- Spring 2012

Reason-I: Why a Data Warehouse (DWH)? Data sets are growing: How Much Data is that? 1 MB2 20 or 10 6 bytes Small novel 3½ Disk. 1 GB2 30 or 10 9 bytes Paper reams that could fill the back of a pickup van. 1 TB2 40 or bytes 50,000 trees chopped and converted into paper and printed. 2 PB1 PB = 2 50 or bytesAcademic research libraries across USA. 5 EB1 EB = 2 60 or bytes All words ever spoken by the Human Beings. Data Warehouse & Mining- Spring 2012

Reason-I: Why a Data Warehouse (DWH)? Size of Data Sets are going up. Cost of Data Storage is coming down. The amount of data average business collects and stores is doubling every year. Total hardware and software cost to store and manage 1 MB of data: 1990: $ : ¢ 15 (down 100 times) 2010: < ¢ 1 (down 150 times) A few examples: Wall Mart: 24+ TB Finance Telecom: 100+ TB CERN: Upto 20 PB by 2006 Stanford Linear Accelerator Center (SLAC): 500 TB Telenor, Ufone, Mobilink, Warid, Zong ??? Data Warehouse & Mining- Spring 2012

Caution! A Warehouse of Data is NOT a Data Warehouse. Data Warehouse & Mining- Spring 2012

Caution! Size is NOT Everything. Data Warehouse & Mining- Spring 2012

Reason-2: Why a Data Warehouse (DWH)? DBMS Approach List of all items that were sold last month? List of all makeup items purchased by Sassi? The total sales of the last month grouped by branch? How many sales transactions occurred during the month of January? Intelligent Enterprise Which items sell together? Which items to stock? Where and how to place the items? What discounts to offer? How best to target customers to increase sales at a branch? Which customers are most likely to respond to my next promotional campaign, and why? Data Warehouse & Mining- Spring 2012 Businesses demand Intelligence (BI). Complex questions from integrated data. “Intelligent Enterprise”

Reason-3: Why a Data Warehouse (DWH)? Businesses want much more … What happened? Why it happened? What will happen? What is happening? What do you want to happen? Data Warehouse & Mining- Spring 2012

What is a Data Warehouse? A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc access by knowledge workers. Data Warehouse & Mining- Spring 2012

What is a Data Warehouse? Transaction System: Management Information System (MIS) Could be typed sheets (NOT transaction system) Ad-Hoc Access: Does not have a certain access pattern Queries not known in advance Difficult to write SQL in advance Knowledge Workers: Typically NOT IT literate (Executives, Analysts, Managers) NOT clerical workers Decision makers Data Warehouse & Mining- Spring 2012

What is a Data Warehouse? Inmons’s Definition: A Data Warehouse is: Subject-oriented Integrated Time-variant Nonvolatile Collection of data in support of management’s decision making process. Data Warehouse & Mining- Spring 2012

Another View of a DWH Data Warehouse & Mining- Spring 2012 Subject Oriented Integrated Time Variant Non Volatile

Subject-oriented Data Warehouse is organized around subjects such as sales, product, customer. It focuses on modeling and analysis of data for decision makers. Excludes data not useful in decision support process. Data Warehouse & Mining- Spring 2012

Integration Data Warehouse is constructed by integrating multiple heterogeneous sources. Data Preprocessing are applied to ensure consistency. Data Warehouse & Mining- Spring 2012 RDBMS Legacy System Data Warehouse Flat File Data Processing Data Transformation

Time-variant Provides information from historical perspective e.g. past 5-10 years. Every key structure contains either implicitly or explicitly an element of time. Data Warehouse & Mining- Spring 2012

Nonvolatile Data once recorded cannot be updated. Data Warehouse requires two operations in data accessing Initial loading of data Access of data Data Warehouse & Mining- Spring 2012 load access

Summary: What is a Data Warehouse? It is a blend of many technologies, the basic concept being: Take all data from different operational systems If necessary, add relevant data from industry Transform all data and bring into a uniform format Integrate all data as a single entity Store data in a format supporting easy access for decision support Create performance enhancing indices Implement performance enhancement joins Run ad-hoc queries with slow selectivity Data Warehouse & Mining- Spring 2012

Benefits of Data Warehouse High returns on investment. Substantial competitive advantage. Increased productivity of corporate decision-makers. Fast reporting for decision making process. Reduced reporting load on transactional systems. Making institutional data more user-friendly and accessible for knowledge workers. Integrated data from different source systems. Enabled ‘point-in-time’ analysis and trending over time. Helps in identifying and resolving data integrity issues, either in the warehouse itself or in the source systems that collect the data. Data Warehouse & Mining- Spring 2012

Data Warehouse: How is it Different? 1.Decision making is Ad-Hoc Data Warehouse & Mining- Spring 2012

Data Warehouse: How is it Different? 2.Different patterns of hardware utilization Data Warehouse & Mining- Spring 2012 Bus Service vs. Train

Data Warehouse: How is it Different? 3.Combines operational and historic data Don’t do data entry into a DWH. OLTP or ERP are the source systems. OLTP systems don’t keep history, cannot get balance statement more than a year old. DWH keep historical data, even of bygone customers. Why? In the context of bank, want to know why the customer left? What are the events that led to his/her leaving? Why? Customer retention Data Warehouse & Mining- Spring 2012

Data Warehouse: How is it Different? How much history? Depends on: Industry Cost of storing historical data Economic value of historical data Industry and history Telecom calls are much much more as compared to bank transactions 18 months Retailers interested in analyzing yearly seasonal patterns 65 weeks, why? Insurance companies want to do actuary analysis, use the historical data in order to predict risk 7 years Hence NOT a complete repository of data. Data Warehouse & Mining- Spring 2012

Data Warehouse: How is it Different? How much history? Economic value of data vs. storage cost Data Warehouse a complete repository of data? Data Warehouse & Mining- Spring 2012