Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

Supporting End-User Access
C6 Databases.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
DATA WAREHOUSING.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Data Warehouse & Data Mining
Understanding Data Warehousing
Database Systems – Data Warehousing
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Warehouse Concepts Transparencies
Datawarehouse Objectives
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Meeting a Business Need Chapter 2. Overview Defining DW Concepts & Terminology Planning For a Successful Warehouse Project Management (Methodology, Maintaining.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
BUSINESS ANALYTICS AND DATA VISUALIZATION
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Ayyat IT Group Murad Faridi Roll NO#2492 Muhammad Waqas Roll NO#2803 Salman Raza Roll NO#2473 Junaid Pervaiz Roll NO#2468 Instructor :- “ Madam Sana Saeed”
MIS2502: Data Analytics Advanced Analytics - Introduction.
Advanced Database Concepts
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
An Overview of Data Warehousing and OLAP Technology
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Defining Data Warehouse Concepts and Terminology
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse.
Chapter 13 – Data Warehousing
Defining Data Warehouse Concepts and Terminology
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
An Introduction to Data Warehousing
Introduction of Week 9 Return assignment 5-2
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

Introduction to Data Warehousing

From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis, exploration, identification of trends etc. has led to Decision Support Systems. Rapid Growth since mid 70’s DBMSs vendors have answered this trend by adding new features to existing products Rarely enough

DBs for Decision Support Trend towards Data Warehousing Data Warehousing – consolidation of data from several databases which are in turn maintained by individual business units along with historical and summary information

Characteristics of TPSs CharacteristicOLTP Typical operationUpdate Level of analytical requirementsLow ScreensUnchanging Amount of data per transactionSmall Data levelDetailed Age of dataCurrent OrientationRecords

Complex Analysis Historical information to analyze Data needs to be integrated Database design: Denormalized, star schema OLTP Information to support day-to-day service Data stored at transaction level Database design: Normalized TPS vs Decision Support

MIS and Decision Support Operational reports Decision makers Production platforms MIS systems provided business data Reports were developed on request Reports provided little analysis capability no personal ad hoc access to data Ad hoc access

Analyzing Data from Operational Systems Data structures are complex Systems are designed for high performance and throughput Data is not meaningfully represented Data is dispersed TPS systems unsuitable for intensive queries Operational reports Production platforms ERP

End user computing offloaded from the operational environment User’s own data Data Extract Processing Extracts Operational systems Decision makers

Management Issues Extract explosion Duplicated effort Multiple technologies Obsolete reports No metadata Extracts Operational systems Decision makers

Data Quality Issues No common time basis Different calculation algorithms Different levels of extraction Different levels of granularity Different data field names Different data field meanings Missing information No data correction rules No drill-down capability

From Extract to Warehouse DSS Controlled Reliable Quality information Single source of data Data warehouse Internal and external systems Decision makers

Data Warehousing Architecture Metadata respository Serves Extract Clean Transform Load Refresh OLAP Data Warehouse External Data Sources Operational Databases Visualisation Data Mining

Business Motivators Provide superior services and products Know the business New products Invest in customers Retain customers Invest in technology Reinvent to face new challenges

Centralised data warehouse Federated data warehouse

Tiered data warehouse

Data Warehouses Vs Data Marts Data Mart Department Single-subject Few < 100 GB Months Data Mart Data Warehouse Property Scope Subjects Data Source Size (typical) Implementation time Data Warehouse Enterprise Multiple Many 100 GB to > 1 TB Months to years

End-user Access Tools High performance is achieved by pre-planning the requirements for joins, summations, and periodic reports by end-users. There are five main groups of access tools: –Data reporting and query tools –Application development tools –Executive information system (EIS) tools –Online analytical processing (OLAP) tools –Data mining tools

Data Usage - $1000 questions Need to complement RDBMS technology with a flexible, multidimensional view of data

The Functionality of OLAP Rotate and drill down Create and examine calculated data Determine comparative or relative differences. Perform exception and trend analysis. Perform advanced analytical functions

The star structure

Multidimensional Database Model The data is found at the intersection of dimensions. Store Time FINANCE Store Product Time SALES Customer

Data Mining

Data mining functions Associations –85 percent of customers who buy a certain brand of wine also buy a certain type of pasta Sequential patterns –32 percent of female customers who order a red jacket within six months buy a gray skirt Classifying –Frequent customers are those with incomes about $50,000 and having two or more children Clustering –Market segmentation Predicting –predict the revenue value of a new customer based on that personal demographic variables