Data Warehouse and OLAP Technology

Slides:



Advertisements
Similar presentations
Chapter 13 The Data Warehouse
Advertisements

April 30, Data Warehousing and OLAP Technology: An Overview  What is a data warehouse?  Data warehouse architecture  From data warehousing to.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
Overview of Databases and Transaction Processing Chapter 1.
Introduction to Data Warehousing Enrico Franconi CS 636.
Chapter 13 The Data Warehouse
1 Introduction Introduction to database systems Database Management Systems (DBMS) Type of Databases Database Design Database Design Considerations.
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Chapter 1 Database and Database Users Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2008.
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
CISB594 – Business Intelligence
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Operational vs. Informational System. Operational System Operational systems maintain records of daily business transactions whereas a Data Warehouse.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
CISB594 – Business Intelligence Data Warehousing Part I.
MAIN BOOKS 1. DATA WAREHOUSING IN THE REAL WORLD : Sam Anshory & Dennis Murray, Pearson 2. DATA MINING CONCEPTS AND TECHNIQUES : Jiawei Han & Micheline.
CISB594 – Business Intelligence Data Warehousing Part I.
DATA RESOURCE MANAGEMENT
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
CISB594 – Business Intelligence Data Warehousing Part I.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
An Overview of Data Warehousing and OLAP Technology
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Chapter 1 Database and Database Users
Data Warehousing Data warehousing provides architectures & tools for business executives to systematically organize, understand & use their data to make.
Chapter 1: Introduction
Defining Data Warehouse Concepts and Terminology
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
Fundamentals & Ethics of Information Systems IS 201
Chapter 13 The Data Warehouse
Data Warehouse—Subject‐Oriented
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Defining Data Warehouse Concepts and Terminology
Overview of Databases and Transaction Processing
Data Warehouse and OLAP
Database Vs. Data Warehouse
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
Introduction to Data Warehousing
Data Warehousing: Data Models and OLAP operations
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Data Warehousing Data Model –Part 1
Introduction of Week 9 Return assignment 5-2
Data Warehouse.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Concepts
Chapter 3 Database Management
Module 5 Strategic Enterprise Management and Reporting Tools
Data Warehouse and OLAP
Presentation transcript:

Data Warehouse and OLAP Technology Chapter-2 Data Warehouse and OLAP Technology This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul , Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed under a Creative Commons Attribution 4.0 International License.

What Is a Data Warehouse? Data warehousing provides architectures and tools for business executives to systematically organize, understand, and use their data to make strategic decisions. Loosely speaking, a data warehouse refers to a database that is maintained separately from an organization's operational databases. According to William H. Inmon,, "A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management's decision making process"

The four keywords are: Subject-oriented Integrated Time-variant Nonvolatile These four keywords distinguish data warehouses from other data repository systems, such as relational database systems, transaction processing systems, and file systems. 1. Subject-oriented: A data warehouse is organized around major subjects, such as customer, supplier, product, and sales. Rather than concentrating on the day-to-day operations and transaction processing of an organization.

2. Integrated: A data warehouse is usually constructed by integrating multiple heterogeneous sources, such as relational databases, flat files, and on-line transaction records. 3. Time-variant: Data are stored to provide information from a historical perspective ( past 5-10 years). 4. Nonvolatile: A data warehouse is always a physically separate store of data transformed from the application data found in the operational environment. Due to this separation, a data warehouse does not require transaction processing, recovery, and concurrency control. It usually requires only two operations : initial loading of data and access of data.  

"How are organizations using the information from data warehouses?" Many organizations use this information to support business decision-making activities (1) increasing customer focus, which includes the analysis of customer buying patterns. (2) repositioning products and managing product portfolios by comparing the performance of sales by quarter, by year, and by geographic regions in order to fine- tune production strategies. (3) analyzing operations and looking for sources of profit. (4)managing the customer relationships, making environmental corrections, and managing the cost of corporate assets.

Differences between Operational Database Systems and Data Warehouses The major task of on-line operational database systems is to perform on-line transaction and query processing. These systems are called on- line transaction processing (OLTP) systems. They cover most of the day-to-day operations of an organization, such as purchasing, inventory, manufacturing, banking, payroll, registration and accounting. Data warehouse systems, on the other hand, serve users in the role of data analysis and decision making. Such systems can organize and present data in various formats in order to accommodate the different needs of the different users. These systems are known as on-line analytical processing (OLAP) systems.

The major distinguishing features between OLTP and OLAP are summarized as follows: Users and system orientation: An OLTP system is customer-oriented and is used for transaction and query processing by clerks, clients, and IT professionals. An OLAP system is market-oriented and is used for data analysis by knowledge workers, including managers, executives, and analysts. Data contents: An OLTP system manages current data that are too detailed to be easily used for decision making. An OLAP system manages large amounts of historical data, provides facilities for summarization, aggregation and stores , manages information at different levels .

Database design: An OLTP system usually adopts an entity-relationship (ER) data model and an application- oriented database design. An OLAP system typically adopts either a Star or Snow Fake model and a subject oriented database design. View: An OLTP system focuses mainly on the current data within an enterprise or department. An OLAP system often spans multiple versions of a database schema, due to the growing process of an organization. OLAP systems also deal with information that originates from different organizations. Access patterns: The access patterns of an OLTP system consist mainly of short, atomic transactions. However, OLAP systems are mostly read-only operations because most data warehouses store historical rather than up-to-date information.

Feature OLTP OLAP Characteristic Orientation User Function DB design Data Summarization View Unit of work Access Focus Operations Number of records accessed Number of users DB size OLTP   operational processing transaction clerk, DBA, database professional day-to-day operations ER based, application-oriented current; guaranteed up-to-date   primitive, highly detailed detailed, flat relational short, simple transaction read/write data in index/hash on primary key tens thousands 100 MB to GB OLAP informational processing analysis knowledge worker (e.g., manager, executive, analyst) long-term informational requirements, decision support star/snowflake, subject-oriented historical; accuracy maintained over time summarized, consolidated summarized, multidimensional complex query mostly read information out lots of scans millions hundreds 100 GB to TB

But, Why Have a Separate Data Warehouse? 1. An operational database is designed and tuned from known tasks and workloads, such as indexing and hashing using primary keys, searching for particular records, and optimizing queries. Data warehouse queries are often complex. They involve the computation of large groups of data at summarized levels, and may require the use of special data organization, access, and implementation methods based on multidimensional views.

2. An operational database supports the concurrent processing of multiple transactions. Concurrency control and recovery mechanisms, such as locking and logging, are required to ensure the consistency and robustness of transactions. An OLAP query often needs read-only access of data records for summarization and aggregation. Concurrency control and recovery mechanisms, if applied for such OLAP operations, may risk the execution of concurrent transactions and thus substantially reduce the throughput of an OLTP system.

3. The separation of operational databases from data warehouses is based on the different structures, contents, and uses of the data in these two systems. Decision support requires historical data, whereas Operational databases do not typically maintain historical data.

Thank you