Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia

Slides:



Advertisements
Similar presentations
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
Advertisements

Navigator Management Partners LLC Business Analysis Professional Development Day – Sep 2014 How to understand and deliver requirements to your Business.
A Fast Growing Market. Interesting New Players Lyzasoft.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
MetaScale is a subsidiary of Sears Holdings Corporation The 3 Ts of Hadoop Wuheng Luo Ankur Gupta
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
Data Warehouse IMS5024 – presented by Eder Tsang.
IS Consulting Process (IS 6005) Masters in Business Information Systems 26 th Feb 2010 Fergal Carton Business Information Systems.
Enterprise Business Processes and Reporting (IS 6214) MBS MIMAS 12 th Jan 2011 Fergal Carton Business Information Systems.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Enterprise Business Processes and Reporting (IS 6214) MBS MIMAS 19 th Jan 2011 Fergal Carton Business Information Systems.
Components and Architecture CS 543 – Data Warehousing.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Page 1Prepared by Sapient for MITVersion 0.1 – August – September 2004 This document represents a snapshot of an evolving set of documents. For information.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Business Intelligence System September 2013 BI.
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
Business Intelligence Hariprit Padam Ramakanth Gajjela.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Components of the Data Warehouse Michael A. Fudge, Jr.
Real-Time Data Warehousing Prof. Navneet Goyal Department of Computer Science & Information Systems BITS, Pilani.
Patrick Seto CS157A Section 3 Data Warehouses Presented by Patrick Seto CS157A Section 3.
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Data Warehousing Seminar Chapter 5. Data Warehouse Design Methodology Data Warehousing Lab. HyeYoung Cho.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
Ch 5. The Evolution of Analytic Processes
Data Warehouse Concepts Transparencies
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
ETL Overview February 24, DS User Group - ETL - February ETL Overview “ETL is the heart and soul of business intelligence (BI).” -- TDWI ETL.
Enterprise Reporting Solution
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
BUS1MIS Management Information Systems Semester 1, 2012 Week 6 Lecture 1.
April 2004 At A Glance CAT is a highly portable exception monitoring and action agent that automates a set of ground system functions. Benefits Automates.
1 Data Warehouses BUAD/American University Data Warehouses.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Ayyat IT Group Murad Faridi Roll NO#2492 Muhammad Waqas Roll NO#2803 Salman Raza Roll NO#2473 Junaid Pervaiz Roll NO#2468 Instructor :- “ Madam Sana Saeed”
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
7 Strategies for Extracting, Transforming, and Loading.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
An Overview of Data Warehousing and OLAP Technology
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 5: Data Warehousing.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Advanced Applied IT for Business 2
THE COMPELLING NEED FOR DATA WAREHOUSING
Chapter 13 Business Intelligence and Data Warehouses
Chapter 13 The Data Warehouse
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
Data Warehouse.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Typically data is extracted from multiple sources
Data Warehouse.
Data Warehousing Concepts
Analytics, BI & Data Integration
David Gilmore & Richard Blevins Senior Consultants April 17th, 2012
Presentation transcript:

Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia Intelligent Enterprise Feb 7, 2004 Vol. 7, Iss. 2; pg 26

Traditional Data Warehouse Central repository of transactional data spread across heterogeneous platforms and applications Focused on strategic reporting and analysis Loaded periodically (nightly, weekly, monthly) Information latency

Evolution of The Data Warehouse First-generation Reporting Second-generation Analytic processing and data mining Multidimensional tools for drill down New generation Speed information cycle time Minimize latency Information on demand

Why Real Time Data Warehousing? Active decision support Business activity monitoring (BAM) Alerting Efficiently execute business strategy Monitoring is completed in the background Positions information for use by downstream applications Can be built on top of existing data warehouse

Traditional Vs. Real-Time Data Warehouse Traditional Data Warehouse (EDW) Strategic Passive Historical trends Batch Offline analysis Isolated Not interactive Best effort Guarantees neither availability nor performance

Traditional Vs. Real-Time Data Warehouse Real-Time Data Warehouse (RTDW) Tactical Focuses on execution of strategy Real-Time Information on Demand Most up-to-date view of the business Integrated Integrates data warehousing with business processes Guaranteed Guarantees both availability and performance

Real-Time Integration Goal of real-time data extraction, transformation and loading Keep warehouse refreshed Minimal delay Issues How does the system identify what data has been added or changed since the last extract Performance impact of extracts on the source system

Real-Time Data Warehouse – Logical Architecture

Techniques for real-time ETL Simulated real-time feed Increase the frequency of batch runs Most useful when information is not required to be ‘up to the minute’ Requires minimal changes to existing ETL infrastructure Easy to implement

Techniques for real-time ETL Trickle Feed Allows continuous update of the RTDW as the data in the source system changes Messaging infrastructure Perpetually open data pipe Also called streaming Basic elements – Capture, Stage and Apply

Techniques for real-time ETL Trickle feed (cont.) Target and source databases must be configured May require special gateways Source – capture process: automatically capture changes to data or table structure RTDW records changes as logical change records (LCRs) that are kept in a staging partition called the message queue The message queue can be explicitly updated by user applications

Techniques for real-time ETL Trickle feed Role of Target database A process takes the logical change records out of the message queue and applies changes to selected database objects Rules are set in message queues to handle data transformation Require upfront development and can be complex to configure and manage

Trickle Feed Architecture for Real-Time load

Information Delivery Changes to traditional data warehouse Need to accommodate continuous data trickle feeds intermixed with liver user queries Schema design Active partition management Data aggregation

Designing an RTDW - Options Trickle And Flip Copy of fact table is made and given a name that cannot be accessed by queries As new data trickles in, it is appended to copy of the fact table At certain intervals, the trickle is halted, the copy fact table is copied, renamed to the active fact table name, (the active fact table is deleted) and the process starts over Poses scalability problems – may not keep up with the trickle depending on the size of the table

Designing an RTDW - Options Table Partitioning Allows for the creation of large tables that are handled internally by the database as a series of smaller ones, each with its own indexes Can rope off partition so it isn’t visible to active queries Problem: Determining criteria for partitioning

Designing an RTDW - Options Real-Time partitions Create new tables that resemble active fact tables that are designed for quick updates Interval tables – contain data from only the last update Truly real-time Can be accessed by analysts and other BI tools

Real-Time Partition

Conclusion RTDWs have an a distinct advantage for those business utilizing time-sensitive data Call Centers Performance indicators Fraud detection Yield management Certain financial transactions