Enterprise Data Warehousing (EDW) By: Jordan Olp.

Slides:



Advertisements
Similar presentations
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Advertisements

1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Prentice Hall, Database Systems Week 1 Introduction By Zekrullah Popal.
Accessing Organizational Information—Data Warehouse
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Introduction to data warehouses
Data Warehouse IMS5024 – presented by Eder Tsang.
Chapter 3 Database Management
An Introduction to Dimensional Data Warehouse Design Presented by Joseph J. Sarna Jr. JJS Systems, LLC.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
1 Components of A Successful Data Warehouse Chris Wheaton, Co-Founder, Client Advocate.
Data Conversion to a Data warehouse Presented By Sanjay Gunasekaran.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
Data Warehouse Tools and Technologies - ETL
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
Business Intelligence
Understanding Data Warehousing
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
Data Warehouse Concepts Transparencies
Data Warehouse Architecture. Inmon’s Corporate Information Factory The enterprise data warehouse is not intended to be queried directly by analytic applications,
ETL Overview February 24, DS User Group - ETL - February ETL Overview “ETL is the heart and soul of business intelligence (BI).” -- TDWI ETL.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehouse Development Methodology
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
CISB594 – Business Intelligence
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
CISB594 – Business Intelligence Data Warehousing Part I.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
 Understand the basic definitions and concepts of data warehouses  Describe data warehouse architectures (high level).  Describe the processes used.
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Operational Data Store
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Recap of Day 1 1 Dr. Chaitali Basu Mukherji. 2 Which are our lowest/highest margin customers ? Who are my customers and what products are they buying?
Zhangxi Lin Texas Tech University
BUILDING THE INFORMATION INFRASTRUCTURE. The Challenge  Information understanding through increased context and consistency of definition.  Information.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Data Warehouse/Data Mart It’s all about the data.
Business Intelligence Overview
Data Management Capabilities and Past Performance
CHAPTER SIX DATA Business Intelligence
Advanced Applied IT for Business 2
Business Intelligence & Data Warehousing
Manajemen Data (2) PTI Pertemuan 6.
Introduction to Data Warehousing
Data Warehousing and Data Mining By N.Gopinath AP/CSE
Data Warehouse.
Overview and Fundamentals
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
Data warehouse.
Data Warehousing Concepts
Business Intelligence
Presentation transcript:

Enterprise Data Warehousing (EDW) By: Jordan Olp

Overview  Definition  Brief History  Data Setup/Structure  Benefits/Downfalls  Future  Conclusion  Questions

Definition  EDW is a central data repository containing data from multiple usually separated areas that store current and historical data that is used for data analysis and reporting.

Brief History  Traces back to the 1960’s  Became prominent in computing industries in ’s  William Inmon  Considered to be the “Father of Data Warehousing”  Wrote numerous front running books on the topic including “Building the Data Warehouse”  Is a leading voice for the Top-Down methodology for Data Warehousing design

Brief History cont.  William Inmon cont.  Founded Prism Solutions, provided one of the first industry Data Warehousing tools in the early 1990s  Ralph Kimball  Also wrote numerous books on the topic including “The Data Warehouse Toolkit”  Provided solid practical modeling information from industry honed examples on modeling and setup  Is a leading voice for the Bottom-Up methodology for Data Warehousing design

Data Setup/Structure  Where the data originates  How the data is processed  Design methodologies on how data is stored

Pre-Warehouse Data Allocation  Data is never sent directly from origin into Data Warehouse  Always stored in separate specific locations usually based on department

Data Allocation cont.  Sales  Inventory  Accounting  Human Resources  Customer Relationship Management  Marketing  Information Technology  Customer Services  Research & Development  …

Extract, Transform, Load (ETL)  Cleansing, Reformatting, Modeling  This is the process of copying the data from its origin, making it more useful, and then loading it into the Data Warehouse  One or multiple tools can be used to complete this process, sometimes even another tool than provided by the Data Warehouse itself

Extract  Data is gathered from the original sources into a staging area where the data is Transformed before entering the Data Warehouse  Metadata – “data about data”  Is created after the Extract process and is also moved into the Data Warehouse with the rest of the data  Cleansed/error checked, sometimes dada is corrupted from older systems and is removed from the valid data – Metadata helps the process

Transform  Here the data is reformatted to make it more usable by the database  Rules/Function/Metrics are applied to meet technical needs of the Data Warehouse  Encoded, derived, joined, sorted…

Load  Simply moves the Extracted, Transformed data from the staging area into the Data Warehouse  Load times vary on the business and their needs

Design Methodologies  Facts  Single piece of data used as a value or measurement  Normalized approach  Facts are stored in tables, that are then grouped by data subject  Dimensional approach  Fact tables store data together, and dimensional tables reference fact tables

Design Methodologies Top-Down Design  Relies heavily on data normalization  Reduces data redundancy  Allows for precise analytics for this design  Data is closely knit together at atomic levels  Data corresponds very closely to the real world events they related to

Design Methodologies Bottom-Up Design  First need a Top-Down understanding of the business and processes you are trying to achieve  Relies on separated access layers within the Data Warehouse – commonly called Data Marts  Because of dimensional capabilities can achieve faster results – however it allows for data redundancy

Benefits  Data bundling  Application specific databases  Business Intelligence - Reporting analytics  Easily accommodates legacy and historical data  Usually improves data quality  Usefulness increases as data is accumulated

Downfalls  Hefty price tags – hardware and software  Usually require its own team to manage  ETL process is staggering  Adding new data sources or changing Warehouse scheme is difficult

Future Usefulness  More data means better analytics  Better predictability  Well established Data Warehouses give companies a competitive edge  Benefits of reporting far outweigh costs  Data Warehousing…Evolved -> Data Activation  Textual Information

Conclusion  High cost – High reward  Allows predictions of trends  Can bring together nearly unrelated data and make amazing use of it  More accurate and relevant data = better analytics = better reporting information = better business responses

Questions/Comments