Best Practices Building a Data Warehouse Quickly October 16, 2009 | Florida Chapter Presented by Raphael Klebanov, WhereScape USA Copyright © 2009 by.

Slides:



Advertisements
Similar presentations
A BPM Framework for KPI-Driven Performance Management
Advertisements

Course: e-Governance Project Lifecycle Day 1
Systems Development Environment
Ch 3 System Development Environment
(Insert Title of Project Here) Kickoff Meeting (Month Date, Year)
Alternate Software Development Methodologies
Delivery Business Solutions April 29, Nashville PMI Symposium April 29, 2013 Stephanie Dedmon, PMP Director, Business Solutions Delivery Department.
CSCU 411 Software Engineering Chapter 2 Introduction to Software Engineering Management.
Roadmap to Continuous Integration Testing and Benefits Gowri Selka, Walgreens Natalie Koltun, Walgreens May 20th, 2014 ©2013 Walgreen Co. All rights reserved.
Chapter 6 SYSTEMS DEVELOPMENT Phases, Tools, and Techniques
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall Essentials of Systems Analysis and Design Fourth Edition Joseph S. Valacich Joey F.
Chapter 1 The Systems Development Environment
IS 214 Needs Assessment and Evaluation of Information Systems Managing Usability © Copyright 2001 Kevin McBride.
Chapter 8 Managing IT Project Delivery
Iterative development and The Unified process
Pertemuan Matakuliah: A0214/Audit Sistem Informasi Tahun: 2007.
ECM Project Roles and Responsibilities
Business Intelligence Dr. Mahdi Esmaeili 1. Technical Infrastructure Evaluation Hardware Network Middleware Database Management Systems Tools and Standards.
Copyright 2006 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Third Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter.
Systems Development Planning Lifecycle.
CHAPTER 19 Building Software.
Collaborative Business Intelligence Kevin Burrus Brainspire Solutions
Enterprise Architecture
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 1.1.
What is Business Analysis Planning & Monitoring?
Information on Demand in Action Darren Silvester – Design Authority 17 th September 2009.
The Microsoft Office 2007 Enterprise Project Management Solution:
Chapter 3 – Agile Software Development 1Chapter 3 Agile software development.
IT Systems Analysis & Design
Engineering, Operations & Technology | Information TechnologyAPEX | 1 Copyright © 2009 Boeing. All rights reserved. Architecture Concept UG D- DOC UG D-
The Challenge of IT-Business Alignment
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
1 ISA&D7‏/8‏/ ISA&D7‏/8‏/2013 Systems Development Life Cycle Phases and Activities in the SDLC Variations of the SDLC models.
Testing Challenges in an Agile Environment Biraj Nakarja Sogeti UK 28 th October 2009.
Certificate IV in Project Management Introduction to Project Management Course Number Qualification Code BSB41507.
Industry SDLCs and Business Climate. Justin Kalicharan Credentials Director and Senior Technology Officer Over 14 years of coding experience in various.
University of Palestine software engineering department Testing of Software Systems Testing throughout the software life cycle instructor: Tasneem Darwish.
Rapid Application Development. What is RAD……..?  Rapid Application Development (RAD) is a software development process.  first developed during the.
Extreme Programming (XP). Agile Software Development Paradigm Values individuals and interactions over processes and tools. Values working software over.
Introduction to Systems Analysis and Design
Microsoft Office Project 2003: Selling EPM in your Organization Matt Wilson Business Solutions Specialist LMR Solutions.
Systems Analysis and Design in a Changing World, Fourth Edition
Slide 1 Systems Analysis and Design With UML 2.0 An Object-Oriented Approach, Second Edition Chapter 1: Introduction to Systems Analysis and Design Alan.
12/4/ OBIEE Technical Conference Getting Started with a Dashboard Development Project Theresa May.
Business Analysis. Business Analysis Concepts Enterprise Analysis ► Identify business opportunities ► Understand the business strategy ► Identify Business.
What Is DevOps? DevOps is "a portmanteau of 'development' and 'operations'" and is "a software development method that stresses communications, collaboration,
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
WATERFALL DEVELOPMENT MODEL. Waterfall model is LINEAR development lifecycle. This means each phase must be completed before moving onto the next!!! WHAT.
Extreme programming (XP) Variant of agile Takes commonsense practices to extreme levels © 2012 by Václav Rajlich1.
Overview of RUP Lunch and Learn. Overview of RUP © 2008 Cardinal Solutions Group 2 Welcome  Introductions  What is your experience with RUP  What is.
Business Intelligence Program 2011 Scorecard. Business Intelligence Program Roadmap 2.
Chapter 10 Information Systems Development. Learning Objectives Upon successful completion of this chapter, you will be able to: Explain the overall process.
Software Testing Process
An Agile Requirements Approach 1. Step 1: Get Organized  Meet with your team and agree on the basic software processes you will employ.  Decide how.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
Business Intelligence Pathway Method 5 th Meeting Course Name: Business Intelligence Year: 2009.
SYSTEM ANALYSIS AND DESIGN SAFAA S.Y. DALLOUL. INTRODUCTION.
44222: Information Systems Development
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Moving To Business Objects XI – Release 2 Presented to Business Objects User Group Rich Strout October 20, 1006.
Analysis and Reporting Toolset (A&RT): Lessons on how to develop a system with an external partner David Smith AstraZeneca.
C_ITIP211 LECTURER: E.DONDO. Unit 1 : The Systems Development Environment.
Methodologies and Algorithms
Continuous Delivery- Complete Guide
BANKING INFORMATION SYSTEMS
IT Systems Analysis & Design
Script-less Automation: An Approach to Shift-Left.
SDLC (Software Development Life Cycle)
(Insert Title of Project Here) Kickoff Meeting
Presentation transcript:

Best Practices Building a Data Warehouse Quickly October 16, 2009 | Florida Chapter Presented by Raphael Klebanov, WhereScape USA Copyright © 2009 by WhereScape Software | Slide # 1 Copyright © 2009 by WhereScape Software

Key factors that influence a successful data warehouse task Implementing the True Development Approach Choosing a Rapid Development Product Ensuring Data Availability Involving Key Users throughout the whole project Relying on a Pragmatic Governance Framework Utilizing experienced Team Members Selecting the right Hardware, Infrastructure Technology Copyright © 2009 by WhereScape Software | Slide # 2 Abstract

Copyright © 2009 by WhereScape Software | Slide # 3 Basic Architecture of a Data Warehouse

…for a intelligent decision-making process? …for data warehouse? Are you ready … Copyright © 2009 by WhereScape Software | Slide # 4

Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 5

Unreliable or unattainable user requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 6

Unreliable or unattainable user requirements Quality of the data that feeds the source system Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 7

Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 8

Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 9

Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 10

Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership) Poor documentation Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 11

Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership) Poor documentation “…over 50% of data warehouse projects fail or go wildly over budget – they blame data quality…” The real problem is project approach. Source: Gartner. Magic Quadrant for Data Integration Tools, 2007 Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide # 12

DW Project Components Copyright © 2009 by WhereScape Software | Slide # 13

Strong sponsorship of the DW from the business DW Project Components Copyright © 2009 by WhereScape Software | Slide # 14

Strong sponsorship of the DW from the business Divide and Conquer approach DW Project Components Copyright © 2009 by WhereScape Software | Slide # 15

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach DW Project Components Copyright © 2009 by WhereScape Software | Slide # 16

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools DW Project Components Copyright © 2009 by WhereScape Software | Slide # 17

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype DW Project Components Copyright © 2009 by WhereScape Software | Slide # 18

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development DW Project Components Copyright © 2009 by WhereScape Software | Slide # 19

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development Compact teams DW Project Components Copyright © 2009 by WhereScape Software | Slide # 20

Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development Compact teams Sturdy development hardware DW Project Components Copyright © 2009 by WhereScape Software | Slide # 21

Business Ownership Copyright © 2009 by WhereScape Software | Slide # 22

The data warehouse should be owned by the business – not IT Business Ownership Copyright © 2009 by WhereScape Software | Slide # 23

The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Business Ownership Copyright © 2009 by WhereScape Software | Slide # 24

The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Prioritization of project phases or agreement on a data dictionary should be agreed by the business Business Ownership Copyright © 2009 by WhereScape Software | Slide # 25

The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Prioritization of project phases or agreement on a data dictionary should be agreed by the business Without a strong, high level business sponsor(s) the project is likely to hit problems Business Ownership Copyright © 2009 by WhereScape Software | Slide # 26

The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business prioritization of project phases or agreement on a data dictionary to should be agreed by the business Without a strong, high level business sponsor(s) the project is likely to hit problems If sponsorship is present then the data warehouse project can be broken down into a set of smaller projects Business Ownership Copyright © 2009 by WhereScape Software | Slide # 27

The Data Warehouse lifecycle …as we know it

Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 29

A ‘big bang’ approach to data warehousing has almost always ended in disaster Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 30

A ‘big bang’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 31

A ‘big bang’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 32

A ‘big bang’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 33

A ‘big bang’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Keep in view the bigger picture Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 34

A ‘big bang’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Keep in view the bigger picture Use smaller phases to fund the project adequately Divide and Conquer Copyright © 2009 by WhereScape Software | Slide # 35

The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 36

Getting the business reps to use working prototypes to share an understanding of the scope The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 37

Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 38

Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 39

Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 40

Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation OLAP technology and user workshops are key tools in allowing the business to get their hands on the data The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 41

Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation OLAP technology and user workshops are key tools in allowing the business to get their hands on the data Data quality should not be addressed in the DW; problem should be fixed on the source system The True Project Approach Copyright © 2009 by WhereScape Software | Slide # 42

Rapid Development Product Copyright © 2009 by WhereScape Software | Slide # 43

Scrutinize Extract/Transform and Load (ETL) tools when considering building a DW. ETL tools do not provide the ability to build a working prototype and work in short development cycles Rapid Development Product and ETL Copyright © 2009 by WhereScape Software | Slide # 44

Combining processing and design Ability to enable, manage fast iterations of the prototype Environment migration Version control Automatic documentation Rapid Development Product Enables: Copyright © 2009 by WhereScape Software | Slide # 45

The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 46

Single Development Interface The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 47

Single Development Interface Documentation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 48

Single Development Interface Documentation Automated Table Generation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 49

Single Development Interface Documentation Automated Table Generation Automated Code Generation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 50

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 51

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 52

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 53

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills. The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 54

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills Consistent Framework The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 55

Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills Consistent Framework Extensibility The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide # 56

Copyright © 2009 by WhereScape Software | Slide # 57 Try to get it right first time: The SDLC approach But: Tools and operators in silos – inflexible Hard to engage business users, no shared understanding Locked in requirements that can’t be met Redevelopment 120 day cycle Risky, expensive, never OTTB & never finished No documentation, so hard to support The Traditional Approach

Copyright © 2009 by WhereScape Software | Slide # 58 Prototype and iterate to prove a design with users Supported by: Integrated toolset and metadata repository – maximum flexibility Continuous business engagement, shared understanding No ambiguity or disagreement about scope Successful phase completion Complete, OTTB, user expectations exceeded Documented solution that is easy to support The Rapid Development Approach 5 day cycle

A DWLC Tool would save a huge amount of development time and effort, and would enable the approach required to deliver a successful outcome The DWLC methodology is a child concept for Agile Development Methodology also known as Systems Development Life Cycle (SDLC) The Features of a DWLC Tool Slide # 59

Ensuring Data Availability Copyright © 2009 by WhereScape Software | Slide # 60

The lack of good quality live data will have a major impact on the success of Iterative project approach The DW’s capacity to answer BI requirements is unworkable, without sufficient data to populate the DW If a new source system is integrated into the data warehouse, the “real” data is quite essential If no “real” data for new source is available, then the significant rework will be required once the source is up and running Ensuring Data Availability Copyright © 2009 by WhereScape Software | Slide # 61

Involving the Business Copyright © 2009 by WhereScape Software | Slide # 62

Representatives from the Business provide the partnership with the DW development team These reps need to be able to articulate the needs of the business to the dev. team These reps have to trust the business department behind them when it comes to making any decisions The partnership during the iterative project approach provides a reliable, successful outcome The main forum for developers to show a working prototype and get user feedback is user workshop The business Involvement for the duration of the DW development will reduce the QA overheads Involving the Business Copyright © 2009 by WhereScape Software | Slide # 63

Copyright © 2009 by WhereScape Software | Slide # 64 Project governance

Governance of the data warehouse project should operate at two levels: an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide # 65

Governance of the data warehouse project should operate at two levels: an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide # 66 Business Requirements Technical Constraints

Governance of the data warehouse project should operate at two levels: an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide # 67 Business Requirements Technical Constraints Shared understanding, Prototype and Iterate, Best possible outcome

Sponsorship is sourced from a highly-placed executive The steering committee provides: + Vision + Visibility + Priorities + Scope + Focus + Terminology Copyright © 2009 by WhereScape Software | Slide # 68 The DW needs to be owned by the business

At a minimum project governance should include: A project plan, detailing (high level) scope and timelines Regular status meetings to share information Change request process documentation Standards and procedures for building a consistent DW Version control and backup procedures Ownership of specific environments and project roles Copyright © 2009 by WhereScape Software | Slide # 69 Project governance

Utilizing Experienced Team Members Copyright © 2009 by WhereScape Software | Slide # 70

Productivity within a data warehouse implementation is dependent on having experienced team members – both on business side and also on the technical side Experienced Subject Matter Experts (SME) provide a thorough understanding of the business and its needs Experienced data warehouse developers can take those requirements and turn them into a functioning data warehouse in a rapid timeframe Utilizing Experienced Team Members Copyright © 2009 by WhereScape Software | Slide # 71

Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software | Slide # 72

Sufficient hardware and technology infrastructure during development Lower productivity can translate into slower development cycles and iterations, which stands the risk of losing project momentum Trade-off between having adequately sized hardware and the cost associated with purchasing that hardware One way to mitigate undersized hardware is to use smaller subsets of data during the prototyping phase Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software | Slide # 73

Treat the Warehousing as a process, not a project Conclusion Copyright © 2009 by WhereScape Software | Slide # 74

This means: focusing on iterative releases and rollouts that follow in quick succession keeping the warehouse in line with the ever changing needs of the business, instead of treating it as a one-time project In order to achieve this, a change in the development approach and tools utilized for building the data warehouse must be adopted Conclusion Copyright © 2009 by WhereScape Software | Slide # 75

The key factors to creating a successful data warehouse are: Implementing the True Development Approach Choosing a Rapid Development Product Ensuring Data Availability Involving Key Users throughout the whole project Relying on a Pragmatic Governance Framework Utilizing experienced Team Members Selecting the right hardware and other related Infrastructure Technology Copyright © 2009 by WhereScape Software | Slide # 76 Conclusion

Raphael Klebanov, Analyst at WhereScape Office Phone: address: Public Profile: My personal information: Copyright © 2009 by WhereScape Software | Slide # 77