Data Warehouse success depends on metadata

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

Business Information Warehouse Business Information Warehouse.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Data - Information - Knowledge
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
Information Integration. Modes of Information Integration Applications involved more than one database source Three different modes –Federated Databases.
Data Warehouse/Data Mart Components Concepts Characteristics.
Hyperion EPM Overview & Case Study.
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Designing the Data Warehouse and Data Mart Methodologies and Techniques.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Components and Architecture CS 543 – Data Warehousing.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Chapter 13 The Data Warehouse
Data Warehouse Components
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
© 2003, Prentice-Hall Chapter Chapter 2: The Data Warehouse Modern Data Warehousing, Mining, and Visualization: Core Concepts by George M. Marakas.
ACL Solutions for Continuous Auditing and Monitoring John Verver CA, CISA, CMC Vice President, Professional Services & Product Strategy ACL Services Ltd.
Leaving a Metadata Trail Chapter 14. Defining Warehouse Metadata Data about warehouse data and processing Vital to the warehouse Used by everyone Metadata.
ETL By Dr. Gabriel.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
L/O/G/O Metadata Business Intelligence Erwin Moeyaert.
Understanding Data Warehousing
Database Systems – Data Warehousing
Database Design - Lecture 1
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Warehousing Seminar Chapter 5. Data Warehouse Design Methodology Data Warehousing Lab. HyeYoung Cho.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
PowerMart of Informatica 발표자 : 김수경 (992COG05) 발표일 : March 27 th, 2000.
AN OVERVIEW OF DATA WAREHOUSING
© 2007 by Prentice Hall 1 Introduction to databases.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 10: The Data Warehouse Decision Support Systems in the 21 st.
1 Data Warehouses BUAD/American University Data Warehouses.
FORUM II Best Practices in Data Warehousing in Higher Education: A Framework for Higher Education Reporting April 18, 2005 Slide 1 Cornell University’s.
ETL Extract. Design Logical before Physical Have a plan Identify Data source candidates Analyze source systems with data- profiling tools Receive walk-through.
1 XML Based Networking Method for Connecting Distributed Anthropometric Databases 24 October 2006 Huaining Cheng Dr. Kathleen M. Robinette Human Effectiveness.
Information Builders : SmartMart Seon-Min Rhee Visualization & Simulation Lab Dept. of Computer Science & Engineering Ewha Womans University.
LS Retail BI Information/requirements/deployment steps.
Sachin Goel (68) Manav Mudgal (69) Piyush Samsukha (76) Rachit Singhal (82) Richa Somvanshi (85) Sahar ( )
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Transportation: Refreshing Warehouse Data Chapter 13.
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
7 Strategies for Extracting, Transforming, and Loading.
SAM for SQL Workloads Presenter Name.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
ARCHIBUS, Inc. COBie Data Connectors Gary Siorek, Technical Applications Engineer 2013 COBie Challenge for Facility Managers112-Mar-2013.
Copyright © 2006, Oracle. All rights reserved. Czinkóczki László oktató Using the Oracle Warehouse Builder.
C Copyright © 2007, Oracle. All rights reserved. Introduction to Data Warehousing Fundamentals.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
1 Copyright © 2007, Oracle. All rights reserved. Installing and Setting Up the Warehouse Builder Environment.
Copyright  Oracle Corporation, All rights reserved Building the Warehouse.
Plan for Populating a DW
Building a Data Warehouse: Understanding Why & How
Advanced Applied IT for Business 2
Overview of MDM Site Hub
Chapter 13 The Data Warehouse
PowerMart of Informatica
Data Warehouse.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
C.U.SHAH COLLEGE OF ENG. & TECH.
Metadata The metadata contains
Data Warehousing Concepts
Presentation transcript:

Data Warehouse success depends on metadata

Overview What is metadata? Why is it needed? Types of metadata Metadata life cycle

Better end user data access and analysis tools can help users figure out how to get information they need out of the warehouse, but only good, easily accessible metadata can help them figure out what is available in the data warehouse and how to ask for it.

Copyright © 1997, Enterprise Group, Ltd. Data Warehouse Process Data Characteristics Raw Detail No/Minimal History Integrated Scrubbed History Summaries Targeted Specialized (OLAP) Source OLTP Systems Data Marts Data Warehouse Design Mapping Extract Scrub Transform Load Index Aggregation Replication Data Set Distribution Access & Analysis Resource Scheduling & Distribution Meta Data System Monitoring Copyright © 1997, Enterprise Group, Ltd.

Meta Data Description Information about the data warehouse system Content Organizational Structural Management Information Scheduling Information Contact Information Technical Information

Why Do You Need Meta Data? Share resources Users Tools Document system Without metadata Not Sustainable Not able to fully utilize resource

Metadata Life Cycle Collection - Identify metadata and capture into repository; automate Maintenance - Put in place processes to synchronize metadata automatically with changing data architecture; automate Deployment - Provide metadata to users in the right form and with the right tools; match metadata offered to specific needs of each audience

Metadata Collection Right metadata at the right time Variety of collection strategies Sources potential sources of data for DW external data data structures Data Models - enterprise data model start point import from CASE tool correlate enterprise and warehouse models

Metadata Collection Warehouse mappings Warehouse usage information map operational data into warehouse data structure Need record of logical connection used for mapping and transformation Warehouse usage information After roll out What tables accessed, by whom and for what What queries written Capture nature of business problem or query

Maintaining Metadata Up to date with reality Capture incremental changes

Metadata Deployment Warehouse developers need: physical structure info for data sources enterprise data model warehouse data model concerned with accuracy, completeness and flexibility of metadata Need access to comprehensive impact analysis capabilities Need to defend against accuracy & integrity questions

Meta Data Types Technical Business / User Levels Core Basic Deluxe

Core Technical Meta Data Source Target Algorithm

Basic Technical Meta Data History of transformation changes Business rules Source program / system name Source program author / owner Extract program name & version Extract program author / owner Extract JCL / Script name Extract JCL / Script author / owner Load JCL / Script name

Basic Technical Meta Data (con’t) Load JCL / Script author / owner Load frequency Extract dependencies Transformation dependencies Load dependencies Load completion date / time stamp Load completion record count Load status

Deluxe Technical Meta Data Source system platform Source system network address Source system support contact Source system support phone / beeper Target system platform Target system network address Target system support contact Target system support phone / beeper Etc.

Core Business Meta Data Field / object description Confidence level Frequency of update

Basic Business Meta Data Source system name Valid entries (i.e. “There are three valid codes: A, B, C”) Formats (i.e. Contract Date: 82/4/30) Business rules used to calculate or derive the data Changes in business rules over time

Deluxe Business Meta Data Data owner Data owner contact information Typical uses Level of summarization Related fields / objects Existing queries / reports using this field / object Estimated size (tables / objects)

Amount of Meta Data How much Meta Data do I need? As much as you can support!

The Meta Data Conundrum Meta Data is absolutely required for success Meta Data is 99% Manual Cold, Hard Reality 5,000 data mart fields 7 manually populated and maintained meta data fields 35,000 total manual meta data fields Are you ready for this, forever? Copyright © 1997, Enterprise Group, Ltd.

The Meta Data Conundrum Can you support 35,000 Meta Data fields? Calculate available ongoing resources Commit only to what you can maintain You MUST deliver core, probably some basic to be viable

Meta Data Functions - Technical Maintenance Troubleshooting Documentation Logging / Metrics

Meta Data Location DB Resident Almost always relational C/S predominantly Normalized design OODB is popular option for proprietary solutions

Repository Specialized databases designed to maintain metadata, together with tools and interfaces that allow a company to collect and distribute its metadata Repository Requirements Logically Common Open Extensible

Multiple Repository Upside Downside Local instance, quick response Local view Users don’t have to wade through other’s material Downside More challenging implementation Advanced replication Requires maintenance resources More susceptible to architecture modification to remote instances

Copyright © 1997, Enterprise Group, Ltd. Multiple Repository Where do I find all the information about sales? Data Mart Meta Data Data Marts Requires multiple access points Requires more system resources Copyright © 1997, Enterprise Group, Ltd.

Common Repository Upside Downside Optimum solution Avoids replication challenges Allows central management/access Downside Requires remote access for remote DM’s More network infrastructure May require gateways

Copyright © 1997, Enterprise Group, Ltd. Common Repository Where do I find all the information about sales? Data Mart Meta Data Data Marts Single access point for all information resources Low system resources required Copyright © 1997, Enterprise Group, Ltd.

Copyright © 1997, Enterprise Group, Ltd. Meta Data Process Integrated with entire process and data flow Populated from beginning to end Begin population at design phase of project Dedicated resources throughout Build Maintain Design Mapping Extract Scrub Transform Load Index Aggregation Replication Data Set Distribution Access & Analysis Resource Scheduling & Distribution Meta Data System Monitoring Copyright © 1997, Enterprise Group, Ltd.

Meta Data Vision vs. Reality Standards OMG standard (June 2000) Common Warehouse Metadata Model XML based Supported by Oracle Designed by Oracle, Unisys, IBM, NCR and Hyperion Industry initiatives just taking hold Proprietary solutions inadequate Who is missing?

Meta Data Challenges The Meta Data conundrum Thin tool support (pairing standards, MSFT coming) Hidden resource trap Absolute requirement for success

Web Sites List of metadata tools http://www.dwinfocenter.org/catalog.html Universal Metadata http://www.eaijournal.com/DataIntegration/HolyGrail.asp Metadata Project http://www.dis.state.ar.us/DIS_Proj/EDWR/Metadata/MD_Home.htm