Data Governance Data & Metadata Standards Antonio Amorin © 2011.

Slides:



Advertisements
Similar presentations
Chapter 1: The Database Environment
Advertisements

C6 Databases.
The Database Environment
Analyzing Systems Using Data Dictionaries Systems Analysis and Design, 7e Kendall & Kendall 8 © 2008 Pearson Prentice Hall.
3/5/2009Computer systems1 Analyzing System Using Data Dictionaries Computer System: 1. Data Dictionary 2. Data Dictionary Categories 3. Creating Data Dictionary.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Managing Data Resources
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 7.1.
ETEC 100 Information Technology
Analyzing Systems Using Data Dictionaries
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
Chapter 10: Analyzing Systems Using Data Dictionaries Instructor: Paul K Chen.
The Relational Database Model:
© 2007 by Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Data and Knowledge Management
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 7.1.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Chapter 1: The Database Environment
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Data warehousing theory and modelling techniques Building Dimensional Models.
Enterprise Business Information Model Enterprise Data Services.
DATA GOVERNANCE: Managing Access Jeremy Singer Suneetha Vaitheswaran.
Chapter 10 Architectural Design
Managing Information Technology
® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Chapter 1: The Database Environment and Development Process
Database Design - Lecture 1
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
ITEC224 Database Programming
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Managing Information Technology 6 th Edition CHAPTER 5 THE DATA RESOURCE.
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall Essentials of Systems Analysis and Design Fourth Edition Joseph S. Valacich Joey F.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall Analyzing Systems Using Data Dictionaries Systems Analysis and Design, 8e Kendall.
Database System Introduction to Database Environment October 31, 2009 Software Park, Bangkok Thailand Pree Thiengburanathum College of Arts and Media Chiang.
Chapter 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden © 2007 by Prentice.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Prepared By Prepared By : VINAY ALEXANDER ( विनय अलेक्सजेंड़र ) PGT(CS),KV JHAGRAKHAND.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Analyzing Systems Using Data Dictionaries Systems Analysis and Design, 8e Kendall & Kendall 8.
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
1 DATABASE TECHNOLOGIES (Part 2) BUS Abdou Illia, Fall 2015 (September 9, 2015)
Chapter 1 © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chapter 1: The Database Environment and Development Process (Contd..) Modern Database.
Data Resource Management Chapter 5 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Introduction to DBMS Purpose of Database Systems View of Data
Software Specification Tools
Chapter 1: The Database Environment
Databases and Information Management
Database Management System (DBMS)
Introduction to Database Systems
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Introduction to DBMS Purpose of Database Systems View of Data
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Chapter 1: The Database Environment
Chapter 1: The Database Environment
The Database Environment
Chapter 1: The Database Environment
The ultimate in data organization
Chapter 1: The Database Environment
Presentation transcript:

Data Governance Data & Metadata Standards Antonio Amorin © 2011

Abstract This data governance presentation focuses on data and metadata standards. The intention of the presentation is to identify new standards or modernize existing standards for both data and metadata. © 2011

Biography Antonio Amorin President, Data Innovations, Inc. –Nineteen years of data modeling experience –Eleven years of data profiling experience –Delivered data modeling and data profiling solutions to numerous clients in the Midwest and East Coast –Presented at national and international conferences, user groups, webcasts, and at client sites –Founded Data Innovations, Inc. in 2002 © 2011

Data Innovations, Inc. Established in 2002 Based in northwest suburbs Professional Services: –Data Modeling –Data Profiling –Data Architecture –Metadata –Database Administration –ETL CA Service Partner in 2004 CA Commercial Reseller in 2006 CA Enterprise Solution Provider in 2007 © 2011

Agenda Data Standards Metadata Standards Recommendations Summary © 2011

Data Standards Documented agreements on representations, formats, and definitions of business data © 2011

Data Standards Benefits –Improved data quality –Improved data compatibility –Improved consistency and efficiency of data collection, use, and sharing –Reduced data redundancy © 2011

Data Standards Data Stewards –Role or position –Responsible for overseeing stewardship of the data and metadata –Likely to be on both the business and IT sides of the organization –Gatekeepers © 2011

Data Standards Council or Board –Data stewards and representatives of the various business areas –Responsible and/or accountable for specific data for the organization © 2011

Data Standards Types of Standards –Data definitions –Data rules –Data values –Data quality –Data standardization –Data security © 2011

Data Standards Data Definitions and Rules –Provide a consistent, clear understanding of what data content is expected –Centralize or publish across the organization –Enterprise data dictionary or metadata repository © 2011

Data Standards Data Values –Valid values lists Static or rarely changed data Codes Indicators –Master reference data Customer Product Etc –Centralize © 2011

Data Standards Data Quality –Leverage data profiling Column/Field –Value analysis –Pattern analysis –Data type analysis Table/File –Validate key structure –Determine dependencies Cross-table –Validate foreign keys –Valid values Cross-system © 2011

Data Standards Data Quality Assessments –Standardize the process through detailed analysis procedures –Identify the different data quality problems using standardized notation –Summarize the analysis in reports to communicate to others –Create detailed examples to coincide with the analysis procedures © 2011

Data Standards Data Standardization –Address Leverage address standardization software –Phone and Leverage data quality software to standardize –Business data Leverage valid values and master reference data to standardize data across the organization © 2011

Data Standards Data Security –Identify sensitive data –Clearly define and publish procedure for requesting access –Identify and maintain lists of users with access rights –Validate regularly that the user still needs access © 2011

Metadata Standards Documented agreements on representations, formats, and definitions of Metadata © 2011

Metadata Standards Metadata Stewards –Generally IT resources fill this role or position –Responsible for overseeing stewardship of the metadata –Standards are generally integrated into the SDLC © 2011

Metadata Standards Metadata Stewards –Generally IT resources fill this role or position –Responsible for overseeing stewardship of the metadata –Standards are generally integrated into the SDLC © 2011

Metadata Categories © 2011

Model Metadata Business metadata –Business requirements –Functional requirements –Data requirements Data profiling metadata –Column profiling –Table profiling –Cross-table profiling –Cross-system profiling Data quality metadata –Data quality statistics Data modeling metadata –Enterprise data models –Logical models –Physical models Mapping metadata –Source-to-target mapping –Data Flow Diagrams Database metadata –Data Definition Language © 2011

Model Metadata Business metadata –Business requirements –Functional requirements –Data requirements Data profiling metadata –Column profiling –Table profiling –Cross-table profiling –Cross-system profiling Data quality metadata –Data quality statistics Data modeling metadata –Enterprise data models –Logical models –Physical models Mapping metadata –Source-to-target mapping –Data Flow Diagrams Database metadata –Data Definition Language © 2011

Metadata Standards Data Requirements –Align with the business requirements –Each business requirement is likely to have matching data requirements –Clearly define the data content to be captured –Profile existing data sources © 2011

Metadata Standards Data Profiling –Identify standards for utilization Create a step-by-step process for preparing the data, profiling the data, and analyzing the results Identify and document the communication method to the business and IT © 2011

Metadata Standards Data Profiling –Column Profiling Identify both valid and invalid –Values –Patterns –Data types –Lengths Standardize notation –Descriptions –Problems © 2011

Metadata Standards Data Profiling –Table Profiling Validate key structure Identify candidate keys Identify natural keys Identify and document exceptions or violations –Cross-Table Profiling Identify redundant data Validate foreign keys Identify orphaned rows © 2011

Metadata Standards Data Profiling –Table Profiling Validate key structure Identify candidate keys Identify natural keys Identify and document exceptions or violations –Cross-Table Profiling Identify redundant data Validate foreign keys Identify orphaned rows © 2011

Metadata Standards Data Profiling –Cross-system Profiling Identify redundant data Identify inconsistent data Identify common matching criteria © 2011

Metadata Standards Data Quality –Consider requiring as part of all profiling initiatives –Capture and store in metadata repository –Establish thresholds –Trend monitoring © 2011

Metadata Standards Data Modeling –Enterprise Data Model Identify high level view of where the data lives across the enterprise Centralize to make accessible across the organization Consider identifying enterprise-level entities for important data © 2011

Metadata Standards Data Modeling –Model Standards Standardized development process Model naming convention Name standards Data type standards Clearly documented review process © 2011

Metadata Standards Data Modeling –Logical/Physical Models Standards Model or project narrative Subject area Entity Relationships Attribute Identifier Derived and BI Elements © 2011

Metadata Standards Data Modeling –Metadata Validation Column level –Values –Patterns –Data types –Lengths Table level –Key validation Cross-table level –Foreign key relationships © 2011

Metadata Standards Mapping –Standardize mapping process –Standardize format of mapping document –Require data profiling as part of the mapping process or to validate mapping © 2011

Recommendations Publish or centralize data and metadata standards Integrate data and metadata standards into the SDLC Include standards review during onboarding Identify and publish the list of stewards Enforce standards with offshore teams © 2011

Summary Data and metadata standards need to be developed and supported by both IT and the business Well defined standards will enhance the development of new applications and simplify the integration of data across the organization © 2011

Questions ? © 2011

Thank You! Antonio C. Amorin –(847) Data Innovations, Inc. – –(888) © 2011