DATA QUALITY PROBLEMS AND THEIR ROOT CAUSES DAMA COLUMBUS, OH CHAPTER MEETING – JANUARY 2015.

Slides:



Advertisements
Similar presentations
Data Quality Considerations
Advertisements

C6 Databases.
©Silberschatz, Korth and Sudarshan4.1Database System Concepts Lecture-1 Database system,CSE-313, P.B. Dr. M. A. Kashem Associate. Professor. CSE, DUET,
Describing Process Specifications and Structured Decisions Systems Analysis and Design, 7e Kendall & Kendall 9 © 2008 Pearson Prentice Hall.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
The Database Environment
Prentice Hall, Database Systems Week 1 Introduction By Zekrullah Popal.
Systems Analysis and Design 9th Edition
Storing Organizational Information—Databases
Kimberley Correia 15 th November, 2005 QuickAddress Pro Web V5.
Managing Data Resources
Supply Chain Management Managing the between all of the parties directly and indirectly involved in the procurement of a product or raw material.
Chapter 9 Describing Process Specifications and Structured Decisions
Chapter 2 Health Care Information Systems: A Practical Approach for Health Care Management 2nd Edition Wager ~ Lee ~ Glaser.
Introduction to Databases Transparencies
CS155b: E-Commerce Lecture 10: Feb. 13, 2003 XML and its relationship to B2B commerce Acknowledgements: R. Glushko, A. Gregory, and V. Ramachandran.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
© 2007 by Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Recording / Financing Fixed Asset Acquisition Human Resources Purchasing Revenue Traditional files approach: separate systems (Legacy Systems) Expenditure.
BUSINESS DRIVEN TECHNOLOGY
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Chapter 4: Managing Information Resources with Databases Copyright © 2013 Pearson Education, Inc. publishing as Prentice Hall Chapter
Enterprise Resource Planning, 1st Edition by Mary Sumner
Dimensions of Data Quality M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
Chapter 1: The Database Environment
Hall, Accounting Information Systems, 7e ©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly.
Systems Analysis and Design 10th Edition
Prof. Mohammad Moizuddin
Data and Process Modeling
1 BTEC HNC Systems Support Castle College 2007/8 Systems Analysis Lecture 9 Introduction to Design.
Customer Relationship Management Wagner & Zubey (2007) 11 Copyright (c) 2006 Prentice-Hall. All rights reserved. Copyright 2007 Thomson Publishing: All.
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
The Changing Business Environment: A Manager’s Perspective 18.
7.1 Managing Data Resources Chapter 7 Essentials of Management Information Systems, 6e Chapter 7 Managing Data Resources © 2005 by Prentice Hall.
Emerging Technologies Work Group Master Data Management (MDM) in the Public Sector Don Hoag Manager.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
Storing Organizational Information - Databases
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
- Ahmad Al-Ghoul Data design. 2 learning Objectives Explain data design concepts and data structures Explain data design concepts and data structures.
Controlling MRK151 Chapter 6. Controlling Detecting and correcting significant variations in the results obtained from planned activities. Controlling.
Chapter 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden © 2007 by Prentice.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Chapter 4: Managing Information Resources with Databases Copyright © 2013 Pearson Education, Inc. publishing as Prentice Hall Chapter
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Database Application Design and Data Integrity AIMS 3710 R. Nakatsu.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Copyright 2013 John Wiley & Sons, Inc. Chapter 3 Monitoring and Controlling the Transformation System.
Chapter 4 Automated Tools for Systems Development Modern Systems Analysis and Design Third Edition 4.1.
Proven Solutions for Complex Customer Relationship Structures Roger Hartman, INRANGE Consulting April 16, 2007.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Chapter 1 © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chapter 1: The Database Environment and Development Process (Contd..) Modern Database.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Lecture 12: Data Quality and Integration
Managing Data Resources File Organization and databases for business information systems.
Efficient & Streamlined Processes Using EE Accounts Payable Presented By: Andrew J Vitullo, CPA Partner, Logan Consulting 1.
Introduction To DBMS.
Modern Systems Analysis and Design Third Edition
Overview of MDM Site Hub
Modern Systems Analysis and Design Third Edition
MANAGING DATA RESOURCES
Data Quality By Suparna Kansakar.
Modern Systems Analysis and Design Third Edition
Modern Systems Analysis and Design Third Edition
Chapter 11 Describing Process Specifications and Structured Decisions
Enterprise Resource Planning, 1st Edition by Mary Sumner
Modern Systems Analysis and Design Third Edition
Sample Assessment & Governance Results
Presentation transcript:

DATA QUALITY PROBLEMS AND THEIR ROOT CAUSES DAMA COLUMBUS, OH CHAPTER MEETING – JANUARY 2015

2 Consequences of poor Data Quality  Have multiple names for the same supplier and items in our system, cannot roll up spend by supplier by item correctly –this leads to an inability to shrink our supplier base and negotiate more volume discounts and control spend  Cannot identify if the same customer has auto,property and life insurance with our organization which limits our ability to cross-sell/up- sell to them  We are losing millions of dollars in postage and collateral costs by mailing to customers who live in the same household

3 Consequences of poor Data Quality  Unable to identify active accounts receivables for the same B2B customer across our different lines of business. Customers with an outstanding balance of $500,000 are required to have their credit reviewed per SEC guidelines.  Need to measure risk exposure by top 5 customer accounts across the organization from a credit perspective. The biggest challenge is figuring out parent child relationships across affiliates  Have lost millions of dollars in procuring suppliers because we are not able to correctly identify and reconcile suppliers on a global basis

4 Examples of Data Quality Problems Data glitches  Typos, multiple formats, multiple scales, missing/default values Business logic embedded in natural keys Data values that do not conform to the business rules Information buried in free-form/flex fields Mistakes, misspellings, incorrect data types, lack of standards etc.

5 Traditional Definition of Data Quality Accuracy: Does the data accurately represent reality or a verifiable source? Integrity: Is the structure of data and relationships among entities and attributes maintained consistently?  Policy data that is not tied to a valid customer Consistency: Are data elements consistently defined and understood?

6 Example of Data Inconsistency Accuracy: Does the data accurately represent reality or a verifiable source? Integrity: Is the structure of data and relationships among entities and attributes maintained consistently?  Policy data that is not tied to a valid customer Consistency: Are data elements consistently defined and understood?

7 Traditional Definition of Data Quality Completeness: Is all necessary data present?  Field three is Revenue  In dollars or cents?  In dollars or euros?  Field four is Product Sales  In units or cases?

8 Traditional Definition of Data Quality Validity: Do data values fall within acceptable ranges defined by the business? Timeliness: Is data available when needed? Accessibility: Is the data easily accessible, understandable, and usable? Relevance

9 Another perspective on Data Quality When the quality of data is sufficient to support the business purpose for which it needs to be used, either by people or applications

10 Major Causes for Poor Data Quality Poorly designed business processes No central management of business processes No ownership or stewardship for data Weakness of tool(s) used to manage business processes Multiple instances of a central tool for managing business processes

11 Major Causes for Poor Data Quality Third party interfaces Data conversion from legacy systems Human error Lack of sufficient training on tools used to manage business processes

12 Case Study of a Manufacturing Organization Example of a case study with multiple heterogeneous ERP systems used by different divisions in the organization Centralized ERP system used to share Master Data Master Data published in a “one-to-many” way to other ERP systems Comprehensive documentation on the usage of centralized ERP system and acceptable domain values

13 Analysis of Case Study No specification on “who” had to enter the data “when” Same business task supported differently by divisional ERP instances Considerable lag between the physical presence of incoming goods and their visibility in the ERP system

14 Analysis of Case Study (Continued..) Multiple customizations of divisional ERP systems Weakness of GUI of central ERP system used for master data entry Central ERP system did not prevent duplicate entries form being made

15 Analysis of Case Study (Continued..) Weak search engine in ERP system Difficult to correct errors in ERP system Data conversion from legacy applications

16 Lessons Learnt Processing delays, divisional idiosyncrasies or operational errors cause data quality issues Business processes must be both aligned with the organization and oriented towards the customer

17 Lessons Learnt (Continued..) Multiple instances of the same tool customized by different persons from various institutions with divergent interests or work standards can cause issues Multiple instances of the same tool also cause issues due to lack of an enterprise-wide view

18 How to Prevent Data Quality Issues Best practices for handling separate ways to enter data in an IT system  Declare one application as “reference” or “master” system  Develop a set of common definitions and procedures

19 Example of Sample Governance Process

20 How to Prevent Data Quality Issues (Cont..) Avoid bias in producing data Avoid distributed architectures

21 How to Prevent Data Quality Issues (Cont..) Assign responsibility for data quality issues

22 Typical Data Governance Org Chart

23 How to Prevent Data Quality Issues (Cont..) Design Information Chains First

24 Questions? Contact me at :

25 References Presentation on Data Quality and Data Cleaning from Rutgers University