Understanding Data Quality Issues: Finding Data Inaccuracies Art DeMaio Evoke Software VP Technical Sales Support.

Slides:



Advertisements
Similar presentations
Family Owned and Operated Business Founded in 1919 Headquartered in Rochester, NY – Branch Offices in Albany, Syracuse, Buffalo and Erie, PA Provides.
Advertisements

Facilitated by Joanne Fraser RiverSystems
Information Technologies Page 1 Information Technologies Page 1 Information Technologies Page 1 Information Technologies Page 1Information Technologies.
BUSINESS PLUG-IN B2 Business Process.
What is a major cause of dissatisfaction with the sales job? Lack of training.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Practical Uses of Software Measurement for Process Improvement January 10, V1.0 Larry Dribin, Ph.D
Lecture 5 Themes in this session Building and managing the data warehouse Data extraction and transformation Technical issues.
Projmgmt-1/33 DePaul University Project Management I - Risk Management Instructor: David A. Lash.
ISAS - Strategic Business Systems Group Data Warehouses and Business Information Quality John Shelton Strategic Business Systems Group Manager.
3 Chapter Needs Assessment.
Chapter 3: The Project Management Process Groups
By Saurabh Sardesai October 2014.
Sylnovie Merchant, Ph.D MIS 210 Fall 2004 Lecture 1: The Systems Analyst Project Management MIS 210 Information Systems I.
Lean Six Sigma Executive Introduction. Copyright OpenSourceSixSigma.com Competition Every morning in Africa, a gazelle wakes up; it knows it must run.
Readiness Index – Is your application ready for Production? Jeff Tatelman SQuAD October 2008.
Managing Projects
Introduction to Project Management. What is a Project? “A planned undertaking of related activities to reach an objective that has a beginning and an.
Business Consulting Services Agenda Discussion: Management Reports Discussion: Project Reports Discussion: Engagement Proposal Upcoming Events Review Project.
Chief Learning Officer Webinar Sponsored by Skillsoft September 13, 2012 David Vance.
Change Request Management
Rohit Agarwal. Introduction Types of Profiling When should Data Profiling be done? General Model Methodology Conclusion References.
Strategic Initiatives for Implementing Competitive Advantage Great products—Innovative products Doesn’t matter---Bad processes—no perceived value 1) You.
WELCOME good day Alexandru Doszlop
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc. All rights reserved. BUSINESS DRIVEN TECHNOLOGY Chapter Twelve: Integrating the Organization from.
Software Project Management
Project Risk Management. The Importance of Project Risk Management Project risk management is the art and science of identifying, analyzing, and responding.
BUSINESS DRIVEN TECHNOLOGY
9 Closing the Project Teaching Strategies
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc. All rights reserved. BUSINESS DRIVEN TECHNOLOGY Business Plug-In B10 Project Management.
Chapter 14 Managing Projects.
Team Chartering Six Sigma Foundations Continuous Improvement Training Six Sigma Foundations Continuous Improvement Training free six sigma site.
Chapter 10 Contemporary Project Management Kloppenborg
1 Project Name Team Lead Location Month XX, Year.
Business Intelligence Solutions for the Insurance Industry DAT – 13 Data Warehousing Rasool Ahmed.
MARKETING MANAGEMENT 12 th edition 5 Creating Customer Value, Satisfaction, and Loyalty KotlerKeller.
Chapter 6 : Software Metrics
INTRODUCTION TO PROJECT MANAGEMENT. WHAT IS A PROJECT? “A planned undertaking of related activities to reach an objective that has a beginning and an.
Five Things Niel Nickolaisen CIO, Headwaters, Inc. Co-founder, Accelinnova.
How To Build a Testing Project 1 Onyx Gabriel Rodriguez.
IT Project Management, Third Edition Chapter 6 1 Chapter 3: Project Time Management.
Module 4: Systems Development Chapter 12: (IS) Project Management.
Project Management Workshop Overview By Tarek Lahdhiri Region 4 PACE Chair IEEE Region 4 Meeting January 30/31, 2004.
2 FOR INTERNAL USE ONLY Project Chartering  Define the components of a project charter  Develop a project idea into an effective project charter  Review.
The Marketing Research Project. Purposes of the Project 1.Give you practical experience at conducting a marketing research project. 2.Examine some factors.
1 Project Kick Off Briefing Cost Data Integrity Project August 30, 2007.
Chapter 18 Make or Buy, Insourcing and Outsourcing.
SOFTWARE PROJECT MANAGEMENT
Chapter Fourteen Communicating the Research Results and Managing Marketing Research Chapter Fourteen.
Project Risk Management Planning Stage
Information System Project Management Lecture Five
Small Business Information Systems Professor Barry Floyd
Copyright 2009 John Wiley & Sons, Inc. Chapter 12 Project Auditing.
Project Organization Chart Roles & Responsibilities Matrix Add Project Name.
Copyright 2012 John Wiley & Sons, Inc. Chapter 12 Project Auditing.
Copyright © 2003 by The McGraw-Hill Companies, Inc. All rights reserved.
The Feasibility Study The objective of a feasibility study is to find out if an project can be done and if so, how The objective of a feasibility study.
1 Data Warehouse Assessments What, Why, and How Noah Subrin Technical Lead SRA International April 24, 2010.
1 Working with Project Stakeholders in a Statewide Project PMI-SVC PMO Forum Monthly Meeting Dan Conway, PMP October 22, 2008.
CHAPTER 14 MANAGING PROJECTS. Project Management (Introduction) These numbers vary based on the study but… We spend about $1 trillion on IT projects 3.
Service Contract with Periodic Billing
Using ERP to Streamline and Enhance Your Organization
BA Continuum India Pvt Ltd
Software Requirements
Description of Revision
Data Quality By Suparna Kansakar.
Project Management Process Groups
{Project Name} Organizational Chart, Roles and Responsibilities
Presentation transcript:

Understanding Data Quality Issues: Finding Data Inaccuracies Art DeMaio Evoke Software VP Technical Sales Support

Agenda Why is Understanding Data Important Methodology for Assessing Data –Defining –Weighting –Profiling –Revisiting –Finding –Addressing –Maintaining What is Profiling Benefits of the Assessment

What the Experts say… “Information quality is not an esoteric notion;it directly affects the effectiveness and efficiency of business processes. Information quality also plays a major role in customer satisfaction.” - Larry P. English

What the Experts say… “Poor data quality is costly. It lowers customer satisfaction, adds expense, and makes it more difficult to run a business and pursue tactical improvements such as data warehouses and re-engineering.” - Thomas C. Redman

What’s in Your DATA… “…three-quarters (of participating companies) reported significant problems as a result of defective data, with a third failing to bill or collect receivables as a result.” - In a PricewaterhouseCoopers survey of 600 CIOs, IT directors or similar executives

What is Data Quality? Accuracy of Content Structure Completeness Timeliness Presentation

Assessing Your Data 2-Weight /Impact 3-Profile Data 6-Address Source Data 7-Maintain 4-Revisit Definitions, Weights 5-Findings 1-Define Issues

Defining Issues Standard list Key requirements Content Structure Completeness Update list by project or source Source Data 1-Define Issues

Defining Issues-sample Source Data 1-Define Issues

Weight Impact After the issues are initially identified: Some issues are more critical than others Weights are not priorities Assign a weighting factor (1-5) Weighting factors SHOULD change by project 2-Weight /Impact Source Data 1-Define Issues

Profile Data What does Data Profiling mean? 2-Weight /Impact 3-Profile Data Source Data 1-Define Issues

What is Data Profiling? The use of analytical techniques on data for the purpose of developing a thorough knowledge of its content, structure and quality. A process of developing information about data instead of information from data.

Information About Data: (Data Profiling) 30% of entries in SUPPLIER_ID are blank the range of values in UNIT_PRICE is 5.99 to there are 14 ORDER_HEADER rows with no ORDER_DETAIL rows Information FROM Data: (not Data Profiling) Texas auto buyers buy more Cadillacs per capita than any other state The average mortgage amount increased last year by 6% 10% of last year's customers did not buy anything this year What is Data Profiling?

Profile Data This is multi-step process Collect documentation Review the DATA itself Compare data to documentation Identify and detail specific issues 2-Weight /Impact 3-Profile Data Source Data 1-Define Issues

Revisit Review the issues and weights Should there be more or less issues What are they? Are the relative importance of each issue different? 2-Weight /Impact 3-Profile Data Source Data 4-Revisit Definitions, Weights 1-Define Issues

Findings Your findings tell others about the data Documented reports and/or charts Results database Quality Assessment Score 2-Weight /Impact 3-Profile Data Source Data 4-Revisit Definitions, Weights 5-Findings 1-Define Issues

Findings-Chart

 Weighted Issue Rate % Weighted Assessment Score %

Address the Issues Addressing your findings Actual vs. Potential Subject Matter Expertise Cleansing Requirements 2-Weight /Impact 3-Profile Data 6-Address Source Data 4-Revisit Definitions, Weights 5-Findings 1-Define Issues

Maintain Vigilance Maintain Complete the cycle Periodic review Document score changes 2-Weight /Impact 3-Profile Data 6-Address Source Data 7-Maintain 4-Revisit Definitions, Weights 5-Findings 1-Define Issues

Why Do The Assessment? Quantify the quality issues Isolate true problems Proactive review –reduces the cost of resolving issues –reduces the risk of customer dissatisfaction Define the scope of issues Determine the resources required to address issues

Why Do The Assessment? Project Timeline When you find an Issue Cost to Address an Issue Project Costs

Why should it be done TIME Pay me now or Pay me later

When Should It Be Done? Every IT data project –Warehousing –CRM –ERP –EAI –M&A Ongoing based on –Criticality of the system –Current status (score) –Need to re-purpose data

Bibliography Larry P. English: Improving Data Warehouse and Business Information Quality, John Wiley & Sons Inc., 1999 Jack Olson, Data Profiling: The Accuracy Dimension, Morgan Kaufmann, 2002 Thomas C. Redman: Data Quality for the Information Age, Artech House, 1996 PricewaterhouseCoopers, “Global Data Management Survey”, 2001