UNDERSTANDING DATA QUALITY 1. Philosophical Position and Important Definitions 2.

Slides:



Advertisements
Similar presentations
Data Quality Considerations
Advertisements

© Gerald Kotonya and Ian Sommerville Viewpoint-Oriented Requirements Methods.
Quality Data for a Healthy Nation by Mary H. Stanfill, RHIA, CCS, CCS-P.
MapleLeaf, LLC SDLC Methodology. MapleLeaf, LLC, has established standard phases and processes in regards to project management methodologies for planning.
The International RuleML Symposium on Rule Interchange and Applications Local and Distributed Defeasible Reasoning in Multi-Context Systems Antonis Bikakis,
UNDERSTANDING DATA QUALITY 1. Data quality dimensions in the literature  include dimensions such as accuracy, reliability, importance, consistency, precision,
OASIS Reference Model for Service Oriented Architecture 1.0
Data - Information - Knowledge
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 15 Finalizing.
Viewpoint-oriented requirements methods. Objectives To explain the notion of viewpoints in RE To explain the notion of viewpoints in structured analysis.
Query Optimization in Sensor Networks Mark Rossman Iris Bass Supervised by: Dr. Fatma Milli.
Systems Engineering Foundations of Software Systems Integration Peter Denno, Allison Barnard Feeney Manufacturing Engineering Laboratory National Institute.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Contents Topic 1 Introduction Topic 2 Screening Topic 3 Assessment
DATA QUALITY PROBLEMS AND THEIR ROOT CAUSES DAMA COLUMBUS, OH CHAPTER MEETING – JANUARY 2015.
© 2011 Infotech Enterprises. All Rights Reserved We deliver Global Engineering Solutions. Efficiently.August 7, 2015 Geo-Technical Data management – A.
Dimensions of Data Quality M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
R 255 G 211 B 8 R 255 G 175 B 0 R 127 G 16 B 162 R 163 G 166 B 173 R 137 G 146 B 155 R 175 G 0 B 51 R 52 G 195 B 51 R 0 G 0 B 0 R 255 G 255 B 255 Primary.
Developing Enterprise Architecture
Database Systems: Design, Implementation, and Management Ninth Edition
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Information Security Governance 25 th June 2007 Gordon Micallef Vice President – ISACA MALTA CHAPTER.
Information Management in British Telecom Jon Hill.
Database Design - Lecture 1
DBS201: DBA/DBMS Lecture 13.
1 IBM Software Group ® Mastering Object-Oriented Analysis and Design with UML 2.0 Module 1: Best Practices of Software Engineering.
SMS Operation.  Internal safety (SMS) audits are used to ensure that the structure of an SMS is sound.  It is also a formal process to ensure continuous.
1 An Analytical Evaluation of BPMN Using a Semiotic Quality Framework Terje Wahl & Guttorm Sindre NTNU, Norway Terje Wahl, 14. June 2005.
THE REGIONAL MUNICIPALITY OF YORK Information Technology Strategy & 5 Year Plan.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
 Explain the role of a system analyst.  Identify the important parts of SRS document.  Identify the important problems that an organization would face.
Topic (1)Software Engineering (601321)1 Introduction Complex and large SW. SW crises Expensive HW. Custom SW. Batch execution.
Architecture styles Pipes and filters Object-oriented design Implicit invocation Layering Repositories.
Chapter © 2009 Pearson Education, Inc. Publishing as Prentice Hall.
TEACHERS’ KNOWLEDGE AND PEDAGOGICAL CONTENT KNOWLEDGE
UNDERSTANDING DATA QUALITY 1. Philosophical Position and Important Definitions 2.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
5 Levels of MDM Maturity.
1 The Theoretical Framework. A theoretical framework is similar to the frame of the house. Just as the foundation supports a house, a theoretical framework.
1 Getting Started : Purposes of IS Strategic Planning.
Information Quality in Customer Relationship Management Systems Utpal Bose Herb Rebhun Shohreh Hashemi University of Houston-Downtown ISECON November 3,
A Semiotic Information Quality Framework: Applications and Experiments Mr Gregory Hill, Prof. Graeme Shanks and Dr Rosanne Price Prof. Graeme Shanks and.
Database Management Systems (DBMS)
Chapter 2 The marketing environment Learning objectives 1.Discuss the external environment of marketing and explain how it affects an organisation 2.Describe.
FIS Deryck Payne. Basic Concepts UNDERSTANDING INFORMATION – Based on Chapter 1: – Business Information Systems Bocij, Greasley, Chaffey, Hickie.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Kathy Corbiere Service Delivery and Performance Commission
1 Model-based Development and Evolution of Information Systems Quality of models and modeling languages John Krogstie Professor, IDI, NTNU UPC,
Organization Development and Change © PAPERHINT.COM.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
A Guide to Organizational Communications
Application architectures Advisor : Dr. Moneer Al_Mekhlafi By : Ahmed AbdAllah Al_Homaidi.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Conflict Management in the Workplace Rahim ch. 6, 7, 8
Understanding Data Quality 1. Understanding of data handling 2.
Big Data Quality the next semantic challenge
Module 4: Validating Data: Data Quality Control and Assessment
Understanding Data Quality
Big Data Quality the next semantic challenge
Management information systems ( MIS )
IT Directors Meeting — IT Transformation Plan —
SERVICE QUALITY & OPERATIONAL PERFORMANCE OF TOUR OPERATORS IN KENYA
IT Directors Meeting — IT Transformation Plan —
SERVICE QUALITY & OPERATIONAL PERFORMANCE OF TOUR OPERATORS IN KENYA
Big Data Quality the next semantic challenge
Organizational Aspects of Data Management
Presentation transcript:

UNDERSTANDING DATA QUALITY 1

Philosophical Position and Important Definitions 2

Data quality dimensions in the literature  include dimensions such as accuracy, reliability, importance, consistency, precision, timeliness, understandability, conciseness and usefulness  Wand and Wang (1996: p92) 3

 Kahn et al. (1997) developed a data quality framework based on product and service quality theory, in the context of delivering quality information to information consumers. 4

5  Four levels of information quality were defined:  sound information,  useful information,  usable information, and  effective information.  The framework was used to define a process model to help organisations plan to improve data quality.

 A more formal approach to data quality is provided in the framework of Wand and Wang (1996) who use Bunge’s ontology to define data quality dimensions.  They formally define five intrinsic data quality problems: incomplete, meaningless, ambiguous, redundant, incorrect. 6

Semiotic Theory 7  Semiotic theory concerns the use of symbols to convey knowledge. Stamper (1992) defines six levels for analysing symbols. These are the physical, empirical, syntactic, semantic, pragmatic and social levels.

Data quality could be emphasize on these levels:  Physical -  Empirical -  Syntactic - concerned with the structure of data  Semantic - concerns with the meaning of data  Pragmatic - concerns with the usage of data (usability and usefulness)  Social - concerns with the shared understanding of the meaning of the data/information generated from the data Concern with physical and physical media for communications of data 8

Discuss the strategies for ensuring quality data in all the categories listed in the form according to levels given. DISCUSSIONS 9

10 Semiotic LevelGoalDimensionImprovement Strategy SyntacticConsistentWell-defined (perhaps formal) syntax SemanticComplete and Accurate Comprehensive, Unambiguous, Meaningful, Correct PragmaticUsable and UsefulTimely, Concise, Easily Accessed, Reputable SocialShared understanding of meaning Understood, Awareness of Bias

11 Semiotic LevelGoalDimensionImprovement Strategy SyntacticConsistentWell-defined (perhaps formal) syntax Corporate data model, Syntax checking, Training for data producers SemanticComplete and Accurate Comprehensive, Unambiguous, Meaningful, Correct Training for data producers, Minimise data transformations and transcriptions PragmaticUsable and UsefulTimely, Concise, Easily Accessed, Reputable Monitoring data consumers, Explanation and visualisation, High quality data delivery systems, Data tagging SocialShared understanding of meaning Understood, Awareness of Bias Viewpoint analysis, Conflict resolution, Cultural Immersion

4 Common Data Challenges Faced During Modernization: Data is fragmented across multiple source systems - Each system holds its own notion of the policyholder. This makes developing a unified customer-centric view extremely difficult. The situation is further complicated because the level and amount of detail captured in each system is incongruent.

4 Common Data Challenges Faced During Modernization: Data formats across systems are inconsistent - When organization operating with systems from multiple vendors and each vendor has chosen to implement a custom data representation. In order to respond to evolving business needs, this led to a dilution of the meaning and usage of data fields: the same field represents different data, depending on the context.

4 Common Data Challenges Faced During Modernization: (Cont.) Data is lacking in quality - When organization has units that are organized by line of functions. Each unit holds expertise in a specific field and operates fairly autonomously. This has resulted in different practices when it comes to data entry. The data models from decades-old systems weren’t designed to handle today's business needs.

4 Common Data Challenges Faced During Modernization: (Cont.) Systems are only available in defined windows during the day, not 24/7 - If the organization's core systems are batch oriented. This means that to make updates are not available in the system until batch processing has completed. Furthermore, while the batch processing is taking place, the systems are not available, neither for querying nor for accepting data. Another aspect affecting availability is the closed nature of the systems: They do not expose functionality for reuse by other systems.

Lack of Centralized Approach Hurting Data Quality 16 “Data quality is the foundation for any data-driven effort, but the quality of information globally is poor. Organizations need to centralize their approach to data management to ensure information can be accurately collected and effectively utilized in today’s cross-channel environment.” Thomas Schutz, senior vice president, general manager of Experian Data Quality