Quality and Reliability of CRIS data A case for euroCRIS? euroCRIS Membership Meeting November 1 – 2, 2007, Vienna Maximilian Stempfhuber GESIS–IZ Social.

Slides:



Advertisements
Similar presentations
Open repositories: value added services The Socionet example Sergey Parinov, CEMI RAS and euroCRIS.
Advertisements

Access 2007 ® Use Databases How can Microsoft Access 2007 help you structure your database?
System Integration Verification and Validation
Quality Management What is Quality?.
Database Management System MIS 520 – Database Theory Fall 2001 (Day) Lecture 13.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Data - Information - Knowledge
Project Management Quality Management* Dr. Khalid S. Husain * 07/16/96
Measuring the quality of academic library electronic services and resources Jillian R Griffiths Research Associate CERLIM – Centre for Research in Library.
Quality-driven Integration of Heterogeneous Information System by Felix Naumann, et al. (VLDB1999) 17 Feb 2006 Presented by Heasoo Hwang.
Security in Databases. 2 Outline review of databases reliability & integrity protection of sensitive data protection against inference multi-level security.
DSAC (Digital Signature Aggregation and Chaining) Digital Signature Aggregation & Chaining An approach to ensure integrity of outsourced databases.
OHT 2.1 Galin, SQA from theory to implementation © Pearson Education Limited 2004 Software Quality - continued So let’s move on to ‘exactly’ what we mean.
Quality Concept Computer Science Department, Faculty Of Science Prince of Songkhla University Apirada Thadadech.
Commercial Database Applications Testing. Test Plan Testing Strategy Testing Planning Testing Design (covered in other modules) Unit Testing (covered.
Managing Software Quality
What is Software Engineering? the application of a systematic, disciplined, quantifiable approach to the development, operation, and maintenance of software”
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Database Design - Lecture 1
This presentation prepared for MIS 421 / MBA 575 at Western Washington University. Material in this presentation drawn from Richard T. Watson, Data Management:
Sept - Dec w1d11 Beyond Accuracy: What Data Quality Means to Data Consumers CMPT 455/826 - Week 1, Day 1 (based on R.Y. Wang & D.M. Strong)
Topics Covered: Software requirement specification(SRS) Software requirement specification(SRS) Authors of SRS Authors of SRS Need of SRS Need of SRS.
Help Desk System How to Deploy them? Author: Stephen Grabowski.
Quality Control Project Management Unit Credit Value : 4 Essential
1 Chapter 7 Query-By-Example by Monica Chan CS157B Professor Lee.
Microsoft Access 2003 Define some key Access terminology: Field – A single characteristic or attribute of a person, place, object, event, or idea. Record.
Term 2, 2011 Week 6. CONTENTS Validating data Formats and conventions – Text – Numerical information – Graphics Testing techniques – Completeness testing.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
© 2011 Underwriters Laboratories Inc. All rights reserved. This document may not be reproduced or distributed without authorization. ASSET Safety Management.
Project Management Gaafar 2006 / 1 * This Presentation is uses information from PMBOK Guide 2000 Project Management Quality Management* Dr. Lotfi Gaafar.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Question To know that quality has improved, it would be helpful to be able to measure quality. How can we measure quality?
Software Project Management Lecture # 3. Outline Chapter 22- “Metrics for Process & Projects”  Measurement  Measures  Metrics  Software Metrics Process.
Access Project 3 Notes. Introduction Maintaining the Database  Modifying the data to keep it up-to-date Restructure the Database  To change the database.
ESSnet on microdata linking and data warehousing in statistical production: Metadata Quality in the Statistical Data Warehouse.
SYSTEM TESTING AND DEPLOYMENT CHAPTER 8. Chapter 8: System Testing and Deployment 2 KNOWLEDGE CAPTURE (Creation) KNOWLEDGE TRANSFER KNOWLEDGE SHARING.
CRIS as an Interconnector: IConnectEU - Building a thematic CRIS Maximilian Stempfhuber & Engin Sagbas GESIS-IZ Social Science Information Centre Bonn,
QUALITY DEFINITIONS AND CONCEPTS
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
Information quality in the context of CRIS and CERIF Maximilian Stempfhuber GESIS-IZ Social Science Information Centre Bonn, Germany CRIS 2008, June 5-7,
Service Brokering Yu-sik Park. Index Introduction Brokering system Ontology Services retrieval using ontology Example.
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
1 Software Testing and Quality Assurance Lecture 17 - Test Analysis & Design Models (Chapter 4, A Practical Guide to Testing Object-Oriented Software)
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 1 Quality management Produced in Collaboration between.
Project Management Quality Management. Introduction Project planning Gantt chart and WBS Project planning Network analysis I Project planning Network.
ISO 9001:2015 Subject: Quality Management System Clause 8 - Operation
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
TOTAL QUALITY MANAGEMENT “ MUST KNOW” CONCEPTS FOR ENGINEERS.
Introduction To DBMS.
Software Quality Assurance
Quality Control.
McCall’s Quality Factors
CIS 155 Table Relationship
Tutorial 10 Quality Management.
Big Data Quality the next semantic challenge
Associative Query Answering via Query Feature Similarity
Software engineering.
Quality Management MNGT 420
Data Quality By Suparna Kansakar.
How does the “Iron Triangle” relate to project management?
Big Data Quality the next semantic challenge
Metadata in Digital Preservation: Setting the Scene
Software Requirements Specification (SRS) Template.
Chapter # 1 Overview of Software Quality Assurance
Robin Dale RLG OAIS Functionality Robin Dale RLG
Microsoft Access Validation Rules, Table Relationships And
Big Data Quality the next semantic challenge
Organizational Aspects of Data Management
Introduction to reference metadata and quality reporting
Presentation transcript:

Quality and Reliability of CRIS data A case for euroCRIS? euroCRIS Membership Meeting November 1 – 2, 2007, Vienna Maximilian Stempfhuber GESIS–IZ Social Science Information Centre Bonn, Germany

What to expect No Answers… …only Questions!

Current situation Data within a single CRIS is not up-to-date or correct Data harvested from different sources does not match Coupling of systems and data difficult because of different features, data structures / semantics, invalid references, … What more?

Data errors Single data source –Schema level Value out of range Referential integrity violated … –Data level Missing value Typing errors Wrong values Duplicates …

Data errors (cont.) Multiple data sources –Schema level Structural heterogeneity Semantic heterogeneity … –Data level Contradictory values Different representations Different level of aggregation Duplicates …

Quality of data When is an error an error? Who decides what is correct? How can we correct existing errors? How can we prevent future errors? What is Quality? How can we guarantee it in a CRIS?

What is Quality? Degree to which a set of inherent characteristics fulfills requirements (ISO 9000) Conformance to requirements (Philip B. Crosby) "Fitness for use". Fitness is defined by the customer. (Joseph M. Juran) The quality has two dimensions: "must-be quality" and "attractive quality“ (Noriaki Kano)

What is Quality? A quality is a characteristic that a product or service must have. For example, products must be reliable, useable, and repairable. These are some of the characteristics that a good quality product must have. Similarly, service should be courteous, efficient, and effective. These are some of the characteristics that a good quality service must have. In short, a quality is a desirable characteristic. …

What is Quality? (cont.) However, not all qualities are equal. Some are more important than others. The most important qualities are the ones that customers want. These are the qualities that products and services must have. …

What is Quality? (cont.) So providing quality products and services is all about meeting customer requirements. It's all about meeting the needs and expectations of customers. So a quality product or service is one that meets the needs and expectations of customers.

What is Quality? (cont.) The quality of a product or service refers to the perception of the degree to which the product or service meets the customer's expectations. Quality has no specific meaning unless related to a specific function and/or object. Quality is a perceptual, conditional and somewhat subjective attribute.

Information Quality IQ or data quality denotes the degree of relevance of information in relation to a specific context and information need. –Requirements may be user specific or very general –Total of all requirements towards information or information products ([information]process oriented view) –Information that is fit for use by information consumers (user oriented view)

Information Quality (cont.) Business oriented view: –Creating your own data and information: constructive information quality. –Getting data and information from external sources: receptive information quality

Criteria for IQ Eigenvalue Correctness, objectivity, trustablity, reputation Information context Relevance, added value, timeliness, completeness, amount of information View to information Interpretability, comprehensibility, free of manipulatoin, integrity, free of conflicts Information access Access to the system, Secure access (Wang & Strong)

Criteria for IQ (cont.) User-specific view: Degree of confidence in the correctness of the information Trustability of information on the basis of previous experiences Verifiability of information Precision of information Timeliness of information (Heinrich)

Criteria for IQ (cont.) For electronic media: Internal quality Precision, objectivity, trustability Quality of access Accessibility, Security Quality in context Meaning, added value, timeliness, completeness, information content Quality of display Interpretability, comprehensibility, compactness Quality of metadata (meta information) Existence, adequacy Quality of structure Existence, adequacy, traceability (Königer & Reithmayer)

Quality and CRISs User‘s view (determines categories for CRIS quality) Data producer’s view (initially creates information and (sometimes) has to maintain it) Data provider’s view (has to ensure information quality and quality of service)

Quality and CRISs (cont.) Roles: Data producers/researchers, CRIS/service providers, CRIS users IQ criteria: Precision, objectivity, trustability, timeliness, completeness, added value, accessibility, … Is it going beyond Code of Good Practice? Who is responsible for which quality criteria (in which phase)?

User‘s view Do we know the users‘ information needs (records, statistics,…)? Do we know of canonical needs (to specify pre-structured queries)? Do we know how information should be displayed, how it should be browsable, …? Do we know how information is used at the user‘s site (preferred formats, additional processing)?

CRIS provider’s view What scope and content should the CRIS have (= users‘ information needs)? How can we guarantee completeness How can we guarantee sustainability? How have quality criteria to be defined for local use of a CRIS? How for federated CRISs?

Data producer‘s view What support do I have in entering data? Who helps me in maintaining it? Can I reuse the data I entered in other contexts?

Questions to euroCRIS Do we have Use cases generally accepted? Common set of information quality criteria (beyond what is supported by database mechanisms and CERIF structure)? Do we need end-user testing? How can we establish IQ in the CRIS community? How can we share IQ with other actors?

23 Thank You! Dr. Maximilian Stempfhuber GESIS-IZ Social Science Information Centre Lennéstr. 30, Bonn, Germany