U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 6: Manage Quality CC image by Shane Melaugh on Flickr.

Slides:



Advertisements
Similar presentations
MONITORING OF SUBGRANTEES
Advertisements

Preparing Data for Quantitative Analysis
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
BLM Data Quality. Purpose - after this course you will be able to… Describe why Data Quality matters Describe why Data Quality matters Define what is.
ICS 417: The ethics of ICT 4.2 The Ethics of Information and Communication Technologies (ICT) in Business by Simon Rogerson IMIS Journal May 1998.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Discussion on SA-500 – AUDIT EVIDENCE
Auditing Computer Systems
Auditing Computer-Based Information Systems
Coping with Electronic Records Setting Standards for Private Sector E-records Retention.
Developing a Records & Information Retention & Disposition Program:
Research Integrity: Collaborative Research Michelle Stickler, DEd Office for Research Protections
Security in Databases. 2 Outline review of databases reliability & integrity protection of sensitive data protection against inference multi-level security.
Office of Inspector General (OIG) Internal Audit
OPERATIONS MANAGEMENT for MBAs Fourth Edition
Quality Assurance/Quality Control Policy
Purpose of the Standards
Achieving our mission Presented to Line Staff. INTERNAL CONTROLS What are they?
Dimensions of Data Quality M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 5: Preserve CC image by NASA Goddard Space Flight Center on.
Property of Common Sense Privacy - all rights reserved THE DATA PROTECTION ACT 1998 A QUESTION OF PRINCIPLES Sheelagh F M.
Network security policy: best practices
Per Anders Eriksson
Auditing Standards IFTA\IRP Audit Guidance Government Auditing Standards (GAO) Generally Accepted Auditing Standards (GAAS) International Standards on.
Release & Deployment ITIL Version 3
Certification of Market Values STEB PROGRAM Briefing Points 2011 Pennsylvania Department of the Auditor General Thomas E. Marks, CPA Deputy Auditor General.
Why Information Governance….instead of Records & Information Management? Angela Fares, RHIA, CRM, CISA, CGEIT, CRISC, CISM or
Database Applications – Microsoft Access Lesson 2 Modifying a Table and Creating a Form 45 slides in presentation Accessibility check 9/14.
Service Organization Control (SOC) Reporting Options and Information
Other Laws (Primarily for E-Government) COEN 351.
Designing Smart Cities Conference University of Strathclyde, Glasgow 31 st March 2015 “Regulating Smart Cities: Policing & Privacy” Paul Mackie Chief Executive.
1 Quality Assurance In moving information from statistical programs into the hands of users we have to guard against the introduction of error. Quality.
Lecture #9 Project Quality Management Quality Processes- Quality Assurance and Quality Control Ghazala Amin.
The Executive Office of the President (EOP). Office of Management and Budget (OMB)
European Statistical Law – in preparation Kirsten Wismer & Lars Thygesen.
Information Systems Security Operational Control for Information Security.
1 Information Sharing Environment (ISE) Privacy Guidelines Jane Horvath Chief Privacy and Civil Liberties Officer.
Emission Inventory Quality Assurance/Quality Control (QA/QC) Melinda Ronca-Battista ITEP/TAMS Center.
Module 6. Data Management Plans  Definitions ◦ Quality assurance ◦ Quality control ◦ Data contamination ◦ Error Types ◦ Error Handling  QA/QC best practices.
Ensuring the Integrity of Financial Information Ensuring the Integrity of Financial Information C H A P T E R 5.
1 Improving Data Quality. COURSE DESCRIPTION Introduction to Data Quality- Course Outline.
U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 4: Acquire CC image by Cyprien on Flickr.
Strengthening Science Supporting Fishery Management  Standards for Best Available Science  Implementation of OMB’s Peer Review Bulletin  Separation.
M u l t I b e a m III W o r k s h o p M u l t I b e a m III W o r k s h o p National Geophysical Data Center / World Data Centers NOAA Slide 1 End-to-End.
The right item, right place, right time. DLA Privacy Act Code of Fair Information Principles.
Chapter 1 Introduction to Databases. 1-2 Chapter Outline   Common uses of database systems   Meaning of basic terms   Database Applications  
1 Information Quality Act. Purpose- after this course you will be able to… define what is the Information Quality Act define what is the Information Quality.
Copyright © 2007 Pearson Education Canada 1 Chapter 21: Completing the Audit.
DEVA Data Management Workshop Devil’s Hole Pupfish Project Quality Assurance.
Project quality management. Introduction Project quality management includes the process required to ensure that the project satisfies the needs for which.
Tad and Terry Legal Issues in ILP. 28 CFR Part 23 The federal rule that governs or provides guidance for these issues. § 23.3 Applicability: These policy.
ISO MONITORING AND MEASUREMENT. ISO Environmental Management Systems2 Lesson Learning Goals At the end of this lesson you should be.
Data Governance 101. Agenda  Purpose  Presentation (Elijah J. Bell) Data Governance Data Policy Security Privacy Contracts  FERPA—The Law  Q & A.
The Civil Registration and Vital Statistics System in Country Names & Titles of Presenters.
Information Security IBK3IBV01 College 2 Paul J. Cornelisse.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
IS 630 : Accounting Information Systems Auditing Computer-based Information Systems Lecture 10.
Unlocking the Mystery of the Data Quality Act: What it Means for NOAA Research Jamie Krauk, Office of Scientific Support August 6, 2002.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
ISO DOCUMENT CONTROL. ISO Environmental Management Systems2 Lesson Learning Goals At the end of this lesson you should be able to: 
Project Management Strategies Hidden in the CMMI Rick Hefner, Northrop Grumman CMMI Technology Conference & User Group November.
Chapter 3-Auditing Computer-based Information Systems.
Session 6: Data Flow, Data Management, and Data Quality.
Internal Audit Section. Authorized in Section , Florida Statutes Section , Florida Statutes (F.S.), authorizes the Inspector General to review.
Welcome. Contents: 1.Organization’s Policies & Procedure 2.Internal Controls 3.Manager’s Financial Role 4.Procurement Process 5.Monthly Financial Report.
1 Auditing Your Fusion Center Privacy Policy. 22 Recommendations to the program resulting in improvements Updates to privacy documentation Informal discussions.
Water quality data - providing information for management
Privacy principles Individual written policies
The role of metadata in census data dissemination
Presentation transcript:

U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 6: Manage Quality CC image by Shane Melaugh on Flickr

Manage Quality Provided by DataONE Lesson Topics  Definitions  Quality assurance and Quality control  Data contamination  Types of errors  QA/QC best practices  Before data collection  During data collection/entry  After data collection/entry CC image by cobalt123 on Flickr

Manage Quality Provided by DataONE Learning Objectives  After completing this lesson, the participant will be able to:  Define data quality control and data quality assurance  Perform quality control and assurance on their data at all stages of the research cycle CC image by 0xFCAF on Flickr

Manage Quality Provided by DataONE The Data Lifecycle

Manage Quality Provided by DataONE Information Quality Act  Requires federal agencies to publish their "Guidelines for Ensuring and Maximizing the Quality, Objectivity, Utility, and Integrity of Information" that they disseminate Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Information Quality Act  Also requires agencies to establish mechanisms for receiving requests for correction of information  Further requires that agencies report annually on the number and nature of complaints received and the steps that were taken to resolve them Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE What information is covered by the Act?  Information must be disseminated  Either published or publicly available on or after October 1, 2002  Agency Sponsored  Agency creates and uses this data or has decided to use this data  Could include third party data if it is used for agency decision-making and/or can be considered “endorsed” by the agency Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE What information is not covered?  Party-to-Party transactions  Internet hyperlinks and other references  Opinions  Press release, fact sheets, press conferences or similar communications in any medium  Public filings of information  Dissemination of information by an agency employed scientist, grantee, or contractor Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE What information is not covered?  Testimony and other submissions to Congress  Inadvertent or unauthorized disclosure of information intended only for inter-agency and/or intra-agency use of communication  Correspondence with individuals  Records covered by other laws, including FOIA, Privacy Act, etc.  Archived records Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Objectivity, Utility, & Integrity  Objectivity - process under which the data are collected and maintained  Utility - usefulness of the information to the Intended users  Integrity - protections against unauthorized tampering Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE How Is Data Objective ?  Objectivity is achieved by using the same procedures to collect, verify, record, maintain, and archive data for each occurrence  Ensures that no possible bias exists  Procedures must be “Transparent”  Procedures must be “Reproducible” Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE How to Define Transparency  Identify:  the source  assumptions employed  analytical methods employed  statistical procedure involved  Identify/use commonly accepted standards  Document what you did Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE How to Define Reproducibility  If anyone follows the process the agency used, it will get the same result  An agency is not required to duplicate the process if challenged and is not compelled to run the process to certify that it can be duplicated  An agency needs to be prepared to explain the process and must ensure that the process was followed in all cases Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE How Is Utility of Data Determined?  Based on the relevance of data to the analysis being performed  Information Quality Act Guidelines - data’s usefulness to the public (Data must have a perceived public benefit)  Must be a connection between strategic mission requirements and data collected/maintained by an agency  Intent to eliminate collection and retention of extraneous data Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Data From External Sources  Agencies must disclose what they know of the quality of the data  Transparency and reproducibility standards apply to external information  Agency can be challenged on “third party data” under the Information Quality Act Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Third Party Data Challenge  The response should be to inform the party that possesses the data about the concern and notify the complainant  While the agency is not responsible for correcting the data, they can be held responsible for relying on the data for decision-making purposes Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Data Integrity  Integrity refers to the protection of information from unauthorized access or revision, to ensure that the information is not compromised through corruption or falsification  Access and security controls must be sufficient to prevent contamination of the data Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Request for Correction of Information Who may request a correction of information? Must be an “affected person” Anyone who may use, be benefited by, or be harmed by the disseminated information Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Contents of Data Quality Correction Request  Name and contact information  A description of the information the person believes does not comply with the Guidelines  An explanation of how the information does not comply with the Guidelines  Recommendation of correction action Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE BLM Request for Information Correction Process  USGS? REQUEST PROCESS? Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Definitions Data Contamination  Process or phenomenon, other than the one of interest, that affects the variable value  Erroneous values CC image by Michael Coghlan on Flickr

Manage Quality Provided by DataONE Definitions: Types of Errors  Errors of Commission  Incorrect or inaccurate data entered  Examples: malfunctioning instrument, mistyped data  Errors of Omission  Data or metadata not recorded  Examples: inadequate documentation, human error, anomalies in the field CC image by Nick J Webb on Flickr

Manage Quality Provided by DataONE Defining QA/QC  Strategies for preventing errors from entering a data set  Activities to ensure quality of data before collection  Activities that involve monitoring and maintaining the quality of data during the study

Manage Quality Provided by DataONE QA/QC Before Collection  Define & enforce standards  Formats  Codes  Measurement units  Metadata  Assign responsibility for data quality  Be sure assigned person is educated in QA/QC

Manage Quality Provided by DataONE  Double entry  Data keyed in by two independent people  Check for agreement with computer verification  Record a reading of the data and transcribe from the recording  Use text-to-speech program to read data back CC image by weskriesel on Flickr QA/QC During Data Entry

Manage Quality Provided by DataONE  Design data storage well  Minimize number of times items that must be entered repeatedly  Use consistent terminology  Atomize data: one cell per piece of information  Document changes to data  Avoids duplicate error checking  Allows undo if necessary QA/QC During Data Entry

Manage Quality Provided by DataONE  Make sure data line up in proper columns  No missing, impossible, or anomalous values  Perform statistical summaries CC image by chesapeakeclimate on Flickr QA/QC After Data Entry

Manage Quality Provided by DataONE  Look for outliers  Outliers are extreme values for a variable given the statistical model being used  The goal is not to eliminate outliers but to identify potential data contamination QA/QC After Data Entry

Manage Quality Provided by DataONE  Methods to look for outliers  Graphical  Normal probability plots  Regression  Scatter plots  Maps  Subtract values from mean QA/QC After Data Entry

Manage Quality Provided by DataONE Points to Remember  Determine the relative importance of the Fields you are entering (compared to other fields you are entering)  Adjust any quality control factors (# in sample, for instance) to ensure that accuracy level is properly accounted for  Target training and review to those fields with the highest accuracy level requirement  Do not assume overall quality based on entries alone; ensure that the relative importance of certain entries are factored in  Anyone can lie (or at least mislead) with statistics Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE Error Rates  the overall number of entries and the errors for the entire data set. Assuming all entries to be equal, the total number of errors is 22 out of a total of 650 entries made. Error Rate = 3% meaning that the quality of the data entered is 97%.  97% is very likely to be the desired level of accuracy (typically 95% -97%  would be expected in quality organizations). Provided by Tom Chatfield, BLM

Manage Quality Provided by DataONE  Data contamination is data that results from a factor not examined by the study that results in altered data values  Data error types: commission or omission  Quality assurance and quality control are strategies for  preventing errors from entering a data set  ensuring data quality for entered data  monitoring, and maintaining data quality throughout the project  Identify and enforce quality assurance and quality control measures throughout the Data Lifecycle Summary

Manage Quality Provided by DataONE 1.D. Edwards, in Ecological Data: Design, Management and Processing, WK Michener and JW Brunt, Eds. (Blackwell, New York, 2000), pp Available at R. B. Cook, R. J. Olson, P. Kanciruk, L. A. Hook, Best practices for preparing ecological data sets to share and archive. Bull. Ecol. Soc. Amer. 82, (2001). 3.A. D. Chapman, “Principles of Data Quality:. Report for the Global Biodiversity Information Facility” (Global Biodiversity Information Facility, Copenhagen, 2004). Available at online-resources/download-publications/bookelets/ online-resources/download-publications/bookelets/ References