Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements.

Slides:



Advertisements
Similar presentations
1 GEOSS Monitoring & Evaluation. 2 1st GEOSS M&E WG meeting outcomes Agreed on 1.the final draft of Terms of Reference for the M&E WG 2.The plan for delivery.
Advertisements

Participation Requirements for a Guideline Panel Member.
Instructions and Reporting Requirements Module 3 Electronic Reporting For Facilities March 2014 North Carolina Central Cancer Registry State Center for.
Configuration Management
Participation Requirements for a Guideline Panel Co-Chair.
10. NLTS2 Documentation Overview. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
Software Quality Assurance Plan
Edward P. Sloan, MD, MPH, FACEP Research Lecture The Successful Research Program: Ideas, The Research, and Presenting the Work.
Introduction to Research Design Statlab Workshop, Fall 2010 Jeremy Green Nancy Hite.
SYSTEM ANALYSIS & DESIGN (DCT 2013)
Managing a Club A club is trying to persuade more people to take part. Some features of the club are: Open to all ages Takes part in competitions and.
Hannele Keckman-Koivuniemi and Mari Kleemola : Data Processing in FSD : CHALLENGES IN A NEW ARCHIVE IASSIST2003 Ottawa,
1. Preparing Research Datasets Data Request Data Cleaning Dataset Preparation Documentation Beverly Musick 2.
10.5 Report Performance The process of collecting and distributing performance information, including status reports, progress measurements and forecasts.
Basic And Advanced SAS Programming
Data Management: Documentation & Metadata Types of Documentation.
Pet Fish and High Cholesterol in the WHI OS: An Analysis Example Joe Larson 5 / 6 / 09.
Software Development, Programming, Testing & Implementation.
Understanding SAS Data Step Processing Alan C. Elliott stattutorials.com.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Managing Your Own Data (…if you have to) Kathryn A. Carson, Sc.M. Senior Research Associate Department of Epidemiology Johns Hopkins Bloomberg School of.
Survey Methodology Survey data entry/cleaning EPID 626 Lecture 10.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
Flow Cytometry and Reproducible Analysis Cliburn Chan Department of Biostatistics and Bioinformatics, DUMC.
1 Our Expertise and Commitment – Driving your Success An Introduction to Transformation Offering November 18, 2013 Offices in Boston, New York and Northern.
Requirements Walk-through
State of Kansas INF50 Excel Voucher Upload Statewide Management, Accounting and Reporting Tool The following Desk Aid instructs users on overall functionality.
Financial Statement Modeling & Spreadsheet Engineering “Training in spreadsheet modeling improves both the efficiency and effectiveness with which analysts.
1 Data Management (1) Data Management (1) “Application of Information and Communication Technology to Production and Dissemination of Official statistics”
Spreadsheet-Based Decision Support Systems Chapter 22:
Dan Parish Program Manager Microsoft Session Code: OFC 304.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
State of Kansas INF50 Excel Voucher Upload Statewide Management, Accounting and Reporting Tool The following Desk Aid instructs users on overall functionality.
SWC Spring 2010 Application Version SWC Spring 2010 Select Folder 2.
PREPARING DATA FOR STATISTICAL ANALYSIS Data Cleaning Data Cleaning Dataset Preparation Dataset Preparation Documentation Documentation 9 September 2008.
The Marketing Research Project. Purposes of the Project 1.Give you practical experience at conducting a marketing research project. 2.Examine some factors.
Chapter 1: Introduction to SAS  SAS programs: A sequence of statements in a particular order  Rules for SAS statements: –Every SAS statement ends in.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
1 Archiving Michael J. Levin Harvard Center for Population and Development Studies
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
Building Marketing Databases. In-House or Outside Bureau? Outside Bureau: Outside agency that specializes in designing and developing customized databases.
Chapter: 7 Filing. FILE MANAGEMENT The purpose of good file management is to keep the paper flowing to its final destination. Four Easy Steps to Improve.
Chapter 1: Overview of SAS System Basic Concepts of SAS System.
IT1001 – Personal Computer Hardware & system Operations Week7- Introduction to backup & restore tools Introduction to user account with access rights.
Copyright © 2010, SAS Institute Inc. All rights reserved. SAS ® Using the SAS Grid.
Project Chartering & Approval Process
Copyright © The OWASP Foundation Permission is granted to copy, distribute and/or modify this document under the terms of the OWASP License. The OWASP.
Leading firms through the next generation of change™
Data Coordinators Conference – 2014 Laura Marroquin CASEWORKER/JCMS Specialist Everything New Data Coordinators Should Know.
April ADaM define.xml - Metadata Design Analysis Results Metadata List of key analyses (as defined in change order) Analysis Results Metadata per.
Based on Learning SAS by Example: A Programmer’s Guide Chapters 1 & 2
Data Coordinating Center University of Washington Department of Biostatistics Elizabeth Brown, ScD Siiri Bennett, MD.
Essential Skills Wales ICT Level 1. ESW ICT L1 Portfolio outcome for at least one activity. Evidence sufficient level 1 features (1.3.1) within the final.
Advanced Higher Computing Science The Project. Introduction Worth 60% of the total marks for the course Must include: An appropriate interface using input.
GCE Software Systems Development A2 Agreement Trial Implementing Solutions October 2015.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 5 & 6 By Ravi Mandal.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
A FRUIT AND VEGETABLE PRESCRIPTION PROGRAM
Writing a Critical Summary of an Article or Paper
Incorporating W3C’s DQV and PROV in CISER’s Data Quality Review and
Reporting the Course level RWR Assessment data
SAS Programming Introduction to SAS.
PREPARATION OF DOCUMENTATION FOR CONSTITUENT BODIES MEETINGS
Gayane Yenokyan, MD, MPH, PhD
File Managements.
To change this title, go to Notes Master
PA430 - Data coding March 7/8, 2000.
The Estonian experience with ex-ante evaluation – set-up and progress
OBSERVER DATA MANAGEMENT PRINCIPLES AND BEST PRACTICE (Agenda Item 4)
Presentation transcript:

Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements Amy Praestgaard March 6, 2008

2 Project Directory File Structure Located on the network to ensure regular back-up. Name of directory usually includes project number (e.g., BIO999 or EPI888) Has (at least) the following subdirectories:  Documentation  Data  Original  Analytic  Programs  Current  Archived  Results  Current  Archived

3 Documentation Subdirectory Contact sheet (REQUIRED) Must include names and contact information for PI, Biostatistics faculty members, and BAC staff members. Project registration is sufficient for many projects. Larger projects may include need to include project managers, research assistants, and CRCU contacts. BAC Project Plan (REQUIRED -- will overlap with overall project plan) Study objective(s), Outcome definition, Statement of analytic methodology (analysis plan). At minimum, a note in the issues log should contain the BAC project goal, even if as simple as “this project converts a SAS file to an SPSS file” Analytic efforts related to timeline for abstracts, meetings, manuscripts. Study information (STRONGLY RECOMMENDED) Study protocol or grant proposal, data collection forms, meeting minutes or summaries, To-do lists (if applicable), all abstract and manuscript drafts. Issues log (STRONGLY RECOMMENDED) Includes brief description, priority, and estimated resolution date. Keep resolved issues on the list, noted with date resolved.

4 Data Subdirectory BIO999\Data Good idea to separate the original data from the created Original (Raw) Data: BIO999\Data\Original Original versions of source data (e.g., DVD, external hard drive, Excel spreadsheet) must be stored in a secure location, like a locked desk drawer. An electronic “gold copy” of the source data must be retained in the data subdirectory. It may not hurt to include “GOLD” in the file name. Analytic Data: BIO999\Data\Analytic Original extract from source data “Cleaned” data, in original format  Incorporates data changes, deletions  Does not include derived variables Final analytic data set, including derived variables

5 Data Dictionary At minimum, must contain: Results from PROC CONTENTS on the analytic dataset containing labeled variables. Results from PROC MEANS that includes N for each variable. A separate document may be necessary for some projects. If the data base is very large, spend billable time creating labels only for the variables you will be using for the analysis unless the client specifically asks you to do it for all variables.

6 Program headers All programs have a header with program name, purpose, location, project name, faculty and BAC name, input, output, last modified, and other relevant details

7 Code Documentation Code should have a sufficient number of comments, especially on macros, complex merges, and code for analysis See the “Code Complete” book for a fuller discussion of good documentation Within a program, the following types of code must be annotated with a description of purpose and expected outcome: All macros or functions Any code generating a data set that is to be retained for interim or final analysis Any code generating output (e.g., frequency tables, test statistics, p- values) for an interim or final deliverable. Any code used for data validation, data changes, and derived variable calculation.

8 Program Archiving During active analytic periods, log and listing files must be archived as follows: With date clearly indicated in the file name, as in “gmm_8_8Dec07.log” and “gmm_8_8Dec07.lst.” In a separate “archive” or “history” section of the appropriate BAC project folder At least weekly, and more frequently if substantial changes are made. Log and list files can be run in Windows, no need to use unix to generate them Programs generally should have a date in their names in order to keep track of versions

9 Documenting Deliverables All interim and final deliverables must include the following information: Analyst name Deliveree Date produced (good practice to include in file name) Input data, output data, and program location Statistical software (including version) used to produce results Once distributed, the deliverable will be retained in the appropriate project folder