Database Management for Clinical Research Tables, Normalization, Queries, and Forms Michael A. Kohn, MD, MPP 3 September 2013.

Slides:



Advertisements
Similar presentations
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Extended Learning Module J (Office 2010 Version) Implementing.
Advertisements

EPI 218 Web-Enabled Research Data Management Platforms Michael A. Kohn, MD,MPP 5 September 2013.
Introduction for Clinical Database 陳勁辰2003/06/02.
Web-Based, Hosted Research Data Management Platforms 2/12/2008.
Chapter 7 UNDERSTANDING AND DESIGNING FORMS. Input Forms: Content and Organization Need for forms Event analysis and forms Relationship between input.
Database Management An Introduction.
Database Management: Getting Data Together Chapter 14.
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Business Driven Technology Unit 2 Exploring Business Intelligence Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution.
BUSINESS DRIVEN TECHNOLOGY
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
An Introduction to Database Management Systems R. Nakatsu.
Database Software Application
Chapter 9 Database Management
Overview What is SQL Server? Creating databases Administration Security Backup.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
EPI 218 Database Management for Clinical Research Tables, Relationships, Normalization, Data Types, and Data Dictionaries Michael A. Kohn, MD, MPP 1 August.
Data Resource Management Chapter 5 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 4: Organizing and Manipulating the Data in Databases
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. ACCESS 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 4 – Creating New.
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
Database Resources Final Project Database Demonstrations 2/9/2010.
Database Design Concepts With Access. Learning Outcomes  Identify and define the information that is needed to design a database  Create conceptual.
Data Collection and Management for Clinical Research Michael A. Kohn, MD, MPP 31 August 2010.
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
STORING ORGANIZATIONAL INFORMATION— DATABASES CIS 429—Chapter 7.
Data Management for Pharmaceutical Trials Michael A. Kohn, MD, MPP (Acknowledgment: Susanne Prokscha)
Chapter 16 Designing Effective Output. E – 2 Before H000 Produce Hardware Investment Report HI000 Produce Hardware Investment Lines H100 Read Hardware.
CIS 103 — Applied Computer Technology Last Edited: September 17, 2010 by C.Herbert Using Database Management Systems.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Access 2010 by Robert Grauer, Keith Mast, Mary Anne Poatsy Chapter.
EPI 218 Web-Enabled Research Data Management Platforms Michael A. Kohn, MD,MPP Josh Senyak 22 August 2013.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
EPI 218 Web-Enabled Research Data Management Platforms Michael A. Kohn, MD,MPP 30 August 2012.
Storing Organizational Information - Databases
EPI 218 Web-Enabled Research Data Management Platforms Michael A. Kohn, MD,MPP 29 August 2013.
EPI 218 Database Management for Clinical Research Michael A. Kohn, MD, MPP January 10, 2010.
Microsoft Office 2007 Access Chapter 3 Maintaining a Database.
Created by, Author Name, School Name—State FLUENCY WITH INFORMATION TECNOLOGY Skills, Concepts, and Capabilities.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Databases Shortfalls of file management systems Structure of a database Database administration Database Management system Hierarchical Databases Network.
Epi 202: Designing Clinical Research Data Management for Clinical Research Thomas B. Newman, MD,MPH Professor of Epidemiology & Biostatistics and Pediatrics,
EPI 218 Database Management for Clinical Research Michael A. Kohn, MD, MPP January 6, 2009.
Epi 202: Designing Clinical Research Data Management for Clinical Research Thomas B. Newman, MD,MPH Professor of Epidemiology & Biostatistics and Pediatrics,
EPI 218 Database Management for Clinical Research Michael A. Kohn, MD, MPP January 8, 2008.
EPI 218 Queries and On-Screen Forms Michael A. Kohn, MD, MPP 9 August 2012.
McGraw-Hill/Irwin ©2005 The McGraw-Hill Companies, All rights reserved ©2005 The McGraw-Hill Companies, All rights reserved McGraw-Hill/Irwin.
REDCap Overview Institute for Clinical and Translational Science Fred McClurg Neil Nuehring.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
DAY 16: ACCESS CHAPTER 1-2 Rahul Kavi October 8,
Database and Information Management Chapter 9 – Computers: Understanding Technology, 3 rd edition.
Microsoft Office 2013: In Practice Chapter 2 Using Design View, Data Validation, and Relationships Copyright © 2014 by The McGraw-Hill Companies, Inc.
1 MS Access. 2 Database – collection of related data Relational Database Management System (RDBMS) – software that uses related data stored in different.
Access Chapter 1: Intro to Access Objectives Navigate among objects in Access database Difference between working in storage and memory Good database file.
0 / Database Management. 1 / Identify file maintenance techniques Discuss the terms character, field, record, and table Describe characteristics.
Chapter 9 Vocabulary Databases. 1.Table – a collection of information, or data arranged in columns and rows. 2.Record – all of the information about one.
Instructor: Pavlos Pavlikas1 How Data is Stored Chapter 8.
Data Management in Clinical Research Rosanne M. Pogash, MPA Manager, PHS Data Management Unit January 12,
Medidata Rave Start-Up Information
VOCAB REVIEW. A field that can be computed from other fields Calculated field Click for the answer Next Question.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. REDCap:
How Can REDCap Help my Research?
Introduction to Ms-Access Submitted By- Navjot Kaur Mahi
The ultimate in data organization
Databases and Information Management
Presentation transcript:

Database Management for Clinical Research Tables, Normalization, Queries, and Forms Michael A. Kohn, MD, MPP 3 September 2013

Clinical Research* Choose the study design, and define the study population, predictor variables, and outcome variables; measure these variables and anticipate problems with measurement; analyze the results Today, we discuss the “ nitty gritty ” of collecting, storing, updating, and monitoring the study measurements. *Private companies that make data management systems for clinical research understand “ clinical research ” to include only trials preparatory to FDA drug or device approval, not observational studies.

Outline Online Surveys Data Tables Rows = Records; Columns = Fields Data Dictionary (Metadata) On-screen forms ( “ EDC ”, “ eCRFs ” ) Normalization of Data Tables Database Queries Data management programs (platforms)

“All-that-apply” Questions

Single Best Response

Skip Logic

DCR-4 Website

Online Survey Tools REDCap Qualtrics SurveyMonkey Zoomerang, Google Forms, etc.

Components of Data Management 1) Collecting/entering 2) Monitoring/checking/cleaning 3) Reporting 4) Statistical analysis 5) Maintaining confidentiality 6) Back-ups

Default Option Collect data by hand-writing on paper forms Transcribe into Excel tables Import into Stata for monitoring, reporting and analysis

Rows = Records = Entities Columns = Fields = Attributes Data Tables

DCR Chapter 16 Exercise 2 The PHTSE (Pre-Hospital Treatment of Status Epilepticus) Study was a randomized blinded trial of lorazepam, diazepam, or placebo in the treatment of pre-hospital status epilepticus. The primary endpoint was termination of convulsions by hospital arrival. To enroll patients, paramedics contacted base hospital physicians by radio. The following are base-hospital physician data collection forms for 2 enrolled patients: Lowenstein DH, Alldredge BK, Allen F, Neuhaus J, Corry M, Gottwald M, et al. The prehospital treatment of status epilepticus (PHTSE) study: design and methodology. Control Clin Trials 2001;22(3): Alldredge BK, Gelb AM, Isaacs SM, Corry MD, Allen F, Ulrich S, et al. A comparison of lorazepam, diazepam, and placebo for the treatment of out-of-hospital status epilepticus. N Engl J Med 2001;345(9):631-7.

Demonstration: Creating a Data Table Enter a row of data in datasheet view (PHTSE table) Where is predictor on data collection form?

Demonstration: Data Dictionary Table design view: field (=column) names, data types, definitions, validation rules

Create a 9-field data dictionary for the data table

Demonstration: Same Table in Excel, Stata Excel Stata Etc Rows = Records = Entities Columns = Fields = Attributes Access and Stata have a special row at the top for column headings (=field names); Excel just uses the first row.

(Use EnollmentWithUnblindedGroupAssignments query)

On-screen Forms EDC = Electronic Data Capture eCRF = Electronic Case Report Form

Advantages of On-Screen Forms Data keyed directly into the computer data tables without a transcription step Include validation checks and provide immediate feedback when a response is out of range Incorporate skip logic

Demonstrate PHTSE Enrollment Form

Normalization

Table of Study Subjects Row = Enrollment Columns = ID#, KitNumber, Date/Time, HospArrSzAct Research nurse collects several variables for each of the patient’s hospital days: Intubation, Lowest O2 Saturation, Tmax, anti-seizure medications and doses, … Adverse event tracking Call logs Table of PHTSE Enrollments

If study subjects have an indeterminate number of hospital days, and each hospital day is associated with an indeterminate number of medications, “ normalize ” the records into 3 tables, one for subjects, one for hospital days, and one for medications. Keep adverse events in a table with one row per event and calls in another table with row per call. Normalization For more on normalization, take Epi 218.

Database Queries* SQL = Structured Query Language * Not to be confused with queries to study sites, author queries, etc.

Select Queries Select queries (aka “ Views ” ) organize, sort, filter, and display data. Queries use Standard Query Language (SQL), but you don ’ t have to learn it, because of graphical query design tools. A query can join data from two or more tables, display only selected fields, and filter for records that meet certain criteria.

Platforms for Epi 218 Access (Labs 1, 2 and 3) REDCap (Lab 4, 5) QuesGen (Lab 6) May use other data platforms for final project: -- Oncore -- Filemaker Pro -- OpenClinica -- Other Note that the list does NOT include Excel.

Microsoft Access Integrated desktop database management platform Uses SQL (Structured Query Language) Has an outstanding graphical query design tool Incorporates an excellent report writer Based on the principles of the Relational Model Relationships diagram has integrated referential integrity Very flexible, infinitely customizable NOT browser based (desktop application) Using advanced features usually requires hiring a developer

Microsoft Access UCSF MyResearch DEB Terminal Server

Provides a remote Windows desktop with Microsoft Office Professional Remote Desktop client software freely available for the Mac and already part of Windows Obtain DEB Terminal Server username and password from Instructions available on Epi 218 course syllabus page

REDCap Web-based research data collection system developed at Vanderbilt Available free through UCSF Academic Research Systems You are both the Principal Investigator and User 1. Model= “ Do-it-yourself ”

QuesGen Web-enabled research data collection and management platform developed (with UCSF input) by a private company based in Burlingame More full-featured and customizable than REDCap, but primarily “ pay-us-to- do-it ” rather than “ do-it-yourself ”

Transcribing into Excel tables No data dictionary (meta-data) No automatic data validation No on-screen forms No skip logic Significant security and file corruption issues Not acceptable for Epi 218 Final Project.

General description of database Data collection and entry Error checking and data validation Analysis (e.g., export to Stata) Security/confidentiality Back up Elements of a Data Management Plan

Final Project 1) What is your study? ("The [CUTE ACRONYM] study is a [DESIGN] study of the association between [PREDICTOR] and [OUTCOME] in [STUDY POPULATION]"). 2) What data points are you collecting? (Your data collection forms.) 3) Who collects the data? You? RAs? MDs? Chart abstractors? Maybe the study subjects enter the data themselves?

Final Project (cont’d) 4) How are the data collected? Written onto a paper form and then transcribed into a computer database? Entered directly into the computer? (If data are transcribed, who does it? Have you hired/will you hire somebody? Or have you enlisted/will you enlist some med students?) 5) Will the above-mentioned computer database be in Access, REDCap, QuesGen, Oncore, OpenClinica, SurveyMonkey, Medidate RAVE, or something else? 6) Try to provide a detailed data dictionary with the name, data type, description, and validation rules for each field (column) in the data table(s).

Final Project 7) If it's a multi-table database, even a hand-drawn relationships diagram would help but is not required. 8) How do you validate the data for correctness and monitor the data collection effort? (Usually you have some range checks on individual variables and you periodically query for outliers that are nonetheless within the allowed range.) 9) You should periodically analyze the data, not only to look for problems, but also to see where the study is headed. How do/will you do this? Query in Access and export to Stata? 10) How will you protect your subjects' identifying data? 11) How will you ensure that you don't lose your data file in a computer crash or if a water pipe leaks?

Answering these questions is an essential part of doing a clinical research study.