GPC Methods Call PopMedNet Implementation Bob Greenlee 02/09/15.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

ASYCUDA Overview … a summary of the objectives of ASYCUDA implementation projects and features of the software for the Customs computer system.
Configuration management
Test Automation Success: Choosing the Right People & Process
HP Quality Center Overview.
© 2005 by Prentice Hall Appendix 2 Automated Tools for Systems Development Modern Systems Analysis and Design Fourth Edition Jeffrey A. Hoffer Joey F.
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
Lecture 5 Themes in this session Building and managing the data warehouse Data extraction and transformation Technical issues.
Introduction to the State-Level Mitigation 20/20 TM Software for Management of State-Level Hazard Mitigation Planning and Programming A software program.
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
Requirements Specification
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Fundamentals of Information Systems, Second Edition
Computers: Tools for an Information Age
Data Warehouse success depends on metadata
10.5 Report Performance The process of collecting and distributing performance information, including status reports, progress measurements and forecasts.
Introduction to Systems Analysis and Design
Configuration Management
Design, Implementation and Maintenance
PopMedNet Software Development Life Cycle Chayim Herzig-Marx Harvard Pilgrim Health Care Institute Daniel Dee Lincoln Peak Partners.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
This chapter is extracted from Sommerville’s slides. Text book chapter
JumpStart the Regulatory Review: Applying the Right Tools at the Right Time to the Right Audience Lilliam Rosario, Ph.D. Director Office of Computational.
IDR Snapshot: Quantitative Assessment Methodology Evaluating Size and Comprehensiveness of an Integrated Data Repository Vojtech Huser, MD, PhD a James.
Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements.
Systems Analysis and Design: The Big Picture
Chapter 3 – Agile Software Development 1Chapter 3 Agile software development.
Knowledge Management in Distributed Networks July 27, 2015 Melanie Davies Kyle Erickson.
Demystifying the Business Analysis Body of Knowledge Central Iowa IIBA Chapter December 7, 2005.
Why Use MONAHRQ for Health Care Reporting? May 2014 Note: This is one of seven slide sets outlining MONAHRQ and its value, available at
PCORnet: PopMedNet Use Jessica Sturtevant Data Standards, Security and Network Infrastructure Operations Team July 27, 2015 PopMedNet User Group Conference.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
PopMedNet in Mini-Sentinel Tiffany Siu Woodworth PopMedNet User Group Conference July 27, 2015.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
Rev. 0 CONFIDENTIAL Mod.19 02/00 Rev.2 Mobile Terminals S.p.A. Trieste Author: M.Fragiacomo, D.Protti, M.Torelli 31 Project Idea Feasibility.
1 What’s Next for Financial Management Line of Business (FMLoB)? AGA/GWSCPA 6 th Annual Conference Dianne Copeland, Director, FSIO May 8, 2007.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Building Marketing Databases. In-House or Outside Bureau? Outside Bureau: Outside agency that specializes in designing and developing customized databases.
Systems Analysis and Design in a Changing World, Fourth Edition
Fundamentals of Information Systems, Second Edition 1 Systems Development.
The HMO Research Network (HMORN) is a well established alliance of 18 research departments in the United States and Israel. Since 1994, the HMORN has conducted.
Configuration Management and Change Control Change is inevitable! So it has to be planned for and managed.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
MDPHnet & ESP Data Partner Participation Overview The following slides describe the necessary steps for a data partner to participate in the MDPHnet Network.
National Enrolment Service (NES) Overview October 2015 – June 2016.
Firmware - 1 CMS Upgrade Workshop October SLHC CMS Firmware SLHC CMS Firmware Organization, Validation, and Commissioning M. Schulte, University.
1 DAF Phase 3 Objective  Enable researchers to access data from multiple organizations and data sources within a Learning Health System (LHS) infrastructure.
Query Health Technical WG Update 12/1/2011. Agenda TopicTime Slot F2F Update (Actions, Decisions and FollowUps) 2:05 – 2:50 pm Wrap Up2:50 - 2:55 pm.
Team X Review. The members of Team X propose to develop a therapeutic activity system using a Microsoft Surface unit and a Windows 7 desktop. By working.
Module 4: Systems Development Chapter 13: Investigation and Analysis.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Student Centered ODS ETL Processing. Insert Search for rows not previously in the database within a snapshot type for a specific subject and year Search.
State of Georgia Release Management Training
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
1 Options Clearing Corporation Encore Data Distribution Services April 22, 2004.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
LECTURE 5 Nangwonvuma M/ Byansi D. Components, interfaces and integration Infrastructure, Middleware and Platforms Techniques – Data warehouses, extending.
DAF Phase 3-Data Access for Research Frequently Asked Questions DRAFT VERSION
HPHC - PERFORMANCE TESTING Dec 15, 2015 Natarajan Mahalingam.
GCE Software Systems Development A2 Agreement Trial Implementing Solutions October 2015.
Analysis and Reporting Toolset (A&RT): Lessons on how to develop a system with an external partner David Smith AstraZeneca.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
Software Project Configuration Management
Fundamentals of Information Systems, Sixth Edition
Metadata The metadata contains
The ultimate in data organization
REACHnet: Research Action for Health Network
Presentation transcript:

GPC Methods Call PopMedNet Implementation Bob Greenlee 02/09/15

Overview What is PopMedNet Experiences with PopMedNet in other research networks – FDA Minisentinel – Others PCORNet Plans/GPC Status Next Steps

WHAT IS POPMEDNET

PopMedNet Open source, no-cost licensing software application for creating, operating, governing distributed health data networks Facilitates secure, customized, distributed analysis of electronic data – medical product safety, comparative effectiveness, quality, resource utilization, cost-effectiveness, etc. Much of the functionality is data model agnostic (i2b2 query integration possible) Architecture: Portal (one, central) + datamart clients (many, local)

PopMedNet timeline (pre-PCORNet)

EXPERIENCE WITH POPMEDNET IN OTHER RESEARCH NETWORKS

PopMedNet in the FDA Minisentinel Minisentinel: 5+ year development award to build a national system of post market medical product safety surveillance – – 5-year Sentinel system awarded, will complete transition from MS to S ~ in late 2015 – Continues to grow and develop – Determination that MS/Sentinel is public health surveillance and outside jurisdiction of IRB/OHRP Minisentinel Distributed Database – Locally secure, distributed data within a standardized set of SAS tables in the MS Common Data Model format Tables: enrollment, demographics, dispensing, encounters, death, cause of death, lab results, vital signs No direct identifiers, but contains real dates, ages over 80 Developed in collaboration with MS data partners Borrows from HMO Research Network Virtual Data Warehouse model

PopMedNet in the FDA Minisentinel 4 main PopMedNet activities; all delivered through the MS PopMedNet datamart client – evolved collectively over the 5+ years Pre-refresh - central QA package to vet/approve site’s new CDM tables – 240+ output files Post-refresh - central programs run against CDM to – generates summary counts for MS Access database for use with automated ‘MS Distributed Query Tool’ – Generate ‘Common components’ – standard macro variables with site specific information (new) ~100 times per year – permit/review/release automated simple SQL queries using the Distributed Query Tool ~50 times per year – respond to analytic SAS programs to run against the full CDM – Originally many separately developed modular programs with different purposes/capabilities – Now being organized into the ‘MS Routine Query System’, with a master ‘Cohort Identification and Analysis’ program, and many accompanying options

Minisentinel – Simple Queries Simple Automated Queries – Prevalence Counts – number of events and members within a specified category Enrollment, drugs, diagnoses, procedures Restricted/stratified by age, sex, year, coverage, setting – Incidence Counts Same as above – Most frequent utilization – ranking top XX events within a specified prevalence category, stratified by age, sex, year, setting – Also supplies denominators 2 day turnaround – Can fully automate, or require approval at each step

MS Query - receipt

MS Query - Main Screen

MS Query - Request Details

MS Query - Output

Minisentinel – SAS program queries SAS programs to run against full CDM – Common analytic approaches in medical product safety surveillance Medical product exposures among individuals with/without conditions of interest Select events during exposure(s) to a drug/procedure group(s) of interest Health outcomes among individuals with or without conditions of interest Frequency and duration of treatment following an event of interest Drug use, diagnoses, and procedures before and after exposure of interest Drug use – uptake, persistence and patterns of use of newly approved drugs – Options for common observational research techniques Propensity Score Matching, Comorbidity score stratification, health care utilization stratification – Flexible inputs exposure, outcome, washout, temporal relationships, minimum supplies/durations, setting, diagnosis type, blackout periods, pre-existing conditions, lookback periods, query range, enrollment gaps, Charlson, utilization, rate ratio calculations, attrition – Usually summary results; 5+ day turnaround; ~50 per year

Minisentinel – SAS program queries MS datamart client does not yet execute SAS programs queries automatically against the CDM – delivered in zip package – requires minor local edits to adjust file input/output/reference table file paths – local execution against CDM tables – Package output into a zip and load back into datamart client for submission Ulitmately expect to run automatically against cdm from the datamart client, and will also obviate the need for summary count storage in Access for simple queries Automation has been done in the SPAN Network Scalable PArtnering Network (SPAN) for Comparative Effectiveness Research – Kaiser Permanente Colorado, AHRQ funded, ARRA program to develop a drn for cer – focus on adhd, obesity; – 10 HMORN and nonHMORN partners Similar automatic SAS enhancement being developed within NCI Cancer Research Network’s CRNNet

PCORNET PLANS/GPC STATUS

PCORNet Plans - 1 DRN Query Tool V5.0 Release Timeline Final release to testing environment– Thursday, 12/11 Software migration scripts completed – Monday, 12/15 Migration tested and user testing complete with approval – Wednesday, 12/17 Release to PCORnet – week of 1/5 PCORnet v5 official release– Friday, 1/9 or Monday, 1/12* *As is always the case with software development, critical bugs or “blockers” may be uncovered during testing that could delay release. From pcornent steering committee dec 19, 2014

PCORNet Plans - 1 DRN Query Tool V5.0 Upgrade Steps Confirm network architecture with PopMedNet team (CDRNs) Establish network architecture within the Query Tool (PMN team) Re-installation the DataMartClient (CDRNs) Update DataMart profiles if necessary (CDRNs) Update DataMart metadata if necessary (CDRNs) This activity will begin 2 nd week of January and will continue on a rolling basis until each CDRN is completed. The process for each CDRN should only take a few days once their network architecture is finalized. From pcornent steering committee dec 19, 2014

PCORNet Plans - 2 Test the Querying and Governance Infrastructure Establish network architecture within the network Send simple query via the PCORnet Query Tool to test – Querying functionality (will the query run) – Network data release and approval process (can the results be shared) Query will generate simple frequencies based on real data in the demographic table (PCORnet CDM 1.0) From pcornent steering committee dec 19, 2014

PCORNet Plans - 3 Data Characterization Program Development Steps Develop functional specifications for data characterization program based on the Mini-Sentinel approach* (completed) Develop technical specifications for data characterization program (in process – expected completion January) Write and test data characterization program in one SQL flavor** (expected completion in February) Create a validation package with code, test database, and results to enable development of subsequent versions by one or more collaborators (February) Implement data characterization program in other SQL flavors (March) * ** CDRN store their data a variety of relational database system. Each system requires unique SQL code to execute successfully, therefore, the data characterization program must be written and tested in multiple “flavors” of SQL (eg, MYSQL, Oracle, teradata, Postgres, etc) From pcornent steering committee dec 19, 2014

PCORNet Plans - 3 Data Characterization Process Review and check all ETL ADDs for each site – For early adopters, discuss ETL ADD and findings from the query infrastructure testing prior to complete data characterization execution (to account for timing) Distribute complete data characterization code to each site via the PCORnet Query Tool – Basic checks of all data domains and data elements – Intended to assess consistency with the data model, characterize available data, and identify major issues for discussion Create data characterization report for each site Discuss data characterization report with each site, update ETL ADDs as necessary From pcornent steering committee dec 19, 2014

PCORNet Plans - 3 Data Characterization Process Considerations Note that the release of CDM V2.0 and 2.1 may overlap with the data extraction process for some sites. If this happens we would recommend that CDRNs use their CDM V2.0 or 2.1 for data characterization rather than their CDM V1.0. Therefore, it is possible that some CDRNs will undergo only 1 data characterization process using CDM V2.0 or 2.1. This would extend the data characterization timeline for those sites. From pcornent steering committee dec 19, 2014

PCORNet Plans – 4 PCORNet CDM V2 From cdm v2 stakeholder input presentation, january 23, 2015

PCORNet Plans – 4 PCORNet CDM V2 From cdm v2 stakeholder input presentation, january 23, 2015

PCORNet Plans – 4 PCORNet CDM V2 CDM Guiding Principles Most Pertinent with v2.0 Comments #2: It is not expected that all CDRNs and PPRNs will be able to populate all parts of the PCORnet CDM. It is the responsibility of the CDRNs and PPRNs to communicate availability of each data domain and element. #7: The CDM will reflect variables and values found in the local data. If some data are coded in a way that is unique to a site, mapping the data to a standardized format will be necessary. Values in the source data before mapping will also be included in the CDM. Derived variables should be avoided. From cdm v2 stakeholder input presentation, january 23, 2015

PCORNet Plans – 4 PCORNet CDM V2 #4: The PCORnet CDM will be developed in a modular, incremental, and extensible fashion. New types of data will be needed, or newly available, during the life of PCORnet. Data domains and data elements will be added, revised, and deprecated throughout an iterative CDM lifecycle. Personnel from the CDRNs and PPRNs will work with the DSSNI Task Force (and other Task Forces as appropriate) to assist in these efforts. #5: Documentation will be clear and transparent so that its contents are understandable to all contributors. The CDM will be intuitive and easy for analysts and investigators to use. Investigators and analysts with prior experience using research data will not need additional skills or knowledge to use the CDM. From cdm v2 stakeholder input presentation, january 23, 2015

GPC Status on CDM/PopMedNet GPC Call with DSSNI/Coord. Center on 1/13/15 – Bob, laurel, jim c, russ, phillip, susan, keith – GPC’s DSSNI liaison is Jessica Sturtevant from the CC at Harvard – Verifying popmednet contacts for each site – Clarifying database management systems at each site – Acknowledged our plan to set up a master data sharing agreement with coordinating center/designee site (external investigator agreement) to support a large portion of data characterization/quality assurance/feasibility queries Coord Ctr Developing Data Characterization WorkPlan Coordinating Center/Steering Commitee developing PCORNet governance documentation – Rolling out to CDRNs as they are ready, rolling out within CDRNs as they are ready – Periodic contact from Jessica to our group Calls as needed

GPC Status on CDM/PopMedNet As of the first week of feb – 7 sites with the datamart client installed 2 on V5, 5 on V4 – Multiple sites have created V1 CDMs via i2b2 transformation using Nathan’s methodology – Basic sample queries being distributed Via Pitt, via CC through (for informal local QA) Through datamart client (purpose???) – Multiple sites have begun running and comparing counts of individuals and facts in cdm tables with i2b2

NEXT STEPS (?)

GPC and PopMedNet Form PopMedNet Implementation Group – Periodic communications from DSSNI liaison/CC to all – Regular communications from Marshfield Datamart client nodes at each GPC site CDM transformation at each site More extensive quality/completeness checking of CDM – Additional quality/completeness scripts – MS CDM comparisons External data sharing agreement Respond to infrastructure testing query via the datamart client Data characterization process CDM v2 development Phase 2, centralization of GPC query response

CDRN Phase 2 RFA CDM issues Data infrastructure in place at time of application: full range of quality-checked data for core population of >= 1 million CDM V 2.1 or current Describe how individuals were included in core population, e.g., – Health plan enrollment – Certain number of visits – Other indicators to characterize defined population Reasons for missing data, plans to collect if feasible Activities to complete the capture of longitudinal data – Evidence of agreements with suitable partners in lieu of complete data (to be pursued under irb approved studies) Ability to execute queries against the CDM, including SAS queries that can run without modification. Must be in place at time of award. Return simple queries within 1 week of receipt Ability to continue development of CDM consistent with and beyond V2.1 (unstructured data, etc.) Policies and practices for security, privacy, and confidentiality Creation and characterization of 3 cohorts, document numbers available for query Enables computable phenotypes Facilitates patient level data linkage/sharing within the CDRN Refresh at least quarterly