Patterns emerging from chaos

Slides:



Advertisements
Similar presentations
Dimitri Kutsenko (Entimo AG)
Advertisements

Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
What’s New with CDISC Wayne R. Kubick CDISC CTO.
SUCCESSFULLY EMBRACING CHANGES TO CDISC STANDARDS Dr. Elke Sennewald Director, Biometrics Operating Standards Group.
Quick tour of CDISC’s ADaM standard
Kendle Implementation of Clinical Data Acquisition Standards Harmonization Dr Elke Sennewald Kendle 9th German CDISC User Group Meeting Berlin, 28 September.
CDISC (CDASH/SDTM) integration into OC/RDC
Metadata Management – Our Journey Thus Far
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 1 PhUSE 2010 Berlin * Accessing the metadata from the define.xml using XSLT transformations.
Emerging Technologies Semantic Web and Data Integration This meeting will start at 5 min past the hour As a reminder, please place your phone on mute unless.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Accenture Accelerated R&D Standards Metadata Management – version control and its governance Kevin Lee CDISC NJ Meeting at 01/28/2015 We help our Clients.
Beyond regulatory submission - Standards Metadata Management Kevin Lee CDISC NJ Meeting at 06/17/2015 We help our Clients deliver better outcomes, so.
23 August 2015Michael Knoessl1 PhUSE 2008 Manchester / Michael Knoessl Implementing CDISC at Boehringer Ingelheim.
Dominic, age 8, living with epilepsy SDTM Implementation Guide : Clear as Mud Strategies for Developing Consistent Company Standards PhUSE 2011 – CD02.
© 2011 Octagon Research Solutions, Inc. All Rights Reserved. The contents of this document are confidential and proprietary to Octagon Research Solutions,
Remapping of Codes (and of course Decodes) in Analysis Data Sets for Electronic Submissions Joerg Guettner, Lead Statistical Analyst Bayer Pharma, Wuppertal,
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Contents Integrating clinical trial data Working with CROs
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 2 Octagon Research Solutions, Inc. Leading the Electronic Transformation of Clinical R&D ©
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
Overview and feed-back from CDISC European Interchange 2008 (From April 21 st to 25 th, COPENHAGEN) Groupe des Utilisateurs Francophones de CDISC Bagneux.
Confidential - Property of Navitas Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Second Annual Japan CDISC Group (JCG) Meeting 28 January 2004 Julie Evans Director, Technical Services.
Alun, living with Parkinson’s disease QS Domain: Challenges and Pitfalls Knut Müller UCB Biosciences Conference 2011 October 9th - 12th, Brighton UK.
Dave Iberson-Hurst CDISC VP Technical Strategy
Research based, people driven CDISC ADaM Datasets - from SDTM to submission CDISC Experience Exchange and ADaM Workshop 15 Dec 2008 Zoë Williams, LEO Pharma.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
© 2015 Akana., Inc All Rights Reserved. From Data to Knowledge Semantics & Implementations.
1 GLOBAL BIOMETRICS Biostatistics Clinical Data Management Epidemiology & Patient Reported Outcomes Statistical Programming and Analysis Strategic Planning,
Emerging Technologies Semantic Web and Data Integration This meeting will start at 5 min past the hour As a reminder, please place your phone on mute unless.
How Good is Your SDTM Data? Perspectives from JumpStart Mary Doi, M.D., M.S. Office of Computational Science Office of Translational Sciences Center for.
Metadata Driven Clinical Data Integration – Integral to Clinical Analytics April 11, 2016 Kalyan Gopalakrishnan, Priya Shetty Intelent Inc. Sudeep Pattnaik,
Generation of real-time SDTM datasets and metadata through a generic SDTM converter mechanism CDISC (CDASH/SDTM) integration into OC/RDC Peter Van Reusel.
Submission Standards: The Big Picture Gary G. Walker Associate Director, Programming Standards, Global Data Solutions, Global Data Management.
Dave Iberson-Hurst CDISC VP Technical Strategy
Building Block macro principle for Standard Program Library Design
Implementation of Quality indicators for administrative data
Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
Towards connecting geospatial information and statistical standards in statistical production: two cases from Statistics Finland Workshop on Integrating.
Software Design and Architecture
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Secondary Uses Primary Use EHR and other Auhortities Clinical Trial
Beyond regulatory submission - Standards Metadata Management Kevin Lee CDISC NJ Meeting at 06/17/2015 We help our Clients deliver better outcomes, so.
Accenture Accelerated R&D Standards Metadata Management – version control and its governance Kevin Lee CDISC NJ Meeting at 01/28/2015 We help our Clients.
MAKE SDTM EASIER START WITH CDASH !
CPT and Disclosure: Connecting Critical Processes
Creating ADaM Friendly Analysis Data from SDTM Using Meta-data by Erik Brun & Rico Schiller (CD ) H. Lundbeck A/S 13-Oct
Traceability between SDTM and ADaM converted analysis datasets
Quality Control of SDTM Domain Mappings from Electronic Case Report Forms Noga Meiry Lewin, MS Senior SAS Programmer The Emmes Corporation Target: 38th.
Research on FHIR Fast Healthcare Interoperability Resources
What’s New in Colectica 5.3 Part 1
Chapter 2 Database Environment Pearson Education © 2009.
SDMX: A brief introduction
A SAS macro to check SDTM domains against controlled terminology
Logical information model LIM Geneva june
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Fabienne NOEL CDISC – 2013, December 18th
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
CPT and Disclosure: Connecting Critical Processes
Carolina Mendoza-Puccini, MD
SDMX Information Model: An Introduction
Visualizing Subject Level Data in Clinical Trial Data
Question Banks, Reusability, and DDI 3.2 (Use Parameters)
Metadata The metadata contains
Safety Analytics Workshop – Computational Science Symposium 2019
Chapter 2 Database Environment Pearson Education © 2009.
CDISC 360: Evolving our standards towards end to end automation
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Palestinian Central Bureau of Statistics
Presentation transcript:

Patterns emerging from chaos 6th June 2018, PhUSE US Connect, Raleigh Alan Cantrell Hello and welcome to “Patterns Emerging from Chaos”. Thank you for allowing me this opportunity to present today.

Historical changes to structure Metadata Linkage Agenda Introduction Historical changes to structure Metadata Linkage Metadata changes and impact Governance and patterns Conclusion On today’s agenda; I’m going to provide A brief introduction to the connectivity within the PAREXEL MDR How changes to our structure historically impact our processes Linkage in our metadata How metadata can change and the impact with linkage Patterns and their governance And wrap up with a conclusion

TRANSLATION. displayname INTRODUCTION (HUB) AE Domain Definition Pattern Coded Item Concept ACNOTH Other Action Taken ISO 21090 based CLINICAL Recognizable CONCEPTS Data is mapped to and from a hub Usable for CDISC and other data Models (SPOKES) TRANSLATION. displayname displayname CDASH / Data Collection AEACNOTH Other Action Taken GROUP. displayname During 2016, the PAREXEL MDR was implemented to define, store and manage our metadata for data collection, tabulation and analysis; this metadata was connected by the use of pattern metadata. Domains are defined through a hub & spoke. Each domain definition within the hub contains the specification of the variables relevant for the domain, combined into semantically consistent groups (e.g. Value and Unit are linked together). Hubs are agnostic of any standards. Each hub is linked to several spokes, such as data collection (CDASH), data tabulation (SDTM) and analysis (ADaM) that includes the variables or variable group for that specific spoke, linked to the hub. Mapping through the hub and spokes is defined by patterns. With the collection, tabulation and analysis metadata (Spokes) in place along with the connections to the domain definitions (Hub); the metadata can be extracted from the MDR and consumed by SAS program code along with the data it is describing.   SDTM / Data Tabulation AEACNOTH Other Action Taken Mapping by declaration to concepts Allows End to End (E2E) data flow dry-run without data Study teams worry about the what not the how ADaM / Data Analysis AEACNOTH Other Action Taken AEACNOFL Other Action Taken Flag

Historical changes to structure - i Source Metadata Specification Target Metadata A specification documents the requirements and rules for mapping from source to a target The specification is dependent on the source and target metadata. If either changes the specification needs to be adapted – it acts as a bridge When programming is performed; a specification documenting the requirements and rules for mapping from a source to a target is created; this is based on the source structure and target structure being defined at a moment in time. In the case of an SDTM mapping specification; the source would be the database extract structure and content; the target would be the CDISC SDTM model.   Although the target structure would be unlikely to change on a project with a set CDISC SDTM version being followed; the source structure could be affected by a protocol amendment during the project life span; a change in expectations by the clinical trial sponsor due to an internal standards change. As a CRO in a position to deliver multiple datasets / outputs on multiple projects for a sponsor; it is important to be in a position to react to these changes in an efficient manner ensuring consistency. Source metadata could be updated by a protocol amendment or change in expectations by sponsor

Historical changes to structure - ii Source Metadata Specification Target Metadata Example Protocol Version 1 Adverse Event Start Date collected in format ddmmmyyyy, e.g. 06JUN2018 Protocol Amendment 1 Adverse Event Start Time is added to the data collection instrument, e.g. 14:00 and Adverse Event Start Date collection format is changed to yyyymmdd, e.g. 20180606 An example; In the original protocol; the adverse event date may need to be collected in the format of ddmmmyyyy, e.g. 06JUN2018. However in the protocol amendment; it may require the adverse event start time to also be collected and perhaps the adverse event date format will also change. Typically changes like this then result in various programming updates downstream in the SDTM….so how can we adapt better to these.?

Metadata linkage - i Patterndateitem A pattern defines the structure of attributes with a common theme Each pattern has an associated SAS macro developed to allow the attributes to be understood when called item.ddmmmyyyy Time.hhmm Item.YYYYMMDDHHMM StartDateCode We can use linkage in our metadata to help… The PatternDateItem patterns facilitates the definition of various connected attributes… For example attributes here could be,,, Date (in format ddmmmyyyy) Time (in format hhmm) Datetime (in format yyyymmddhhmm) Date (in format yyyymmdd) Where appropriate the value of one attribute can be understood by another; e.g. reading in data using the attribute “ddmmmyyyy” can be output as data using the attribute “yyyymmdd”. With each pattern defined; a corresponding SAS macro has been created. In this case the macro is “gmPatternDateItem”; it has parameters for Understanding the input variable (aeStDat_raw) and the corresponding pattern attribute. Understanding the output varoiable (aestdtc) and the corresponding pattern attribute. Creating a sasprogram to hold the mapping code (in this case called startDateCode) When the attributes are selected for a particular study and the study instance is created; each appropriate pattern macro code is created and executed to create SAS code for the mapping. to allow The PatternDateItem pattern facilitates the reading in of variable metadata “aeStDat_raw” with the attribute “item.ddmmmyyyy” – therefore expecting the input date to be of the format ddmmmyyyy, e.g. 25JAN2018. The pattern also facilitates the reading out of variable metadata “aeStDtC” with the attribute “item.yyyymmdd” – therefore outputting the date value in the format yyyymmdd, e.g. 20180125.   * Date variables; ** Start Date; %gmPatternDateItem( varsIn = aeStDat_raw = item.ddmmmyyyy , varsOut = aeStDtC = item.yyyymmdd , codeOut = startDateCode ); Each pattern macro is called to generate SAS code in a consistent way; e.g. in this case; SAS code startDateCode is generated. Each of the generated SAS codes are then executed to facilitate the creation of an output such as the AE SDTM dataset. The pattern attributes are defined within the MDR and together form a concept data type. A concept data type DateTimeItem is defined and linked to a hub variable STDTC in the AE domain. The concept data type supports the collection and exchange of dates and times; including partial dates; e.g. when only year and month are known for a date and the day value is unknown. Within the EDC adverse event form; the variables AESTDAT (AE Start Date) and AESTTIM (AE Start Time) are connected to the hub item AE.STDTC with attributes of date.ddmmmyyyy and time.hhmm respectively. Within the SDTM AE domain; the variable AE.AESTDTC (Start Date/Time of Adverse Event) is connected to the hub item AE.STDTC with an attribute of yyyymmddhhmm. Similarly the ADaM variables are also connected to the hub item and allows imputation to also be performed when the date collected in the form is partial. The analysis start date (perhaps imputed) and analysis start date imputation flag are also created as attributes. %gmPatternDateItem( varsIn = aeStDtC = item.yyyymmdd , varsOut = aStDt = item.imputeFirst.sas @ aStDtF = item.imputeFirst.flag Similar to SDTM generation; each pattern macro is called to generate SAS code in a consistent way; e.g. in this case; SAS code startDateCode is generated. Each of the generated SAS codes are then executed to facilitate the creation of an output such as the ADAE ADaM dataset. Item.YYYYMMDD

Metadata linkage - ii Patterndateitem The associated pattern code facilitates the reading in of variable metadata and outputting of variable metadata item.ddmmmyyyy In turn the pattern code when called enables the generation of code to implement this transformation using the data ITEM.IMPUTEFIRST.FLAG ITEM.IMPUTEFIRST.SAS StartDateCode In a similar way; the patterndateitem macro can be used to generate ADaM variable values. In this case the attribute yyyymmdd is being used to support the in value And the attributes for containing SAS code to impute the missing date and an attribute for flag are being used to define the output variables. Similar to SDTM generation; each pattern macro is called to generate SAS code in a consistent way; e.g. in this case; SAS code startDateCode is generated. Each of the generated SAS codes are then executed to facilitate the creation of an output such as the ADAE ADaM dataset. Patterns allow reusability; so we have the same pattern macro being used for both SDTM and ADaM. And they can be reused in other datasets; perhaps medical history; concomitant medications. Item.YYYYMMDD * Date variables; %gmPatternDateItem( varsIn = aeStDtC = item.yyyymmdd , varsOut = aStDt = item.imputeFirst.sas @ aStDtF = item.imputeFirst.flag , codeOut = startDateCode);

Metadata changes and impact - i Adverse events domain definition STDTC StartDateCode DCI Adverse events SDTM AE ADaM ADAE Start date AESTDTC AESTDTC Start time AESTDTC ASTDTC ASTDT ASTDTF As from above; the form items “Adverse Event Start Date” and “Adverse Event Start Time” and the SDTM variable “Start Date/Time of Adverse Event” are connected to a central variable AE.STDTC instead of being directly connected to each other in a linear way. Through this connection; the attributes for the form items can be changed while still ensuring the data produced in the SDTM is consistent.   With an attribute being changed such as the date collection format in the form; the connection to the hub variable AE.STDTC is still maintained. As this connection is still maintained and valid; it ensures that the connection from the hub to the SDTM variable AESTDTC is still able to be mapped from the updated form item. Through this approach compared to the “linear” approach mapping directly from form item to SDTM variable and SDTM variable to ADaM variable; less manual intervention is required to adapt code in multiple places as it is generated based on the attributes selected in the study implementation. Also have consistency as the same patterns can be reused for other date times in SDTM and ADaM. StartDateCode

Metadata changes and impact - ii Pre-protocol amendment Changing the format of the source metadata retains the connectivity between the data collection instrument and SDTM but updates the related source code Post-protocol amendment As from above; the form items “Adverse Event Start Date” and “Adverse Event Start Time” and the SDTM variable “Start Date/Time of Adverse Event” are connected to a central variable AE.STDTC instead of being directly connected to each other in a linear way. Through this connection; the attributes for the form items can be changed while still ensuring the data produced in the SDTM is consistent.   With an attribute being changed such as the date collection format in the form; the connection to the hub variable AE.STDTC is still maintained. As this connection is still maintained and valid; it ensures that the connection from the hub to the SDTM variable AESTDTC is still able to be mapped from the updated form item. Through this approach compared to the “linear” approach mapping directly from form item to SDTM variable and SDTM variable to ADaM variable; less manual intervention is required to adapt code in multiple places as it is generated based on the attributes selected in the study implementation.

Metadata changes and impact - iii In addition, a change to the Statistical Analysis Plan could impact the method for imputing the analysis date and flags such as ASTDT / ASTDTF

Governance and patterns - i Metadata is stored within the MDR and is version controlled Updated metadata is used to generate updated mapping code Code related to metadata is retained using subversion During the pattern code execution; checks are performed on the metadata; the data and that the metadata is consistent between the study instance metadata; metadata in pattern code generated StartDateCode StartDateCode StartDateCode As the metadata connecting the hub and spokes (in this case EDC forms and SDTM datasets) is contained within the PAREXEL MDR and is version controlled when creating a study implementation; any updates (in accordance with protocol / sponsor change requests) made to the form items / form design are therefore also controlled.   As the SAS code used to generate and populate outputs such as SDTM and ADaM datasets is based on the corresponding version controlled metadata; this ensures that the SAS code has implicit governance.

Governance and patterns - ii By developing and releasing pattern definitions in a controlled environment we ensure Consistency across projects and programs Increase efficiency as patterns are reusable in multiple places Patterns have the ability to evolve with the changing landscape – if additional attributes are required these can be added The additional attributes and new patterns are managed centrally to ensure SME guidance is followed As the metadata connecting the hub and spokes (in this case EDC forms and SDTM datasets) is contained within the PAREXEL MDR and is version controlled when creating a study implementation; any updates (in accordance with protocol / sponsor change requests) made to the form items / form design are therefore also controlled.   As the SAS code used to generate and populate outputs such as SDTM and ADaM datasets is based on the corresponding version controlled metadata; this ensures that the SAS code has implicit governance.

Patterns emerging from chaos Conclusion Using patterns to connect form items, SDTM and ADaM variables to a central hub ensures changes to metadata allow connectivity to remain in place. Patterns emerging from chaos quality efficiency consistency So in conclusion; the use of patterns and their attributes to connect the forms, tabulation and analysis variables to a central hub ensures changes to metadata allow connectivity to remain in place.

Questions For more information: Alan Cantrell Associate Director, Statistical Programming Alan.Cantrell@PAREXEL.com Come and see us at PAREXEL’s booth! Visit our MDR resource portal at: https://www.parexel.com/solutions/clinical-research/global-data-operations/clinical-metadata-repository-mdr