© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 1 Octagon Research Solutions, Inc. Leading the Electronic Transformation of Clinical R&D ©

Slides:



Advertisements
Similar presentations
ADaM Implementation Guide: It’s Almost Here. Are You Ready?
Advertisements

How Come It Takes Me So Long to Get Answers to Simple Questions About My Business? Technologies for Business Intelligence Introduction to Microsoft Access.
Mapping Site Instruments. Introduction The Mapped Instrument is a tool that guides you in matching your site specific Instrument and materials to the.
Tutorial 6 Creating a Web Form
The Importance of CDASH
Quick tour of CDISC’s ADaM standard
Programming Types of Testing.
Kendle Implementation of Clinical Data Acquisition Standards Harmonization Dr Elke Sennewald Kendle 9th German CDISC User Group Meeting Berlin, 28 September.
Cross Check between Define.xml and blankcrf.pdf
Standardisation of Trial Design Definitions in CDW at Novo Nordisk
MyFloridaMarketPlace Analysis 2.0 Functional Overview December 18 - December 19, 2007.
Aki Hecht Seminar in Databases (236826) January 2009
QA/QC: A Checklist for Quality and Control
Metadata Management – Our Journey Thus Far
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 1 PhUSE 2010 Berlin * Accessing the metadata from the define.xml using XSLT transformations.
Chapter 7 Structuring System Process Requirements
Copyright © 2010, SAS Institute Inc. All rights reserved. Define.xml - Tips and Techniques for Creating CRT - DDS Julie Maddox Mark Lambrecht SAS Institute.
JumpStart the Regulatory Review: Applying the Right Tools at the Right Time to the Right Audience Lilliam Rosario, Ph.D. Director Office of Computational.
Introduction to Systems Analysis and Design Trisha Cummings.
PhUSE SDE, 28-May A SAS based Solution for define.xml Monika Kawohl Statistical Programming Accovion.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Remapping of Codes (and of course Decodes) in Analysis Data Sets for Electronic Submissions Joerg Guettner, Lead Statistical Analyst Bayer Pharma, Wuppertal,
Database Design - Lecture 1
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 2 Octagon Research Solutions, Inc. Leading the Electronic Transformation of Clinical R&D ©
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
Overview and feed-back from CDISC European Interchange 2008 (From April 21 st to 25 th, COPENHAGEN) Groupe des Utilisateurs Francophones de CDISC Bagneux.
Output and User Interface Design
Confidential - Property of Navitas Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Information Systems Today (©2006 Prentice Hall) 3-1 CS3754 Class Note 12 Summery of Relational Database.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Cleansing Ola Ekdahl IT Mentors 9/12/08.
DAY 22: MICROSOFT ACCESS – REVIEW Akhila Kondai November 04, 2013.
XP New Perspectives on Integrating Microsoft Office XP Tutorial 3 1 Integrating Microsoft Office XP Tutorial 3 – Integrating Word, Excel, Access, and PowerPoint.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
ITGS Databases.
Microsoft Access 2002 Illustrated Complete Forms Using.
© 2011 Clinovo. All Rights Reserved. The contents of this document are confidential and proprietary to Clinovo 1 Clinovo 1208 E. Arques Avenue, Suite 114.
For the Chicago Chapter BOUG Meeting – August 20, 2010
Copyright © 2015, SAS Institute Inc. All rights reserved. Future Drug Applications with No Tables, Listings and Graphs? PhUSE Annual Conference 2015, Vienna.
Database Systems Database Systems: Design, Implementation, and Management, Rob and Coronel.
1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Office Access 2010 by Robert Grauer, Keith Mast, and Mary Anne.
Common Sense Validation Using SAS Lisa Eckler Lisa Eckler Consulting Inc. TASS Interfaces, December 2015.
John Porter Sheng Shan Lu M. Gastil Gastil-Buhl With special thanks to Chau-Chin Lin and Chi-Wen Hsaio.
Karen Cannell APEX: Tight Tabular Forms Karen Cannell
INVITATION TO Computer Science 1 11 Chapter 2 The Algorithmic Foundations of Computer Science.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Distribution A: Approved for public release; distribution is unlimited. Case Number: 88ABW , 31 Mar 2015 A Tool that Uses the SAS PRX Functions.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Tutorial 6 Creating a Web Form
MyFloridaMarketPlace Analysis 2.0 Functional Overview.
The NCCS Data Web: An Introduction The National Center for Charitable Statistics at the Urban Institute January.
MindMajix SAS Clinical TrainingSAS Clinical Training.
Best Practices for Implementing a Paradata Warehouse Presentation at the Washington Statistical Society Mini-Conference on Paradata Washington, D.C. Jason.
ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC Joerg Guettner, Lead Statistical Analyst, Bayer Pharma, Wuppertal,
What does data from a normal distribution look like? The shape of histograms developed from small samples drawn from a normal population are somewhat.
Introduction To DBMS.
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Lecture 2 Introduction to Programming
Testing a Solution.
Traceability between SDTM and ADaM converted analysis datasets
In-Depth Report from Optimizing Data Standards Working Group
Algorithm Discovery and Design
Data Model.
DESIGN OF CRF CRF.
DESIGN OF CRF CRF.
PhilaSUG Spring Meeting June 05, 2019
Module 2 - Xtrata Pro Product Overview Module 2 – Product Overview
CLINICAL DATA ANALYST INTERNSHIP, AT DATAPLUS PHARMA SERVICES
Presentation transcript:

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 1 Octagon Research Solutions, Inc. Leading the Electronic Transformation of Clinical R&D © 2009 Octagon Research Solutions, Inc. All Rights Reserved.

2 Data Profiling Octagon Research Solutions

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 3 Metadata Profiling Metadata (structure) –Likeness of nomenclature among study databases –Answer some planning questions: Claim: “The studies are 90% identical.” Are they? If they indeed are, can you to create pool(s) of source data to gain efficiency? Not our main focus today

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 4 Data Profiling Data (content) –Statistics, e.g., min, max, average –Relationship –Pattern Fact: Data are often “bad, worse, or ugly” Goal: Get a realistic pulse on quality of the data

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 5 Case Study (“Slightly” Altered for Illustration Purposes) Background –Central lab, i.e., eDT CHEM for biochemistry (20807 records), along with 4 other labs –No annotated CRF Mapping document initially authored using variable label

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 6 Case Study (con’t) Sponsor decisions: –Match standard results with original results, i.e., no unit conversion; therefore, LBSTRSC = LBORRES –LPARM to (LBTEST and LBTESTCD) will be done through a sponsor-supplied lookup table Easy enough, right?

High-level mapping based on source dataset metadata

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 8 Case Study (con’t) Programmer noticed errors –LBSTRESN is a numeric variable, but CHEM.LVALUE contains non-numeric data Programmer determined the mapping specifications document is not detailed enough, began to involve the analyst

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 9 Case Study (con’t) Let’s look some options at their disposal (novice to veteran): –SAS System Viewer –A creative method by an Excel-savvyA creative method by an Excel-savvy –SAS PROC FREQSAS PROC FREQ

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 10 Case Study (con’t) SAS System Viewer –Read-only, great for displaying data –Unreliable as a data browser Analyze data in Excel –Very manual –Changes of data ownership, possible “lost in translations”? “Smart” behaviors, e.g., “01JAN :00” to “1/1/ :00:00 PM”, auto-trimming, etc SAS PROC FREQ –CHEM.LVALUE: records reduced to 1237 unique values

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 11 Case Study (con’t) 4 th option –A data pattern analyzer

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 12 Case Study (con’t) –Reduced records to only 11 patterns Aha, we found the needle in the haystack! 0.3% of LVAULE is not numeric.

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 13 Case Study (con’t) –Drilled down to the actual values with non- numeric data patterns

Through issue/resolution with the sponsor, added detailed instructions for LVALUE to accommodate the non-numeric values

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 15 Another Data Pattern Example #1 Source: Character variable AEV.STOP (AE stop date), being mapped to AE Realized source is “somewhat” a free-form field –Critical data point, must handle case-by-case using regular expression (regex) technique

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 16 Another Data Pattern Example #2 Source: Character variable DOSE.DOSE_ACT (Actual dose), being mapped to EX Realized source does not always contain numbers –Used both EX.EXDOSE and EX.EXDOSTXT

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 17 Wrapping Up Integrated data profiling – a tool demo The bigger picture: –Data rules (e.g., pre-defined business rules, data standards, etc) –Data corrections Although ETL is a solution platform for CDISC SDTM data conversion, too much of it is symptom of a problem

© 2009 Octagon Research Solutions, Inc. All Rights Reserved. 18 Thank you! Anthony Chow (610) x5526