National Data Management Trends and Challenges Mike Conlon, PhD Director of Biomedical Informatics, Chief Operating Officer UF Clinical and Translational.

Slides:



Advertisements
Similar presentations
Chiari Surgical Outcomes Trial PI: Bermans J. Iskandar and Timothy George with collaboration from John Kestle ASAP-funded pilot study ASAP-funded pilot.
Advertisements

Tips to a Successful Monitoring Visit
UC BRAID: Co-creating and evaluating performance in a regional laboratory for conducting translational science UC BRAID Executive Committee: Steven Dubinett.
Clinical research and the electronic medical record: Interdisciplinary research agendas Michael G. Kahn MD, PhD Biomedical Informatics Core Director Colorado.
1 The HIPAA Privacy Rule and Research This presentation will probably involve audience discussion, which will create action items. Use PowerPoint to keep.
VIVO: Vision for Research Information, 2020 Brussels, September 10, 2012 Michael Conlon, PhD Clinical and Translational Science Institute University of.
Research Discovery, Social Networks and VIVO Chicago, October 8, 2012 Michael Conlon, PhD Clinical and Translational Science Institute University of Florida.
Overview of Biomedical Informatics Rakesh Nagarajan.
Connecting the Dots Creating a learning health system linking clinical quality improvement, Maintenance of Certification, and research Maureen Smith, MD,
Research-driven data standards CIMI 11 th April 2013.
Stanley Estime, MSCI December 9, 2014 Record Keeping: What is Regulatory Documentation and how should it be maintained? Tel:
Opportunities for Big Data in Medical Records Mike Conlon, PhD
VIVO: Enabling National Networking of Scientists Michael Conlon, PhD Principal Investigator
Johns Hopkins Technology Transfer 1 Translational Biomedical Research: Moving Discovery from Academic Centers to the Community Translational Biomedical.
Research and Development Protocol Submission and Continuing Review Processes Kimberly Summers, PharmD Assistant Chief for Clinical Research South Texas.
Consent2Share Linking Cohort Discovery to Consent David R Nelson MD Assistant Vice President for Research Professor of Medicine Director, Clinical and.
Mike Conlon Here’s Mike on a conference call from his home. Mike spends a lot of time on conference calls from his home, and from coffee shops in and around.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
VIVO is supported by NIH award U24 RR The UF CTSI is supported in part by NIH awards UL1 RR029890, KL2 RR and TL1 RR The UF CTSI VIVO:
VIVO: Enabling Networking of Scientists Mike Conlon University of Florida.
Joint Research & Enterprise Office Training The team, the procedures, the monitor and the Sponsor Lucy H H Parker Clinical Research Governance Manager.
MODULE B: Case Report Forms Jane Fendl & Denise Thwing April 7, Version: Final 07-Apr-2010.
VIVO: Sharing Data for Research Discovery Mike Conlon University of Florida
The CTSI and the VA: Opportunities for Discovery Mike Conlon, PhD Associate Director, UF CTSI Presented at VA Research Inspires! Gainesville, Florida May.
Anticipated FY2016 Appropriations Agency$ Million NIH200 Cancer70 Cohort130 FDA10 Office of the Natl Coord. for Health IT (ONC) 5 TOTAL215 Mission: To.
The UF CTSI is supported in part by NIH awards UL1 RR029890, KL2 RR and TL1 RR The UF Study Registry and UF StudyConnect: Understanding and.
Role of the Oncology Research Team Carmen B. Jacobs, BS, RN,OCN, CCRP U.T.M.D. Anderson Cancer Center Houston, Texas U.S.A.
Department of Health and Human Services National Institutes of Health National Center for Research Resources Division of Research Infrastructure Extending.
Siteman Cancer Center at Barnes-Jewish Hospital and Washington University School of Medicine Cancer Center Administration Database.
VIVO: A SEMANTIC WEB NETWORK ENABLING COLLABORATION AMONG SCIENTISTS Mike Conlon, University of Florida John Ruffing, Weill Cornell Medical College Friday.
Chapter 6 – Data Handling and EPR. Electronic Health Record Systems: Government Initiatives and Public/Private Partnerships EHR is systematic collection.
What is VIVO? Research and Scholarship Discovery Across the University of Florida – A service of the Clinical and Translational Science Institute A Grant.
Method How are data being collected? Data collection is done manually from paper IRB records. Every member of the CTSI Study Registry team is thoroughly.
H I P A A T R A I N I N G Self Directed Module 7 Research Disclosures For Data Custodians START Click to begin…
National Center for Research Resources NATIONAL INSTITUTES OF HEALTH T r a n s l a t I n g r e s e a r c h f r o m b a s i c d i s c o v e r y t o i m.
An Introduction to the University of Florida Clinical and Translational Science Institute Mike Conlon, PhD COO, UF CTSI The UF CTSI is supported in part.
VIVO Update and Many Flavors of Search Mike Conlon University of Florida.
A Director of Biomedical Informatics and Chief Operating Officer, UF Clinical and Translational Science Institute. b Chief Technology Officer, Department.
Medication Therapy Management Programs in Community Pharmacy Community Pharmacy October 17, 2006 Kurt A. Proctor, Ph.D., RPh Chief Operating Officer Community.
The HMO Research Network (HMORN) is a well established alliance of 18 research departments in the United States and Israel. Since 1994, the HMORN has conducted.
Health Informatics for Clinical and Translational Science: The University of Florida Experience Mike Conlon, PhD Director, Biomedical Informatics Chief.
Integrated Data Repository Mike Conlon, PhD Associate Director, UF CTSI The UF CTSI is supported in part by NIH awards UL1 TR000064, KL2 TR and TL1.
Stanley Estime, MSCI October 19, 2015 Record Keeping: What is Regulatory Documentation and how should it be maintained? Tel:
Paul Kelly Facility Research Compliance Officer for the Ralph H. Johnson VA Medical Center.
Building on VIVO and going the next step: Adding or Linking to Local and National repositories and/or research data; research resources and core facilities;
May 2007 CTMS / Imaging Interoperability Scenarios March 2009.
Improving Research Data Sharing and Reuse: Scientists and Repositories Michael Conlon, PhD Emeritus Faculty Member, University of Florida VIVO Project.
Research Tools Brought to you by the Clinical and Translational Science Institute Presented by: Terri Shkuda Systems Analyst Research Informatics The Penn.
Informatics Tools and Services Biomedical Informatics Core Tim Aro.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
Data Coordinating Center University of Washington Department of Biostatistics Elizabeth Brown, ScD Siiri Bennett, MD.
Making Clinical Trials More Efficient Site Management Organization (SMO)
IRB BASICS BETTY WILSON, MS, CIP. CONCEPTS SUBMISSION APPROVAL RECRUITING WORKING WITH PRINCIPAL INVESTIGATOR (PI) /SPONSOR POLICIES DEVIATIONS RECORD.
VIVO is... A community of 133 sites in 26 countries Organizations represented on VIVO governance groups: Brown, Cornell, Duke, George Washington University,
Writing Grants for VIVO Partnerships and Innovations Mike Conlon.
C3PR: An Introduction for Users A Tool Demonstration from caBIG™ Vijaya Chadaram Duke Cancer Center April 29, 2008.
OnCore Current Status and Implementation Project Plan
The Role and Responsibilities of the Clinical Research Coordinator
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
The Essentia Institute of Rural Health (EIRH) is
NIH Clinical Trial Requirements
VIVO: Faculty Research Information System and Discovery
TRI Resources for Researchers
Regulatory Binder: Maintaining Essential Study Documentation
The HIPAA Privacy Rule and Research
An Investigator’s Guide to the Clinical and Translational Science Awards and the Trial Innovation Network (TIN)  What can we do for you? May 2017.
eResearch at Emory Phase I: eIRB
Sustaining Networks of Researchers:
Trial Funding and Engagement: The NIH Sponsored CTSA Program
From Innovation to Commercialization Access to Data
Presentation transcript:

National Data Management Trends and Challenges Mike Conlon, PhD Director of Biomedical Informatics, Chief Operating Officer UF Clinical and Translational Science Institute The UF CTSI is supported in part by NIH awards UL1 TR000064, KL2 TR and TL1 TR000066

Data Production

We represent the natural world in computer systems as data ProviderNameSpecialityCert 1640Anansi, …Urology Abbot, …OBGYN Park, …PO Box …None MRNLabOrderFulfill PO Box … MRNDiagnosisDateProvider J H Park, …PO Box …1834 MRNCPTDateAssess Normal Poor Park, …PO Box …Normal MRNNameAddressInsurer Johns, …1234 W … Smith, …Apt 6, … Park, …PO Box …None

We collect data for a purpose In research: to answer a question In mining: to find patterns In clinical settings: to treat and bill

Data Management Data Representation Data Capture Data Flow Data Preservation Data Curation Data Citation Data Management Plans

Data management (and data management plans) have a presumption of reuse. Perhaps we should call the plans data reuse plans.

Data Representation Is the key to reuse Use Standard vocabularies and ontologies The development and choice of vocabularies and ontologies is difficult

Data Preservation is typically beyond the capability of the research team. Engage specialists (Data Coordinating Center, Informatics Team, Library, Research Computing, …) The economics of preservation are troubling.

Data Curation Improving the quality of the data – correcting errors, resolving discrepancies, matching to vocabularies Making decisions about on- going preservation Requires domain expertise/collaboration

Data Curation does not end when the project ends The economics of data curation are very troubling

Data Citation

Some Projects

Integrated Data Repository

I2B2 for Cohort Discovery

Consent2Share Initiated on 9/11/12 Consent form given with admissions packet (pt. specific bar code) Consent asks 2 questions Can we store your excess tissue with PHI? Can we re-contact you for a future study? Collected by admissions clerk, data entered into EPIC, consent form scanned with other documents Patient’s physician can access patient response, answer questions Informed Consent Hotline to answer initial questions CTSI patient research advocate for more detailed queries Results to date (>8,000): 86% patients returned signed forms 85% checked “yes” for samples 79% checked “yes” for recontact

Clinic Paper Form EMR Scanned Image IDR Data Elements and Dates Data Flow for Consent2Share Next: Continue roll-out to additional clinics

CTSI Biorepository Biospecimen collection, processing and storage. Stored biospecimens can be used by any researcher with IRB-approved protocols. Prospective biospecimen collection to fulfill investigator needs for IRB-approved protocols. Storage for biospecimens collected by investigators. Stored biospecimens belong solely to the investigator. Oversight of the release of biospecimens from the UF Department of Pathology for other IRB-approved research protocols. Pathology services including those provided by the Molecular Pathology Core and confirmation of diagnosis by a board-certified pathologist upon request One of two Hamilton Storage Technologies’ SAM -80°C automated sample management systems (Robotic freezers). The biorepository also has eight Forma Thermo Scientific -80°C Freezers with back- up CO 2 and sensaphone alarm systems including back-up storage space, centrifuge for basic bodily fluid processing, QiaCube for small volume RNA, DNA and protein purification, Agilent Bioanalyzer for RNA, DNA and protein quality control analysis, OnCore BioSpecimen Management

Biorepository Data Flow Bar codes Collection information Sample Vials & worksheets All sample information Oncore BMS Sample information linked to phenotypic information IDR

Three in One Continue to build integrated data repository

VIVO: An International Resource for Scholarship

UF Co-Funded Network Main Component: what changes have occurred between 2008 and 2012? More of Health Science Center comes under the CTSI umbrella The CTSI has a broader reach in the whole network Increasingly the CTSI incorporates all researchers in relevant areas (areas not relevant to CTSI research fields naturally remain out of its network) Chris McCarty & Raffaele Vacca, UF Bureau of Business and Economic Research Data source: UF Division of Sponsored Research (DSR) database Each node represents one Contract PI, Project PI or Co-PI linked by a common PeopleSoft Contract number Nodes are sized by Total Awarded in UF fiscal year (July-June)

Collaboration and Coordination At UF – Libraries, CTSI, AHC IT, Office of Research, Enterprise Systems, Registrar, Business Services Federal – OSTP, NIH, NLM, NSF, USDA, EPA, FDA, NASA, FDP, … Partners -- Symplectic, Pivot, Elsevier, Thomson-Reuters, ORCID, CiteSeer, CrossRef, OCLC, DuraSpace, CNI, Total-Impact, … Ontology– EuroCRIS, CASRAI, NCBO, Eagle-I, CTSAconnect, … Professional Societies – APA, AAAS, AIRI, AAMC, ABRF, … International – Australia, China, Netherlands, UK, Canada, Brazil, … Semantic Web community – DERI, Tim Berners-Lee, MyExperiment, Concept Web Alliance, Open Phacts (EU), Linked Data, … Social Network Analysis Community – Northwestern, Davis, UCF, … Universities -- Melbourne, Duke, Penn, Colorado, Eindhoven, Pittsburgh, Leicester, Cambridge, Stony Brook, Weill, Indiana, Scripps, Washington U, Ponce, Northwestern, Iowa, Harvard, UCSF, Florida, Stanford, MIT, Brown, Johns Hopkins, OHSU, Minnesota, and the CTSA consortium Application and service providers – over 100 Software downloads (over 30,000) and contact list (over 1,600)

Study Registry and StudyConnect Study Registry: All (9,400) human subject studies approved by 4 UF IRBs from 2008 to date. StudyConnect: Web site with 400 active studies for potential research participants to find opportunities

Two Purposes: 1) Research management: understand the collection of human subject studies 2) Clinical research: facilitate recruitment

The Translational Research Continuum at UF: An Expansion of T3 and T4 Research Basic Discovery Clinical Efficacy Clinical Effectiveness Clinical Practice T1 What works under controlled conditions? (Translation to Humans) How can we change practice? (Translation to Practice) What is the effect on population health? (Translation to Population Health) T2 T3 T4 What works in real world settings? (Translation to Patients) 2011 (n=912) 2008 (n= 862) CTSI Study Registry includes 9,401 human subject research protocols across four IRBs (medical and non-medical) 4,229 / 9,401 are considered medical and/or health related protocols 3,422 / 4,229 are categorized as translational research (T1 – T4)

CTSI REDCap Services No-charge, unlimited self-service access to REDCap and REDCap Survey Training in REDCap data entry and study set-up Support Services Configuration Service Participation in the national consortium REDCap created for standard sign- on method (GatorLink, Incommon/Shibboleth)

REDCap and data management “Makes it easy to do the right thing” Time and event calendar, case report forms represent research visits and primary capture naturally Future work will introduce vocabularies and ontologies Future work will interface REDCap to electronic medical records Future work will interface REDCap to devices for data capture

Personalized Medicine Program- Launched June 25, 2012

Personalized Medicine Information Flow Challenge: genetic polymorphism of CYP2C19 leads to reduced ability to activate clopidogrel (Plavix) and increased risk of cardiovascular complication UF Pathology Lab IT Hospital Orders & Receipts Hospital Best Practice Alerts Team IDR Team CTS IT

Molecular Medicine

Data Got Big (IDR Slide) Big Data Management

Clinical research challenge Create a connected, accessible, efficient, effective, compliant environment for CR that speeds development, execution and reporting of CR studies Weill Cornell reported that their clinical research faculty were required to use more than 40 systems to conduct a CR study

Clinical Research Environment Focus CTSAs on conducting only high-quality translational research Increase efficiency and decrease cost through institution-wide accountability and transparency Clinical study oversight, timely recruitment, effective enrollment, and follow up, retention of participants, data quality and security, availability of highly trained study staff and ancillary services, and timely submission of adverse events Broad culture of responsibility for safe and ethical conduct of human subjects research throughout the institutions participating in the CTSA Appropriateness of specific study designs Development of realistic recruitment goals Steps to ensure timely feasibility assessment and closure of studies that don’t meet goals Efficient institutional workflows for conduct of translational studies Expectation of prompt analysis of results and dissemination of those results Address tracking of all clinical studies at participating institutions from inception through review, activation, conduct, closure, analysis, and dissemination of results Identify and address inefficiencies, troubled studies and bottlenecks Ensure registration of all applicable trials with ClinicalTrials.gov Facilitate participation by their research teams in multi-site studies, including willingness to adopt centralized IRB arrangements similar to those utilized in many NIH networks Describe processes for clinical trial agreements and contracts and timelines for completion

Next Step: Internal Planning Grant, The Clinical Research Initiative

Data Sharing

Information is stored using the Resource Description Framework (RDF) as subject- predicate-object “triples” Jane Smith professor in author of has affiliation with Dept. of Genetics College of Medicine Journal article Book chapter Book Genetics Institute SubjectPredicateObject A Web of Data – The Semantic Web

processOrg<-function(uri){ x<-xmlParse(uri) u<-NULL name<-xmlValue(getNodeSet(x,"//rdfs:label")[[1]]) subs<-getNodeSet(x,"//j.1:hasSubOrganization") if(length(subs)==0) list(name=name,subs=NULL) else { for(i in 1:length(subs)){ sub.uri<- getURI(xmlAttrs(subs[[i]])["resource"]) u<-c(u,processOrg(sub.uri)) } list(name=name,subs=u) } VIVO produces human and machine readable formats Software reads RDF from VIVO and displays

A Consumption Scenario Find all faculty members whose genetic work is implicated in breast cancer VIVO will store information about faculty and associate to genes. Diseaseome associates genes to diseases. Query resolves across VIVO and data sources it links to.

Data Reasoning “Linked Life Data” integrates common public datasets that describe the relationships between gene, protein, interaction, pathway, target, drug, disease and patient and currently consist of more than 5 billion RDF statements. The dataset interconnects more than 20 complete data sources and helps to understand the “bigger picture” of a research problem by linking previously unrelated data from heterogeneous knowledge. From the LarKC (Large Knowledge Collider)

Questions?