CCEGA Informatics Project: Developing Shared Infrastructure and Data Models Project Leader: Brad Hemminger School of Information and Library.

Slides:



Advertisements
Similar presentations
New Knowledge Management Roles in Support of a University CTSA TRLN Annual Meeting July 25, 2011 New Roles for Librarians Barrie Hayes, Bioinformatics.
Advertisements

Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
UC BRAID: Co-creating and evaluating performance in a regional laboratory for conducting translational science UC BRAID Executive Committee: Steven Dubinett.
Disclosure I, Peter T. Katzmarzyk, PhD, FACSM, have no relationships with commercial interests to disclose. A commercial interest is any entity producing,
Sustainability Planning Pat Simmons Missouri Department of Health and Senior Services.
Presentation to Educational Policy Committee Department of Biology Revised March, 2013 Biology Department: Position Requests.
The Research Hub AT UNC LIBRARIES. “Libraries, traditionally focused on the products of scholarship, are now prompted to understand and support the processes.
Federations in Texas Barry Ribbeck University of Texas Health Science Center at Houston.
UNC – NCRC January 2008 NC State Campus Steven Leath The University of North Carolina General Administration.
CCEGA InformaticsHemminger CCEGA Informatics Working Group Bradley Hemminger School of Information and Library Science Supported in part by NIH Grant 5P20RR
CCEGA Informatics Working Group Bradley Hemminger School of Information and Library Science.
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
Future Trends: Translational Informatics James J. Cimino Chief, Laboratory for Informatics Development Mark O. Hatfield Clinical Research Center National.
CCEGA Informatics Project: Developing Shared Infrastructure and Data Models Project Leader: Brad Hemminger School of Information and Library.
Security expenditure should be determined by security risk. What is the financial risk to UNC of undetected modification of bioresearch data? theft and.
Identity and Access Management IAM A Preview. 2 Goal To design and implement an identity and access management (IAM) middleware infrastructure that –
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
Signature Research and Academic Collaborations Status Report to RHEDC February 15, 2007.
Company LOGO Leading, Connecting, Transforming UNC… …Through Its People Human Capital Management.
Research Bioethics Consultation: More potential than sequencing genomes Benjamin S. Wilfond MD Seattle Children’s Hospital Treuman Katz Center for Pediatric.
IDR Snapshot: Quantitative Assessment Methodology Evaluating Size and Comprehensiveness of an Integrated Data Repository Vojtech Huser, MD, PhD a James.
Service System for Management and Sharing of Scientific Data in Medicine Depei Liu, Ph.D. Chinese Academy of Medical Sciences.
Working Plan of US-China Bilateral cooperation on biomedical data sharing.
TYPE 2 TRANSLATIONAL RESEARCH 2009 GRANT PROGRAMS UW Institute for Clinical and Translational Research (ICTR) Community-Academic Partnership Core (CAP)
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
Supporting the local research data environment via cross-campus collaboration and leveraging of national expertise Hannah F. Norton, Rolando Garcia Milian,
Global Access to Health Information: The UNC Medical Library in Malawi Susan Swogger, 1 Mamie Sackey Harris, 1 Myron S. Cohen, 1 Irving Hoffman, 1 Bernard.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Department of Health and Human Services National Institutes of Health National Center for Research Resources Division of Research Infrastructure Extending.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
State HIE Program Chris Muir Program Manager for Western/Mid-western States.
Joyce Mull, MPM Director, Regulatory Affairs National Surgical Adjuvant Breast and Bowel Project Consent Form and IRB Challenges that Arise with Specimen.
HIT Policy Committee Privacy & Security Workgroup Update Deven McGraw Center for Democracy & Technology Rachel Block Office of Health Information Technology.
MED INF HIT Integration, Interoperability & Standards ASTM E-31 January 14, 2010 By Imran Khan.
Common Core State Standards: Supporting Implementation and Moving to Sustainability Based on ASCD’s Fulfilling the Promise of the Common Core State Standards:
Facilitate Scientific Data Sharing by Sharing Informatics Tools and Standards Belinda Seto and James Luo National Institute of Biomedical Imaging and Bioengineering.
ASCAC-BERAC Joint Panel on Accelerating Progress Toward GTL Goals Some concerns that were expressed by ASCAC members.
Greenville Technical Charter High School Strategic Plan Developed October 2014.
David Carr The Wellcome Trust Data management and sharing: the Wellcome Trust’s approach Economic & Social Data Service conference.
Objectives and Strategies of RRSF The RRSF has been prepared with an overall objective and four specific objectives to overcome the identified problems.
IABIN Visioning Meeting Washington, D.C. October 2008 Mike Frame.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Team Building: Critical Role of Interdisciplinary Research Teams in Translational Research C. Kent Osborne, M.D. Director, Lester and Sue Smith Breast.
Group Science J. Marc Overhage MD, PhD Regenstrief Institute Indiana University School of Medicine.
Tom Furlani, Center for Computational Research University at Buffalo, October 15, 2015 Coexisting with Protected Health Information at CCR.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Planning for School Implementation. Choice Programs Requires both district and school level coordination roles The district office establishes guidelines,
UNC Deans Council The North Carolina K-12 Digital Learning Transition Glenn Kleiman Friday Institute for Educational Innovation NC State University College.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
University of Kentucky Center for Clinical and Translational Science (CCTS) November 2015 Stephen W. Wyatt, DMD, MPH Senior Associate Director Center for.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
NIH and the Clinical Research Enterprise Third Annual Medical Research Summit March 6, 2003 Mary S. McCabe National Institute of Health.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Friday Institute Leadership Team Glenn Kleiman, Executive Director Jeni Corn, Director of Evaluation Programs Phil Emer, Director of Technology Planning.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Tony Pan, Stephen Langella, Shannon Hastings, Scott Oster, Ashish Sharma, Metin Gurcan, Tahsin Kurc, Joel Saltz Department of Biomedical Informatics The.
Data Coordinating Center University of Washington Department of Biostatistics Elizabeth Brown, ScD Siiri Bennett, MD.
National Institutes of Health U.S. Department of Health and Human Services Planning for a Team Science Evaluation ∞ NIEHS: Children’s Health Exposure Analysis.
Dream of Medical Grid in Korea What can we do for it? Global Grid Forum13 Workshop 13 th Mar 2005 Young-Woo Kim, MD, PhD Center for Gastric Cancer, National.
External Review Exit Report Campbell County Schools November 15-18, 2015.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Translational Research Why should you care about translational research? The public cares. Translation into therapies justifies spending on science.
ELIXIR Core Data Resources and Deposition Databases
Presentation transcript:

CCEGA Informatics Project: Developing Shared Infrastructure and Data Models Project Leader: Brad Hemminger School of Information and Library Science University of North Carolina at Chapel Hill

Participants Brad Hemminger bmh at ils.unc.edu Kaye Balkebalke at ils.unc.edu Kirk Wilhemsen kirk at neurology.unc.edu David Threadgill dwt at med.unc.edu Dong Xiang dxiang at .unc.edu Min Xuxumin at med.unc.edu Joel Kingsolver jgking at bio.unc.edu Paul Brown paul.brown at unc.edu Lavana Ramakrishnan lavanya at renci.org Roger Akers akers at unc.edu Peter DeSaix pdesaix at .unc.edu Clark Jeffries clark_jeffries at med.unc.edu Xiaojun Guan xguan at renci.org Kevin Gamiel kgamiel at renci.org Erik Scottescott at renci.org Barrie Hayesbhayes at .unc.edu

Project Aims Goal: Development of common data model and informatics infrastructure for UNC Determine needs of research labs on campus Develop common data model, incorporating applicable global standards Determine issues that affect whether research labs would utilize a common infrastructure and common data model. Understand and address security, privacy issues Develop “workable” infrastructure on campus

Lab Surveys Bioinformatics Research labs at UNC were invited to provide details of their data infrastructure, in particular their data models (and example data). PIs and database administrators from the projects meet with our full committee for interviews, and afterwards we followed up to obtain database schemas, and example database records.

Labs that provided in depth interviews and complete data models Kirk Wilhelmsen (alcoholism and addiction projects) Paul Brown (Cell Biology, multiple projects) Roger Akers (Epidemiology Specimen Tracking) Lineberger (multiple cancer projects) Mike Knowles (Pulmonary and Cystic Fibrosis) Kari North (case control and family based studies of cardiovascular disease) Proteomics Center (earlier project)

Global Standards While there are no overarching standards that define common definitions for all the data elements necessary, standards exists in many individual domains (microarrays, genetic sequences, proteins, etc). Additionally, larger scale efforts are being made, such as CDSIC (clinical trials) and caBIG (cancer). caBIG has a whole workgroup devoted to vocabularies and common data elements (VCDE).VCDE

Issues affecting user acceptance Most all research projects prefer to have their own database –Specific projects –No need to tie into other researchers data –No need to preserve data generated by study –Easier to build themselves –More control when managed themselves Core facilities –Require specific control, privacy of data Clinical facilities –Rigorous requirements regarding sharing of data (ELSI, HIPAA)

Reasons for Sharing More studies are required to share data between projects (larger studies, multicenter studies) More projects depend on outside resources (databanks) Free, or inexpensive disk space Dependable archiving of data Assistance in designing data models for study

Common Data Model Began with a general framework developed in previous work Built new model from ground up –Took all data elements from all the research labs and pooled together to define overall set of elements, including which elements from different labs mapped to the same “common” elements. –Produced set of core elements that were common to many projects and important for sharing. Integrated new model with overall design principles from general framework to develop final “common data model”.

Final Common Model Developed by taking common data elements and putting into a database system for testing. –Database schema design (see printout) –Integrate standards in definition of data elements –Incorporate into actual database Test model database by incorporating actual data from volunteer labs (Kirk, Roger)

Next Steps The aim of this P20 planning project is to prepare for further grants in this area, and to hopefully help lay the groundwork for building a common biomedical informatics infrastructure at UNC In Jan 2007, we submitted a CTSA grant (Clinical and Translational Science Award). This grant aims to integrate all biomedical informatics infrastructure on campus.

CTSA--overview The TraCS Biomedical Informatics Core will unite the silos of biomedical informatics research excellence at UNC and across North Carolina to maximize re-use of data, knowledge and processes. With the establishment of the North Carolina Collaboratory for Biomedical Informatics (NCCBI), TraCS will support research, patient care, education and policy-making while building upon, leveraging and extending the current biomedical informatics infrastructure at UNC- CH. This core involves several external partners with a strong presence in NC and world-wide: Red Hat, IBM, SAS, Allscripts, Quintiles and NCHICA. We are committed to achieving a national leadership role in the design and development of best practices for the inclusion of clinical data into shared repositories of biomedical data.

CTSA—tie in clinical data To support the goals of the TraCS Institute, the Biomedical Informatics Core will create a statewide interdisciplinary and inter- institutional collaboratory (collaborative laboratory): the North Carolina Collaboratory for Biomedical Informatics (NCCBI). It will build on the transformative technology used by the NIH to create Entrez for the NCBI. The long-term goal is to create a shared biomedical informatics data repository connecting clinical enterprises across the State of North Carolina to create a demonstration project for clinical data that will be a model for sharing and re-use of clinical data. This repository will contain appropriately de-identified data from clinical trials and clinical care. With the establishment of the NCCBI, the TraCS Biomedical Informatics Core will transform the excellent but fragmented biomedical informatics capabilities at UNC-CH into a coherent and connected system that facilitates routine re-use of research knowledge, data and processes throughout UNC and North Carolina, serving as a prototype for the nation.

CTSA In short, the CTSA proposal builds on the work of the P20, and offers us the potential to truly transform the way scientists and clinicians work at UNC, and bring about unprecedented integration and data sharing.

Example of integrating data View integration spreadsheet, look at example (samples) of before and after.spreadsheet

Security Possible security design requirements: Identification tables of entities (as in Trusted Broker doc) Translation tables among entities Authentication (two-way) between broker and entities Authorization of entities by broker Encrypted channels (SSL, IPSec, other) Protection against various denial of service attack types (limiting multiple accesses or very frequent access requests from any one researcher, etc.) Multiple types of access requirements for the human trusted broker (something you have, you know, or you are) Other requirements on trusted broker (bonded staff, permission to modify databases requiring at least two separate trusted brokers cooperating, etc.) Remote backup system...

Example Centers Included General Clinical Research Center, the Collaborative Studies Coordinating Center, the Lineberger Comprehensive Cancer Center, the Carolina Center for Exploratory Genetic Analysis, the Carolina Center for Genome Sciences, the Carolina Exploratory Center for Cheminformatics Research, the Biomedical Imaging Research Center, the Carolina Environmental Bioinformatics Center, the Center for Bioinformatics, the Renaissance Computing Institute, and the Odum Institute for Research in Social Science

Summary--Timeline Initial Workshop beginning project (spring 2005) Analysis of data requirements, policies, and existing infrastructure at UNC. Internal interviews with labs (spring through fall 2005) Development complete list of data elements, review with labs and finalize elements for common model (fall 2005-spring 2006) Development of draft model (fall 2006-spring 2007) Testing of draft model using example labs data (fall 2007) Review by labs and researchers at UNC. Share with outside experts to solicit critiques. (fall 2007) Use this work to develop new grants to fund actual deployment of common data models, policies and infrastructure at UNC. (spring 2007-current)