1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.

Slides:



Advertisements
Similar presentations
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Advertisements

CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California Facilitating Distributed.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Dan Crichton April Topics Introduction – who am I? Architecture – what is means to me Challenges in Developing Architectures Reference Architecture.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
A Software Architecture for Highly Data-Intensive Systems Chris A. Mattmann USC Center for Software Engineering Annual Research Review.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
NIH/NASA Meeting on Space-Related Health Research Henry Rodriguez, Ph.D., M.B.A. Director, Clinical Proteomic Technologies Initiative National Cancer Institute.
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
CSCI 5980: From GPS and Google Earth to Spatial Computing Fall 2012 Midterm Presentation Chapter 7: Architectures Team 9: Thao Nguyen, Nathan Poole October.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Connecting different ethnomusicological archives with ethnoArc Maurice Mengel Music Archive of the Ethnological Museum, National Museum in Berlin (EMEM)
Module 9: Fundamentals of Securing Network Communication.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
©2012 LIESMARS Wuhan University Building Integrated Cyberinfrastructure for GIScience through Geospatial Service Web Jianya Gong, Tong Zhang, Huayi Wu.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
Fundamentals of Information Systems, Second Edition 1 Telecommunications, the Internet, Intranets, and Extranets.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
EDRN Biomarker Database Curation Web Interface and Model.
What is NCIA? National Cancer Imaging Archive Searchable repository of in vivo cancer images in DICOM format Publicly available at no cost over the Internet.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
WEB SERVER SOFTWARE FEATURE SETS
Information Architecture The Open Group UDEF Project
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Jemerson Pedernal IT 2.1 FUNDAMENTALS OF DATABASE APPLICATIONS by PEDERNAL, JEMERSON G. [BS-Computer Science] Palawan State University Computer Network.
Mars Exploration Rover Machine Using Java Technology Presented by k.Pranusha k.Ishwarya.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Informatics and the caTissue Wrapper for the Early Detection Research Network Chris A. Mattmann, Ph.D. Senior Computer Scientist Instrument Software/ Science.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Clouding with Microsoft Azure
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Distributed System Concepts and Architectures
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
The Anatomy and The Physiology of the Grid
Presentation transcript:

1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion Laboratory Mark Thornquist Fred Hutchinson Cancer Research Center Sudhir Srivastava National Cancer Institute Heather Kincaid Fred Hutchinson Cancer Research Center Donald Johnsey National Cancer Institute Marcy Winget Fred Hutchinson Cancer Research Center

2 Vision Development of a world-wide knowledge and informatics environment for sharing cancer specimen data across repositories Data and Computers interconnected to form a virtual database Integrated Cancer Resources Specimens Images Assays Biomarkers etc

3 Early Detection Research Network (EDRN) w 5-Year collaboration supported by NCI w Goal: Identify, evaluate, and validate promising biomarkers to support the early detection of cancer w Comprised of: 18 Biomarker Laboratories 9 Clinical and Epidemiology Centers 3 Biomarker Validation Laboratories Data Management and Coordinating Center

4 EDRN Resource Network Exchange (ERNE) w Virtual Specimen Repository (real-time access to distributed repositories) w Informatics infrastructure created for EDRN w Existing sites specimen databases maintained locally w Uses EDRN Common Data Elements (CDEs) w Maps institutions local data definitions to EDRN CDEs w Secure and Confidential w Secure Dynamic Portal

5 Informatics Deployment

6 Information Infrastructure Progress Initiation (10/00 - 3/01) Connect Moffitt and San Antonio Finalize EDRN CDEs used in knowledge system Create Dynamic Portal Present Feasibility at EDRN S.C. Meeting Discuss Informatics at 2 nd EDRN S.C. Meeting Present Mock Knowledge System at EDRN S.C. Meeting Feasibility (4/ /01) Pilot (10/01 - 9/02) Implement four sites Finalize IRB Protocol template Create Online Mapping Tool Present at EDRN S.C. Meeting Implementation (9/02 - 6/03) Implement three additional sites Present at EDRN S.C. Meeting

7 EDRN Bioinformatics Architecture 3. Repositories for storing and retrieving many data types data 1. Bioformatics tools and applications use “API” Visualization Tools Analysis Tools “OODT” Middleware “OODT” Middleware EDRN Data Repositories EDRN Data Repositories API 2. Middleware creates the informatics infrastructure connecting systems and data SPORE Data Repositories SPORE Data Repositories Other Data Repositories Other Data Repositories API Web Search Tools Metadata Mediation Standard Metadata

8 Informatics Infrastructure w Connect local databases via the Internet w Query multiple institutional databases concurrently w Metadata-based distributed framework w Object Oriented Data Technology (OODT) framework (JPL) Combines semantic data model with distributed services to create a “grid” architecture

9 OODT Framework w Developed by NASA to support science data management for the robotic planetary program w Defines a reusable architectural pattern that enables information clustering and retrieval across distributed data resources intelligent query algorithm for scalability interoperability between disparate data models a reusable software components domain independence plug-in for various distributed computing implementations

10 Critical OODT Components w Query Server – Manages and routes concurrent queries to distributed resources. Combines results. w Profile Server – Enables resource discovery providing information about what data resources are available (a resource is really an electronic object) w Product Server – Enables access and retrieval of data products from an online data source w Servers written in Java and supported on Windows, Linux, Solaris, Mac OS X, etc

11 Software Component Deployment User query EDRN Secure Website QueryClientWeb server search.jsp Product Server Moffitt EDRN Profile Server EDRN CDE Mapping Database Specimen Database Specimen Database Specimen Database Specimen Database Specimen Database Specimen Database DMCC – Fred Hutchinson Cancer Research Center Science Tools User query Specimen Database Specimen Database Product Server San Antonio Product Server MD Anderson Product Server Colorado Product Server Creighton Product Server GLNE Product Server Pittsburgh Product Server New York Product Server Brigham and Womens Specimen Database

12 Semantic Architecture w Define a common data model for EDRN Common Data Elements Relationships between elements w Institutions have existing specimen repositories with locally defined data models Map local data elements to CDEs using EDRN CDE mapping and repository tools 39 CDEs Shared w Use Standards ISO/IEC Resource Description Framework (RDF) w Use standard definitions for data exchange Communicate using a standard XML schema

13 Gender Mapping Example

14 Security and Confidentiality w Highly Sensitive Information w Health Insurance Portability and Accountability Act (HIPAA) Removed Personal Health Information (PHI) w Security Measures 128-bit strong encryption using Secure Socket Layer (SSL) Access limited to remote connect from specific IP(s) on specific ports. Firewalls augmented with rule set. w Institutions IRBs Common Protocol

15 Dynamic Portal

16 Advanced Search

17 Results

18 Number of Participants by Specimen Type

19 ERNE Achievements w Deployed Software Infrastructure to 10 institutions Process of connecting new sites well understood w Software Infrastructure Maturing Extensive nightly testing and monitoring of infrastructure w Team Maturing and Growing w Policy Challenges w Institutional Access w Science Support

20 More Information w EDRN – w OODT – w Contact: Heather Kincaid: Dan Crichton: Don Johnsey:

21 Quick Search

22 Dynamic Portal w JSP-based implementation that queries informatics infrastructure Uses CDE terms for constructing query expression w Shows available servers w Limit available choices based on selected criteria w Quick Search w Advanced Search

23 Quick Search Results