0 caBIG and caGrid: Interoperable Computing Infrastructure for the Nation’s [and World’s] Cancer Research Enterprise Peter A. Covitz, Ph.D. Chief Operating.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

27 June 2005caBIG an initiative of the National Cancer Institute, NIH, DHHS caBIG the cancer Biomedical Informatics Grid Arumani Manisundaram caBIG - Project.
Open Grid Forum 19 January 31, 2007 Chapel Hill, NC Stephen Langella Ohio State University Grid Authentication and Authorization with.
CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
WEB SERVICES DAVIDE ZERBINO.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
CaGrid Service Metadata Scott Oster - Ohio State
Frank Hartel, PhD Enterprise Vocabulary Services National Cancer Institute NCI Enterprise Vocabulary Services (EVS) and Semantic Integration at NCI - An.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
0 The Cancer Biomedical Informatics Grid From Village to City Peter A. Covitz, Ph.D. Director, Core Infrastructure National Cancer Institute Center for.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
Technical Introduction to caGrid Service Development caGrid 1.3 Justin Permar caGrid Knowledge Center
CaBIG: the cancer Biomedical Informatics Grid Ken Buetow NCICB/NCI/NIH/DHHS.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
CaGrid Executive Introduction caGrid 1.3 Justin Permar caGrid Knowledge Center kc.nci.nih.gov/CaGrid/KC.
Department of Biomedical Informatics Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational Science.
Adapting an Existing Data Service to be caBIG™ Silver-level Compliant Peter Hussey LabKey Software, Inc, Seattle, WA USA Contact: Abstract.
Silver to Grid Data Services Session III: Deploying a Data Service on caGrid and using caGrid Service APIs caBIG™ Annual Meeting June 23-25, 2008.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Interfacing Registry Systems December 2000.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
Middleware Support for Virtual Organizations Internet 2 Fall 2006 Member Meeting Chicago, Illinois Stephen Langella Department of.
Web Services Based on SOA: Concepts, Technology, Design by Thomas Erl MIS 181.9: Service Oriented Architecture 2 nd Semester,
CaBIG ® VCDE Workspace Tactics thru June 14, 2010: How working groups fit together, and other activities Brian Davis April 1, 2010 VCDE WS Teleconference.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
Open Terminology Portal (TOP) Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer Institute, Center for Biomedical Informatics.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Cancer MetaData Standards Peter A. Covitz, Ph.D. HL7 RCRIM October 1, 2002.
CaCORE Software Development Kit George Komatsoulis 25-Feb-2005.
0 Cancer Biomedical Informatics Grid (caBIG) – An Approach towards Data Access and Integration Avinash Shanbhag Director, Core Infrastructure Engineering.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
Grid Services I - Concepts
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
1 Cancer Models Database (caMOD). 2 History  January 2000 – Prototype is presented during the Mouse Models of Human Cancers (MMHCC) Steering Committee.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
WEB SERVICE DESCRIPTION LANGUAGE (WSDL). Introduction  WSDL is an XML language that contains information about the interface semantics and ‘administrivia’
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
What is NCIA? National Cancer Imaging Archive Searchable repository of in vivo cancer images in DICOM format Publicly available at no cost over the Internet.
Adapting an Existing Data Service to be caBIG™ Silver-level Compliant Peter Hussey LabKey Software, Inc, Seattle, WA USA Contact: Abstract.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Patterns in caBIG Baris E. Suzek 12/21/2009. What is a Pattern? Design pattern “A general reusable solution to a commonly occurring problem in software.
Protégé 3.4 Plug-in for Editing and Maintaining the NCI Thesaurus Protégé Conference June 23, 2009 Amsterdam Sherri de Coronado, Gilberto Fragoso.
Design for a High Performance, Configurable caGrid Data Services Platform Peter Hussey LabKey Software, Inc, Seattle, WA USA Contact:
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
0 caCORE: A Common Framework for Cancer Data Management Denise Warzel Associate Director, Core Infrastructure National Cancer Institute Center for Bioinformatics.
Welcome to the caBIG Community! The cancer Biomedical Informatics Grid (caBIG ® ) offers more than 120 open source tools, technologies and infrastructure.
0 Vision and Infrastructure Behind the Cancer Biomedical Informatics Grid Peter A. Covitz, Ph.D. Director, Core Infrastructure National Cancer Institute.
Challenges and issues with information sharing: The four pillars of semantic interoperability Douglas B. Fridsma, MD, PhD, FACP University of Pittsburgh.
Collaborative and Open Source Software Development NCI’s caBIG™ Collaborative Environment Sharon Gaheen, SAIC Program Manager Himanso Sahni, SAIC Chief.
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
Semantic Interoperability: caCORE and the Cancer Data Standards Repository (caDSR)  Jennifer Brush.
International Planetary Data Alliance Registry Project Update September 16, 2011.
VCDE WS in EY2 Where we are, where we’re going ICR WS Teleconference Brian Davis – VCDE WS Lead March 26, 2008.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois
NCI Center for Biomedical Informatics and Information Technology (CBIIT) The CBIIT is the NCI’s strategic and tactical arm for research information management.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Public Health Information Network Annual Meeting Atlanta, GA
Wsdl.
Vision and Infrastructure Behind the
WEB SERVICES DAVIDE ZERBINO.
Presentation transcript:

0 caBIG and caGrid: Interoperable Computing Infrastructure for the Nation’s [and World’s] Cancer Research Enterprise Peter A. Covitz, Ph.D. Chief Operating Officer National Cancer Institute Center for Bioinformatics TM

1  The Center for Bioinformatics is the NCI’s strategic and tactical arm for research information management  We collaborate with both intramural and extramural groups  Mission to integrate and harmonize disparate biomedical research data  Production, service-oriented organization. Evaluated based upon customer and partner satisfaction.

2 The Problem 1,372,910 new cancer cases and 570,280 deaths due to cancer expected in the U.S. in 2005 Jemal et al., CA Cancer J Clin 2005; 55:10-30

3 A National Response  Enable investigators and research teams nationwide to combine and leverage their findings and expertise.  Create scalable, actively managed organization that will connect members of the NCI-supported cancer enterprise by building a biomedical informatics network The Cancer Biomedical Informatics Grid™ (caBIG™)

4 Scenario from caBIG Strategic Plan A researcher involved in a phase II clinical trial of a new targeted therapeutic for brain tumors observes that cancers derived from one specific tissue progenitor appear to be strongly affected. The trial has been generating proteomic and microarray data. The researcher would like to identify potential biochemical and signaling pathways that might be different between this cell type and other potential progenitors in cancer, deduce whether anything similar has been observed in other clinical trials involving agents known to affect these specific pathways, and identify any studies in model organisms involving tissues with similar pathway activity.

5 Semantic interoperability Syntactic interoperability Interoperability ability of a system to access and use the parts or equipment of another system

6 SYNTACTIC SEMANTIC caBIG Compatibility Guidelines

7 caCORE Model Driven Architecture + Computable Semantics = Platform for Syntactic and Semantic Interoperability

8

9 caCORE Bioinformatics ObjectsEnterprise VocabularyCommon Data Elements SECURITYSECURITY

10 Bioinformatics Objects

11  What do all those UML data Classes and Attributes actually mean, anyway?  UML model components are mapped to semantic concepts drawn from Enterprise Vocabulary sources, then registered in the Cancer Data Standards Repository (caDSR).  caDSR is a metadata registry, implements ISO/IEC standard for Common Data Elements (CDEs). Common Data Elements

12 Preferred Name Synonyms Definition Relationships Concept Code Enterprise Vocabulary Description Logic

13 caCORE Software Development Kit

14 caCORE SDK Components  UML Modeling Tool (any with XMI export)  Semantic Connector (concept binding utility)  UML Loader (model registration in caDSR)  Codegen (middleware code generator)  Security Adaptor (Common Security Module) caCORE SDK Generates a caBIG Silver-Compliant System caCORE SDK Generates a caBIG Silver-Compliant System

15 Java Applications Data Access Objects Web Application Server Interfaces Java SOAP XML HTTP Clients SOAP Clients Data Clients Perl Clients Enterprise Vocabulary Common Data Elements Middleware APIAPI APIAPI APIAPI APIAPI Data Access Objects Domain Objects [Gene, Disease, etc.] Domain Objects [Gene, Disease, Agent, etc.] caCORE Architecture Biomedical Data Authorization

16 From Silver to Gold: caGrid

17 Use cases not satisfied by caCORE alone  Advertisement –Service Provider composes service metadata describing the service and publishes it to grid.  Discovery –Researcher (or application developer) specifies search criteria describing a service of interest –The research submits the discovery request to a discovery service, which identifies a list of services matching the criteria, and returns the list.  Invocation –Researcher (or application developer) instantiates the grid service and access its resources

18 Gold Cancer Center Cancer Center NCI OTHER caBIG SERVICE PROVIDERS OTHER TOOLKITS Silver

19 caGrid 1.0 Architecture Grid Communication Protocol Service Description Service Business Process Service Registry Security Semantic service Resource Management Functions Quality of Service ID Resolution Transport GSI DORIAN GT4 Workflow GLOBUS Toolkit caDSR EVSGT4 Portal caDSR GME GTS Index IntroduceFQE Grid ID

20 Data Object Semantics, Metadata, and Schemas  Object oriented, APIs, well-defined data types  Classes defined in UML and converted into ISO/IEC 11179, registered in the caDSR  Definitions drawn from Enterprise Vocabulary Services (EVS), relationships semantically described  XML serialization of objects adhere to XML schemas registered in the Global Model Exchange (GME)

21 Service Data Elements  Two types of top-level grid services defined –Data Services –Analytical Services  Service Data Elements (SDEs) describe services so clients can discover what they do

22 Integrating with other Grids  caGrid intentionally focused on federated data and analytic service interoperability, not computing power  Adoption of standard grid tooling intended to facilitate integration other grids with compute power focus  Seeking partnership with established compute grids to install caGrid Analytical Service nodes that would be transparently available to caGrid users

23 Acknowledgements  caCORE –Denise Warzel –George Komatsoulis –Avinash Shanbhag –Frank Hartel –Dianne Reeves –Sherri De Coronado –Gilberto Fragoso –SAIC –Terrapin Systems –Oracle –Ekagra –ScenPro –Apelon –MSD  caGrid –Avinash Shanbhag, NCI –Joel Saltz and colleagues, Ohio State U. –Ian Foster and colleagues, U. Chicago/Argonne –Booz Allen Hamilton –SAIC –SemanticBits

24 Links  caBIG: –  caGrid –  caCORE –