Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.

Slides:



Advertisements
Similar presentations
27 June 2005caBIG an initiative of the National Cancer Institute, NIH, DHHS caBIG the cancer Biomedical Informatics Grid Arumani Manisundaram caBIG - Project.
Advertisements

Open Grid Forum 19 January 31, 2007 Chapel Hill, NC Stephen Langella Ohio State University Grid Authentication and Authorization with.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
Dorian Grid Identity Management and Federation Dialogue Workshop II Edinburgh, Scotland February 9-10, 2006 Stephen Langella Department.
CaGrid Service Metadata Scott Oster - Ohio State
CaGrid Overview AstraZeneca Workshop Rockville, MD May 2011.
The cancer Biomedical Informatics Grid™ (caBIG™): In Vivo Imaging Workspace Projects Fred Prior, Ph.D. Mallinckrodt Institute of Radiology Washington University.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Technical Introduction to caGrid Service Development caGrid 1.3 Justin Permar caGrid Knowledge Center
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
CaGrid Executive Introduction caGrid 1.3 Justin Permar caGrid Knowledge Center kc.nci.nih.gov/CaGrid/KC.
Department of Biomedical Informatics Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational Science.
Tony Pan, Ashish Sharma, Metin Gurcan Kun Huang, Gustavo Leone, Joel Saltz The Ohio State University Medical Center, Columbus OH gridIMAGE Microscopy:
Silver to Grid Data Services Session III: Deploying a Data Service on caGrid and using caGrid Service APIs caBIG™ Annual Meeting June 23-25, 2008.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
State of Service Oriented Science Tools Open Source Grid Cluster Conference Oakland.
CaGrid 2.0 December What is caGrid 2.0??? Provides a patch for caGrid 1.x to support SHA2 OSGi implementation of WSRF on the new technical stack.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
December 2006 National Cancer Imaging Archive (NCIA) October 11, 2007.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
Middleware Support for Virtual Organizations Internet 2 Fall 2006 Member Meeting Chicago, Illinois Stephen Langella Department of.
CaBIG ® VCDE Workspace Tactics thru June 14, 2010: How working groups fit together, and other activities Brian Davis April 1, 2010 VCDE WS Teleconference.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Grid Trust Service (GTS). Problem How does the grid clients/services know which CA certificates to trust? Should I trust this CA?
Ashish Sharma, Tony Pan, Barla Cambazoglu, Joel Saltz Ohio State University, Columbus, OH (ashish, tpan, October 10, 2007 caBIG In Vivo.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Introduce Grid Service Authoring Toolkit Shannon Hastings, Scott Oster, Stephen Langella, David Ervin Ohio State University Software Research Institute.
1 caGrid Security Overview Mark Grand Senior Engineer caGrid Knowledge Center February 7, 2011.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
0 Cancer Biomedical Informatics Grid (caBIG) – An Approach towards Data Access and Integration Avinash Shanbhag Director, Core Infrastructure Engineering.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Technology behind using Taverna in caGrid caGrid user meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
1 Cancer Models Database (caMOD). 2 History  January 2000 – Prototype is presented during the Mouse Models of Human Cancers (MMHCC) Steering Committee.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
What is NCIA? National Cancer Imaging Archive Searchable repository of in vivo cancer images in DICOM format Publicly available at no cost over the Internet.
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
Security Solutions Rachana Ananthakrishnan University of Chicago.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Grid Rapid Application Virtualization Interface (gRAVI) - Service Oriented Science Ravi K Madduri, Argonne National Laboratory/ University of Chicago Joshua.
Design for a High Performance, Configurable caGrid Data Services Platform Peter Hussey LabKey Software, Inc, Seattle, WA USA Contact:
CaGrid 1.0 Security Infrastructure Stephen Langella, Scott Oster, Shannon Hastings, David Ervin, Joshua Phillips, Vinay Kumar, Tahsin Kurc, Joel Saltz.
Identifiers, Resources, EPRs,and Missing Links OSG - Middleware Security Group Meeting Mon-Tue, June 5-6, 2006, SLAC, Stanford, CA Frank Siebenlist (Argonne.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Application Hosting Services — Enabling Science 2.0 —
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Collaborative and Open Source Software Development NCI’s caBIG™ Collaborative Environment Sharon Gaheen, SAIC Program Manager Himanso Sahni, SAIC Chief.
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
Tony Pan, Stephen Langella, Shannon Hastings, Scott Oster, Ashish Sharma, Metin Gurcan, Tahsin Kurc, Joel Saltz Department of Biomedical Informatics The.
0 caBIG and caGrid: Interoperable Computing Infrastructure for the Nation’s [and World’s] Cancer Research Enterprise Peter A. Covitz, Ph.D. Chief Operating.
Semantic Interoperability: caCORE and the Cancer Data Standards Repository (caDSR)  Jennifer Brush.
International Planetary Data Alliance Registry Project Update September 16, 2011.
CTTI PROJECT Emory University, Quality Assurance and Review Center (QARC) and Washington University in St. Louis.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Security in Research Computing John Sandefur UAB Comprehensive Cancer Center John-Paul Robinson UAB Research Computing.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois
NCI Center for Biomedical Informatics and Information Technology (CBIIT) The CBIIT is the NCI’s strategic and tactical arm for research information management.
The Anatomy and The Physiology of the Grid
Presentation transcript:

Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University

National Cancer Institute 2015 Goal Relieve suffering and death due to cancer by the year 2015

Cancer Biomedical Informatics Grid (caBIG TM ) The cancer Biomedical Informatics Grid (caBIG™), is a voluntary network or grid connecting individuals and institutions to enable the sharing of data and tools, creating a World Wide Web of cancer research. The goal is to speed the delivery of innovative approaches for the prevention and treatment of cancer. The infrastructure and tools created by caBIG™ also have broad utility outside the cancer community.  National Cancer Institute Initiative  Over 800 Participants  Over 80 Organizations  Over 70 Projects

Origins of caBIG  Need: Enable investigators and research teams nationwide to combine and leverage their findings and expertise in order to meet NCI 2015 Goal.  Strategy: Create scalable, actively managed organization that will connect members of the NCI-supported cancer enterprise by building a biomedical informatics network

caBIG Community Organization

caBIG Overview  Common, widely distributed infrastructure that permits the cancer research community to focus on innovation  Shared, harmonized set of terminology, data elements, and data models that facilitate information exchange  Collection of interoperable applications developed to common standards  Cancer research data is available for mining and integration

Interoperability  The ability of multiple systems to exchange information and to be able to use the information that has been exchanged. Syntactic interoperability Semantic interoperability

SYNTACTIC SEMANTIC caBIG Compatibility Guidelines

What is caGrid?  Development project of Architecture Workspace, aimed at helping define and implement Gold Compliance (the highest level of caBIG compatibility)  Gold compliance creates the G in caBIG  Gold => Grid => connecting Silver Compliant Systems  No requirements on implementation technology is necessary for Gold compliance  Specifications will be created defining requirements for interoperability  caGrid provides core infrastructure, and tooling to provide “a way” to achieve Gold compliance

caGrid Conceptual View Microarray NCICB Research Center Gene Database Grid-Enabled Client Research Center Tool 1 Tool 2 caArray Protein Database Tool 3 Tool 4 Grid Data Service Analytical Service Image Tool 2 Tool 3 Grid Services Infrastructure (Metadata, Registry, Query, Invocation, Security, etc.) Grid Portal

caGrid Data Description Infrastructure  Client and service APIs are object oriented, and operate over well-defined and curated data types  Objects are defined in UML and converted into ISO/IEC Administered Components, which are in turn registered in the Cancer Data Standards Repository (caDSR)  Object definitions draw from vocabulary registered in the Enterprise Vocabulary Services (EVS), and their relationships are thus semantically described  XML serialization of objects adhere to XML schemas registered in the Global Model Exchange (GME)

Conceptual View of the Problem

caGrid Components  Leverage existing technologies:  caDSR, EVS, Mobius GME: Common data elements, controlled vocabularies, schema management  Globus Toolkit (currently version 4.0.3)  Core grid services infrastructure  Service deployment, service registry, invocation, base security infrastructure  Additional Core Infrastructure  Higher-level security services (Dorian, GTS, GridGrouper)  Grid service access to metadata components (caDSR, GME, etc)  Workflow, Identifier services  Service Provider Tooling (Introduce)  Graphical service development and configuration environment  Abstractions from service infrastructure for Data and Analytical services  Deployment wizards  Client Tooling  High-level APIs for interacting with core components and services  Graphical Tools

Grid Authentication and Authorization with Reliably Distributed Services (GAARDS)  The GAARDS Security Infrastructure provides services and tools for the administration and enforcement of security policy in an enterprise Grid.  Developed on top of the Globus Toolkit  Extends the Grid Security Infrastructure (GSI)  Provide enterprise services and administrative tools for:  Grid User Management  Identity Federation  Trust management  Group/VO management  Access Control Policy management and enforcement  Integration between existing security domains and the grid security domain.  Security Infrastructure for the Cancer Biomedical Informatics Grid (caBIG TM )

GAARDS Services  Dorian  Grid User Account Management  Integration point between external security domains and the grid.  Allows accounts managed in external domains to be federated and managed in the grid.  Dorian allows users to use their existing credentials (external to the grid) to authenticate to the grid  Grid Trust Service (GTS)  Creation and Management of a federated trust fabric.  Supports applications and services in deciding whether or not signers of digital credentials/user attributes can be trusted.  Supports the provisioning of trusted certificate authorities and corresponding CRLS.  Grid Grouper  Group management service for the grid  Provides a group-based authorization solution for the Grid  Enforce authorization policy based on membership to groups

Accessing caGrid workflow Data uchicago.edu BPEL Workflow Doc BPEL Engine Workflow Mgmt Service Analytic osu.edu Analytic duke.edu Workflow Results Workflow inputs  Workflow management service  Sharing workflows  Get workflow status

Introduce Graphical Development Environment (GDE)  GUI for creating and manipulating a grid service  Provides means of simple creation of service skeleton that a developer can then implement, build, and deploy  Automatic code generation of complete caBIG compliant grid service which is configured to provide:  Security  Advertisement  Discovery  Complete Client API  Provides a set of tools which enable the developer to add/remove/modify/import methods of the service as well create sub- services.  Automatic code generation of all the required code, Globus grid service code/configuration, service configuration, implementation of the client, and stubbed implementation of the service

Introduce Generated Grid Service Architecture  Base service is a GT4 based WSRF capable grid service.  Utilize compositional inheritance (in lieu of non-standard port type extensions) to enable the service to inherit required features such as providing service security metadata and access to resource properties.  Utilize JNDI for server side configuration properties, and resources and resource properties.  Provide client and service side wrappers which implement the service designers interface as opposed to the document literal interface generated by Axis.  Provide metadata registration to the index service by configuring the Resource to register it’s service groups to a predefined caGrid MDS based Index Service.

Collaborating Architects and Developers  Ohio State University  Argonne National Lab  Duke University  Georgetown University  Semantic Bits

Project Resources and Communication  caBIG at NCI   Globus Dev   caGrid 1.0 GForge Home:  Feature Requests  Bug Reports  Discussion Forums  Public Wiki  Quality Dasboards  Downloads / Source Repository   caGrid Users Mailing List  

Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University