ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.

Slides:



Advertisements
Similar presentations
©2006 University of Southampton IT Innovation Centre and other members of the SIMDAT consortium A SIMDAT Perspective on Grid Standards and Specifications.
Advertisements

The Next Generation Grid Kostas Tserpes, NTUA Beijing, 22 of June 2005.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
CVRG Presenter Disclosure Information Joel Saltz MD, PhD Director Comprehensive Informatics Center Emory University Translational Research Informatics.
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Semantic Web Agents: Hope or Hype Nicholas Gibbins School of Electronics and Computer Science University of Southampton.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
Overview of Biomedical Informatics Rakesh Nagarajan.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Ontology Based Clinical Trial Builder Norbert Graf, Fatima Schera, Gabriele Weiler University of the Saarland, Fraunhofer Institute, Germany Clinical Trial.
CaGrid Service Metadata Scott Oster - Ohio State
The my Grid project aims to provide middleware layers that make the Information Grid appropriate for the needs of bioinformatics. my Grid is building high.
Advanced Data Mining and Integration Research for Europe ADMIRE – Framework 7 ICT ADMIRE Overview European Commission 7 th.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 FACS Data Management Workshop The Immunology Database and Analysis Portal (ImmPort) Perspective Bioinformatics Integration Support Contract (BISC) N01AI40076.
University of ViennaP. Brezany 1 Knowledge Discovery in Grid Datasets – Goals, Design Concepts and the Architecture Peter Brezany University of Vienna.
Developing Reusable Software Infrastructure – Middleware – for Multiscale Modeling Wilfred W. Li, Ph.D. National Biomedical Computation Resource Center.
Fighting Cancer with the (help of) Grid Jarek Nabrzyski, PSNC
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
PrognoChip Mediator, FORTH-ICS 1 Integrating Clinical and Genomic Information through the PrognoChip Mediator Implemented by Manos Kalaitzakis, Dimitris.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
The National Biomedical Imaging Archive (NBIA) In Action: An Introduction for Users A Tool Demonstration from caBIG® Presented by: Eliot Siegel, MD Maryland.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Middleware Support for Virtual Organizations Internet 2 Fall 2006 Member Meeting Chicago, Illinois Stephen Langella Department of.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
INFSO-RI Enabling Grids for E-sciencE Biomedical applications V. Breton, CNRS-IN2P3.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
©2008 University of Southampton IT Innovation Centre and other members of the SIMDAT Consortium SIMDAT Grid Technology Mike Boniface
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Interoperability from the e-Science Perspective Yannis Ioannidis Univ. Of Athens and ATHENA Research Center
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
Data Integration in Bioinformatics Using OGSA-DAI The BioDA Project Shirley Crompton, Brian Matthews (CCLRC) Alex Gray, Andrew Jones, Richard White (Cardiff.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
The financial support of the European Commission is gratefully acknowledged. Material in this presentation reflects only the author’s views and the Commission.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
1 The Virtual Physiological Human ToolKit Jonathan Cooper University of Oxford On behalf of the ToolKit team  VPH NoE, 2009 All Hands Meeting 2009 Oxford.
Data and storage services on the NGS.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
High throughput biology data management and data intensive computing drivers George Michaels.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
IFIC Grid Course, Valencia , Particle Therapy Motivation Project description Legal and ethical framework Status Faustin.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
0 caBIG and caGrid: Interoperable Computing Infrastructure for the Nation’s [and World’s] Cancer Research Enterprise Peter A. Covitz, Ph.D. Chief Operating.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
WP1:Definition & Production of the GRDI2020 Roadmap Roadmap Report To address the Technological, Organizational and Policy problems which hinder the building.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
EUDAT: collaborative pan-European infrastructure providing research data services, training and consultancy This work is licensed.
Databases, Ontologies and Text mining Session Introduction Part 2
Similarities between Grid-enabled Medical and Engineering Applications
Grid Application Model and Design and Implementation of Grid Services
Presentation transcript:

ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH

The ACGT vision & principles The ultimate objective of the ACGT project is the provision of a unified technological infrastructure which will facilitate integrated access to multi-level biomedical data development or re-use of open source analytical tools, accompanied with the appropriate meta-data allowing their discovery and orchestration into complex workflows. ACGT will deliver a European Biomedical GRID infrastructure offering seamless mediation services for sharing data and data- processing methods and tools, and advanced security; ACGT focuses on clinical trials on Cancer (Wilms tumor, Breast) and is based on the principles of Open access (among trusted partners) Open source is not a standards generating exercise but a standards adopting one.

Enabling dynamic Virtual Organizations Ontologies and mediation tools Basic GRID technology and security D D D U U U D D Clinical Trials … U User Data and Public Databases Layer Knowledge Discovery Tools D D Simulation and Visualization Tools U D D D D Distributed multilevel Biomedical Data The ACGT Integration Layer, theACGT Tools and Services User Applications and services layer in support of U

The ACGT Virtual Organizations Grid Portal V O V irtual O rganizations Grid Services Infrastructure (VO Manag., Metadata, Registry, Publishing, Query, Invocation, Security, etc.) Tool 1 Tool 2 Grid Data Service Analytical Services Clinical data Research Center Microarray Grid Data Service Image Tool 2 Tool 3 Research Center Analytical Services Grid Data Service Grid-Enabled Client Research Center Gene Database Protein Database Tool 3 Tool 4 Grid Data Services Analytical Services Grid Data Service Public data & tools Tool n

Discovery and Orchestration of Services 2D/3D visualization for in silico models ACGT experiment topology Microarray data processing for molecular classification of disease Analytical Services Data access Services DM/KDD and visualization services

The ACGT clinical trials Multicentric TOP trial – Breast Cancer SIOP 2002 – paediatric nephroblastoma In Silico modeling and simulation of tumor growth & response to treatment Breast Ca Multicentric Oncogenomic Study 1 Nephroblastoma Multi-centric Oncogenomic Study 2 UoC UoO UoS JBI IEO FORTH In Silico Oncology Study 3 Prolipsis

Main challenges in ACGT Grid middleware services, enabling large-scale (semantic, structural, and syntactic) interoperation among biomedical resources and services; Master ontology (on Cancer) through semantic modelling of biomedical concepts using existing ontologies and ontologies developed for the needs of the project; Open source bioinformatic tools and other analytical services; Semantic annotation and advertisement of biomedical resources, to allow metadata-based discovery and query of tools, and services; Orchestration of data access and analytical services into complex eScience workflows for post genomic clinical research and trials on cancer; Meta-data descriptions of clinical trials to provide adequate provenance information for future re-use, comparison, and integration of results;

Major Challenge: Semantic Interoperability The bottleneck is not so much about: computational needs, the volume of data, or performance issues in accessing/transferring data; It’s integration and semantic interoperability;

Data Integration Impediments Heterogeneity Syntactic: Relational (SQL) Databases, web accessible databases, … Structural: Different schemas and formats Semantic: Different vocabularies and semantics Security related: Different access policies: some data sources require authentication, whereas others are public Sensitive and confidential data: patient names or other identifying traits should be hidden (anonymization, pseudonymization)

Required Services The primary services required for supporting the identified scenarios fall into four categories: services for access and retrieval of data, that is: internal phenotypical (clinical and imaging) DBs and other “-omic” DBs, as well as external biomedical databases; services that are the analytical and visualization tools, that is: computational analysis, simulations, knowledge extraction, exposed as Grid (web) services; services for forming and executing eScience Workflows, that is: workflow management services, information management services, and distributed database query processing; semantic services for discovering services and workflows, and managing metadata, such as: ontologies metadata provenance

The ACGT Consortium Funding: ~18 MEuro, Time plan: 1/2/2006 – 31/1/2010

Thank you!