Un sistema avanzato per la gestione di immagini biomedicali in ambiente GRID G. Aloisio, S. Fiore SPACI Consortium & University of Salento, Italy, Lecce.

Slides:



Advertisements
Similar presentations
Data Management Expert Panel - WP2. WP2 Overview.
Advertisements

CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
11 DICOM Image Communication in Globus-Based Medical Grids Michal Vossberg, Thomas Tolxdorff, Associate Member, IEEE, and Dagmar Krefting Ting-Wei, Chen.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
THE DICOM 2013 INTERNATIONAL CONFERENCE & SEMINAR March 14-16Bangalore, India DICOM Medical Image Management the Challenges and Solutions – Cloud as a.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
Seamless Medical Image Processing on the Grid on the Example of Segmentation and Partition of the Airspaces Andrzej Rutkowski 1, Michał Chlebiej 1, Marcelina.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
Grid Security Issues Shelestov Andrii Space Research Institute NASU-NSAU, Ukraine.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Group 7 Laura Antonelli William Bacchi Gerardo Giordano Pietro Hiram Guzzi Francisco Pinto Khalid Tijani The 3rd International Summer School on Grid Computing.
The Climate-G testbed towards a large scale data sharing environment for climate change S. Fiore Scientific Computing and Operations Division, CMCC, Italy.
Tools for collaboration How to share your duck tales…
The GRelC Project: architecture, history and a use case in the environmental domain G. Aloisio - S. Fiore The Climate-G testbed is an interdisciplinary.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
The Climate-G testbed towards a large scale data sharing environment for climate change S. Fiore Scientific Computing and Operations Division, CMCC, Italy.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
MEDIGRID project, DataGrid FR meeting, April 18, 2002, Johan Montagnat, WP10 ACI GRID 2002 MEDIGRID: high performance medical image processing on a computational.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
WP 10 ATF meeting April 8, 2002 Data Management and security requirements of biomedical applications Johan Montagnat - WP10.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
REST API to develop application for mobile devices Mario Torrisi Dipartimento di Fisica e Astronomia – Università degli Studi.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA GRIDICOM G. Foti 1), S. Scifo 2), R. Barbera 3), F. Portuese 4), S. Parisi 5)
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
A Data Engine for Grid Science Gateways Enabling Easy Transfers and Data Sharing Dr. Marco Fargetta (1), Mr. Riccardo Rotondo (2,*), Prof. Roberto Barbera.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
Enhancements to Galaxy for delivering on NIH Commons
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Grid based telemedicine application
Accessing the VI-SEEM infrastructure
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Grid Computing Security Mechanisms: the state-of-the-art
Vincenzo Spinoso EGI.eu/INFN
NA4 Medical Imaging Geneva, September 26, 2006 Johan Montagnat.
Cross-health enterprises Medical Data Management on the EGEE grid
Medical Data Manager use case: 3D medical images analysis workflow.
VirtualGL.
Joseph JaJa, Mike Smorul, and Sangchul Song
Grid Training done in/by the Italian Federation in 2007 Roberto Barbera Univ. of Catania and INFN NA3 Partner Review Meeting at EGEE’07 Budapest,
Introduction to Data Management in EGI
Similarities between Grid-enabled Medical and Engineering Applications
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
GSAF Grid Storage Access Framework
University of Technology
GSAF Grid Storage Access Framework
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
Goals Introduce the Windows Server 2003 family of operating systems
Google Sky.
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Presentation transcript:

Un sistema avanzato per la gestione di immagini biomedicali in ambiente GRID G. Aloisio, S. Fiore SPACI Consortium & University of Salento, Italy, Lecce

Outline Scenario Related Works Issues Medical Imaging Environment System Architecture Data Storage Workflow Management Metadata Management: GRelC Case Studies: CMCC & Climate-G Security Web Gateway Conclusions First what is Metadata? A common definition is that metadata is data about data. On the GRID, this is mainly information describing files that is necessary for running jobs, that is, file metadata. But So, in a way accessing metadata is mainly about accessing databases. But having clients going directly to the database is not the most convenient solution. Better than that, is to have a simple interface for metadata access on the GRID. This interface should be defined in terms of metadata concepts, like entries, keys and values, instead of DB concepts. This has several advantages. It is easier to use by clients, since it exposes only metadata concepts and effectively hides the database. Having a simple interface that reveals DB functionality solves most of the problems. Simplified relational database interface.

Scenario, issues and needs Huge amount of medical data produced by several Centres (i.e. Hospitals) Medical Interconnection Network: data sharing among several centres Data integration, digital libraries, sharing are FP7 keywords Need to move towards open, distributed and service-based environments Patients Challenging issues Security Data distribution Data format heterogeneity Metadata management Metadata schema Transparent access to the system Scalable approach …. Infrastructure Clinicians 3

Grid for Biomedical Imaging, 1999 Parallel Rendering Computational Grid Globus Pre Processing Raw data Server HPC Data Acquisition Collaborative working

Surrounding context MediGrid …and more…

Issues (high level) Medical image storage and processing TAC, MRI, fMRI, PET, SPECT DICOM image format Privacy issues Very large data sets handling storage management (PACS) Metadata management Security of data Process complex algorithms with large computing power and memory requirements parallel processing workflow Management System (WFMS) TAC MRI fMRI SPECT PET

AGIR - gLite based www.aci-agir.org

Medicus - Globus based http://dev.globus.org/wiki/Incubator/MEDICUS

Computational Providers Medical Imaging Environment Info Data Virtualization Info DICOM Images DICOM Images Computational Providers DICOM Images Cloud Environment Info Grid Environment

Main Issues (architectural level) Full security support (data security/privacy) Integration effort is needed to achieve high level results Transparent management of data & metadata Preservation of data locality (distribution of data) Distribution of metadata Access via Web Gateway Workflow management for medical purposes high level tools for end users medical imaging methods composition Grid support (storage, computation) on a large scale

Medical Imaging - Data Virtualization Info Data Virtualization Info DICOM Images DICOM Images Distributed Data & Metadata Locality/Autonomy Scalability Legacy systems (PACS) Security Metadata DB Encryption Authorization/policies/roles Data Storage Encryption Network Communication protection Interchange Protocols DICOM Images Info

Computational Providers Medical Imaging - Analysis Info Data Virtualization Info DICOM Images DICOM Images Computational Providers Analysis Distributed Grid Based Cloud Based Complex Medical Workflow Secure Data anonymisation Data Encryption Authorization/Authentication Cloud Environment Grid Environment

System Architecture Security Web Gateway Orchestrator/Collective Services Grid Middleware Metadata Service Data Service Comput. Service MediGrid Fabric Layer (data and metadata) 13

Medical imaging - Storage Management of Medical Images Secure on site management of DICOM images Encryption of data Anonymisation of data for analysis outside the centre Access control through authorization Storage interfaces between distributed environment and local storage devices Secure and efficient data transport protocols Backup of data ….

Biomedical Imaging Processing Issues Huge amount of data 1 radiology department: 10TB/year; 1 CT dataset: 500MB-2GB. Compute-intensive analysis Registration noisy; Difficulty in the segmentation; Volume Reconstruction. Need to explore several imaging analysis algorithms, validation methods, visualization pipelines Grid Workflow Management System Scheduler Grid Data Mng CE Node

Computational Providers Medical Imaging - Workflow Info Info DICOM Images DICOM Images Workflow Example Denoising Segmentation Rendering Cloud Environment Computational Providers Grid Environment

Metadata on the GRID Metadata is data about data Metadata enables search and discovery Metadata on the GRID Mainly information about files Usually stored on DBs Need simple SOA & Grid interfaces for Metadata access Advantages Easier to use by clients Robust paradigm Exploiting Grid Security paradigm Addressing Interoperability First what is Metadata? A common definition is that metadata is data about data. On the GRID, this is mainly information describing files that is necessary for running jobs, that is, file metadata. But So, in a way accessing metadata is mainly about accessing databases. But having clients going directly to the database is not the most convenient solution. Better than that, is to have a simple interface for metadata access on the GRID. This interface should be defined in terms of metadata concepts, like entries, keys and values, instead of DB concepts. This has several advantages. It is easier to use by clients, since it exposes only metadata concepts and effectively hides the database. Having a simple interface that reveals DB functionality solves most of the problems. Simplified relational database interface.

Why are metadata so important? Metadata is the key to manage, route and retrieve your data properly PACS systems can be “federated” exploiting metadata Search and discovery of “clinical case studies” can be afforded by metadata system Metadata information increases the global knowledge about available patient data From data to metadata and from metadata to knowledge to support Physicians for diagnostic and therapeutic purposes How to manage “efficiently”, “securely” and transparently” metadata? METADATA SERVICE

Metadata Service Metadata service Compatibility with distrib. systems/security Full security support for metadata authorization (policies, roles, etc.) authentication (based on grid certificates) Access/Integration of metadata sources Distributed vs centralized approach Grid Access interface Common interface for RDBMS and XML-DB Database backend independence GSI support

Metadata Management: Stack

GRelC Project (starting date 2001) Grid Relational Catalog (GRelC) is a project which aims at designing and developing a set of efficient, secure and transparent Data Grid Services DB XML

Grid Metadata Handling System: architecture in the small GRelC DAIS

International Testbeds Lecce (Italy) Bejing (China) GRelC Data Access Data Sources (DB) gandalf.unile.it Linux x86 sara.unile.it Mac OS X sigma2.unile.it Linux IA64 gridsurfer.unile.it FreeBSD galileo.hpcc.unical.it sepac00.projects.cscs.ch spacina.na.infn.it

Test Performance

CMCC: a fully distributed data grid environment Grid Layer VOMS Server CMCC Data Grid Portal Metadata Server (GRelC DAIS) Grid Storage GridFTP GRelC Storage CMCC CA P2P layer Data and Metadata resources RDBMS (Oracle,MySQL, Postgres) XML DBs (eXist, XIndice, etc.) Search & Discovery Two step process Grid SS Grid SS Grid SS

A new research effort: Climate-G The main goal of Climate-G is to create a unified environment for climate change, able to concentrate in the same context big amount of data geographically spread among several centres, rich metadata descriptions, efficient data access services, advanced data analysis and visualization tools, etc. exploiting and joining knowledge and skills in the fields of climate change and computational science 26

Climate-G partners Università del Salento 27

Metadata Distribution and virtualization For each site: Relational DB (index) XML DB (entire schema) Virtualization/Integration layer: GRelC DAIS Virtualization allows to conceal: Data distribution Number of sites, RDBMS and XML back-ends P2P Topology Data Integration aspects technological details …

Security What about security? Authentication (mutual process) Authorization Access control lists Role Membership Services Data encryption (to ensure secure data communication) Data anonymisation (to move data outside the Centers) A certificate for each actor (user, service, host) Each action is associated with an actor Firewall protection …

Medical Web Gateway Main Functionalities Search & Discovery Data access & viz Workflow management Users and roles mng …. Features Easy to use interfaces Platform independent Secured by design Integrated environment Data and … Analysis tools Visualization tools Post-processing tools

Computational Providers Medical Imaging Environment Info DICOM Images Computational Providers Cloud Environemnt Grid Environment Data Virtualization

Questions? First what is Metadata? A common definition is that metadata is data about data. On the GRID, this is mainly information describing files that is necessary for running jobs, that is, file metadata. But So, in a way accessing metadata is mainly about accessing databases. But having clients going directly to the database is not the most convenient solution. Better than that, is to have a simple interface for metadata access on the GRID. This interface should be defined in terms of metadata concepts, like entries, keys and values, instead of DB concepts. This has several advantages. It is easier to use by clients, since it exposes only metadata concepts and effectively hides the database. Having a simple interface that reveals DB functionality solves most of the problems. Simplified relational database interface.