NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Slides:



Advertisements
Similar presentations
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Advertisements

LNL CMS M.Biasotto, Bologna, 29 aprile LNL Analysis Farm Massimo Biasotto - LNL.
Grid Data Management Assaf Gottlieb - Israeli Grid NA3 Team EGEE is a project funded by the European Union under contract IST EGEE tutorial,
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite Data Management System Yaodong Cheng CC-IHEP, Chinese Academy.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Computational grids and grids projects DSS,
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) GISELA Additional Services Diego Scardaci
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
Jan 31, 2006 SEE-GRID Nis Training Session Hands-on V: Standard Grid Usage Dušan Vudragović SCL and ATLAS group Institute of Physics, Belgrade.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
CGW 04, Stripped replication for the grid environment as a web service1 Stripped replication for the Grid environment as a web service Marek Ciglan, Ondrej.
E-science grid facility for Europe and Latin America Data Management Services E2GRIS1 Rafael Silva – UFCG (Brazil) Universidade Federal.
INFSO-RI Enabling Grids for E-sciencE A Grid Approach to Distributed Image Analysis for Early Diagnosis of Alzheimer Disease Livia.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
SEE-GRID-SCI Storage Element Installation and Configuration Branimir Ackovic Institute of Physics Serbia The SEE-GRID-SCI.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
1 Egrid portal Stefano Cozzini and Angelo Leto. 2 Egrid portal Based on P-GRADE Portal 2.3 –LCG-2 middleware support: broker, CEs, SEs, BDII –MyProxy.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
INFSO-RI Enabling Grids for E-sciencE University of Coimbra GSAF Grid Storage Access Framework Salvatore Scifo INFN of Catania EGEE.
User Interface UI TP: UI User Interface installation & configuration.
Stephen Burke – Sysman meeting - 22/4/2002 Partner Logo The Testbed – A User View Stephen Burke, PPARC/RAL.
12th EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Istituto Nazionale di Astrofisica Information Technology Unit INAF-SI Job with data management Giuliano Taffoni.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Algiers, EUMED/Epikh Application Porting Tutorial, 2010/07/04.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) LFC Installation and Configuration Dong Xu IHEP,
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Data Management Maha Metawei
User Domain Storage Elements SURL  TURL LFC Domain (LCG File Catalogue) SA1 – Data Grid Interoperation Enabling Grids for E-sciencE EGEE-III INFSO-RI
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Scuola Grid INFN, Trieste, 1-12 Dic Managing Confidential Data in the gLite Middleware – The Secure Storage.
Riccardo Zappi INFN-CNAF SRM Breakout session. February 28, 2012 Ingredients 1. Basic ingredients (Fabric & Conn. level) 2. (Grid) Middleware ingredients.
GFAL Grid File Access Library
GFAL Grid File Access Library
GFAL: Grid File Access Library
The EDG Testbed Deployment Details
StoRM: a SRM solution for disk based storage systems
Java API del Logical File Catalog (LFC)
Scuola Grid INFN, Martina Franca, Nov
gLite Data management system overview
Roberto Barbera (a nome di Livia Torterolo)
Attività grid di Biomedicina in Italia e in Europa
GSAF Grid Storage Access Framework
Data Management Ouafa Bentaleb CERIST, Algeria
Architecture of the gLite Data Management System
INFNGRID Workshop – Bari, Italy, October 2004
Presentation transcript:

NA4 Generic Applications Meeting January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis of Alzheimer Disease Livia Torterolo

Contents Introduction to SPM analysis for diagnosis in ADIntroduction to SPM analysis for diagnosis in AD Development of SPM service through a web portalDevelopment of SPM service through a web portal GRID implementation of SPM serviceGRID implementation of SPM service Current Status and PlansCurrent Status and Plans

Partners GRID node has been installed at BioLab, University of Genoa. The first part of work has been done in collaboration with: San Raffaele of Milano San Raffaele of Milano University of Milano – Bicocca University of Milano – Bicocca GRID implementation has carried out by Bio-Lab team.

Contents Introduction to SPM analysis for diagnosis in ADIntroduction to SPM analysis for diagnosis in AD Development of SPM service through a web portalDevelopment of SPM service through a web portal GRID implementation of SPM serviceGRID implementation of SPM service Current Status and PlansCurrent Status and Plans

Introduction to Alzheimer Disease AD is the most common form of dementia, accounting for more than half of all dementias in elderly peopleAD is the most common form of dementia, accounting for more than half of all dementias in elderly people Clinically, AD is characterized by a progressive loss of cognitive abilitiesClinically, AD is characterized by a progressive loss of cognitive abilities Memory loss is typically the earliest sign of ADMemory loss is typically the earliest sign of AD

Analysis of PET images Conventional Reporting: qualitative analysis of flux images Qualitative analysis of images doesn’t reveal anomalies in the distribution of the tracer in cortical, subcortical and cerebellar regions. The metabolic study is in the normal range. Qualitative analysis of images reveals a reduction in the accumulation of the tracer in correspondence of the inferior parietal lobe, bilaterally. Hypometabolism in these regions. Qualitative analysis of images doesn’t reveal anomalies in the distribution of the tracer in cortical, subcortical and cerebellar regions. The metabolic study is in the normal range. F18-FDGTc99-ECD Qualitative analysis of images reveals a reduction in the accumulation of the tracer in correspondence of the inferior parietal lobe, bilaterally. Parietal hypoperfusion.

SPM (Statistical Parametric Mapping ) Functionality: Image displayImage display Image segmentationImage segmentation Image realignment and co-registrationImage realignment and co-registration Nonlinear spatial normalizationNonlinear spatial normalization SmoothingSmoothing Statistical analysis – parameter estimation {GLM}Statistical analysis – parameter estimation {GLM} Statistical analysis – interface {TGF}Statistical analysis – interface {TGF} Results display (Graphical, tabular and image format)Results display (Graphical, tabular and image format) by Institute of Neurology at University College London

Contents Introduction to SPM analysis for diagnosis in ADIntroduction to SPM analysis for diagnosis in AD Development of SPM service through a web portalDevelopment of SPM service through a web portal GRID implementation of SPM serviceGRID implementation of SPM service Current Status and PlansCurrent Status and Plans

Why a portal? SPM requirements: the knowledge of functions used in the image analysis in order to provide the correct values of parameters and in order to understand the resultsthe knowledge of functions used in the image analysis in order to provide the correct values of parameters and in order to understand the results a large set of images of normal patients to be used for comparison during the statistical parametric mapping. The accuracy of ipoperfusion maps is strictly related to the number of normal studiesa large set of images of normal patients to be used for comparison during the statistical parametric mapping. The accuracy of ipoperfusion maps is strictly related to the number of normal studies

SET of CONTROLS 1 (PET, SPECT IMAGES) STATISTICAL TOOL (SPM) SET of CONTROLS 2 (PET, SPECT IMAGES) SET of CONTROLS 3 (PET, SPECT IMAGES) SET of CONTROLS n (PET, SPECT IMAGES) IMAGE of PATHOLOGICAL SUBJECT (PET or SPECT IMAGE) STATISTICAL ANALYSIS OF THE UPLOADED IMAGE Web-based application

Design of the application’s structure Image Acquisition Image Transfer Parameter fitting for the statistical analysis Maps Visualization Management Node User Node Porting on GRID Query to data catalogue Job submission Statistical Analysis Results transfer Repository Node Extraction of image information Results transfer Extraction of image information Results transfer 2 Repository Node 1

Contents Introduction to SPM analysis for diagnosis in ADIntroduction to SPM analysis for diagnosis in AD Development of SPM service through a web portalDevelopment of SPM service through a web portal GRID implementation of SPM serviceGRID implementation of SPM service Current Status and PlansCurrent Status and Plans

SPM: Why GRID? Doctors would have an easy access to normal PET/SPECT databases without moving images from hospitalsDoctors would have an easy access to normal PET/SPECT databases without moving images from hospitals Waiting time for statistical parametric mapping depends on the number of normal images stored in every database It is very important to split this application into independent jobs and to run them at the same time to increase software performancesWaiting time for statistical parametric mapping depends on the number of normal images stored in every database It is very important to split this application into independent jobs and to run them at the same time to increase software performances

GILDA testbed GILDA testbed:

LCG Installation Installation and Set up of the LCG Grid Environment v Installation and Set up of the LCG Grid Environment v machines are dedicated to LCG tests: 1 CE, 1 SE, 3 WNs, 1 UI, 1 LCG install server 12 CPUs, 7Giga (RAM), ≈ 400Giga of storage BioLab site is now on maintainance to upgrade from LCG to LCG 2.6.0BioLab site is now on maintainance to upgrade from LCG to LCG 2.6.0

DIST – GENOVA site

LCG node: Technical characteristics (I) is powered by an autonomous power line and a uninterruptible power supply with a capacity of 3000 VA is providedis powered by an autonomous power line and a uninterruptible power supply with a capacity of 3000 VA is provided the UPS is big enough to maintain online the node for about ‘the UPS is big enough to maintain online the node for about ‘ storage element has a redundant power supply and redundant hot-swap cooling system (fan)storage element has a redundant power supply and redundant hot-swap cooling system (fan) storage is provided by 4 U320 scsi rpm disks (about 400 GByte) and a raid 5 controller with 128 mb r/w onboard cachestorage is provided by 4 U320 scsi rpm disks (about 400 GByte) and a raid 5 controller with 128 mb r/w onboard cache

LCG node: Technical characteristics(II) network is provided by a 24 port Gigabit switch and each node has a dual gigabit network adapter (only one channel is used by now)network is provided by a 24 port Gigabit switch and each node has a dual gigabit network adapter (only one channel is used by now) node is connected to Internet through the local department LANnode is connected to Internet through the local department LAN 1Gbit/sec connection to the GARR POP in Genoa1Gbit/sec connection to the GARR POP in Genoa the University of Genoa is connected to the Milan Internet Exchange (Internet backbone access point) with a 155 Mbps linethe University of Genoa is connected to the Milan Internet Exchange (Internet backbone access point) with a 155 Mbps line

Installation phase with this type of hardware (brand HP Proliant DL 140 and DL 380 server) we didn't have any troubleswith this type of hardware (brand HP Proliant DL 140 and DL 380 server) we didn't have any troubles node’s installation has been performed without problemsnode’s installation has been performed without problems very good system administration support from staff in Cataniavery good system administration support from staff in Catania

Development phase: Migration of the application to LCG Main objectives: to distribute PET/SPECT images on different Storage Elements available on the GRID and register data - with related metadata - on LCG File Catalog (LFC).to distribute PET/SPECT images on different Storage Elements available on the GRID and register data - with related metadata - on LCG File Catalog (LFC). to access images from User Interface using logical file names (LFN) without copying them on Worker Nodes.to access images from User Interface using logical file names (LFN) without copying them on Worker Nodes.

Data Management and File Access

LCG_utils CLI:CLI: lcg-* commands as lcg-cp, lcg-cr, lcg-del, lcg-la, lcg-lr, ecc…. C API:C API: int lcg_cp (char *src_file, char *dest_file, char *vo, int nbstreams, char * conf_file, int insecure, int insecure); int lcg_cp (char *src_file, char *dest_file, char *vo, int nbstreams, char * conf_file, int insecure, int insecure); int lcg_del (char *file, int aflag, char *se, char *vo, char *conf_file, int insecure, int verbose); int lcg_del (char *file, int aflag, char *se, char *vo, char *conf_file, int insecure, int verbose);…… Used to store application’s data and images on distributed storage elements and register them in LFC catalog

Grid File Access Library (GFAL) GFAL provides calls for catalog interaction, storage management and file access and can be very handy when an application requires access to some part of a big Grid file but does not want to copy the whole file locally we used GFAL API to access to distributed images without copying them locally

GFAL: File I/O API Examples: gfal_openint gfal_open (const char * filename, int flags, mode_t mode); gfal_readssize_t gfal_read (int fd, void *buf, size_t size); gfal_closeint gfal_close (int fd);….. GFAL accepts GUIDs, LFNs, SURLs and TURLs as file names, and, in the first two cases, it tries to find the closest replica of the file. the application code has been modified to access images using their logical file names

A GFAL call

LCG File Catalog (LFC) Functionalities: Use of lcg_utilsUse of lcg_utils Use of GFAL callsUse of GFAL calls Use of GSI certificationUse of GSI certification Access to grid file in SEs from “anywhere”Access to grid file in SEs from “anywhere” Several replicas of files in different sitesSeveral replicas of files in different sites Copy of data from/to local file system to GRIDCopy of data from/to local file system to GRID

Re-Design of the application With reference to different LCG tools mentioned above, application code has been modified and structured in the following way: Registration and storage of data files (PET/SPECT images) on SEs available using lcg_utils. Registration and storage of data files (PET/SPECT images) on SEs available using lcg_utils. Development of a C program with GFAL C API in order to access distributed images using their LFNs and to extract some information necessary to SPM analysis without copying them locally. Development of a C program with GFAL C API in order to access distributed images using their LFNs and to extract some information necessary to SPM analysis without copying them locally. Job Submission: creation of a JDL file to submit the executable (and not the images) with GFAL call to the GRID. Job Submission: creation of a JDL file to submit the executable (and not the images) with GFAL call to the GRID. Statistical Analysis: running of SPM analysis from results obtained from job submission. Statistical analysis is performed outside GRID environment. Statistical Analysis: running of SPM analysis from results obtained from job submission. Statistical analysis is performed outside GRID environment.

Final GRID structure

Problems If you want to access a file on a classic SE using GFAL from your UI, you won't be able to do this from the UI (because of insecure RFIO that needs uid correspondance), but you should be able to do it from a WN in the same site (there is indeed such correspondance between uids in WNs and in SEs). So the options are: 1. Copy the file with gridftp to your UI, then access it locally 2. Send your application (that contains the GFAL calls) in a job to a CE in the same site as the SE where the file you want to access sits.If you want to access a file on a classic SE using GFAL from your UI, you won't be able to do this from the UI (because of insecure RFIO that needs uid correspondance), but you should be able to do it from a WN in the same site (there is indeed such correspondance between uids in WNs and in SEs). So the options are: 1. Copy the file with gridftp to your UI, then access it locally 2. Send your application (that contains the GFAL calls) in a job to a CE in the same site as the SE where the file you want to access sits. Support for Metadata using LFC catalog:Support for Metadata using LFC catalog: There is an only field for metadata There is an only field for metadata No way to do metadata queries No way to do metadata queries

Contents Introduction to SPM analysis for diagnosis in ADIntroduction to SPM analysis for diagnosis in AD Development of SPM service through a web portalDevelopment of SPM service through a web portal GRID implementation of SPM serviceGRID implementation of SPM service Current Status and PlansCurrent Status and Plans

Current Status and Plans Integration of GRID application with neuroinformatics web interfaceIntegration of GRID application with neuroinformatics web interface To find a better way to manage metadataTo find a better way to manage metadata Integrate the application in GENIUSIntegrate the application in GENIUS To value the possibility of parallelize the statistical analysisTo value the possibility of parallelize the statistical analysis Thanks for your attention!