EDG Applications The European DataGrid Project Team

Slides:



Advertisements
Similar presentations
ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
Advertisements

ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Réunion DataGrid France, Lyon, fév CMS test of EDG Testbed Production MC CMS Objectifs Résultats Conclusions et perspectives C. Charlot / LLR-École.
CMS-ARDA Workshop 15/09/2003 CMS/LCG-0 architecture Many authors…
V. Breton, Lyon BBE, September 2002 WP10 Status. V. Breton, Lyon BBE, September 2002 WP10 goals between now and the end of the year Deploy the applications.
The DataGrid Project NIKHEF, Wetenschappelijke Jaarvergadering, 19 December 2002
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
School on Grid Computing – July 2003 – n° 1 A.Fanfani INFN Bologna – CMS WP8 – CMS experience on EDG testbed  Introduction  Use of EDG middleware in.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Using The EDG Testbed The European DataGrid Project Team
Application Use Cases NIKHEF, Amsterdam, December 12, 13.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
DataGrid is a project funded by the European Commission under contract IST Earth Observation applications in EU DataGrid (EDG) and follow on.
EDG Application The European DataGrid Project Team
DataGrid is a project funded by the European Commission under contract IST Status and Prospective of EU Data Grid Project Alessandra Fanfani.
CMS Report – GridPP Collaboration Meeting VI Peter Hobson, Brunel University30/1/2003 CMS Status and Plans Progress towards GridPP milestones Workload.
EU 2nd Year Review – Feb – Title – n° 1 WP8: Progress and testbed evaluation F Harris (Oxford/CERN) (WP8 coordinator )
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
WP9 – Earth Observation Applications – n° 1 Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat th August 2002
November 7, 2001Dutch Datagrid SARA 1 DØ Monte Carlo Challenge A HEP Application.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
Claudio Grandi INFN Bologna CHEP'03 Conference, San Diego March 27th 2003 Plans for the integration of grid tools in the CMS computing environment Claudio.
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
WP8 Status – Stephen Burke – 30th January 2003 WP8 Status Stephen Burke (RAL) (with thanks to Frank Harris)
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
CMS Stress Test Report Marco Verlato (INFN-Padova) INFN-GRID Testbed Meeting 17 Gennaio 2003.
WP9 – Earth Observation Applications – n° 1 WP9 report to Plenary ESA, KNMI, IPSL Presented by M. Petitdidier, IPSL DataGrid Plenary Session 5 th Project.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
22 nd September 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
29 May 2002Joint EDG/WP8-EDT/WP4 MeetingClaudio Grandi INFN Bologna LHC Experiments Grid Integration Plans C.Grandi INFN - Bologna.
13 May 2004EB/TB Middleware meeting Use of R-GMA in BOSS for CMS Peter Hobson & Henry Nebrensky Brunel University, UK Some slides stolen from various talks.
Claudio Grandi INFN Bologna CHEP'03 Conference, San Diego March 27th 2003 BOSS: a tool for batch job monitoring and book-keeping Claudio Grandi (INFN Bologna)
V. Breton, Budapest, September 2002 WP10 status V. Breton DataGrid conference.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
ATLAS is a general-purpose particle physics experiment which will study topics including the origin of mass, the processes that allowed an excess of matter.
INFSO-RI User Forum 1-3 March 2006 Enabling Grids for E-sciencE Worldwide ozone distribution by using Grid infrastructure ESA: L.
MEDIGRID project, DataGrid FR meeting, April 18, 2002, Johan Montagnat, WP10 ACI GRID 2002 MEDIGRID: high performance medical image processing on a computational.
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
INFSO-RI Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry.
ESA DataGrid Review – 10 June 2002 – n° 1 Summary item 3  ESA and WP9 part 1 (45m)  DataGrid Frascati infrastructure (AT, 7.5m)  ESA GOME application.
DataGrid is a project funded by the European Commission under contract IST WP10 Biomedical Applications Vincent Breton (CNRS)
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 Parallelization of Monte Carlo simulations.
The DataGrid Project NIKHEF, Wetenschappelijke Jaarvergadering, 19 December 2002
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
WP10 Goals and accomplishments from WP10 point of view J. Montagnat, CNRS, CREATIS V. Breton, CNRS/IN2P3 DataGrid Biomedical Work Package.
GDB Meeting CERN 09/11/05 EGEE is a project funded by the European Union under contract IST A new LCG VO for GEANT4 Patricia Méndez Lorenzo.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 WP9 Earth Observation applications.
Applications and the Grid EDG CERN Ingo Augustin CERN DataGrid HEP Applications.
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
Grid Computing: Running your Jobs around the World
WP 9.4 Use Case (ESA-ESRIN, IPSL, KNMI)
The EDG Testbed Deployment Details
Work Package 9 – EO Applications
EO Applications Parallel Session
WP10 Biomedical Applications
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
INFN-GRID Workshop Bari, October, 26, 2004
Biomedical grid initiatives in Europe
Earth Observation and Climate Applications for EGEE
EDG Final Review Demonstration
CMS report from FNAL demo week Marco Verlato (INFN-Padova)
Gridifying the LHCb Monte Carlo production system
Presentation transcript:

EDG Applications The European DataGrid Project Team

EU DataGrid - Applications 2 EDG Application Areas High Energy Physics Biomedical Applications Earth Observation Science Applications

EU DataGrid - Applications 3 High Energy Physics 4 Experiments on LHC CMS ATLAS LHCb ~6-8 PetaBytes / year ~10 8 events/year ~10 3 batch and interactive users

EU DataGrid - Applications 4 Europe: 267 institutes, 4603 users Elsewhere: 208 institutes, 1632 users CERN’s Network in the World

EU DataGrid - Applications 5 Data Flow in LHC

EU DataGrid - Applications 6 Example: CMS Monte Carlo Production

EU DataGrid - Applications 7 CMS jobs description CMKIN : MC Generation of the proton- proton interaction for a physics channel (dataset) CMSIM: Detailed simulation of the CMS detector, processing the data produced during the CMKIN step CMKIN Job CMSIM Job Output data Output data Grid Storage Write to Grid Storage Element Write to Grid Storage Element Read from Grid Storage Element * PIII 1GHz 512MB  46.8 SI95 size/eventtime * /event CMKIN ~ 0.05MB~ sec CMSIM ~ 1.8 MB ~ 6 min

EU DataGrid - Applications 8 CMS production components interfaced to EDG middleware u Production is managed from the EDG User Interface with IMPALA/BOSS u CMS Virtual Organization server at NIKHEF (Amsterdam) CMSEDG SE CE CMS software BOSS DB Workload Management System JDL RefDB parameters Push data or info Pull info UI IMPALA/BOSS CE CMS software CE CMS software CE SE

EU DataGrid - Applications 9 CMSEDG SE CE CMS software BOSS DB Workload Management System JDL RefDB parameters data registratio n input data location Push data or info Pull info UI IMPALA/BOSS Replica Manager CE CMS software CE CMS software CE WN SE CE CMS software SE X CMS production components interfaced to EDG middleware u CMKIN jobs running on all EDG Testbed sites with CMS software installed u CMSIM jobs running on CE close to the input data u produced data: scripts for batch replication to a dedicated SE

EU DataGrid - Applications 10 CMS production components interfaced to EDG middleware u Job monitoring and bookkeeping: BOSS DBs, EDG Logging & Bookkeeping service CMSEDG SE CE CMS software BOSS DB Workload Management System JDL RefDB parameters data registratio n Job output filtering Runtime monitoring input data location Push data or info Pull info UI IMPALA/BOSS Replica Manager CE CMS software CE CMS software CE WN SE CE CMS software SE

EU DataGrid - Applications 11 CMS use of the system (Statistics) CEsSEs Nb. of evts time Events Production within EDG is part of the Official CMS production production/www/html/general/

EU DataGrid - Applications 12 Summary of CMS work and plans for use of EDG middleware u RESULTS n We can distribute and run CMS s/w in the EDG environment n We have generated ~250K events for physics with ~10000 jobs in 3 week period u OBSERVATIONS and PLANNING for the future n We were able to quickly add new sites to provide extra resources n There was a fast turnaround in bug fixing and installing new software n The stress test was labor intensive (since software was developing) n Release EDG 2.0 should fix the major problems and allow for enhanced scalability,and we look forward to evaluating it and using it in our Data Challenge work

EU DataGrid - Applications 13 ESA(IT) – KNMI(NL) Processing of raw GOME data to ozone profiles. 2 alternative algorithms ~28000 profiles/day IPSL(FR) Validate some of the GOME ozone profiles (~10 6 /y) Coincident in space and time with Ground-Based measurements Visualization & Analyze LIDAR data (7 stations, 2.5MB per month) DataGrid environment Level 2 (example of 1 day total O 3 ) Level 1 Raw satellite data from the GOME instrument (~75 GB - ~5000 orbits/y) EDG EO challenge: Processing / validation of 1y of GOME data

EU DataGrid - Applications 14 EO WebMap Portal

EU DataGrid - Applications 15 Web Portal EO Product Catalogue EDG Storage Element EDG User Interface EDG Resource Broker EDG Computing Element EO Replica Catalogue EO Grid Engine EO Product Archive 1. Search Level-1 catalogue 2. Retrieve Level-2 products 3. Level-2 Products already registered in RC? 8. Submit jobs to process Level-1 data 7. Register Level-1 data 11. Register level-2 data 9. Process Level-1 data 10. Transfer Level-2 data to SE 12. Return new Level-2 products Yes? 4. Return available Level-2 products No? 5. Perform GRID processing on-the- fly 6. Transfer Level-1 data from Archive to the Grid Processing Sequence

EU DataGrid - Applications 16 Goals of the DataGrid application validate satellite data with all ground based data available in an easy way:  Comparison of ozone profiles provided by satellite with lidar data in different locations and times (see the web portal)  Statistical comparison and analysis in order to improve algorithms. OZONE LAYER 50 km 10 km ERS/GOME satellite Lidar at the Haute Provence Observatory GOME Ozone Profile Validation

EU DataGrid - Applications 17 Level 2 Catalogue Lidar data catalogue Queries and data information retrieval from the Lidar metadata catalogue GRID Computing Element Storage Elements with Lidar data Queries and data information retrieval from the Gome Level 2 orbit or pixel metadata catalogues When completed comparison between lidar and satellite ozone profiles Satellite data validation Lidar site Level 2 Catalogue GRID Portal Storage Elements with Gome L2 data Submission of the Job in the GRID Validation Processing Sequence

EU DataGrid - Applications 18 Validation Output Figure 1: Estimation of the bias between Gome and Lidar using one month of data. Figure 2 : example of 2 profiles : Comparison between Gome profile and lidar profile for the 2nd October 2000.

EU DataGrid - Applications 19 Perspectives for Biomedical Applications u Grids open new perspectives in large scale genomics analysis n Complete genome annotation n Cross-genomes analysis n Data mining on distributed databases n Pipelining of huge automatic bio-informatics analysis u Medical image processing n Large databases processing n Anatomy and physiology modeling n Epidemiological studies

EU DataGrid - Applications 20 Biomedical Applications u Bio-informatics n Phylogenetics : BBE Lyon (T. Sylvestre) n Search for primers : Centrale Paris (K. Kurata) n Statistical genetics : CNG Evry (N. Margetic) n Bio-informatics web portal : IBCP (C. Blanchet) n Parasitology : LBP Clermont, Univ B. Pascal (N. Jacq) n Data-mining on DNA chips : Karolinska (R. Médina, R. Martinez) n Geometrical protein comparison : Univ. Padova (C. Ferrari) u Medical imaging n MR image simulation : CREATIS (H. Benoit-Cattin) n Medical data and metadata management : CREATIS (J. Montagnat) n Mammographies analysis ERIC/Lyon 2 (S. Miguet, T. Tweed) n Simulation platform for PET/SPECT based on Geant4 : GATE collaboration (L. Maigne) Applications deployed Applications tested on EDG Applications under preparation

EU DataGrid - Applications 21 Medical Imaging Medical images Metadata H 1. query 2. visualisation 3. similarity search 4. scores 5. best results visualisation LFN image patient hospital...

EU DataGrid - Applications 22 Graphic layer Job Monitoring Grid File Browsing File registration and retrieval

EU DataGrid - Applications 23 Graphical Interfaces Image registration Image retrieval Local filesGrid files Metadata Query over metadata Query result

EU DataGrid - Applications 24 Image Registration LFN image patient hospital... Imager SE

EU DataGrid - Applications 25 Similarity search Similarity computation Results visualization Job monitoring Ranked list of images Source image Most similar images Low score images

EU DataGrid - Applications 26 Future: Interfacing medical data with the Grid Client 1 interface Client 2 interface RS interface core grid - server interface header blanking encryption Storage Element Replica Catalog Replication Service RC interface Metadata interface Medical (trusted) site Grid middleware File metadata ACL size checksum... Application metadata ACL encryption key sensitive metadata... Medical server Storage Element MSS Master File Replica Imager

EU DataGrid - Applications 27 Parallel Processing Magnetic Resonance Images simulation using the grid 3 levels of parallelism: Parallel isochromat computations Multi-slice MRI computation Parallel magnetization kernel Magnetisation computation kernel Reconstruction algorithm MRI Image Virtual object MRI sequence

EU DataGrid - Applications 28 Summary u Use Cases n High Energy Physics n Earth Observation n Biomedical Applications

EU DataGrid - Applications 29 Further Information u High Energy Physics u Bio-Informatics u Earth Observation

EU DataGrid - Applications 30