Presentation is loading. Please wait.

Presentation is loading. Please wait.

2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.

Similar presentations


Presentation on theme: "2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion."— Presentation transcript:

1 2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion of technologies and techniques used in production Grids for Data Management – Identification of gaps and issues Erwin Laure & Miron Livny Workshop Organizers

2 EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Data Management in EGEE Erwin Laure EGEE Technical Director Erwin.Laure@cern.ch

3 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 3 EGEE Production Grid Infrastructure Steady growth over the lifetime of the project Improved reliability EGEE in a Nutshell Data Transfer MB/s 04/2006 08/2006

4 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 4 EGEE in a Nutshell >200 VOs from several scientific domains –Astronomy & Astrophysics –Civil Protection –Computational Chemistry –Comp. Fluid Dynamics –Computer Science/Tools –Condensed Matter Physics –Earth Sciences –Fusion –High Energy Physics –Life Sciences Further applications under evaluation 98k jobs/day Applications have moved from testing to routine and daily usage ~80-90% efficiency

5 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 5 gLite Middleware Distribution Combines components from different providers –Condor and Globus 2 (via VDT) –LCG –EDG/EGEE –Others After prototyping phases in 2004 and 2005 convergence with LCG-2 distribution reached in May 2006 –gLite 3.0 Focus on providing a deployable MW distribution for EGEE production service Currently working on gLite 3.1 –Major updates –Support for Scientific Linux 4, GT4 LCG-2 prototyping product 2004 2005 product gLite 2006 gLite 3.0

6 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 6 Application Example High Energy Physics 4 LHC Experiments: ALICE, ATLAS, CMS, LHCb 40 Million Particle collisions per second Online filter reduces to a few 100 “good” events per second recorded on disk and magnetic tape at 100-1,000 MegaBytes/sec ~15 PetaBytes per year for all four experiments Data analyzed by 100s of research groups world wide

7 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 7 Pre-SC4 April tests CERN  T1s – –SC4 target 1.6 GB/s reached – but only for one day –Sustained data rate 80% of the target But – experiment-driven transfers (ATLAS and CMS) sustained 50% of the SC4 target under much more realistic conditions CMS transferred a steady 1 PByte/month between Tier-1s & Tier-2s during a 90 day period ATLAS distributed 1.25 PBytes from CERN during a 6-week period 0.8 GBytes/sec 1.6 GBytes/sec 1.3 GBytes/sec LHC Data Distribution

8 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 8 Data Management Requirements for Biomedical Applications Strict security –ACLs at user and group level  Both on data and metadata (e.g. file catalogs) –Encryption Flexible authentication mechanisms –Access via portals Confidentiality –User not easily traceable (apart from auditing)

9 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 EGEE User Forum: The gLite File Transfer Service 9 Medical Imaging DICOM server Processing on Grid (Retrieval, analysis) Grid Interface User Interface Worker Node DICOM clients

10 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 10 Grid-enabled virtual docking Millions of potential drugs to test against interesting proteins! High Throughput Screening 1-10$/compound, several hours Data challenge on EGEE ~ 2 to 30 days on ~5000 computers Hits screening using assays performed on living cells Leads Clinical testing Drug Selection of the best hits Too costly for neglected disease! Molecular docking (FlexX, Autodock) ~1 to 15 minutes Targets: PDB: 3D structures Compounds: ZINC: 4.3M Chembridge: 500 000

11 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 11 Environment (2/2) Web Service Middleware API WISDOM Client Results Database Monitoring Database AMGA AMGA client Input location Result update Secure Results Store Computing Element Storage Element Resource Broker User Interface Glite Production Infrastructure AMGA client Input/Output Files transfer

12 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 12 EGEE Data Management CatalogingStorageData transfer Data Management User Tools (RLS) LFC SRM (Classic SE) gridftp RFIO/ DCAP

13 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 13 DM Interaction Overview File and Replica Catalog LFC Database WMS Storage Element SRM Storage GFALgridFTP File Transfer Service FTS Transfer Agent Database VOMS MyProxy Get credential Store credential File I/O File namespace and Metadata mgmt File replication Proxy renewalReplica Location

14 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 14 Encrypted Data Storage DICOM-SE SRMv2 gridftp I/O DICOM trigger Hydra KeyStore AMGA metadata image patient data file ACL keys 1. patient look-up 3. get TURL 2. keys 4. read GFAL

15 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 Erwin Laure - Data Mgmt in EGEE - HPDC workshop 2007 15 Some Main Issues Storage systems typically don’t throttle I/O –FTS used for coordinating source and destination and throttle transfers but still see problems with overloading source system SRM implementations show different behavior on errors –Problem for interfacing tools like FTS Scratch space on Worker Nodes is not managed –Impossible to find out how much scratch space would be available for a job –Job may leave garbage that fills up scratch space Consistent ACLs (DN based) among different storage systems and catalogs Interoperability between SRM based systems and other storage systems applications use (in particular SRB)


Download ppt "2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion."

Similar presentations


Ads by Google