EGEE is a project funded by the European Union under contract IST-2003-508833 NA4 Applications F.Harris(Oxford/CERN) NA4/HEP coordinator www.eu-egee.org.

Slides:



Advertisements
Similar presentations
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
Advertisements

EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Roberto Barbera EGEE NA4 Generic Applications coordinator.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
EGEE is proposed as a project funded by the European Union under contract IST The EGEE International Grid Infrastructure and the Digital Divide.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
EGEE (EU IST project) – Networking Activities 4: biomedical applications NA4 Biomedical Activities Johan Montagnat First EGEE Meeting Cork,
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
CMS Report – GridPP Collaboration Meeting VI Peter Hobson, Brunel University30/1/2003 CMS Status and Plans Progress towards GridPP milestones Workload.
Frédéric Hemmer, CERN, IT DepartmentThe LHC Computing Grid – October 2006 LHC Computing and Grids Frédéric Hemmer IT Deputy Department Head October 10,
IST E-infrastructure shared between Europe and Latin America Biomedical Applications in EELA Esther Montes Prado CIEMAT (Spain)
EGEE is a project funded by the European Union under contract IST Generic Applications: strategy, organization, tools and manpower Roberto.
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 25 th April 2012.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
EGEE is proposed as a project funded by the European Union under contract IST EU eInfrastructure project initiatives FP6-EGEE Fabrizio Gagliardi.
INFSO-RI Enabling Grids for E-sciencE EGEE and Industry Bob Jones EGEE-II Project Director Final EGEE Review CERN, May 2006.
7April 2000F Harris LHCb Software Workshop 1 LHCb planning on EU GRID activities (for discussion) F Harris.
SouthGrid SouthGrid SouthGrid is a distributed Tier 2 centre, one of four setup in the UK as part of the GridPP project. SouthGrid.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
SA1/SA2 meeting 28 November The status of EGEE project and next steps Bob Jones EGEE Technical Director EGEE is proposed as.
EGEE is a project funded by the European Union under contract IST HEP Use Cases for Grid Computing J. A. Templon Undecided (NIKHEF) Grid Tutorial,
EGEE is a project funded by the European Union under contract IST Middleware Planning for LCG/EGEE Bob Jones EGEE Technical Director e-Science.
Ian Bird LHC Computing Grid Project Leader LHC Grid Fest 3 rd October 2008 A worldwide collaboration.
…building the next IT revolution From Web to Grid…
Tony Doyle - University of Glasgow 8 July 2005Collaboration Board Meeting GridPP Report Tony Doyle.
Induction: Adding new applications to EGEE infrastructure –April 26-28, Adding new applications to EGEE infrastructure Roberto Barbera EGEE is.
The LHC Computing Grid – February 2008 The Challenges of LHC Computing Dr Ian Bird LCG Project Leader 6 th October 2009 Telecom 2009 Youth Forum.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Ian Bird LCG Deployment Area Manager & EGEE Operations Manager IT Department, CERN Presentation to HEPiX 22 nd October 2004 LCG Operations.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
LCG ARDA status Massimo Lamanna 1 ARDA in a nutshell ARDA is an LCG project whose main activity is to enable LHC analysis on the grid ARDA is coherently.
INFSO-RI Enabling Grids for E-sciencE User Survey Objectives and Results F.Jacq CNRS-IN2P3 EGEE Conference - Athens 21 th April.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE is a project funded by the European Union under contract IST Presentation of NA4 Generic Applications Roberto Barbera NA4 Generic Applications.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
NA4 all activity meeting V. Breton on behalf of NA4.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
HEP Applications within EGEE - an overview and relations with GAG F Harris (Oxford/CERN) EGEE is proposed as a project funded by the European Union under.
EGEE is a project funded by the European Union under contract IST Generic Applications in EGEE-NA4 Roberto Barbera NA4 Generic Applications.
INFSO-RI Enabling Grids for E-sciencE All activity meeting Vincent Breton On behalf of NA4.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
EGEE is a project funded by the European Commission under contract IST NA4/HEP work F Harris (Oxford/CERN) M.Lamanna(CERN) NA4 Open meeting.
LHC Computing, CERN, & Federated Identities
Guy Wormser IN2P3/CNRS, EGEE Applications Manager September 2003 EGEE is proposed as a project funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
EGEE is a project funded by the European Union under contract IST EGEE Summary NA2 Partners April
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
NA4 all activity meeting V. Breton on behalf of NA4.
EGEE is a project funded by the European Union under contract IST EGEE Networking Activities Mike Mineter Training team National e-Science.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
Organisation of HEP Applications within NA4 F Harris (Oxford/CERN) EGEE is proposed as a project funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE NA4 status and issues.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Training in EGEE-II Mike Mineter (Some slides from Brendan Hamill)
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA: a status report Roberto Barbera NA4 Generic Applications.
EGEE is a project funded by the European Union under contract IST Generic Applications Requirements Roberto Barbera NA4 Generic Applications.
INFSO-RI Enabling Grids for E-sciencE GILDA t-Infrastructure Antonio Fuentes Bermejo
First South Africa Grid Training June 2008, Catania (Italy) GILDA t-Infrastructure Valeria Ardizzone INFN Catania.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
EGEE all-activity meeting: status of NA4 V. Breton on behalf of NA4.
EGEE is a project funded by the European Union under contract IST NA4/NA2 activities Roberto Barbera TB INFN Grid,
Bob Jones EGEE Technical Director
JRA3 Introduction Åke Edlund EGEE Security Head
The LHC Computing Grid Visit of Mtro. Enrique Agüera Ibañez
SA1 Execution Plan Status and Issues
Ian Bird GDB Meeting CERN 9 September 2003
NA4 Applications F.Harris(Oxford/CERN) NA4/HEP coordinator
LHC Data Analysis using a worldwide computing grid
Collaboration Board Meeting
Future EU Grid Projects
Presentation transcript:

EGEE is a project funded by the European Union under contract IST NA4 Applications F.Harris(Oxford/CERN) NA4/HEP coordinator

EGEE Induction May Talk Outline The basic goals of NA4 The organisation The participants and their roles The flavour of the work for the NA4 sub-groups  Biomed  HEP  ‘Generic’ applications  Testing  Industry Forum Milestones and deliverables Relations with other EGEE activities Concluding comments

EGEE Induction May NA4: Identification and support of early-user and established applications on the EGEE infrastructure To identify through the dissemination partners and a well defined integration process a portfolio of early user applications from a broad range of application sectors from academia, industry and commerce. To support development and production use of all of these applications on the EGEE infrastructure and thereby establish a strong user base on which to build a broad EGEE user community. To initially focus on two well-defined application areas – Particle Physics and Life sciences, while developing a process for supporting other application areas

EGEE Induction May NA4 organisational structure

EGEE Induction May NA4 technical organization NA4 AWG (V. Breton) LCGEGEE PEB HEP F. Harris M. Lamanna Biomed J. Montagnat C. Blanchet Generic R. Barbera ARDA Data challenges Biomed technical team Generic technical team Test team R. Météry Eric Fede

EGEE Induction May Roles and staffing Federation Role FTE Funded FTE Unfunded CERNHEP Applications (coord.) 44 (9) UK+Ireland NA3 Liaison 0,5 ItalyGeneric app (coord) 22 FranceGeneral coord., BioMed, Test team, Industry diss. 77 Northern Europe Generic applications 11 Germany + Switzerland Generic applications 11 Central Europe Generic applications 11 South West Europe BioMed 22 Russia HEP, BioMed 33 Totals 21,5

EGEE Induction May What do all applications expect from the grid? Access to a world-wide virtual computing laboratory with vast resources Possibility to organise in VOs(virtual organisations) with members being given access rights according to their roles in the VO, and facilities to protect data Transparency in data and job management via easy to use application interfaces Definite added value in performance for both interactive and batch computation

EGEE Induction May Biomedical Applications Some key applications and their characteristics Bioinformatics: gene/proteome databases distributions Medical applications (screening, epidemiology...): image databases distribution Parallel algorithms for medical image processing, simulation, etc Interactive application (human supervision or simulation) Security/privacy constraints Heterogeneous data formats (genomics, proteomics, image formats) Frequent data updates Complex data sets (medical records) Long term archiving requirements

EGEE Induction May BLAST – comparing DNA or protein sequences BLAST is the first step for analysing new sequences: to compare DNA or protein sequences to other ones stored in personal or public databases. Ideal as a grid application. Requires resources to store databases and run algorithms Can compare one or several sequence against a database in parallel Large user community

EGEE Induction May BLAST gridification DB BLAST Seq1 > dcscdssdcsdcdsc bscdsbcbjbfvbfvbvfbvbvbhvbhs vbhdvbhfdbvfd Seq2 > bvdfvfdvhbdfvb bhvdsvbhvbhdvrefghefgdscgdfg csdycgdkcsqkc … Seqn > bvdfvfdvhbdfvb bhvdsvbhvbhdvrefghefgdscgdfg csdycgdkcsqkchdsqhfduhdhdhq edezhhezldhezhfehflezfzejfv DB BLAST dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf DB BLAST dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf DB BLAST dedzedz dzedezd zecdscsd cscdssdc sdcdscbs cdsbcbjb f RESULT dedzedzdzedezdzecdscsdcscdssdcsd cdscbscdsbcbjbfvbfvbvfbvbvbhvbh svbhdvbhfdbvfdbvdfvfdvhbdfvbhd bhvdsvbhvbhdvrefghefgdscgdfgcsd ycgdkcsqkcqhdsqhfduhdhdhqedezh dhezldhezhfehflezfzeflehfhezfhehf ezhflezhflhfhfelhfehflzlhfzdjazslzd hfhfdfezhfehfizhflqfhduhsdslchlkc hudcscscdscdscdscsddzdzeqvnvqvn q! Vqlvkndlkvnldwdfbwdfbdbd wdfbfbndblnblkdnblkdbdfbwfdbfn dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf UI Computing element Input file Computing element dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf Seq1 > dedzedzdzedezdze cdscsdcscdssdcsdc dscbscdsbcbjbdfn dfjvbndfbnbnfbjn bjxbnxbjk:nxbf dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf Seq2 > dedzedzdzedezdze cdscsdcscdssdcsdc dscbscdsbcbjbdfn dfjvbndfbnbnfbjn bjxbnxbjk:nxbf dedzedzd zedezdze cdscsdcsc dssdcsdc dscbscds bcbjbf Seqn > dedzedzdzedezdze cdscsdcscdssdcsdc dscbscdsbcbjbdfn dfjvbndfbnbnfbjn bjxbnxbjk:nxbf

EGEE Induction May Monte carlo simulation for radiotherapy planning StorageElement ComputingElement StorageElement ComputingElement StorageElement ComputingElement StorageElement ComputingElement GATE Image: text file Binary file: Image.raw Size 19M Scanner slices: DICOM format Databa se User interface CCIN2P3 RAL NIKHEF MARSEILLE Anonymisation Concatenation Submission of jdls to the CEs Copy the medical image from the SE to the CE Retrieving of root output files from CEs

EGEE Induction May HEP applications and the grid Have been running large distributed computing systems for many years Now the focus for the future is on computing for LHC and hence we have the LCG (LHC computing grid project) In addition to the 4 LHC experiments(ATLAS,ALICE,CMS,LHCb) other current HEP experiments use grid technology e.g. Babar,CDF,D0.., and don’t forget Theory and other new HEP experiments.. LHC experiments are currently executing large scale data challenges(DCs) involving thousands of processors world-wide and generating many Terabytes of data Moving to so-called ‘chaotic’ use of grid with individual user analysis (thousands of users operating within experiment VOs)..see ARDA

EGEE Induction May CMS ATLAS LHCb ALICE Storage – Raw recording rate 0.1 – 1 GByte/s Accumulating at 5-8 PetaByte/year 10 PetaByte of disk Processing – 200,000 of today’s fastest PCs LHC Experiments

EGEE Induction May LHC Computing Grid Project - a Collaboration The physicists and computing specialists from the LHC experiment The projects in Europe and the US that have been developing Grid middleware The regional and national computing centres that provide resources for LHC The research networks Researchers Software Engineers Service Providers Building and operating the LHC Grid – a collaboration between

EGEE Induction May RAL IN2P3 BNL FZK CNAF PIC ICEPP FNAL LHC Computing Model (simplified!!) Tier-0 – the accelerator centre  Filter  raw data  Reconstruction  summary data (ESD)  Record raw data and ESD  Distribute raw and ESD to Tier-1 Tier-1 –  Permanent storage and management of raw, ESD, calibration data, meta- data, analysis data and databases  grid-enabled data service  Data-heavy analysis  Re-processing raw  ESD  National, regional support “online” to the data acquisition process high availability, long-term commitment managed mass storage Tier-2 –  Well-managed disk storage – grid-enabled  Simulation  End-user analysis – batch and interactive  High performance parallel analysis (PROOF) USC NIKHEF Krakow CIEMAT Rome Taipei TRIUMF CSCS Legnaro UB IFCA IC MSU Prague Budapest Cambridge Tier-1 small centres Tier-2 desktops portables

EGEE Induction May RAL IN2P3 BNL FZK CNAF USC PIC ICEPP FNAL NIKHEF Krakow Taipei CIEMAT TRIUMF Rome CSCS Legnaro UB IFCA IC MSU Prague Budapest Cambridge Data distribution ~70 Gbits/sec

EGEE Induction May Characteristics of CMS Data Challenge DC04 (just completed)……run with LCG-2 and CMS resources world-wide ( US Grid3 was a major component) Pre-Challenge Production (Phase 1) – simulation generation and digitisation  After 8 months of continuous running: 750,000 jobs 3,500 KSI2000 months 700,000 files 80 TB of data Data Challenge (Phase 2)  Ran the full data reconstruction and distribution chain at 25 Hz  Achieved 2,200 jobs/day (about 500 CPU’s) running at Tier-0 Total 45,000 jobs Tier-0 and files/s registered to RLS (with POOL metadata) Total 570,000 files registered to RLS 4 MB/s produced and distributed to each Tier-1

EGEE Induction May ALICE Distr. Analysis EGEE middleware Resource Providers Community ATLAS Distr. Analysis CMS Distr. Analysis LHCb Distr. Analysis SEAL PROOF GAE POOL ARDA Collaboration Coordination Integration Specification Priorities Planning Experience   Use Cases EGEE NA4 Application identification and support LCG-GAG Grid Application Group ARDA (Architectural Roadmap for Distributed Analysis)

EGEE Induction May NA4 Generic Applications This is a key activity in the process of getting new scientific and industrial communities interested and committed to use the continental grid infrastructure built by the EGEE Project. GENIUS is a well established tool which will be fundamental in the process of interfacing new applications with the EGEE middleware hiding its complex internals to non-experts users from new communities. GILDA is a complete suite of grid elements (test-bed, CA, VO, monitoring system, web portal) and applications fully dedicated to dissemination purposes. This could also represent the ideal grid testbed where to start the porting of new generic applications. GILDA is the dissemination tool which will be used by NA3 during courses and tutorials so the important aspect of induction of the grid paradigm to new communities is also covered. It is now important to have the first meeting of the EGAAP board and define the first Generic Applications to be interfaced.

EGEE Induction May The GILDA home page (

EGEE Induction May The Generic Application questionnaire (contribution to MNA4.1) Questionnaire to get information and first requirements from new communities interested in using the EGEE Infrastructure ( questionnaire.doc) questionnaire.doc Feed-backs received so far (  Astrophysics (EVO and Planck satellite)  Earth Observation (ozone maps, seismology, climate)  Digital Libraries (DILIGENT Project)  Grid Search Engines (GRACE Project)  Industrial applications (SIMDAT Project) Interest also from Computational Chemistry (Italy and Czech Republic), Civil Engineering (Spain), and Geophysics (Switzerland and France) communities

EGEE Induction May EGEE Industry Forum Objectives The main role of the Industry Forum in the Enabling Grids for E-Science in Europe (EGEE) project is to raise awareness of the project amongst industry and to encourage businesses to participate in the project. This will be achieved by making direct contact with industry, liaising between the project partners and industry in order to ensure businesses get what they need in terms of standards and functionality and ensuring that the project benefits from the practical experience of businesses. The members of the EGEE Industry Forum are companies of all sizes who have their core business or a part of their business within Europe. The Industry Forum will be managed by a steering group consisting of EGEE project partners and representatives from business

EGEE Induction May NA4 Testing Group Three types of tests will be developed: (based on user requirements and on the experience gathered by the ongoing activities like LHC DCs and ARDA prototyping) Tests of service availability: This set of tests will check the EGEE services availability. All the services providing by the GRID should be tested : Job submission and management, files management, Information service,... Tests of functionality: To verify that the functionalities required are available, usable and complete; for example, file creation, moving and deletion, information publication, errors recovery. Tests to measure performances: Their goal is to characterize the testbed from the end users/application perspective. Part of them will be time measurements ( time to submit X job, time to replicateY files,...), others will address scalability measurements ( how many jobs can be accepted by service Z, files limits size,...) while others will be more abstract (information availability, errors message access,...). This work should be done in close collaboration with ARDA, JRA1 and SA1

EGEE Induction May Milestones for NA4 applications MNA4.1M6 First applications migrated to the EGEE infrastructure HEP data challenges for 4 LHC experiments and for D0 Biomedicine – GATE simulation in nuclear medicine + others Plus the first ‘generic’ applications MNA4.2M12 First external review of Applications Identification and Support with feedback MNA4.3M24 Second external review of Applications Identification and Support with feedback

EGEE Induction May Deliverables for NA4 applications DNA4.1M3 Definition of Common Application Interface (specially important for new applications…probably based on GENIUS flavoured work) DNA4.2M6 Target Application Sector Strategy document (work for getting new applications onto EGEE) DNA4.3M9 EGEE Application Migration Progress report (revision M15 and M21) All applications will include evaluations on production and pre-production services of LCG (current and ‘new’ middleware) DNA4.4M24Final Report of Application Identification and Support Activity

EGEE Induction May NA4 relations with other EGEE activities and other bodies (1) SA1 grid operations  How to get new VOs onto LCG from different domains?  How to integrate new resources(sites) into LCG coming from different application areas?  Rationalistion of test procedures  Working with national agencies (e.g. GridPP Application monitoring) NA3 training  Estimating requirements for courses  Design and implementation of courses JRA1 middleware  All applications input requirements and monitor their satisfaction with feedback to middleware (process goes through the PTF-Project Technical Forum) JRA2 quality assurance  NA4 have a representative on this group to define process for monitoring quality of EGEE services

EGEE Induction May NA4 relations with other EGEE activities and other bodies (2) JRA3 security  Security of medical (and other application) data  Security for sites SA2,JRA4 networking  Global HEP requirements through LCG  Biomed and other applications must similrly give global needs  NA4 will give individual application use cases especially where problems have been encountered LCG  NA4/HEP are presented on the LCG/GAG(grid Applications Group) This is HEP source of requirements and giving feedback to middleware on ‘customer satisfaction’. Some GAG people are on the PTF.

EGEE Induction May Conclusions NA4 is up and running now  HEP is using LCG-2 for data challenges  ARDA is well under way and waiting for first new middleware prototype  Biomedicine has applications ready to go onto LCG-2 and pre-production services  Generic group is very active with GILDA and excellent relations with NA3  Testing group has active dialogue with JRA1 and ARDA for rationalising testing effort  Industry forum has developed links with several companies (see EGEE Cork presentations) NA4 open meeting Jul at Catania with emphasis on inter-activity dialogue (with middleware,operations,security,networking) NA4 Web site