AliE Pablo Saiz/CERN P.Buncic, J-E. Revsbech R.Piskac, V.Sego, L. Aphecetche ALICE Collaboration ALICE Environment on the GRID.

Slides:



Advertisements
Similar presentations
1 ALICE Grid Status David Evans The University of Birmingham GridPP 14 th Collaboration Meeting Birmingham 6-7 Sept 2005.
Advertisements

1 ALICE Grid Status David Evans The University of Birmingham GridPP 16 th Collaboration Meeting QMUL June 2006.
Data Management Expert Panel - WP2. WP2 Overview.
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Workload Management Massimo Sgaravatto INFN Padova.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
AliEn Tutorial MODEL th May, May 2009 Installation of the AliEn software AliEn and the GRID Authentication File Catalogue.
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
AliEn uses bbFTP for the file transfers. Every FTD runs a server, and all the others FTD can connect and authenticate to it using certificates. bbFTP implements.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
Overview of Grid Webservices in Distributed Scientific Applications Dennis Gannon Aleksander Slominski Indiana University Extreme! Lab.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
AliEn central services Costin Grigoras. Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES L. Betev, A. Grigoras, C. Grigoras, P. Saiz, S. Schreiner AliEn.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Accessing the VI-SEEM infrastructure
Open Source distributed document DB for an enterprise
ALICE Physics Data Challenge 3
The GENIUS portal Roberto Barbera University of Catania and INFN
Simulation use cases for T2 in ALICE
LCG middleware and LHC experiments ARDA project
Module 01 ETICS Overview ETICS Online Tutorials
Offline framework for conditions data
Presentation transcript:

AliE Pablo Saiz/CERN P.Buncic, J-E. Revsbech R.Piskac, V.Sego, L. Aphecetche ALICE Collaboration ALICE Environment on the GRID

AliE Content Alice at LHC Alice Computing Model Building AliEn AliEn Components Deploying AliEn AliEn Roadmap Conclusions

AliE LHC

AliE CERN - LHC

AliE Construction

AliE Problem Typical next generation HEP experiment Large scale simulation & reconstruction effort Heavily distributed processing and event storage  ~1000 scientists in ~100 of institutions Complex analyses of distributed data Large files (one event up to 2GB) 10^9 files/year (x n, n>2) 2 PB/year Experiment lifetime years GRID Widely accepted as a solution

AliE Alice Use Cases Simulation, Data Challenges & Reconstruction Centrally managed production of background events Distributed processing and event storage Event mixing Not necessarily centrally managed Once background events exist, the subsequent requests for event mixing must be routed to the location which holds required input Analysis Using AliEn API, PROOF will locate optimal site(s) for macro execution, try to execute it in parallel, collect the output and return it to the user (or register it in the catalogue)

AliE ALICE Computing Model

AliE AliE AliROOT ROOT User Simulation, Reconstruction, Calibration, Analysis System GUI Persistent IO Utility libs World Interfaces & Distributed computing environment C++ anything Nice! I only have to learn C++

AliE Challenge Can we provide, building on top of available public domain and open source components and standards, a functional distributed computing infrastructure to community of ALICE users which will remain operational even if underlying technologies keep changing?

AliE Building AliEn

AliE Building AliEn

AliE Open Source Components SASL/OpenSSL/OpenCA as authentication protocol Globus/GSS as an implementation of authentication compatible with other Grid projects CONDOR ClassAds language for job description (compatible with EU DataGrid) OpenLDAP for configuration management Apache for Web Portal MySQL as relational database backend Bbftp as file transfer protocols

AliE Gluing it together… Already existing pieces of code (NA49 file catalogue) in perl5 Good interface to different databases Easy Web integration Simple Object Access Protocol (also known as Service Oriented Access Protocol) Good Perl implementation (SOAP::Lite) on client and server side Possibility to provide client access from many different platforms and languages (Java,C,C++…) Provides standard means to invoke procedures (services) in distributed environment

AliE Components AliEn Services Modules & libraries

AliE “Web of Services”

AliE Statistics… SLOCCount is Open Source Software/Free Software, licensed under the FSF GPL.Please credit this data as "generated using 'SLOCCount' by David A. Wheeler."

AliE Statistics… SLOCCount is Open Source Software/Free Software, licensed under the FSF GPL.Please credit this data as "generated using 'SLOCCount' by David A. Wheeler.“

AliE Benefits of development based on OpenSource components are more than obvious… AliEn vs OpenSource

AliE AliEn Components

AliE AliEn SASL implementation SASL is the Simple Authentication and Security Layer, a method for adding authentication support to connection-based protocols AliEn now has perl module with implementation GSSAPI This allows us to use all SASL authentication schemes old AliEn authentication (token, AFS password, SSH) X509 certificates Globus/GSI (credential delegation) AliEn distribution includes necessary Globus/MDS/GSI software This allows us to develop secure Peer-To-Peer File Transfers based on machine/protocol/user certificates and LDAP based configuration management

AliE Authentication Client Proxy Server Database LDAP Request methods List of methods SASL Authentication Checking if user exists Data X509(AliEn/Globus) PKI/RSA (ssh) Token (AliEn) AFS password

AliE File catalogue ALICE USERS ALICE SIM Tier1 ALICE LOCAL |--./ | |--cern.ch/ | | |--user/ | | | |--a/ | | | | |--admin/ | | | | | | | | | |--aliprod/ | | | | | |--f/ | | | | |--fca/ | | | | | |--p/ | | | | |--psaiz/ | | | | | |--as/ | | | | | | | | |--dos/ | | | | | | | | |--local/ |--simulation/ | | / | | |--V3.05/ | | | |--Config.C | | | |--grun.C | |--36/ | | |--stderr | | |--stdin | | |--stdout | | |--37/ | | |--stderr | | |--stdin | | |--stdout | | |--38/ | | |--stderr | | |--stdin | | |--stdout | | | | | |--b/ | | | | |--barbera/ Files, commands (job specification) as well as job input and output and metadata are stored in the catalogue

AliE Command Interface

AliE GUI: AliEn Xfiles

AliE Web Portal Generic Web portal Virtual Organizations Alice Atlas NA49 Demo Mammogrid

AliE Task Queue “Pull” rather than “push”architecture

AliE AliEnTasksCEs alien job-submit job.jdl Broker Yes: Select Match ? No: Next CE contacts CPUServer and presents its own ClassAd, Resource Broker will match them against job ClassAds and select the most appropriate job to run on that CE Resource Broker

AliE Resource Broker Resource Broker Optimizer

AliE Class Ads &JDL Requirements = ( other.Type == "machine" ) &&(member(other.Packages,"AliRoot") ); Packages = "AliRoot"; Arguments = "--round run event version v –grun G+F"; Executable = "/Alice/bin/AliRoot.sh"; InputFile = { "LF:/alice/simulation/ /v /00071/Config.C", "LF:/alice/simulation/ /v /00071/grun.C" }; Type = "Job"; An Example – JDL file to run Alice Simulation job:

AliE Class Ads &JDL Requirements = ( other.Type == “Job" ); Type = “machine"; Host = “alienx.cern.ch”; CE =“Alice::CERN::LXBATCH”; Packages = { "AliRoot“, “ROOT”, “AliRoot:: ” }; CloseSE = { “Alice::CERN::Castor”, “Alice::CERN::File”, “Alice::CERN::scratch” }; Class Ads of CE:

AliE Computer Local Center GRID CENTER Computer Monitoring In order to develop and deploy more refined Resource Broker we need monitoring framework Frequent data updates, large data volume for large number of computers The idea is to implement hierarchy of clients and servers where each client (child) maintains the history of measurements reports the summary information to upper layer (parent) using SOAP protocol

AliE Deploying AliEn

AliE First implementation of Alice World Computing Model

AliE Production Summary 5682 events validated, 118 failed (2%) Up to 300 concurrently running jobs worldwide (5 weeks) 5 TB of data generated and stored at the sites with mass storage capability (CERN 73%,CCIN2P3 14%, LBL, 14%, OSC 1%) GSI, Karlsruhe, Dubna, Nantes, Budapest, Bologna, Zagreb, Birmingham, Utrecht, Calcutta in addition ready by now 13 clusters, 9 sites 10^5 CPU hours

AliE AliEn Roadmap

AliE AliEn as a meta-GRID AliEn User Interface AliEn stackiVDGL stackEDG stack

AliE Roadmap… Optimization and test suite PROOF interface & support for interactive jobs EDG interface GRID partitioning Queue optimization (based on AliEn monitoring) Implementation of Web services SOAP (Simple Object Access Protocol) WSDL (Web Services Description Language) UDDI (Universal Description Discovery & Integration) Virtual datasets

AliE Summary AliEn framework is a lightweight, simplified but functionally equivalent alternative to full blown GRID based on standard components (SOAP, Web services) It has been tested in production will be continuously developed with aim to provide long term stable interface to GRID(s) for Alice users AliEn will used to provide GRID component for MammoGRID – 3 year, 2M Euro project funded by EC, starting in September Summary of AliEn features (visit Authentication module which supports various authentication methods (Globus/GSI) Distributed file catalogue built on top of RDBMS with user interface that mimics the file system Secure file transport and replication Service Task queue which holds commands to be executed in the system and Resource Broker Configuration and Information Service Computing and Storage elements Metadata catalogue Monitoring framework C/C++/perl API Web portal