Grid Computing with Debian, Globus and ARC Mattias Ellert, Uppsala Universitet (.se) Steffen Möller, Universität zu Lübeck (.de) Anders Wäänänen, Niels.

Slides:



Advertisements
Similar presentations
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
Advertisements

MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
AustrianGrid, LCG & more Reinhard Bischof HPC-Seminar April 8 th 2005.
UNICORE UNiform Interface to COmputing REsources Olga Alexandrova, TITE 3 Daniela Grudinschi, TITE 3.
CSC Grid Activities Arto Teräs HIP Research Seminar February 18th 2005.
WEB HOSTING & UPLOADING SITI ZULAIHA BINTI MOHD RAIS PGC
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Grid Engine Riccardo Rotondo
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Topaz : A GridFTP extension to Firefox M. Taufer, R. Zamudio, D. Catarino, K. Bhatia, B. Stearn University of Texas at El Paso San Diego Supercomputer.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
EMI 1 Release The EMI 1 (Kebnekaise) release features for the first time a complete and consolidated set of middleware components from ARC, dCache, gLite.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
Alex Read, Dept. of Physics Grid Activities in Norway R-ECFA, Oslo, 15 May, 2009.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
SESEC Storage Element (In)Security hepsysman, RAL 0-1 July 2009 Jens Jensen.
The Institute of High Energy of Physics, Chinese Academy of Sciences Sharing LCG files across different platforms Cheng Yaodong, Wang Lu, Liu Aigui, Chen.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
RI EGI-TF 2010, Tutorial Managing an EGEE/EGI Virtual Organisation (VO) with EDGES bridged Desktop Resources Tutorial Robert Lovas, MTA SZTAKI.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
Federated Access to Storage EGI CF 2012 Luke Howard, Daniel Kouril, Michal Prochazka.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
Packaging Grid Software for Linux Distributions HEPiX spring meeting 2009 Umeå Mattias Ellert, Uppsala Universitet Steffen Möller, Universität zu Lübeck.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Accessing the VI-SEEM infrastructure
Grid and Cloud Computing
Aleksandr Konstantinov, Weizhong Qiang NorduGrid collaboration
The EDG Testbed Deployment Details
Computing Clusters, Grids and Clouds Globus data service
OGF PGI – EDGI Security Use Case and Requirements
Oxana Smirnova, Jakob Nielsen (Lund University/CERN)
StoRM: a SRM solution for disk based storage systems
How to connect your DG to EDGeS? Zoltán Farkas, MTA SZTAKI
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Creating and running applications on the NGS
Evaluation of “data” grid tools
EMI Interoperability Activities
Middleware independent Information Service
Infrastructure Support
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Viet Tran Institute of Informatics Slovakia
University of Technology
Introduction to Cloud Computing
Farid Ould-Saada KnowARC’06 Conference Oslo 7-9/11/06
Grid Security M. Jouvin / C. Loomis (LAL-Orsay)
From Prototype to Production Grid
Wide Area Workload Management Work Package DATAGRID project
Grid Engine Riccardo Rotondo
Grid Engine Diego Scardaci (INFN – Catania)
gLite The EGEE Middleware Distribution
Grid Computing Software Interface
Presentation transcript:

Grid Computing with Debian, Globus and ARC Mattias Ellert, Uppsala Universitet (.se) Steffen Möller, Universität zu Lübeck (.de) Anders Wäänänen, Niels Bohr Institutet (.dk)

www.knowarc.eu2 Grid Computing Seamless integration of distributed computing and storage resources from the user’s point of view Computing grid vs. power grid analogy  Power grid: users plug in their electrical devices and don’t need to care which power plant provides the electricity (unless they want to)  Computing grid: the user prepares a computing task and sends it to the “grid” and doesn’t need to care which cluster performs the calculation (unless it wants to)

www.knowarc.eu3 Volunteer vs. Grid Computing Volunteer Computing: BOINC  “single regular users fetch prepared workunits”  regular Debian client package  unofficial server packages Computational Grids  “big compute clusters wait for arbitrary jobs”  no previous packages for any Linux distributions  common IT backbone for High Energy Physics

www.knowarc.eu4 Mutual Trust Network of trust  Users trust sites Data security, validity of installations  Sites trust users All usage can be traced back to the user X.509 certificates  Certificate Authorities (CAs) guarantee identities  User creates time-limited variants of these certificates (proxies) to delegate their rights to jobs

www.knowarc.eu5 Mutual Trust (cont’d) International Grid Trust Federation (IGTF)  CAs that trust eachother’s policies  Users with a user certificate issued by a member CA can authenticate to resources that have host certificates issued any other member CA Virtual organisations  Clusters in the grid delegate decision over admissions to virtual organisations  Easiest: a website collecting the individuals’ certificates descriptive names

www.knowarc.eu6 Typical Grid Usage Submission of Job  Task should be described in a job description – executable, input data, output data, software and hardware requirements,... Status information  Query the state of clusters and jobs Retrieval of results  Download to client or (if specified in the job description) automatically upload to storage Data management  Keep track of large sets of input and output files

www.knowarc.eu7 Remaining Challenges Make grid access easier  Local vs. grid accounts Increase flexibility  Migration of jobs  Preparation of runtime environments Increase public awareness  Universities and research groups  Industry  Computer clubs  Presentations like this one ;-)

www.knowarc.eu8 Current Technologies / Projects Globus  can be used as a complete grid middleware  is a library of core functionalities for many Unicore  both Grid and Grid Infrastructure EGEE  uses the gLite grid middleware and Globus NorduGrid  with or without Globus  compatible with the others

www.knowarc.eu9Globus A set of libraries and tools for grid computing used by many grid projects  Globus security infrastructure (GSI) Authentication and authorization based on short lived proxy certificates standardized as RFC 3820  GridFTP Extensions to the FTP protocol to support GSI authentication, third-party transfers, multiple data channels for parallel transfers, partial file transfers “proposed recommendation” document in the Global Grid Forum (GFD-R-P.020)

www.knowarc.eu10 Packaging Globus Source  Distributed as >100 MB tarball  Contains ~300 inter-dependent packages within Split into individual packages to become manageable  Strong consistency between Globus and Debian packages Build uses the Grid Packaging Toolkit (GPT) Patches communicated back to upstream

www.knowarc.eu11 Packaging Globus Redundancies with system libraries are all eliminated from the source tree  e.g. openssl, openldap, libltdl Glue packages are provided instead  providing GPT metadata information for system packages to satisfy build dependencies Status  First packages uploaded to Debian new queue, also uploaded to Fedora

www.knowarc.eu12 Packaging Globus Regular package for Grid Package Toolkit Use GPT packaging metadata information to autogenerate Debian folders in source code management system Manual curation of these folders  preparation of patches  provisioning of better descriptions

www.knowarc.eu13 NorduGrid – ARC Advanced Resource Connector Grid middleware built on top of the Globus libraries, with higher level services Used by the Nordic Data Grid Facility (NDGF) to provide computing resources for  High Energy Physics researchers at the CERN Large Hadron Collider  Bioinformatics  Quantum chemistry ...

www.knowarc.eu14 NorduGrid – ARC Monitor of clusters contributing

www.knowarc.eu15 Packaging NorduGrid – ARC Available today from version 0.6.x  “Production” release  full Globus dependency  Globus packages should be accepted first version 1.x  ongoing development  optional Globus dependency  Debian packages will offer the more compatible Globus-dependent version

www.knowarc.eu16 Implications for Debian Increased connectivity  between users of Debian  in between clusters of Linux distributions Promotion as an extended concept of the Debian society  the sharing of packaging may be extended towards a sharing of resources Debian Technologies  packages are perfect descriptions for runtime environments  availability on many heterogeneous platforms

www.knowarc.eu17Acknowledgments KnowARC –  European Commission 5 th framework programme project NDGF – The developers of Globus –  Charles Bacon in particular, for his integration of patches The developers of NorduGrid ARC –