December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Low Level Grid Services (Job Management, Data.

Slides:



Advertisements
Similar presentations
1 Reliable File Transfer Service Ravi K Madduri Argonne National Laboratory, University of Chicago.
Advertisements

NorduGrid Grid Manager developed at NorduGrid project.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
Monitoring and Discovery in a Web Services Framework: Functionality and Performance of Globus Toolkit MDS4 Jennifer M. Schopf Argonne National Laboratory.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
1 Globus Developments Malcolm Atkinson for OMII SC 18 th January 2005.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations.
GridFTP Guy Warner, NeSC Training.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
The Data Replication Service Ann Chervenak Robert Schuler USC Information Sciences Institute.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Configuring Resources for the Grid Jerry Perez.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Ashok Agarwal 1 BaBar MC Production on the Canadian Grid using a Web Services Approach Ashok Agarwal, Ron Desmarais, Ian Gable, Sergey Popov, Sydney Schaffer,
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Globus Data Replication Services Ann Chervenak, Robert Schuler USC Information Sciences Institute.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Dynamic Firewalls and Service Deployment Models for Grid Environments Gian Luca Volpato, Christian Grimm RRZN – Leibniz Universität Hannover Cracow Grid.
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008 John Bresnahan Michael Link Raj Kettimuthu Argonne National Laboratory and.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Globus GridFTP and RFT: An Overview and New Features Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
1 / 18 Federal University of Rio de Janeiro – COPPE/UFRJ Author : Wladimir S. Meyer – Doctorate Student Advisors : Jano Moreira de Souza – Ph.D. Milton.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
UDT as an Alternative Transport Protocol for GridFTP Raj Kettimuthu Argonne National Laboratory The University of Chicago.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
Wide Area Data Replication for Scientific Collaborations Ann Chervenak, Robert Schuler, Carl Kesselman USC Information Sciences Institute Scott Koranda.
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
GridFTP Richard Hopkins
Rights Management in Globus Data Services Ann Chervenak, ISI/USC Bill Allcock, ANL/UC.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Tool Integration with Data and Computation Grid “Grid Wizard 2”
© 2004 IBM Corporation WS-ResourceFramework Service Groups Tom Maguire.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
GridFTP Guy Warner, NeSC Training Team.
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
A Sneak Peak of What’s New in Globus GridFTP John Bresnahan Michael Link Raj Kettimuthu (Presenting) Argonne National Laboratory and The University of.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Globus Toolkit 4: Current Status and Futures Stuart Martin Argonne National Lab.
U.S. ATLAS Grid Production Experience
Presentation transcript:

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Low Level Grid Services (Job Management, Data Management, Monitoring Services) Ravi K Madduri Argonne National Laboratory University of Chicago

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Services Overview Installation Data Management –GridFTP, RFT, RLS, DAIS Resource Management –Schedulers, logs, sudo Information Services –Index service hierarchies, ganglia/hawkeye

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Full Toolkit Installation Binaries available for many platforms –Apple –Linux Debian, Fedora, SuSe, RHEL, Redhat –FreeBSD –HP/UX, Tru64 –AIX –Solaris –Windows (Java code only) Source code also available See for installation guide, quickstart, and pre-req documentationhttp://

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Overview of GT4 Data Services GridFTP –High-performance Data transfer protocol The Reliable File Transfer Service (RFT) –Data movement services for GT4 The Replica Location Service (RLS) –Distributed registry that records locations of data copies The Data Access and Integration Service (DAIS) –Service to access relational and XML databases

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide What is GridFTP? A secure, robust, fast, efficient, standards based, widely accepted data transfer protocol A Protocol –Multiple Independent implementation can interoperate This works. Both the Condor Project at Uwis and Fermi Lab have home grown servers that work with ours. Lots of people have developed clients independent of the Globus Project. The Globus Toolkit supplies a reference implementation: –Server –Client tools (globus-url-copy) –Development Libraries

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide GT4 GridFTP Implementation Based on XIO Extremely modular to allow integration with a variety of data sources (files, mass stores, etc.) Striping support is provided in 4.0 Has IPV6 support included (EPRT, EPSV)

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Configuring GridFTP Right configuration results in better performance Add entries to /etc/services and (x)inetd Configuration options: –Binding to a specific interface/address –Striped backend –TCP tuning parameters

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide What is RFT ? WS-RF compliant Fault-tolerant, High- performance data transfer service –Soft state. –Notifications/Query Reliability on top of high performance provided by GridFTP. –Fire and Forget. –Integrated Automatic Failure Recovery. Network level failures. System level failures etc. –Essentially a Data transfer scheduler with FIFO as a Queue Policy.

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide What is RFT (Continued..)? RFT Service RFT Client SOAP Messages Notifications (Optional) Data Channel Protocol Interpreter Master DSI Data Channel Slave DSI IPC Receiver IPC Link Master DSI Protocol Interpreter Data Channel IPC Receiver Slave DSI Data Channel IPC Link GridFTP Server

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Data Transfer Comparison Control Data Control Data Control Data Control Data globus-url-copyRFT Service RFT Client SOAP Messages Notifications (Optional)

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Replica Management in Grids Data intensive applications produce terabytes or petabytes of data –Hundreds of millions of data objects Replicate data at multiple locations for reasons of: –Fault tolerance Avoid single points of failure –Performance Avoid wide area data transfer latencies Achieve load balancing

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide A Replica Location Service A Replica Location Service (RLS) is a distributed registry that records the locations of data copies and allows replica discovery –RLS maintains mappings between logical identifiers and target names –Must perform and scale well: support hundreds of millions of objects, hundreds of clients E.g., LIGO (Laser Interferometer Gravitational Wave Observatory) Project –RLS servers at 8 sites –Maintain associations between 3 million logical file names & 30 million physical file locations RLS is one component of a Replica Management system –Other components include consistency services, replica selection services, reliable data transfer, etc.

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Goals for OGSA-DAI Aim to deliver application mechanisms that: –Meet the data requirements of Grid applications Functionality, performance and reliability Reduce development cost of data centric Grid applications Provide consistent interfaces to data resources –Acceptable and supportable by database providers Trustable, imposed demand is acceptable, etc. Provide a standard framework that satisfies standard requirements A base for developing higher-level services –Data federation –Distributed query processing –Data mining –Data visualisation

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Data Management Q & A

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Grid Monitoring Services Overview Index Service –Aggregate the data Trigger Service –Notify when data changes Information Providers –Provide the data WebMDS –Client to visualize data

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide What Is Grid Monitoring? A way to discover what services and resources are available to use (Discovery) A way to understand the status/attributes of those services (Monitoring) A system to warn you when things fail Sharing of community data between sites using a standard interface for querying and notification.

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Why Grid Monitoring Hard? Lack of central control –Different local systems according to local policy –Different interfaces and monitoring requirements Shared resources –Contention, variability Communication –Different sites implies different sys admins, users, institutional policies

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide MDS4 Monitoring and Discovery System Grid-level monitoring system used most often for resource selection – Aid user/agent to identify host(s) on which to run an application Uses standard interfaces to provide publishing of data, discovery, and data access, including subscription/notification – WS-ResourceProperties, WS-BaseNotification, WS- ServiceGroup Functions as an hourglass to provide a common interface to lower-level monitoring tools

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide GLUE Schema Attributes (cluster info, queue info, FS info) Information Users : Schedulers, Portals, etc. Cluster monitors (Ganglia, Hawkeye, Clumon, and Nagios soon) Services (GRAM, RFT, RLS) Queueing systems (PBS, LSF, Torque) WS standard interfaces for subscription, registration, notification

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide MDS4 Components Higher level services – Index Service – a way to aggregate data – Trigger Service – a way to be notified of changes – Both built on common aggregator framework Information providers – Monitoring is a part of every WSRF service – Non-WS services can also be used Clients – WebMDS All of the tool are schema-agnostic, but interoperability needs a well-understood common language

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Sample Deployment

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide WebMDS User Interface Web-based interface to WSRF resource property information User-friendly front-end to the Index Service Uses standard resource property requests to query resource property data XSLT transforms to format and display them Customized pages are simply done by using HTML form options and creating your own XSLT transforms Sample page: – o=indexinfo&xsl=servicegroupxsl

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide WebMDS Service

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Information Services Q & A

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide GRAM Overview Submitting a test job Resource Specification Language (RSL) Data Staging Multi-jobs

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide GRAM Overview Intended for jobs where arbitrary programs, state-ful monitoring, credential management, and file staging are important If the application is lightweight, with modest input/output, may be a better candidate for hosting directly as a WSRF service

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide GRAM Prerequisites A secure container For staging jobs, access to an RFT service and a GridFTP server –Note that even stderr/stdout are considered staging, so RFT and GridFTP are used in all but the most basic jobs sudo for running as other accounts Can be integrated with PBS, LSF, Condor

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Submitting A Test Job globusrun-ws –submit –c /bin/true echo $? Will run locally. Specify a remote host with –F globusrun-ws –submit –F host2 –c /bin/true The return code will be the job’s exit code if supported by the scheduler

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Data Staging GRAM allows jobs to stage-in and stage-out data To perform this task it uses RFT RFT in turn uses GridFTP servers Simplest stage-in/stage-out example is stdout/stderr

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Streaming Results globusrun-ws –S –s –c /bin/date -S is short for “-submit” -s is short for –streaming –The output will be sent back to the terminal, control will not return until the job is done

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Resource Specification Language For more complicated jobs, we’ll use RSL to specify the job /bin/echo this is an example_string Globus was here ${GLOBUS_USER_HOME}/stdout ${GLOBUS_USER_HOME}/stderr

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Submitting Using XML Create the file containing the RSL You may validate the RSL ahead of time –globusrun-ws –validate –f rslfile.xml If the file validates, submit using - submit

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide At Most Once Submission You may specify a UUID with your job submission If you’re not sure the submission worked, you may submit the job again with the same UUID If the job has already been submitted, the new submission will have no effect If you do not specify a UUID, one will be generated for you

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Staging Data GRAM’s RSL allows many fileStageIn/fileStageOut directives The transfers will be executed by RFT –May specify additional RFT options using the RFTOptions tag There is no GASS cache staging option anymore

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Batch Submission Your client does not have to stay attached to the execution of the job -batch will disconnect from the job and output an EPR –You may redirect the EPR to a file with –o Use the EPR file with –monitor or -status You may also kill the job using -kill

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Specifying Scheduler Options RSL lets you specify various scheduler options –what queue to submit to –which project to select for accounting –max CPU and wallclock time to spend –min/max memory required All defined online under the schema document for GRAM

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Multijobs You may specify more than one element in a At that point, you want to specify the in the RSL rather than the commandline Will be used by MPICH-G to support MPI jobs

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Resource Management Q & A

December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide For more information The Globus Toolkit ™ – The Globus Toolkit ™ –