VDT 1 The Virtual Data Toolkit Todd Tannenbaum (Alain Roy)

Slides:



Advertisements
Similar presentations
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
Advertisements

Configuration management
Configuration management
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
A. Arbree, P. Avery, D. Bourilkov, R. Cavanaugh, S. Katageri, G. Graham, J. Rodriguez, J. Voeckler, M. Wilde CMS & GriPhyN Conference in High Energy Physics,
A Computation Management Agent for Multi-Institutional Grids
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
R. Cavanaugh GriPhyN Analysis Workshop Caltech, June, 2003 Virtual Data Toolkit.
Experience with ATLAS Data Challenge Production on the U.S. Grid Testbed Kaushik De University of Texas at Arlington CHEP03 March 27, 2003.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Status of Globus activities within INFN (update) Massimo Sgaravatto INFN Padova for the INFN Globus group
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
Vladimir Litvin, Harvey Newman Caltech CMS Scott Koranda, Bruce Loftis, John Towns NCSA Miron Livny, Peter Couvares, Todd Tannenbaum, Jamie Frey Wisconsin.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Grappa: Grid access portal for physics applications Shava Smallen Extreme! Computing Laboratory Department of Physics Indiana University.
1 Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Grid Technology: The Rough Guide Grid Building.
Chapter Fourteen Windows XP Professional Fault Tolerance.
10/20/05 LIGO Scientific Collaboration 1 LIGO Data Grid: Making it Go Scott Koranda University of Wisconsin-Milwaukee.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
HEP Experiment Integration within GriPhyN/PPDG/iVDGL Rick Cavanaugh University of Florida DataTAG/WP4 Meeting 23 May, 2002.
Patrick R Brady University of Wisconsin-Milwaukee
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
OSG Services at Tier2 Centers Rob Gardner University of Chicago WLCG Tier2 Workshop CERN June 12-14, 2006.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
Condor Birdbath Web Service interface to Condor
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
D. Olson, L B N L 1 STAR Collab. Mtg. 13 Aug 2003 Grid Enabling a small Cluster Doug Olson Lawrence Berkeley National Laboratory STAR Collaboration Meeting.
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
GriPhyN Status and Project Plan Mike Wilde Mathematics and Computer Science Division Argonne National Laboratory.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Welcome and Condor Project Overview.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
Production Tools in ATLAS RWL Jones GridPP EB 24 th June 2003.
VDT 1 The Virtual Data Toolkit 7.th EU DataGrid Internal Project Conference Heidelberg / Germany Todd Tannenbaum (Miron Livny) (Alain.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
The GriPhyN Planning Process All-Hands Meeting ISI 15 October 2001.
Alain Roy Computer Sciences Department University of Wisconsin-Madison Packaging & Testing: NMI & VDT.
Testing Grid Software on the Grid Steven Newhouse Deputy Director.
Alain Roy Computer Sciences Department University of Wisconsin-Madison Condor & Middleware: NMI & VDT.
Miron Livny Computer Sciences Department University of Wisconsin-Madison The Role of Scientific Middleware in the Future of HEP Computing.
Virtual Data Management for CMS Simulation Production A GriPhyN Prototype.
Yannick Patois - Datagrid Software Repository Presentation - March, n° 1 Datagrid Software Repository Presentation CVS, packages and automatic.
GriPhyN Project Paul Avery, University of Florida, Ian Foster, University of Chicago NSF Grant ITR Research Objectives Significant Results Approach.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Peter Couvares Computer Sciences Department University of Wisconsin-Madison Condor DAGMan: Introduction &
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
EGEE is a project funded by the European Union under contract IST Installation and configuration of gLite services Robert Harakaly, CERN,
A closer look at the VDT RPMs Alain Roy OSG Software Coordinator.
The Great Migration: From Pacman to RPMs Alain Roy OSG Software Coordinator.
OSG Status and Rob Gardner University of Chicago US ATLAS Tier2 Meeting Harvard University, August 17-18, 2006.
OSG Facility Miron Livny OSG Facility Coordinator and PI University of Wisconsin-Madison Open Science Grid Scientific Advisory Group Meeting June 12th.
U.S. ATLAS Grid Production Experience
TYPES OF SERVER. TYPES OF SERVER What is a server.
CMS report from FNAL demo week Marco Verlato (INFN-Padova)
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Pegasus and Condor Gaurang Mehta, Ewa Deelman, Carl Kesselman, Karan Vahi Center For Grid Technologies USC/ISI.
Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002
Status of Grids for HEP and HENP
Presentation transcript:

VDT 1 The Virtual Data Toolkit Todd Tannenbaum (Alain Roy)

VDT 2 What is the VDT? A packaging of software –Grid software (Globus, Condor-G…) –Virtual data software (Chimera) –Utilities An easy installation mechanism Testing and hardening Support

VDT 3 Who makes the VDT? Grid Physics Network (GriPhyN) –Constructs the VDT International Virtual Data Grid Laboratory (IVDGL) –Testing and hardening Very tight collaboration between GriPhyN and IVDGL

VDT 4 Who makes the VDT? (2) Core VDT Team: –Miron Livny: The boss –Alain Roy –Carey Kireyev VDT Testing –Xin Zhao –Brian Moe Pacman –Saul Youssef

VDT 5 Who uses the VDT? GriPhyN collaborators –USCMS:In use today –USAtlas:In use today –LIGO:Will use soon –SDSS:Will use soon European Data Grid –Uses subset of software –Uses just RPMs LCG

VDT 6 What exactly is in VDT? VDT 1.1.8: –Globus advisories + patches –Condor & Condor-G –Chimera/Pegasus –RLS –GLUE Schema –CA Certificates –Fault Tolerant Shell –EDG’s Make Gridmap –EDG’s CRL Update –ClassAds –Netlogger

VDT 7 What exactly is in VDT? VDT 1.1.8: –Globus advisories + patches –Condor & Condor-G –Chimera/Pegasus –RLS –GLUE Schema –CA Certificates –Fault Tolerant Shell –EDG’s Make Gridmap –EDG’s CRL Update –ClassAds –Netlogger

VDT 8 Grid Software Installation Typical Grid Software Installation Experience… VDT Installation Experience!

VDT 9 VDT Installation 2 Methods –Pacman –RPM

VDT 10 Pacman Installation Goal: –Type a single command –Everything downloads –Everything installs –Everything is configured –No questions asked We’re close: –A few questions if you’re root –Basic configuration, may need changing

VDT 11 Pacman Installation (2) Download Pacman – Install VDT –cd –pacman -get VDT-Server –pacman -get VDT-Client –ls condor/ globus/ post-install/ setup.sh edg/ gpt/ replica/ vdt/ ftsh/ perl/ setup.csh vdt-install.log Use

VDT 12 Pacman post-installation Post-install directory: –Notes on configuration choices made –Instructions for editing configuration Configuration scripts: –Globus configuration –Condor configuration

VDT 13 RPM Installation Subset of whole VDT –Globus –Condor-G Nice RPMs: –We repackage Globus –A dozen Globus RPMs, not hundreds No configuration No post-installation help

VDT 14 Testing VDT team is building test suite Interaction with LCG testing group Working with NMI* to leverage: –NMI test suite Stress testing Application testing (CMS pipeline) –NMI test infrastructure *NMI = NSF Middleware Initiative –

VDT 15 Support Send us questions or problems –We will solve them if we can –We will interact with the developers, if necessary

VDT 16 Interaction with EDG EDG gets Globus and Condor-G RPMs from VDT We do what we can to solve problems and get changes to Globus and Condor We want to make a great package for you

VDT 17 What exactly is in VDT? VDT 1.1.8: –Globus advisories + patches –Condor & Condor-G –Chimera/Pegasus –RLS –GLUE Schema –CA Certificates –Fault Tolerant Shell –EDG’s Make Gridmap –EDG’s CRL Update –ClassAds –Netlogger

VDT 18 Chimera Virtual Data System Much scientific data is not obtained from measurements but rather derived from other data by the application of computational procedures Chimera catalog can be used by application environments to describe a set of application programs ("transformations"), and then track all the data files produced by executing those applications ("derivations"). Chimera contains the mechanism to locate the "recipe" to produce a given logical file, in the form of an abstract program execution graph. These abstract graphs are then turned into and executable DAG for the Condor-G DAGMan meta-scheduler by the bundled Pegasus planner. Enables on-demand execution of computation schedules constructed from database queries.

VDT 19 NetLogger “Networked Application Logger” API w/ calls you add to existing source code to generate time- stamped monitoring events (sent to a file, network server, syslogd, or RAM) Visualization Tools Storage and Retrieval Tools –Store all events into a database

VDT 20 Fault Tolerant Shell (FTSH) The Grid is a hard environment. FTSH –The ease of scripting with very precise error semantics. –Exception-like structure allows scripts to be both succinct and safe. –A focus on timed repetition simplifies the most common form of recovery in a distributed system. –A carefully-vetted set of language features limits the "surprises" that haunt system programmers.

VDT 21 Simple Bourne script… #!/bin/sh cd /work/foo rm –rf data cp -r /fresh/data. What if ‘/work/foo’ is unavailable??

VDT 22 Getting Grid Ready… #!/bin/sh for attempt in cd /work/foo if [ ! $? ] then echo "cd failed, trying again..." sleep 5 else break fi done if [ ! $? ] then echo "couldn't cd, giving up..." return 1 fi

VDT 23 Or with FTSH #!/usr/bin/ftsh try 5 times cd /work/foo rm -rf bar cp -r /fresh/data. end

VDT 24 Or with FTSH #!/usr/bin/ftsh try for 3 days or 100 times cd /work/foo rm -rf bar cp -r /fresh/data. end

VDT 25 Or with FTSH #!/usr/bin/ftsh try for 3 days every 1 hour cd /work/foo rm -rf bar cp -r /fresh/data. end

VDT 26 Or with FTSH #!/usr/bin/ftsh try for 3 days every 1 hour cd /work/foo rm -rf bar cp -r /fresh/data. end

VDT 27 Or with FTSH #!/usr/bin/ftsh try for 3 days every 1 hour cd /work/foo rm -rf bar cp -r /fresh/data. end

VDT 28 Or with FTSH hosts="mirror1.wisc.edu mirror2.wisc.edu mirror3.wisc.edu" forany h in ${hosts} echo "Attempting host ${host}" wget end echo "Got file from ${h}"

VDT 29 FTSH All the usual constructs –Redirection, loops, conditionals, functions, expressions, nesting, … And more –Logging –Timeouts –Process Cancellation –Complete parsing at startup –File cleanup Used on Linux, Solaris, Irix, Cygwin, … Simplify your life!

VDT 30 VDT’s Future Additional Software –MyProxy, Java ClassAds Access to new versions –Globus 3.0 Extra VDT to help early adopters Condor-G will submit to GT2 or GT3 Helping You –What can we do to make life easier for you?

VDT 31 Where do you learn more? Support: –Alain –Miron