Interoperability Achieved by GADU in using multiple Grids. OSG, Teragrid and ANL Jazz Presented by: Dinanath Sulakhe Mathematics and Computer Science Division.

Slides:



Advertisements
Similar presentations
Generic MPI Job Submission by the P-GRADE Grid Portal Zoltán Farkas MTA SZTAKI.
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Virtual Data and the Chimera System* Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
CMS Applications Towards Requirements for Data Processing and Analysis on the Open Science Grid Greg Graham FNAL CD/CMS for OSG Deployment 16-Dec-2004.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Grids and Globus at BNL Presented by John Scott Leita.
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Managing Workflows with the Pegasus Workflow Management System
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Grid Computing I CONDOR.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
G RID M IDDLEWARE AND S ECURITY Suchandra Thapa Computation Institute University of Chicago.
Pegasus-a framework for planning for execution in grids Ewa Deelman USC Information Sciences Institute.
CSIU Submission of BLAST jobs via the Galaxy Interface Rob Quick Open Science Grid – Operations Area Coordinator Indiana University.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Grid Workload Management Massimo Sgaravatto INFN Padova.
Evolution of the Open Science Grid Authentication Model Kevin Hill Fermilab OSG Security Team.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Compute Resources and Job Management. 2 Local Resource Managers (LRM)‏ Compute resources have a local resource manager (LRM) that controls:  Who.
Mar 28, 20071/18 The OSG Resource Selection Service (ReSS) Gabriele Garzoglio OSG Resource Selection Service (ReSS) Don Petravick for Gabriele Garzoglio.
Turning science problems into HTC jobs Wednesday, July 29, 2011 Zach Miller Condor Team University of Wisconsin-Madison.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Using SWARM service to run a Grid based EST Sequence Assembly Karthik Narayan Primary Advisor : Dr. Geoffrey Fox 1.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
TeraGrid Advanced Scheduling Tools Warren Smith Texas Advanced Computing Center wsmith at tacc.utexas.edu.
VO-Ganglia Grid Simulator Catalin Dumitrescu, Mike Wilde, Ian Foster Computer Science Department The University of Chicago.
Pegasus: Running Large-Scale Scientific Workflows on the TeraGrid Ewa Deelman USC Information Sciences Institute
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Review of Condor,SGE,LSF,PBS
GriPhyN Virtual Data System Grid Execution of Virtual Data Workflows Mike Wilde Argonne National Laboratory Mathematics and Computer Science Division.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
GLIDEINWMS - PARAG MHASHILKAR Department Meeting, August 07, 2013.
Pegasus-a framework for planning for execution in grids Karan Vahi USC Information Sciences Institute May 5 th, 2004.
VO Privilege Activity. The VO Privilege Project develops and implements fine-grained authorization to grid- enabled resources and services Started Spring.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Eileen Berman. Condor in the Fermilab Grid FacilitiesApril 30, 2008  Fermi National Accelerator Laboratory is a high energy physics laboratory outside.
GADU: A System for High-throughput Analysis of Genomes using Heterogeneous Grid Resources. Mathematics and Computer Science Division Argonne National Laboratory.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
Managing LIGO Workflows on OSG with Pegasus Karan Vahi USC Information Sciences Institute
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Open Science Grid Consortium Storage on Open Science Grid Placing, Using and Retrieving Data on OSG Resources Abhishek Singh Rana OSG Users Meeting July.
Job submission overview Marco Mambelli – August OSG Summer Workshop TTU - Lubbock, TX THE UNIVERSITY OF CHICAGO.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Workload Management Workpackage
Pegasus and Condor Gaurang Mehta, Ewa Deelman, Carl Kesselman, Karan Vahi Center For Grid Technologies USC/ISI.
Wide Area Workload Management Work Package DATAGRID project
Mats Rynge USC Information Sciences Institute
Presentation transcript:

Interoperability Achieved by GADU in using multiple Grids. OSG, Teragrid and ANL Jazz Presented by: Dinanath Sulakhe Mathematics and Computer Science Division Argonne National Laboratory Computational Institute University of Chicago

GADU Applications… Its all about Comparative analysis Insights of Biology are gained by Comparative Analysis: Unknown genes are compared against known. Unknown genes are compared against known. Similar genes tend to perform same functions. Similar genes tend to perform same functions. Comparative analysis to know what is same and different between two strains of an Organism: Example: What is different a organism living Boiling temperature such as 108 deg Celsius and the one living in extreme freezing conditions. Example: What is different a organism living Boiling temperature such as 108 deg Celsius and the one living in extreme freezing conditions. Difference between Pathogenic and non-pathogenic organisms. Mycobecterium Tuberculosis is a Pathogen causing TB, is only 12 genes different from the non-pathogenic BCG used as vaccine against TB. Mycobecterium Tuberculosis is a Pathogen causing TB, is only 12 genes different from the non-pathogenic BCG used as vaccine against TB. Tools BLAST, Blocks, Chisel, Interpro etc.. An embarrassingly parallel workload. An embarrassingly parallel workload.

GADU’s evolution.. GADU Just evolved into what it is today. Chiba City at Argonne. Jazz Cluster at Argonne. Grid2003 to OSG Teragrid All of them togeather.

Some Results and Highlights GADU can successfully use OSG and Teragrid resources simultaneously. GADU can successfully use OSG and Teragrid resources simultaneously. Individual clusters such as ANL Jazz is also used parallely. Individual clusters such as ANL Jazz is also used parallely. Site selection and scheduling across multiple grids. Easily add a new site into the pool of sites. StatusSite NameSite TestMaxNodesGridcat ASGC_OSG18199Pass FNAL_FERMIGRID12 Pass FNAL_GPFARM266749Pass GRASE-CCR-U Pass NebraskaFAIL_TIMEOUT252Pass OSG_LIGO_PSU28312Pass Purdue-ITaP131224Pass Purdue-Physics1463Pass STAR-BNLFAIL_TIMEOUT672Pass UFlorida-PG279268Pass UMATLASFAIL_TIMEOUT771Pass UTA_DPCC18154Inactive UWMadisonCMSFAIL_TIMEOUT90Pass grow-UNI-PFAIL_TIMEOUT17Pass TG_UC44316NONE TG_NCSA551000NONE TG_PURDUEFAIL_FTP1024NONE Last Run.. ( Last week) Ran BLAST Jobs 70% OSG 30% Teragrid

Grid Resources.. Open Science Grid and Teragrid. Authentication. OSG OSG OSG : GADU VOMS Server. DOE Grid Certificates are automatically picked by the Sites. TeraGrid TeraGrid Individual Accounts via Allocations. Manually adding DOE Grid certificates to each site. (gx-map). Application Deployment. OSG OSG OSG variables, $OSG_APP and $OSG_DATA is used to install GADU’s applications and pre-stage the databases such as NR. TeraGrid TeraGrid GADU has a Community space on each of the sites available. Applications and installed within this community space.

Resource Independent GADU. GADU uses Pegasus based VDS and Condor-G Globus GRAM Interface Pegasus DAGMan Condor-G tc.data Pool.config Abstract Workflow as VDL Condor Submit files Submit Host WN Job management system Gatekeeper JobManager WN Remote Resources WN Job management system Gatekeeper JobManager WN Job management system Gatekeeper JobManager WN Information Services GADU’s automated Analysis Server, expressing, executing and tracking the scientific workflows on Grid. Database Controller Query Interface

Resource Independent GADU. GADU uses Pegasus based VDS and Condor-G The Workflow Generator in GADU is responsible for producing a workflow suitable for execution in the Grid environment. This task is accomplished through the use of the “virtual data language” (VDL). Once the VDL for the workflow is written, VDS converts it into condor submit files and a DAG that can be submitted to the site selected by the site selector. TR FileBreaker(input filename, none nodes, output sequences[], none species) { argument = ${species}; argument = ${filename}; argument = ${nodes}; profile globus.maxwalltime = "300"; } TR BLAST( none OutPre, none evalue, input query[], none type ) { argument = ${OutPre}; argument = ${evalue}; profile globus.maxwalltime = "300"; } DV jobNo_1_1separator->FileBreaker ], species="Aeropyrum_Pernix" ) …. VDL for BLAST workflow

Resource Independent GADU. 4 Million sequences Fig. Example of a Dag representing the workflow. ATGCATGCA 1000 sequences ATGCATGCA

Resource Independent GADU. Representing a Site and the applications on it.. #SITE Transformation PFNTYPE ANL_Jazz BLAST /soft/apps/BLAST/bin/blastallnull ANL_Jazz Blocks /soft/apps/run-Blocks.plnull ANL_Jazz Chisel /soft/apps/chisel/runChisel.plnull ANL_Jazz IPRSCAN /soft/apps/iprscan_wrapper.pl null ANL_Jazz globus-url-copy /soft/apps/packages/globus-2.2.4/bin/globus-url-copy GLOBUS_LOCATION=/soft/apps/packages/globus-2.2.4/; LD_LIBRARY_PATH=/soft/apps/packages/globus-2.2.4/lib; PATH=/soft/apps/packages/globus-2.2.4/bin pool ANL_Jazz { lrc "rls://gnare.mcs.anl.gov“ gridftp "gsiftp:// jmayor1.lcrc.anl.gov:2812/soft/apps/gadu" gridlaunch "/soft/apps/gadu/bin/kickstart" workdir "/soft/apps/gadu/vdldata" universe vanilla "jmayor1.lcrc.anl.gov:2121/jobmanager-pbs" universe globus "jmayor1.lcrc.anl.gov:2121/jobmanager-pbs" universe transfer " jmayor1.lcrc.anl.gov:2812/jobmanager-fork" } …. pool.config tc.data

Resource Independent GADU. GADU uses Pegasus based VDS and Condor-G Globus GRAM Interface Pegasus DAGMan Condor-G tc.data Pool.config Abstract Workflow as VDL Condor Submit files Submit Host WN Job management system Gatekeeper JobManager WN Remote Resources WN Job management system Gatekeeper JobManager WN Job management system Gatekeeper JobManager WN Information Services GADU’s automated Analysis Server, expressing, executing and tracking the scientific workflows on Grid. Database Controller Query Interface

Requirements... Information Services. VDS like System can to provide an Architecture independent mechanism to use different sites (Grids) Information Services at various levels Authentication – To check if the certs are valid at this site. Architecture – Is it an ia-32 cluster or an ia-64 ? Gatekeeper, GridFtp Server. Environment Variables – $OSG_APP, $TG_COMMUNITY Number of CPUs Number of Used CPUs. Number of Idle CPUs. VO (user) specific jobs running at a given site. VO (user) specific jobs sitting in QUEUE at a given site (why?) We a need standards and protocols for these Information Services and identify more information variables that needs to published by the Grids. Gridcat or MDS or something else. Currently GADU uses GridCat to collect site specific information for OSG and manually adds information for TeraGrid and Jazz. We are working on an MDS based information interface on TeraGrid. In order to automatically add a new Grid site, we need information about the site:

Another Big Challenge.. Site Selection. GADU has access to 60 OSG Sites and 5 TeraGrid Sites. One challenge in using the Grid reliably for high-throughput analysis is monitoring the state of all Grid sites and how well they have performed for job requests from a given submit host. We view a site as “available” if our submit host can communicate with it, if it is responding to Globus job-submission commands, and if it will run our jobs promptly, with minimal queuing delays GADU Server OSG

Another Big Challenge.. Site Selection. GADU has access to 60 OSG Sites and 5 TeraGrid Sites. Web Interface to Control the Selection of Sites for GADU: Web Interface showing live status of usage: Grid may not worry about this…

Next Steps.. Working with Teragrid Information Services group – MDS based interface. Continue to improve GADU’s implementation of Site Selection. Trying to generalize Site Selection using the Information Services such as MDS and Gridcat. Continue to deploy faster scientific applications for the Bioinformatics Group at Argonne.

Bioinformatics Group: Natalia Maltsev, PI Alex Rodriguez Elizabeth Glass Mark D’ Souza Mustafa Syed Yi Zhang Globus and VDS Mike Wilde Nika Nefedova Jens Voeckler Ian Foster Rick Stevens VDT Support. Condor Support. Systems at MCS. Acknowledgements Open Science Grid Thanks to Ruth Pordes and OSG team for their wonderful support TeraGrid Charlie Catlett Special thanks to David O’Neal, Joeseph Insley, and Sergiu Sanielevici