Presentation is loading. Please wait.

Presentation is loading. Please wait.

Pegasus WMS Extends DAGMan to the grid world

Similar presentations


Presentation on theme: "Pegasus WMS Extends DAGMan to the grid world"— Presentation transcript:

1 Pegasus WMS Extends DAGMan to the grid world
User creates an abstract workflow Pegasus maps abstract workflow to executable workflow DAGMan runs executable workflow Doesn’t need full Condor (schedd only)

2 Generating mosaics of the sky (Bruce Berriman, Caltech)
Size of the mosaic in degrees square* Number of jobs Number of input data files Number of intermediate files Total data footprint Approx. execution time (20 procs) 1 232 53 588 1.2GB 40 mins 2 1,444 212 3,906 5.5GB 49 mins 4 4,856 747 13,061 20GB 1hr 46 mins 6 8,586 22,850 38GB 2 hrs. 14 mins 10 20,652 3,722 54,434 97GB 6 hours Kent – fixed small typo *The full moon is 0.5 deg. sq. when viewed form Earth, Full Sky is ~ 400,000 deg. sq. 2

3 Abstract Workflow (DAX)
Pegasus workflow description—DAX Workflow “high-level language” Devoid of resource descriptions Devoid of data locations Refers to codes as logical transformations Refers to data as logical files

4 Basic Workflow Mapping
Select where to run the computations Change task nodes into nodes with executable descriptions Select which data to access Add stage-in and stage-out nodes to move data Add nodes that register the newly-created data products Add nodes to create an execution directory on a remote site Write out the workflow in a form understandable by a workflow engine Include provenance capture steps

5 Pegasus Workflow Mapping
1 4 Original workflow: 15 compute nodes devoid of resource assignment 5 8 9 4 10 12 13 15 8 3 Resulting workflow mapped onto 3 Grid sites: 11 compute nodes (4 reduced based on available intermediate data) 13 data stage-in nodes 8 inter-site data transfers 14 data stage-out nodes to long- term storage 14 data registration nodes (data cataloging) 7 9 12 10 15 13 5

6 Pegasus WMS Workflow Description in XML (DAX) Properties
Replica Catalog Pegasus Workflow Mapper Site Catalog Transformation Catalog TeraGrid Open Science Grid Campus resources Local machine Condor DAGMan Condor Schedd Submit Host Pegasus WMS restructures and optimizes the workflow, provides reliability 6

7 Pegasus links Pegasus home page: pegasus.isi.edu
Tutorial materials available at: For more questions:


Download ppt "Pegasus WMS Extends DAGMan to the grid world"

Similar presentations


Ads by Google