EDAGRID TUTORIAL Dr. Tuan-Anh Nguyen Ho Chi Minh City University of Technology Prof. Pierre Kuonen University of Applied Sciences of Fribourg, Switzerland
Overview First part: EDAGrid middleware How to use EDAGrid middleware to submit jobs How to describe a job flow Part two: EDAGrid programming Writing a simple Grid program with POP- C++ Executing the program on EDAGrid
Supercomputer PC Supercomputer My laptop Workstation SMP Cluster The Grid environment ?
EDAGrid infrastructure: Objectives Middleware for computing Grid Execute parallel applications transparently with a QoS of resources Use case User EDAGrid middleware “Execute my app on 4 processors with at least 500MFlops and 1GB/s network”
Login User name: acomp2007 Password: guess Create a subdirectory mkdir cd Server information: Grid FTP server gsiftp://serverkstn.cse.hcmut.edu.vn:2811 Your home: /home/acomp2007
Describe jobs XML based job descriptions Resource characteristics Application characteristics Staging description... A single job description Number of CPUs How many CPU each node Linpack benchmark Network bandwidth... Where are executables Agruments Compatible platforms How to stage in/out data...
Describe a simple job 1 /bin/sleep 10 Edit job.xml: Submit the job eda_job_submit -w job.xml
Resource requirement Add resource requirement to the job description 100MFlops but at least 60MFlops is also acceptable Physical memory of at lease 256MB /bin/sleep 10 NetworkBandwidth (MBit/s) IndividualCPUFree5 (0...1) IndividualCPUFree10 (0...1) IndividualCPUFree15 (0...1) IndividualCPUCount
Stage data in/out gsiftp://serverkstn.cse.hcmut.edu.vn:2811 Sort data gsiftp://serverkstn.cse.hcmut.edu.vn:2811/PATH_TO_YOUR_FILE file:///${GLOBUS_USER_HOME}/data file:///$ /bin/sort ${GLOBUS_USER_HOME}/data --output=${GLOBUS_USER_HOME}/result file:///${GLOBUS_USER_HOME}/result file:///$ gsiftp://serverkstn.cse.hcmut.edu.vn:2811/PATH_TO_YOUR_OUTPUT...
Describe a multijob workflow... Job 1 Job 2 Job 3
Parallel sort.... split data sort 2 sort 1 mer ge result... Note: - To sort (all platforms): /bin/sort -output= - To merge (all platforms): /bin/sort -merge -output= - To split (only available on YOUR machine): /home/edagrid/demo/splittext