Presentation is loading. Please wait.

Presentation is loading. Please wait.

Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.

Similar presentations


Presentation on theme: "Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra."— Presentation transcript:

1 Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra

2 2008-11-14 P - J - L 2 Outline Introduction The GrADS System The Migration Framework and Self Adaptivity Experiments and Results Conclusion

3 2008-11-14 P - J - L 3 Introduction  The optimization was achieved by hand-tuning the software system to fit the needs of the computing environment.  The hand-tuning process was not portable across different computing environments.  Hand customization does not take into account the run-time load dynamics of the system and the input parameters of the application.  build self-adaptive software system

4 2008-11-14 P - J - L 4 Introduction (Cont.)  In our framework, the migration of applications depends on  the amount of increase or decrease in loads on the resources,  the time of the application execution when load is introduced into the system,  the performance benefits that can be obtained for the application due to migration.  The framework has been implemented and tested in the GrADS system.

5 2008-11-14 P - J - L 5 The GrADS System

6 2008-11-14 P - J - L 6 Interactions in Migration framework

7 2008-11-14 P - J - L 7 The Migration Framework and Self Adaptivity  Migrator  A user-level checkpointing library called SRS (Stop Restart Software) is used to provide migration capability to the end application.  The SRS library is implemented on top of MPI and hence can be used only with MPI based parallel programs.  The SRS library consists of 6 main functions: SRS Init(), SRS Finish() –after MPI Init() and before MPI Finalize() SRS Restart Value() –the application in the start mode and certain other parts in the restart mode

8 2008-11-14 P - J - L 8 The Migration Framework and Self Adaptivity (Cont.)  Migrator  The SRS library consists of 6 main functions: SRS Check Stop() –at different phases of the application to check if an external component wants the application to be stopped SRS Register() –to register the variables that will be checkpointed by the SRS library SRS Read() –reads in the checkpointed data in the restart mode –The user, through SRS Read(), also specifies the previous and current data distributions.

9 2008-11-14 P - J - L 9 The Migration Framework and Self Adaptivity (Cont.)  Contract Monitor  The Contract Monitor is a component that uses the Autopilot infrastructure to monitor the progress of applications in GrADS. Autopilot is a real-time adaptive control infrastructure  The contract monitor compares the actual execution times with the predicted execution times. large differences between the actual and the predicted performance of the end application –contacts the rescheduler and requests it to migrate the application

10 2008-11-14 P - J - L 10 The Migration Framework and Self Adaptivity (Cont.)  Rescheduler  migration on request contract monitor contacts the rescheduler requesting it to migrate the application  opportunistic migration the rescheduler periodically queries the GrADS Information Repository (GIR) for recently completed applications rescheduler determines if performance benefits can be obtained for an currently executing application by migrating it to use the resources

11 2008-11-14 P - J - L 11 The Migration Framework and Self Adaptivity (Cont.)  In both case, the rescheduler first contacts the Network Weather Service (NWS) to obtain the updated information for the machines in the Grid. ret_new : it were to execute on the machines in the new schedule ret_current : it were to continue executing on the original set of machines If rescheduling gain > 30% The rescheduler sends STOP signal to the AP, stores the stop status in GIR. Application manager then waits for the RESUME signal. The rescheduler stores the RESUME value in the GIR. If rescheduling gain < 30%, in the migration on request mode the rescheduler contacts the contract monitor prompting the contract monitor to adjust its tolerance limits. The worst case time

12 2008-11-14 P - J - L 12 Experiments and Results 4 8  In all the experiments

13 2008-11-14 P - J - L 13 Problem Sizes and Migration

14 2008-11-14 P - J - L 14 Load Amount and Migration

15 2008-11-14 P - J - L 15 Load Introduction Time and Migration

16 2008-11-14 P - J - L 16 Opportunistic Migration

17 2008-11-14 P - J - L 17 Conclusion  Many existing migration systems that migrate applications under resource load conditions implement simple policies that cannot be applied to Grid systems.  We have implemented a migration framework that takes into account both the system load and application characteristics.  We have also implemented a framework that opportunistically migrates executing applications to make use of additional free resources.

18


Download ppt "Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra."

Similar presentations


Ads by Google