Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n Gromacs & Plantisc Searching for uses cases, improvements and realization of portlets
Team Marco Rinaldi Livio Saluci Alessandro Brischetto Valeria Di Mauro Rossella Biondi Anastasia Scandurra
Outline Gromacs Searching for use cases to facilitate demo on the Science Gateway Improvements of the existing portlet on Science Gateway Plantisc Creation of the portlet for the Science Gateway
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n GROMACS GROningen MAchine for Chemical Simulations
Application description GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. It is primarily designed for biochemical molecules like proteins, lipids and nucleic acids that have a lot of complicated bonded interactions, but since GROMACS is extremely fast at calculating the non-bonded interactions (that usually dominate simulations) many groups are also using it for research on non-biological systems, e.g. polymers.
Use cases We tried a lot of use cases to find the most interesting for a portlet example execution
Integration strategy mdrun is the main command of the AutoDock portfolio. It begins the basic molecular dynamics calculus after all the other parameters have been set. GRID infrastructure is perfect for the execution of a mdrun command line:.TPR FILE SG GRID.EDR FILE Input file generated by user with GROMACS that contains a description of the system Execute the mdrun, a time consuming operation Output file that can be used to generate an xvg (plottable file)
Orchestration of SG comp. + ext. components We improved the SG portlet with a new pilot script capable to automatically generate also the xvg and a jpg plot of the output result to do the jpg plot we integrated octave SG GRID.EDR FILE.XVG.JPG PLOT TYPE.TPR FILE
Orchestration of SG comp. + ext. components.TPR.EDR mdrun g_energy OCTAVE SCIENCE GATEWAY.XVG PLOT TYPE.JPG
Execution workflow Example of TPR generation: Download the 1AKI.pdb from pdb2gmx -f 1AKI.pdb -o 1AKI_processed.gro -water spce editconf -f 1AKI_processed.gro -o 1AKI_newbox.gro -c -d 1.0 -bt cubic genbox -cp 1AKI_newbox.gro -cs spc216.gro -o 1AKI_solv.gro -p topol.top tutorials/lysozyme/Files/ions.mdp tutorials/lysozyme/Files/ions.mdp grompp -f ions.mdp -c 1AKI_solv.gro -p topol.top -o ions.tpr genion -s ions.tpr -o 1AKI_solv_ions.gro -p topol.top -pname NA - nname CL -nn 8 tutorials/lysozyme/Files/minim.mdp tutorials/lysozyme/Files/minim.mdp grompp -f minim.mdp -c 1AKI_solv_ions.gro -p topol.top -o em.tpr
Execution workflow Example of TPR upload:.TPR FILE PLOT TYPE
Execution workflow Example of Gromacs mdrun result:.EDR.XVG.JPG energy minimization
Execution workflow Other TPR elaborations results available from our use cases: densitypressuretemperature
Remaining problems Necessary portlet modifications: Add into UI the possibility to select the plot type (integer number) and save this information into a g_energy_input.txt file that is used by the new pilot script Possible portlet improvements: The generation of the TPR file must be done by the user, so he must have GROMACS installed on his machine. Add the possibility to build a TPR file inserting all the possible parameters directly on the portlet.
Summary and conclusion The creation of TRP files it’s a critical part of the GROMACS workflow. We created some TPR sample files to help people to try the GROMACS portlet and verify the functionalities ( stin/gmx-tutorials/) stin/gmx-tutorials/ We suggest the implementation of TPR creation into the portlet, although it’s not easy because exists a large variety of parameters combinations.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n PLANTICS Plant TIssue Culture Simulation
Application description Plant Tissue culture is a method for plant propagation under in vitro conditions Different types and parts of plants (known as explants) may be cultivated in vitro. These may be organs (roots, stems, shoot tips, leaves and fruits); tissues; cells (suspension cultures) and special tissues and organs such as embryos, canthers, pollen and protoplasts Plant tissue culture is time and material intensive, running into several months of laboratory efforts in trying to build hormonal combinations that will be best for mass propagation of a particular species
Application description Potentially, modeling or computer simulation can provide a useful method for gaining insight into these complex processes by reducing the time needed to screen numerous hormonal combinations. Plantisc is a simulation application based on multiple regression models deployed on a grid computing infrastructure The application simulates the plant tissue culture experiments and predicts the amount and combinations of auxins and cytokinins needed to yield optimal growth of propagules The results obtained from the simulation show over 67% prediction accuracy as compared to the laboratory experiments
Integration strategy Plantics is written in Pyton, using libraries like NumPy and SciPy. The GRID infrastructure is perfect for the execution of a time consuming computation.
Orchestration of SG comp. + ext. comp. Single job using parameters selected from the portlet web interface INPUT PARAMS SG GRID OUTPUT FILE Execute the plantics computation ??
Orchestration of SG comp. + ext. comp. Multi job using files uploaded by the user we used the parametric parallel because different inputs can be executed with the same PLANTICS software and final ouputs can be compared to find the best combination of auxins and cytokinins INPUT FILE SG GRID OUTPUT FILE Execute the plantics computation ?? INPUT FILE OUTPUT FILE
Execution workflow Single job
Execution workflow Multi-job
Remaining problems We created the portlet UI and managed the form input commands Next steps: Install Python on the SG machines Install Plantics on the SG machines Run Plantics from the portlet
Summary and conclusion The Plantics portlet will be completed when the command line execution instruction will be available. The Plantics portlet, divided into two execution mode, will be very useful to do a single test o compare a large number of tests.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n Questions ?