EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Docking and molecular dynamics – complete.

Slides:



Advertisements
Similar presentations
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
Advertisements

P-GRADE and WS-PGRADE portals supporting desktop grids and clouds Peter Kacsuk MTA SZTAKI
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks World-wide in silico drug discovery against.
INFSO-RI Enabling Grids for E-sciencE WISDOM mini-workshop Vincent Breton (CNRS-IN2P3, LPC Clermont-Ferrand) ISGC 2007 March 28th,
EGEE-II INFSO-RI Enabling Grids for E-sciencE Supporting MPI Applications on EGEE Grids Zoltán Farkas MTA SZTAKI.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
Extending a molecular docking tool to run simulations on clouds Damjan Temelkovski Dr. Tamas Kiss Dr. Gabor Terstyanszky University of Westminster.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Support for Astrophysical applications in.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
KISTI’s Activities on the NA4 Biomed Cluster Soonwook Hwang, Sunil Ahn, Jincheol Kim, Namgyu Kim and Sehoon Lee KISTI e-Science Division.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Building Grid-enabled Virtual Screening Service.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Application Case Study: Distributed.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks “High throughput” protein structure prediction.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Combining the strengths of UMIST and The Victoria University of Manchester Utility Driven Adaptive Workflow Execution Kevin Lee School of Computer Science,
BIOINFOGRID: Bioinformatics Grid Application for Life Science Giorgio Maggi INFN and Politecnico di Bari
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
The EDGeS project receives Community research funding 1 SG-DG Bridges Zoltán Farkas, MTA SZTAKI.
INFSO-RI Enabling Grids for E-sciencE Supporting legacy code applications on EGEE VOs by GEMLCA and the P-GRADE portal P. Kacsuk*,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Provenance Challenge gLite Job Provenance.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
INFSO-RI Enabling Grids for E-sciencE In silico docking on EGEE infrastructure, the case of WISDOM Nicolas Jacq LPC of Clermont-Ferrand,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Performance Improvements to BDII - Grid Information.
Enabling Grids for E- sciencE EGEE and gLite are registered trademarks EGEE-III INFSO-RI Analysis of Overhead and waiting times.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
INFSO-RI Enabling Grids for E-sciencE Grid-enabled drug discovery to address neglected diseases N. Jacq – CNRS-IN2P3 EGAAP meeting.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status report on Application porting at SZTAKI.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Design of an Expert System for Enhancing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks INFSO-RI Enabling Grids for E-sciencE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Services for advanced workflow programming.
INFSO-RI Enabling Grids for E-sciencE EGEE Review WISDOM demonstration Vincent Bloch, Vincent Breton, Matteo Diarena, Jean Salzemann.
Enabling Grids for E-sciencE INFSO-RI Institute of mathematical problems of biology RAS Expertise in mathematical modeling and experimental data.
INFSO-RI Enabling Grids for E-sciencE CHARON System Jan Kmuníček, Petr Kulhánek, Martin Petřek CESNET, Czech Republic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Progress on first user scenarios Stephen.
1 DIRAC Job submission A.Tsaregorodtsev, CPPM, Marseille LHCb-ATLAS GANGA Workshop, 21 April 2004.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMPROXY usage Álvaro Fernández IFIC (CSIC)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
INFSO-RI Enabling Grids for E-sciencE Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update Authorization Service Christoph Witzig,
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
EGEE-II INFSO-RI Enabling Grids for E-sciencE P-GRADE overview and introduction: workflows & parameter sweeps (Advanced features)
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE III User Forum – Clermont Ferrand Analysis of Metagenomes on the EGEE Grid Gabriel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
Introduction to Computing Element HsiKai Wang Academia Sinica Grid Computing Center, Taiwan.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
SEE-GRID-SCI WRF-ARW model: Grid usage The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures.
SCAI Activities in the - GRID-field - Molecular Modelling field
WISDOM-II, status of preparation
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Docking and molecular dynamics – complete 16 stage workflow with gLite WMS Astrid Maaß, Jisamma Kallumadikal Fraunhofer Institute SCAI, St. Augustin, Germany

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 2 plasmepsin Application Background ZINC compound db docking Wide in Silico Docking on Malaria Further analysis Targets P.f. PLM P.f. GST P.f. DHFR P.f. Tub P.v. DHFR Plasmodium falciparum in human red blood cells plm hits 5000

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 3 Motivation Docking information –binding mode  drug design –Estimate of inhibitory potential Pro: –reduce number of candidate molecules rapidly! Contra: –results may be inaccurate in detail For promising candidates: –combine docking with more realistic scoring function: All-in-one workflow = FlexX + Amber –gain accuracy, maintain all information!

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 4 rmsd ≤ 1.5Å Reference Data Trypsin (20)Avidin (16)HIV-Pr (15) after Docking 0.86 / 1.88 Å0.62 / 2.19 Å0.76 / 5.91 Å after MD0.90 / 1.81 Å0.95 / 1.00 Å0.88 / 2.38 Å Quality of predictions: correlation with exp. affinities/ average rmsd (quality of placements) exp R≤1 calc

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 5 Docking-MD-Workflow: Purpose ΔG bind = ΔE = E PL –(E P + E L ) ΔE VDW + ΔE ELE + ΔE SOLV + Δn rot D. Huang & A. Caflisch: Efficient evaluation of binding free energy using continuum electrostatics solvation; J. Med. Chem., 2004, 47, Docking Set up system for MD (assign charges, atomtypes) Energy minimization Format conversion Clustering Energy minimization MD 1 MD2 Calc. complex energy Calc. Ligand energy Calc. Protein energy Calc. ΔG FlexX Amber

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 6 Preliminary Results Current status: –Test run completed successfully for avidin with default settings: R: 0.51 → 0.71, rmsd: 2.91Å → 2.51 Å –reranking in progress ZINC

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 7 Workflow The workflow consists of 16 stages, –where each stage depends on the previous stage –executes the following softwares  FlexX Commercial product Licence manager FLEXlm is used Limited access control  Amber 8 (Sander & Tleap + Elan)  Convert (inhouse development)  Cluster (inhouse development) FlexX Tleap Sander Convert Cluster …

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 8 Architecture Components Intermediate Job Meta-data management Software Applications Storage Element gLite WMS EGEE Infrastruture gLite CE Input XML gLite UI

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 9 Middleware gLite –gLite WMS provides the feature ‚DAG‘ Job  Procedures become nodes of the dag FlexX, Tleap, Sander, Convert, Cluster and Intermediate  Dependency feature The output of the previous job will go as input to the next job  Each procedure is responsible for upload and download of its outputs and inputs, from an appropriate storage element Distributed Data Management Available to access data irrespective of the global location of the sites where the jobs are running ComputingElement StorageElement

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 10 Workflow The ‚BIO‘ Workflow –Flexx writes (conformations) are unknown –Cluster reduces the no: of conformations  Output of each stage is unpredictable Dynamic distribution of conformations required Organisation of the huge quantity of produced data

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 11 Processing HOW IT WORKS (Grid)

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 12 Workflow The ‚All in One‘ Workflow –The intermediate job provides dynamic division of the jobs –Higher performance –Organisation of Data Goes through each Ligand Keeps a count of the generated conformation Divide this count with the nodes Packs the input Register the input on the Grid (LFC)

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 13 Workflow The ‚All in One‘ Workflow –Subjobs: 9 (Reserved) –Input: 1 Protein, 10 Ligands –Flexx write: 25 (varies) Dynamic ‚Bio‘ Workflow is combined with the gLite feature ‚DAG‘ Nodes are equally loaded 25 24

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 14 User Input XML (User input) –Experiment Name –VO name –Middleware –SE, LFC –Proteins, Ligands, Sites & Receptor –Job Information  Job Type  Subjobs required (approximate)  Input Script (batch files)

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 15 Monitoring / Meta-data Management Meta-data Management Invokes the handle „ Runs the query –Select, Insert or Update (Id, JobId, NodeName, JobType, InputLfn, OutputLfn, Destination, Hostname, StartTime, StopTime, CPUTime, SizeOfInput, SizeOfOutput, SizeOfApps, SizeOfSandboxStart, SizeOfSandboxStop) Returns the response of the request (Result Set) scaimeta.scai.fraunhofer.de Invokes the web service(Request) Response

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 16 Estimation STAGES –Considering 10 ligands (20 conformations) as input,  The number of subjobs (nodes) for each stage be 5  The dag consists of 80 nodes 16 stages * 5 = 80  Each node will get 2 ligands 10 ligands / 5 nodes = 2  The approximate time required for a workflow to complete the process is 15 hours without getting stuck up in long queues or failed sites without any application / software failures without any grid problems  Achieved on DECH VO by extending the proxy limit to 72 hours

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 17 Estimation Job Name Job Input/job Appr.time/L Appr.time/job Flexx5 2L 3 min6 min Tleap5 2L 15 min30 min Sander(Min1)5 2L 21 min42 min Convert5 2L 42 sec2 min Cluster5 2L 6 sec12 sec Sander(Min2)5 2L 3.5 min7 min Sander(Md1)5 2L 1.5 hrs3 hrs Sander(Md2)5 2L 3.8 hr8 hrs Convert5 2L 6 sec12 sec Tleap25 2L 9 min18 min Sander(Min3)5 2L 6 sec12 sec Convert5 2L 12 sec24 sec Tleap35 2L 8 min16 min Sander(Min4)5 2L 7 min14 min Tleap45 2L 6 min12 min Sander(Min4)5 2L 6 min12 min

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 18 Limitation Time Constraint (Biomed) –Limited Proxy (24 hrs)  To successfully run the workflow we have split it into 3 parts and submitted one after the other to finish it within the available time frame –Successfully run on DECH VO  Proxy was extented to 72 hours gLite updating creates problems which require the restart of WMS which aborts the running jobs –Updates are often not tested for complex jobs Long Black List Jobs are forwarded to sites which are having queues of length 4444 (to failed sites). These jobs are getting scheduled for a long time Jobs are getting aborted due to the following errors –Job got an error while in the CondorG queue. –File not available.Cannot read JobWrapper output, both from Condor and from Maradona. –Got a job held event, reason: Repeated submit attempts (GAHP reports:) –93 the gatekeeper failed to find the requested service –Got a job held event, reason: Unspecified gridmanager error –unable to register job – system load too high

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 19 Challenges Developing a GUI –for the ease of the user Integrating gLite API Automatic restart of aborted jobs Control of FlexX licences

Enabling Grids for E-sciencE EGEE-II INFSO-RI {astrid.maass, 20 Thank You Thanks to... the EGEE-team at SCAI: –Jiri Kraus –Andre Gemünd –Christoph Lenzen –Horst Schwichtenberg –Klaere Cassirer the BiosolveIT for providing FlexX licences