Center for Bioinformatics, University of Tübingen

Slides:



Advertisements
Similar presentations
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
Advertisements

1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Distributed Systems Architectures
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
11 Application of CSF4 in Avian Flu Grid: Meta-scheduler CSF4. Lab of Grid Computing and Network Security Jilin University, Changchun, China Hongliang.
RXQ Customer Enrollment Using a Registration Agent (RA) Process Flow Diagram (Move-In) Customer Supplier Customer authorizes Enrollment ( )
Taverna: From Biology to Astronomy Dr Katy Wolstencroft University of Manchester my Grid OMII-UK.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Create an Application Title 1A - Adult Chapter 3.
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
Sandra Gesing Division for Simulation of Biological Systems Eberhard-Karls-Universität Tübingen Portals for Life.
Photo Slideshow Instructions (delete before presenting or this page will show when slideshow loops) 1.Set PowerPoint to work in Outline. View/Normal click.
Break Time Remaining 10:00.
EU market situation for eggs and poultry Management Committee 20 October 2011.
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Chapter 11: The X Window System Guide To UNIX Using Linux Third Edition.
Sandra Gesing Eberhard-Karls-Universität Tübingen Requirements on a portal for MoSGrid (Molecular Simulation.
15. Oktober Oktober Oktober 2012.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
Mobility Tool Fremtidens afrapportering 2013 – Erasmus Mobilitet / IP 2014 – Erasmus+ aktioner.
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Analyzing Genes and Genomes
Systems Analysis and Design in a Changing World, Fifth Edition
Essential Cell Biology
Clock will move after 1 minute
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 13 Slide 1 Application architectures.
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
© Paradigm Publishing, Inc Access 2010 Level 2 Unit 2Advanced Reports, Access Tools, and Customizing Access Chapter 8Integrating Access Data.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
South Dakota Library Network MetaLib User Interface South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD © South Dakota.
TIDE Presentation Florida Standards Assessments 1 FSA Regional Trainings Updated 02/09/15.
Peter Rice Bioinformatics and Grid: Progress and Potential Peter Rice, EBI ISGC, April 2005.
Taverna the story from up-above Antoon Goderis The University of Manchester, UK DART workshop, Brisbane,
UNICORE Integration into a Science Gateway for MoSGrid Sandra Gesing Faculty of Science, Center for Bioinformatics 11/17/2010.
Workflows within Taverna Stuart Owen University of Mancester, UK
The Representation of Scientific Data
Science, Workflows and Collections Professor Carole Goble The University of Manchester, UK
07/06/11 New Features of WS-PGRADE (and gUSE) 2010 Q Q2 Miklós Kozlovszky MTA SZTAKI LPDS.
Taverna: A Workbench for the Design and Execution of Scientific Workflows Dr Katy Wolstencroft myGrid University of Manchester.
Going with the Flow Distributed Computing for Systems Biology Using Taverna Prof Carole Goble The University of Manchester, UK
Convert generic gUSE Portal into a science gateway Akos Balasko 02/07/
An Identity Crisis in the Life Sciences Jun Zhao, Carole Goble and Robert Stevens The University of Manchester, UK Thanks to: Tom Oinn, Matthew Pocock,
Taverna Workbench Stuart Owen University of Mancester, UK
First International Workshop on Portals for Life Sciences Sandra Gesing
EScience Case Studies Using Taverna Dr. Georgina Moulton The University of Manchester
The Semantic Web, Service Oriented Architectures, the my Grid Experience Carole Goble
Convert generic gUSE Portal into a science gateway Akos Balasko.
Application Specific Module Tutorial Zoltán Farkas, Ákos Balaskó 03/27/
Selected Workflow and Semantic Experiences from my Grid Professor Carole Goble The University of Manchester, UK
An Introduction to Taverna caBIG monthly workspace call and Taverna, Franck Tanoh.
Introduction to Workflows with Taverna and myExperiment Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft and Aleksandra Pawlik.
Workflow and Analysis Approaches for Molecular Dynamics Simulations Catania, September 21 st, 2010 Jens Krüger *, Georg Birkenheuer, Sebastian Breuers,
Martin Wewior, RRZK 1 Technologies for Portlets for Molecular Simulations A Portlet for Quantum Chemistry Martin Wewior, Lars Packschies,
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Introduction to Workflows with Taverna and myExperiment Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft.
Sandra Gesing University of Notre Dame, Center for Research Computing 6 March 2015 UNICORE Use Case Integration for XSEDE and PRACE.
WS-PGRADE for Molecular Sciences and XSEDE
Presentation transcript:

Center for Bioinformatics, University of Tübingen The MoSGrid Portal – A workflow-enabled Grid Portal for Molecular Simulations Sandra Gesing Center for Bioinformatics, University of Tübingen sandra.gesing@uni-tuebingen.de 28.04.2010

Outline Motivation MoSGrid (Molecular Simulation Grid) The MoSGrid portal Domain specific workflows MSML (Molecular Simulation Markup Language) Future work MoSGrid Portal

Motivation Numerous applications for molecular simulations and docking, e.g. Materials science Structural biology Drug design Sophisticated tools and algorithms support scientists High-performance computing facilities are available MoSGrid Portal

Motivation Drawbacks of using molecular simulations and docking Usability of tools is limited Complexity of methods Lack of graphical user interfaces Complexity of infrastructures Many end users lack computer science background ⇒ Need for self-explanatory and intuitive user interfaces ⇒ A portal for molecular simulations and docking MoSGrid Portal

Portals Single point of entry Possibility to customize views and tools Store user preferences No installation of software on the user’s side No firewall issues MoSGrid Portal

Unifying Diversity MoSGrid Portal 12181 acatttctac caacagtgga tgaggttgtt ggtctatgtt ctcaccaaat ttggtgttgt 12241 cagtctttta aattttaacc tttagagaag agtcatacag tcaatagcct tttttagctt 12301 gaccatccta atagatacac agtggtgtct cactgtgatt ttaatttgca ttttcctgct 12361 gactaattat gttgagcttg ttaccattta gacaacttca ttagagaagt gtctaatatt 12421 taggtgactt gcctgttttt ttttaattgg gatcttaatt tttttaaatt attgatttgt 12481 aggagctatt tatatattct ggatacaagt tctttatcag atacacagtt tgtgactatt 12541 ttcttataag tctgtggttt ttatattaat gtttttattg atgactgttt tttacaattg 12601 tggttaagta tacatgacat aaaacggatt atcttaacca ttttaaaatg taaaattcga 12661 tggcattaag tacatccaca atattgtgca actatcacca ctatcatact ccaaaagggc 12721 atccaatacc cattaagctg tcactcccca atctcccatt ttcccacccc tgacaatcaa 12781 taacccattt tctgtctcta tggatttgcc tgttctggat attcatatta atagaatcaa Slide copied from: Stuart Owen „Workflows with Taverna“ MoSGrid Portal

MoSGrid Molecular Simulation Grid (D-Grid project) Goal Providing users with Grid access to molecular simulation tools and docking tools via a workflow-enabled portal Implementation of high-performance computing Workflows Annotations of results Data mining Use of the D-Grid-infrastructure MoSGrid Portal

MoSGrid Partners Universität zu Köln Eberhard-Karls-Universität Tübingen Universität Paderborn Konrad-Zuse-Zentrum für Informationstechnik Berlin Technische Universität Dresden Technische Universität Dortmund Bayer Technology Services GmbH, Leverkusen Origines GmbH, Martinsried GETLIG&TAR, Falkensee BioSolveIT, Sankt Augustin COSMOlogic GmbH&Co. KG, Leverkusen MoSGrid Portal

High-level middleware service level MoSGrid in a Nutshell Portal WS-PGRADE Workflow Structure Recipe Result High-level middleware service level gUSE Cloud File System Grid resources UNICORE 6 XtreemFS Result MoSGrid Portal

Credential Management User management based on Liferay features Community management Organization management X.509 user certificates SAML (Security Assertion Markup Language) Minimize credential data transfers Set of maximum hops for trust delegation Usable for single sign-on infrastructures (e.g., Shibboleth) MoSGrid Portal

Credential Management MoSGrid Portal

WS-PGRADE MoSGrid Portal

WS-PGRADE MoSGrid Portal

WS-PGRADE MoSGrid Portal

High-level middleware service layer gUSE Architecture grid User Support Environment User interface WS-PGRADE Application repository Information system Logging Submitters Workflow storage engine High-level middleware service layer gUSE grid User Support Environment Grid resources middleware layer UNICORE 6 MoSGrid Portal

gUSE Submitter Interface GridService actionJobSubmit actionJobAbort actionJobOutput actionJobStatus actionJobResource JOB1 JOB2 JOB3 JOB4 Workflow engine JOBn Submitter GridService MoSGrid Portal

gUSE Submitter for UNICORE Resources actionJobSubmit UNICORE Atomic Services JOB1 JOB2 JOB3 JOB4 1 - Security 2 - Registry 3 - Submit job 5 - Start job Workflow engine JOBn UNICORE submitter (UCC lib) 4 - Upload data Uspace MoSGrid Portal

ASM (Application Specific Module) Library for managing WS-PGRADE workflows Listing of users and workflows in the local repository Import of Workflows in the user space Upload/download of input and output files Setting the parameters of a job in a workflow Submission of workflows Monitoring of workflows Deletion of workflows Usable in portlets und Java tools ⇒ Implicit use of gUSE submitter MoSGrid Portal

Distributed Data Management XtreemFS is an object-based grid and cloud filesystem Ability to minimize data transfer Low latency, local availability through replication Grid Security Infrastructure (GSI) support MoSGrid Portal

Distributed Data Management XtreemFS integration Portlet UNICORE GSI support Data flow WS-PGRADE XtreemFS Frontend nodes Compute nodes UNICORE mediates data transfers XtreemFS UNICORE TSI MoSGrid Portal

Domain Molecular Dynamics Study and simulation of molecular motion Provide a molecular dynamics service on multiple levels Direct upload of job descriptions Workflows and standard recipes for repeating tasks Analysis of relevant properties MoSGrid Portal

Equilibration of Proteins Proteins from databases (e.g., the Protein Data Bank, PDB) do not necessarily represent a near-native conformation/configuration For all kind of production runs a minimization and an equilibration is an indispensable prerequisite Eases the work of experienced users Lowers the hurdle for novice users MoSGrid Portal

UseCase: Gromacs_EQ structure (pdb/gro) pdb2gmx structure (pdb) topology (top/itp) editconf genbox box (pdb) EM.mdp (mdp) Solvated (pdb) adj. Top. (top/itp) grompp topol.tpr mdout.mdp MoSGrid Portal

ener.edr state.cpt mdrun xmgrace g_energy traj.trr traj.xtc md.log SYSTEM_EM.pdb Analysis.jpg grompp topol.tpr mdout.mdp FULL.mdp (mdp) mdrun xmgrace g_energy ener.edr traj.trr traj.xtc state.cpt md.log SYSTEM_EQ. pdb Analysis.jpg MoSGrid Portal

MD Portlet MoSGrid Portal

Domain Quantum Chemistry Study and simulation of molecular electronic behavior relative to their chemical reactivity Survey - MoSGrid Community First implementation for Gaussian Then support for Turbomole GAMESS-US Further relevant QC applications MoSGrid Portal

Domain Quantum Chemistry Gaussian Jobs Single input file Defines molecular geometry and task Result Not structured output Platform dependent checkpoint file Integrated multi-step job option Not usable for generalized workflows MoSGrid Portal

Domain Quantum Chemistry First prototype Workflow controlled by portlet Three phases Pre-processing Job execution Post-processing MoSGrid Portal

Domain Quantum Chemistry Workflows Assisted job creation Guiding GUI Most common options available Pre-created job description Upload of Gaussian job description file Monitoring of jobs Post-processing and presentation of results MoSGrid Portal

Domain Quantum Chemistry Preprocessing Portlet (GUI) supports common options Automatic generation of job description Submission of job MoSGrid Portal

Domain Quantum Chemistry Post-processing Parsing of result file Python scripts executed by portlet Relevant information about molecular properties Data in CSV-Format saved and accessible MoSGrid Portal

Domain Docking CADDSuite (Computer-aided Drug Design) MoSGrid Portal

Domain Docking Galaxy available for local ressources in Tübingen MoSGrid Portal

MolDB Stores molecules in binary format, which allows for fast export Automatically creates and stores can. smiles, fingerprints, and functional groups counts for imported molecules Automatically saves and restores docking-/rescoring-results DB can be filtered to all stored molecule properties before exporting molecules Current speed for import/export: ~100 compounds/sec. MoSGrid Portal

MSML Molecular Simulation Markup Language Based on CML (Chemical Markup Language) Common interpretation by humans and computers Follows the minimum information principle Description: http://xml-cml.org/convention/dictionary XSL transformation Used for validation purposes validator.xml-cml.org MoSGrid Portal

Future Work WS-PGRADE MD- and QC-Portlet CADDSuite MSML Integration of the UNICORE IDB to offer drop-down boxes of available tools MD- and QC-Portlet Adoption to gUSE workflow engine via the ASM libraries CADDSuite Export of workflows from Galaxy to WS-PGRADE MSML Further development WS-PGRADE Anstatt MoSGrid Portal

Involved Projects SHIWA (SHaring Interoperable Workflows for Large Scale Scientific Simulations on Available DCIs) EU project Duration: 01.07.2010 – 30.06.2012 Tübingen participates via Galaxy workflow export CompChem Virtual Organization EGEE project Available ressources MoSGrid Portal

Future Projects SCI-BUS (SCIentific gateway Based User Support) EU project Duration: 01.10.2011 – 30.09.2014 Pan-European ressources Tübingen participates with the extension of the MoSGrid portal with an interactive molecule editor and a semantic search MoSGrid Portal

Acknowledgements Oliver Kohlbacher Ákos Balaskó Georg Birkenheuer Sebastian Breuers Richard Grunzke Sonja Herres-Pawlis Valentina Huber Miklos Kozlovszky Jens Krüger István Márton Patrick Schäfer Bernd Schuller Johannes Schuster Anna Szikszay Fabri Klaus-Dieter Warzecha Martin Wewior MoSGrid Portal

MoSGrid Portal