Avian Flu Data Challenge Hsin-Yen Chen ASGC 29 Aug. 2007 APAN24.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

The Access Grid Ivan R. Judson 5/25/2004.
Grid Computing and Applications in China Kai Nan Computer Network Information Center (CNIC) Chinese Academy of Sciences (CAS) 17 Mar 2008.
Biosciences Working Group Update & Report Back Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by IOIT Hanoi, Vietnam, Oct 29,
Inetrconnection of CNGrid and European Grid Infrastructure Depei Qian Beihang University Feb. 20, 2006.
Addressing emerging diseases on the grid
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks World-wide in silico drug discovery against.
INFSO-RI Enabling Grids for E-sciencE WISDOM mini-workshop Vincent Breton (CNRS-IN2P3, LPC Clermont-Ferrand) ISGC 2007 March 28th,
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
INFSO-RI Enabling Grids for E-sciencE Using Grid Computing to Accelerate Structure-Based Design Against Influenza A Neuraminidases.
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Bioinformatics is the application of computer science and information technologies to the processing and analysis of biological data. Bioinformatics is.
Protein Structure Computation Gwyn Skone OUCL PRG Student Conference 2006.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
Java Adaptive Mathematical Modeling Engine (JAMME) Leeland Artra, Cell Systems Initiative (CSI) Zheng Li, Department of Bioengineering University of Washington,
FKPPL workshop May 2012 BUI The Quang Prof. Vincent Breton Prof. Doman Kim Prof. NGUYEN Hong Quang Prof. PHAM Quoc Long Grid enabled in silico drug discovery.
InsilicoCell: an integrated platform for biological model development and analysis Thai Quang Tung Korea Institute of Science and Technology Information.
1 EGEE Grid in Asia Simon C. Lin Academia Sinica Grid Computing Centre Taipei, Taiwan 16 November 2007 Do-Son ACGrid School in Hanoi, Vietnam.
Application of e-infrastructure to real research.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Building Grid-enabled Virtual Screening Service.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Application Case Study: Distributed.
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
DR Software: Essential Foundational Elements and Platform Components UCLA Smart Grid Energy Research Center (SMERC) Industry Partners Program (IPP) Meeting.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
1 IDGF International Desktop Grid Federation How can you benefit from joining IDGF? Hannover, Peter Kacsuk, MTA SZTAKI, EDGI.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
PRAGMA Avian Flu Grid: Drug Discovery against Infectious Diseases Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM, a grid enabled virtual screening initiative Yannick Legré LPC Clermont-Ferrand,
INFSO-RI Enabling Grids for E-sciencE In silico docking on EGEE infrastructure, the case of WISDOM Nicolas Jacq LPC of Clermont-Ferrand,
Grid Application Platform (GAP) Developed By ASGC ISGC 2007.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
6 February 2009 ©2009 Cesare Pautasso | 1 JOpera and XtremWeb-CH in the Virtual EZ-Grid Cesare Pautasso Faculty of Informatics University.
INFSO-RI Enabling Grids for E-sciencE EGEE Review WISDOM demonstration Vincent Bloch, Vincent Breton, Matteo Diarena, Jean Salzemann.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
The User Perspective Michelle Osmond. The Research Challenge Molecular biology, biochemistry, plant biology, genetics, toxicology, chemistry, and more.
KnowARC objectives & challenges Balázs Kónya/Lund University Oslo, 1 st KnowARC Conference.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Simula Research Laboratory Lokaliteter & Forskning
INFSO-RI Enabling Grids for E-sciencE Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček.
HA Neuramindase (NA) and replication of virions A enzyme, cleaves host receptors help release of new virions NA Modeling HTS against Inf-A NA on Grid Ying-Ta.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
Presented by The Harness Workbench: Unified and Adaptive Access to Diverse HPC Platforms Christian Engelmann Computer Science Research Group Computer Science.
Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2.
M. Paganoni, APAN29, Feb 11 th, Marco Paganoni INFN, Italy APAN 29 conference Sydney, Feb. 11 th, 2010 EUAsiaGrid.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
Grids and Clouds Interoperation: Development of e-Science Applications Data Manager on Grid Application Platform Hsin-Yen Chen & Wei-Long Ueng Academia.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
Biosciences Working Group Update Wilfred W. Li, Ph.D., UCSD, USA Habibah Wahab, Ph.D., USM, Malaysia Hosted by UCSD San Diego, USA, March 3, 2010.
Docking and Virtual Screening Using the BMI cluster
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to Grids and the EGEE project.
Design of AMGA Web Application Based on Belle II User Requirements Taesang Huh, Geunchul Park, Jae-Hyuck Kwak, Seungwoo Rho, Soonwook Hwang, JungHyun Kim,
Integrating Scientific Tools and Web Portals
EGEE support for HEP and other applications
WISDOM-II, status of preparation
Hsin-Yen Chen 29 Aug APAN24 ASGC
In silico docking on grid infrastructures
Presentation transcript:

Avian Flu Data Challenge Hsin-Yen Chen ASGC 29 Aug APAN24

translation / step=2.0 Å quaternion / step =20 degree torsion / step= 20 degree number of energy evaluation =1.5 X 10 6 max. number of generation =2.7 X 10 4 run number =50 translation / step=2.0 Å quaternion / step =20 degree torsion / step= 20 degree number of energy evaluation =1.5 X 10 6 max. number of generation =2.7 X 10 4 run number =50 2D compound library 3D structure “drug-like” Lipinski’s RO5 ionization tautermization 3D structure library structure generation energy minimization 308,585 (6 known drugs) 8 structures (including 1 original type) Targets Compound selection Grid Data Challenge Drug Analysis: Modeling Complex Molecular docking (Autodock) ~137 CPU years, 600 GB data Data challenge on EGEE, Auvergrid, TWGrid ~6 weeks on ~2000 computers

Lessons learned from the 1st Grid DC In general, grid is helpful; however … the application interface is not friendly for end-users. Lack of a friendly user interface to launch the in-silico docking process on the Grid Requirements concerning the post data analysis An easy-to-use system to simplify the access of the docking results An automatic refinement pipeline emulating the real wet-lab screening process (initial screening → filtering → refinement screening) Compound preparation issue Compounds should be carefully selected to ensure they are purchasable from vendors. Compounds should be better annotated with chemical properties.

2nd Avian Flu Data Challenge Objective Biology goals Re-analyzing the mutations based on the X-ray structures Comparing the open and close conformations of Neuraminidase Grid goal Realizing the 2-step docking emulating the wet-lab workflow Stress testing the new system pushing to a production grid application service

Challenge overview 8 NA targets Close and open conformations from PDB Mutations at E119V, H274Y, R292K 500,000 compounds + 12 positive controls 500,000 compounds 300,000 from in-house collection of AS-GRC 200,000 from SPEC library 2-step pipeline 1st step to quickly filter out 50% non-interesting compounds (~ 100 CPU years) 2nd step to refine the rest 50% (~ 100 CPU years) Docking program Autodock v3 Docking system DIANE, WISDOM with improved environment for data analysis (integrated with GAP)

Partners Grid collaborators EGEE CERN, Switzerland IN2P3/CNRS, France ITB/CNR, Italy Asian-Pacific partners KISTI, Korea NGO, Singapore Laboratories Genomic Research Center, Academia Sinica, Taiwan Chonnam National University, South Korea Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, China

GAP in DC2 Why GAP ? Light-weight client runs on user ’ s desktop High-level interface for job configuration and data visualization Easy to manage the distributed dockings performed by WISDOM and DIANE

Grid Application Platform (GAP)

The layered GAP architecture Interfacing computing resources High-level application logic Re-usable interface components Reduce the effort of developing application services Reduce the effort of adapting new technologies Concentrate efforts on applications

Grid Application Platform: The architecture overview Service Oriented Architecture Multi-user Environment Common Interface to Heterogeneous Environment Portable & light- weight Client

Common interface to different resources common interfacedifferent resources

Demo VQSClient command-line shell the VQSClient is based on a JAVA interpreter Configure the properties of the current VQSClient shell VQS [1]: config();